on Spark (MapReduce deterministic, idempotent tasks), scales out and is fault-tolerant, supports low-latency, interactive queries through in-memory computation, supports both SQL and complex analytics such as machine learning, is compatible with Apache Hive (storage, serdes, UDFs, types, metadata).