processing Write applications quickly in Java, Scala, Python, R Combine SQL, streaming, and complex analytics. Spark runs on Hadoop, Mesos, standalone, or in the cloud. It can access diverse data sources including HDFS, Cassandra, HBase, and S3.
distributed stream and batch data processing. Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams.
in Java and Scala, and 2. DataSet API for static data embedded in Java, Scala, and Python, 3. Table API with a SQL-like expression language embedded in Java and Scala.
“must have” in the Hadoop toolkit and has entered the mainstream. Within the big data landscape there are multiple approaches to accessing, analyzing, and manipulating data in Hadoop. ANSI SQL completeness (and the ability to tolerate machine-generated SQL), developer and analyst skillsets, and architecture tradeoffs
massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs Include Unstructured data and structured data.
conversation. On Twitter reiterated an incendiary and racist remarks.. Sprinkle vomit rant like drug taking, getting in trouble. http://gigazine.net/news/20160329-reason-microso -tay- crazy/