Agenda
• Refreshing knowledge on Java 8 Streams
• Distribute and Conquer
• Distributed Data
• Distributed Streams
• How we did all this
Slide 4
Slide 4 text
Java 8 Streams
Slide 5
Slide 5 text
Java 8 Streams…
• An abstraction represents a sequence of
elements
• Is not a data structure
• Convey elements from a source through a pipeline
of operations
• Operation doesn’t modify a source
Slide 6
Slide 6 text
Why I should care about
Stream API?
• You’re Java developer
Slide 7
Slide 7 text
What does regular Java developer think about Scala?
advanced
Slide 8
Slide 8 text
Why I should care about
Stream API?
• You’re Java developer
• Many Java developers know Java
• It’s all about data processing
Solution
• serializable version of the interfaces
• Introducing DistributedStream
28
Slide 29
Slide 29 text
29
Slide 30
Slide 30 text
No content
Slide 31
Slide 31 text
31
Jet Streams
Slide 32
Slide 32 text
No content
Slide 33
Slide 33 text
What’s Hazelcast Jet?
• General purpose distributed data processing
framework
• Based on Direct Acyclic Graph to model data flow
• Built on top of Hazelcast IMDG
• Comparable to Apache Spark or Apache Flink
33
Slide 34
Slide 34 text
No content
Slide 35
Slide 35 text
DAG
35
Slide 36
Slide 36 text
Job Execution
36
Slide 37
Slide 37 text
No content
Slide 38
Slide 38 text
Future (It’s bright!)
• Memory module for processing big data
• Higher level streaming and batching APIs
• Reactive Streams
• Distributed Classloading
• Integrations (HDFS/Yarn/Mesos)
Slide 39
Slide 39 text
Your fuel, our Jet Engine
• Public release – Feb 7th.
• Developer Preview today - yay!
• http://hazelcast.org/jet-signup
• Send me a note [email protected]
• Follow @hazelcast and @gamussa (duh!!)
• Your questions #hazelcast #hazelcastjet
Slide 40
Slide 40 text
Conclusion
• Java Stream API provides very white range of data
processing tools
• War And Piece – is a Big (a lot of data) Book!
• Now we’re pretty sure that Andrew and Pierre are
the main characters