Slide 5
Slide 5 text
RDD API
• API intentionally very similar to standard Scala collections
(Array[A], List[A], etc)
• Map, flatMap, filter, groupBy, count, foreach, union, zip, ...
• Additional operations like (outer) join, co-group, top-N,
reduceByKey, pipe, ...
• Transformations are lazily evaluated when required by actions
Monday, 12 May 14