Slide 18
Slide 18 text
Shuffle operations
val fruitsRDD = sc.parallelize(
List((“apples”, 4), (“oranges”, 5), (“apples”, 1)))
fruitsRDD.reduceByKey(_ + _) // apples -> 5, oranges -> 5
fruitsRDD.groupByKey // (“apples”, List(4,1)), (“oranges”, List(5))
fruitsRDD.sortByKey // (“apples”,4), (“apples”,1), (“oranges”,5)