Exercise: Spark’s Hello World using Spark shell and Scala
Run Spark shell and count the number of words in a file using MapReduce pattern.
-
Use
sc.textFileto read the file into memory -
Use
RDD.flatMapfor a mapper step -
Use
reduceByKeyfor a reducer step