BigData / Apache Spark

Difference between map and flatmap transformations.

map(func) returns a new distributed dataset formed by passing each element of the source through a function func.

flatMap(func) is similar to map, except that each input item can be mapped to 0 or more output items so that func should return a Seq rather than a single item.

