function = transformation ?

PySpark - Closure

Spark automatically creates closures: for functions that run on RDDs at workers, and for any global variables that are used by those workers. One closure is send per worker for every task. closures...
Spark - (Map|flatMap)

The map implementation in Spark of map reduce. map(func) returns a new distributed data set that's formed by passing each element of the source through a function. flatMap(func) similar to map but...
Spark - Filter Transformation

filter(func) returns a new data set (RDD) that's formed by selecting those elements of the source on which the function returns true.

