Table of Contents

RDD - Partition

About

Spark Engine - Partition in RDD

Managememnt

set

rdd = sc.parallelize([1, 2, 3, 4], 2)

get

rdd.getNumPartitions

mapPartitions

Return a new RDD by applying a function to each partition of this RDD.