Table of Contents
About
shuffle
means moving data rows by rows between
partition
.
Articles Related
Spark DataSet - Partition
Spark Engine - Aggregation
Spark Engine - Join
Conf
spark.sql.shuffle.partitions
- Configures the number of partitions to use when shuffling data for
joins
or
aggregations
.