Table of Contents

About

shuffle means moving data rows by rows between partition.

Conf

  • spark.sql.shuffle.partitions - Configures the number of partitions to use when shuffling data for joins or aggregations.