HashPartitioner
HashPartitioner is a Partitioner that uses partitions configurable number of partitions to shuffle data around.
| Property | Description |
|---|---|
|
Exactly |
|
|
|
|
|
Exactly |
|
Note
|
HashPartitioner is the default Partitioner for coalesce transformation with shuffle enabled, e.g. calling repartition.
|
It is possible to re-shuffle data despite all the records for the key k being already on a single Spark executor (i.e. BlockManager to be precise). When HashPartitioner's result for k1 is 3 the key k1 will go to the third executor.