7. Checking Partition Number and Data Skew
On this page
This step checks whether there are enough partitions and if data is properly balanced across your cluster.
-
Check the total core to partition ratio per leaf for the database that is having trouble:
-
Run SHOW PARTITIONS on the database in question and compare that with the total number of cores in your cluster.
-
Is this ratio at least 1:1? If you have high concurrency, consider increasing the concurrency from 2, 3, or 4 to 1 core to partition ratio.
-
-
Check that your data is balanced across the properly by running on the database in question.
If you see a non-empty set, run on the database. -
Check that your data is redundant by running .
If you see a non-empty set, run -
Data Skew: see the Detecting and Resolving Data Skew.
Next Steps
-
If there are enough partitions, and no data skew is detected: move onto Step 8.
Last modified: June 22, 2022