Tune the spark.sql.shuffle.partitions . Partition the input dataset appropriately so each task size is not too big. Use the Spark UI to study ...
確定! 回上一頁