date:20200818

[spark] branch master updated (891c5e6 -> e15ae60)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 891c5e6  [MINOR][DOCS] Add KMeansSummary and InheritableThread to 
documentation
 add e15ae60  [SPARK-32550][SQL] Make SpecificInternalRow constructors 
faster by using while loops instead of maps

No new revisions were added by this update.

Summary of changes:
 .../catalyst/expressions/SpecificInternalRow.scala | 49 +++---
 1 file changed, 34 insertions(+), 15 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (f33b64a -> 891c5e6)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from f33b64a  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder
 add 891c5e6  [MINOR][DOCS] Add KMeansSummary and InheritableThread to 
documentation

No new revisions were added by this update.

Summary of changes:
 python/docs/source/reference/pyspark.ml.rst | 1 +
 python/docs/source/reference/pyspark.rst| 1 +
 python/pyspark/ml/clustering.py | 2 +-
 3 files changed, 3 insertions(+), 1 deletion(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (f33b64a -> 891c5e6)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from f33b64a  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder
 add 891c5e6  [MINOR][DOCS] Add KMeansSummary and InheritableThread to 
documentation

No new revisions were added by this update.

Summary of changes:
 python/docs/source/reference/pyspark.ml.rst | 1 +
 python/docs/source/reference/pyspark.rst| 1 +
 python/pyspark/ml/clustering.py | 2 +-
 3 files changed, 3 insertions(+), 1 deletion(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (f33b64a -> 891c5e6)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from f33b64a  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder
 add 891c5e6  [MINOR][DOCS] Add KMeansSummary and InheritableThread to 
documentation

No new revisions were added by this update.

Summary of changes:
 python/docs/source/reference/pyspark.ml.rst | 1 +
 python/docs/source/reference/pyspark.rst| 1 +
 python/pyspark/ml/clustering.py | 2 +-
 3 files changed, 3 insertions(+), 1 deletion(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (f33b64a -> 891c5e6)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from f33b64a  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder
 add 891c5e6  [MINOR][DOCS] Add KMeansSummary and InheritableThread to 
documentation

No new revisions were added by this update.

Summary of changes:
 python/docs/source/reference/pyspark.ml.rst | 1 +
 python/docs/source/reference/pyspark.rst| 1 +
 python/pyspark/ml/clustering.py | 2 +-
 3 files changed, 3 insertions(+), 1 deletion(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (f33b64a -> 891c5e6)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from f33b64a  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder
 add 891c5e6  [MINOR][DOCS] Add KMeansSummary and InheritableThread to 
documentation

No new revisions were added by this update.

Summary of changes:
 python/docs/source/reference/pyspark.ml.rst | 1 +
 python/docs/source/reference/pyspark.rst| 1 +
 python/pyspark/ml/clustering.py | 2 +-
 3 files changed, 3 insertions(+), 1 deletion(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch branch-3.0 updated (a36514e -> 753d414)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git.


from a36514e  [3.0][SPARK-32518][CORE] 
CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds 
of resources
 add 753d414  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder

No new revisions were added by this update.

Summary of changes:
 .../spark/sql/catalyst/optimizer/objects.scala | 10 ++-
 .../optimizer/ObjectSerializerPruningSuite.scala   | 33 +-
 .../spark/sql/DatasetOptimizationSuite.scala   | 13 +++--
 3 files changed, 52 insertions(+), 4 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch branch-3.0 updated (a36514e -> 753d414)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git.


from a36514e  [3.0][SPARK-32518][CORE] 
CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds 
of resources
 add 753d414  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder

No new revisions were added by this update.

Summary of changes:
 .../spark/sql/catalyst/optimizer/objects.scala | 10 ++-
 .../optimizer/ObjectSerializerPruningSuite.scala   | 33 +-
 .../spark/sql/DatasetOptimizationSuite.scala   | 13 +++--
 3 files changed, 52 insertions(+), 4 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (1b39215 -> f33b64a)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 1b39215  [SPARK-32018][FOLLOWUP][DOC] Add migration guide for decimal 
value overflow in sum aggregation
 add f33b64a  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder

No new revisions were added by this update.

Summary of changes:
 .../spark/sql/catalyst/optimizer/objects.scala | 10 ++-
 .../optimizer/ObjectSerializerPruningSuite.scala   | 33 +-
 .../spark/sql/DatasetOptimizationSuite.scala   | 13 +++--
 3 files changed, 52 insertions(+), 4 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch branch-3.0 updated (a36514e -> 753d414)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git.


from a36514e  [3.0][SPARK-32518][CORE] 
CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds 
of resources
 add 753d414  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder

No new revisions were added by this update.

Summary of changes:
 .../spark/sql/catalyst/optimizer/objects.scala | 10 ++-
 .../optimizer/ObjectSerializerPruningSuite.scala   | 33 +-
 .../spark/sql/DatasetOptimizationSuite.scala   | 13 +++--
 3 files changed, 52 insertions(+), 4 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (1b39215 -> f33b64a)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 1b39215  [SPARK-32018][FOLLOWUP][DOC] Add migration guide for decimal 
value overflow in sum aggregation
 add f33b64a  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder

No new revisions were added by this update.

Summary of changes:
 .../spark/sql/catalyst/optimizer/objects.scala | 10 ++-
 .../optimizer/ObjectSerializerPruningSuite.scala   | 33 +-
 .../spark/sql/DatasetOptimizationSuite.scala   | 13 +++--
 3 files changed, 52 insertions(+), 4 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch branch-3.0 updated (a36514e -> 753d414)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git.


from a36514e  [3.0][SPARK-32518][CORE] 
CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds 
of resources
 add 753d414  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder

No new revisions were added by this update.

Summary of changes:
 .../spark/sql/catalyst/optimizer/objects.scala | 10 ++-
 .../optimizer/ObjectSerializerPruningSuite.scala   | 33 +-
 .../spark/sql/DatasetOptimizationSuite.scala   | 13 +++--
 3 files changed, 52 insertions(+), 4 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (1b39215 -> f33b64a)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 1b39215  [SPARK-32018][FOLLOWUP][DOC] Add migration guide for decimal 
value overflow in sum aggregation
 add f33b64a  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder

No new revisions were added by this update.

Summary of changes:
 .../spark/sql/catalyst/optimizer/objects.scala | 10 ++-
 .../optimizer/ObjectSerializerPruningSuite.scala   | 33 +-
 .../spark/sql/DatasetOptimizationSuite.scala   | 13 +++--
 3 files changed, 52 insertions(+), 4 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch branch-3.0 updated (a36514e -> 753d414)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git.


from a36514e  [3.0][SPARK-32518][CORE] 
CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds 
of resources
 add 753d414  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder

No new revisions were added by this update.

Summary of changes:
 .../spark/sql/catalyst/optimizer/objects.scala | 10 ++-
 .../optimizer/ObjectSerializerPruningSuite.scala   | 33 +-
 .../spark/sql/DatasetOptimizationSuite.scala   | 13 +++--
 3 files changed, 52 insertions(+), 4 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (1b39215 -> f33b64a)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 1b39215  [SPARK-32018][FOLLOWUP][DOC] Add migration guide for decimal 
value overflow in sum aggregation
 add f33b64a  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder

No new revisions were added by this update.

Summary of changes:
 .../spark/sql/catalyst/optimizer/objects.scala | 10 ++-
 .../optimizer/ObjectSerializerPruningSuite.scala   | 33 +-
 .../spark/sql/DatasetOptimizationSuite.scala   | 13 +++--
 3 files changed, 52 insertions(+), 4 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (1b39215 -> f33b64a)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 1b39215  [SPARK-32018][FOLLOWUP][DOC] Add migration guide for decimal 
value overflow in sum aggregation
 add f33b64a  [SPARK-32652][SQL] ObjectSerializerPruning fails for 
RowEncoder

No new revisions were added by this update.

Summary of changes:
 .../spark/sql/catalyst/optimizer/objects.scala | 10 ++-
 .../optimizer/ObjectSerializerPruningSuite.scala   | 33 +-
 .../spark/sql/DatasetOptimizationSuite.scala   | 13 +++--
 3 files changed, 52 insertions(+), 4 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (70964e7 -> bfd8c34)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 70964e7  [SPARK-21040][CORE][FOLLOW-UP] Only calculate 
executorKillTime when speculation is enabled
 add bfd8c34  [SPARK-32645][INFRA] Upload unit-tests.log as an artifact

No new revisions were added by this update.

Summary of changes:
 .github/workflows/build_and_test.yml | 6 ++
 1 file changed, 6 insertions(+)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (bfd8c34 -> 1b39215)

2020-08-18 Thread gengliang

This is an automated email from the ASF dual-hosted git repository.

gengliang pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from bfd8c34  [SPARK-32645][INFRA] Upload unit-tests.log as an artifact
 add 1b39215  [SPARK-32018][FOLLOWUP][DOC] Add migration guide for decimal 
value overflow in sum aggregation

No new revisions were added by this update.

Summary of changes:
 docs/sql-migration-guide.md | 2 ++
 1 file changed, 2 insertions(+)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (bfd8c34 -> 1b39215)

2020-08-18 Thread gengliang

This is an automated email from the ASF dual-hosted git repository.

gengliang pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from bfd8c34  [SPARK-32645][INFRA] Upload unit-tests.log as an artifact
 add 1b39215  [SPARK-32018][FOLLOWUP][DOC] Add migration guide for decimal 
value overflow in sum aggregation

No new revisions were added by this update.

Summary of changes:
 docs/sql-migration-guide.md | 2 ++
 1 file changed, 2 insertions(+)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (bfd8c34 -> 1b39215)

2020-08-18 Thread gengliang

This is an automated email from the ASF dual-hosted git repository.

gengliang pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from bfd8c34  [SPARK-32645][INFRA] Upload unit-tests.log as an artifact
 add 1b39215  [SPARK-32018][FOLLOWUP][DOC] Add migration guide for decimal 
value overflow in sum aggregation

No new revisions were added by this update.

Summary of changes:
 docs/sql-migration-guide.md | 2 ++
 1 file changed, 2 insertions(+)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (70964e7 -> bfd8c34)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 70964e7  [SPARK-21040][CORE][FOLLOW-UP] Only calculate 
executorKillTime when speculation is enabled
 add bfd8c34  [SPARK-32645][INFRA] Upload unit-tests.log as an artifact

No new revisions were added by this update.

Summary of changes:
 .github/workflows/build_and_test.yml | 6 ++
 1 file changed, 6 insertions(+)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (bfd8c34 -> 1b39215)

2020-08-18 Thread gengliang

This is an automated email from the ASF dual-hosted git repository.

gengliang pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from bfd8c34  [SPARK-32645][INFRA] Upload unit-tests.log as an artifact
 add 1b39215  [SPARK-32018][FOLLOWUP][DOC] Add migration guide for decimal 
value overflow in sum aggregation

No new revisions were added by this update.

Summary of changes:
 docs/sql-migration-guide.md | 2 ++
 1 file changed, 2 insertions(+)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (70964e7 -> bfd8c34)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 70964e7  [SPARK-21040][CORE][FOLLOW-UP] Only calculate 
executorKillTime when speculation is enabled
 add bfd8c34  [SPARK-32645][INFRA] Upload unit-tests.log as an artifact

No new revisions were added by this update.

Summary of changes:
 .github/workflows/build_and_test.yml | 6 ++
 1 file changed, 6 insertions(+)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (bfd8c34 -> 1b39215)

2020-08-18 Thread gengliang

This is an automated email from the ASF dual-hosted git repository.

gengliang pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from bfd8c34  [SPARK-32645][INFRA] Upload unit-tests.log as an artifact
 add 1b39215  [SPARK-32018][FOLLOWUP][DOC] Add migration guide for decimal 
value overflow in sum aggregation

No new revisions were added by this update.

Summary of changes:
 docs/sql-migration-guide.md | 2 ++
 1 file changed, 2 insertions(+)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (70964e7 -> bfd8c34)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 70964e7  [SPARK-21040][CORE][FOLLOW-UP] Only calculate 
executorKillTime when speculation is enabled
 add bfd8c34  [SPARK-32645][INFRA] Upload unit-tests.log as an artifact

No new revisions were added by this update.

Summary of changes:
 .github/workflows/build_and_test.yml | 6 ++
 1 file changed, 6 insertions(+)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (70964e7 -> bfd8c34)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 70964e7  [SPARK-21040][CORE][FOLLOW-UP] Only calculate 
executorKillTime when speculation is enabled
 add bfd8c34  [SPARK-32645][INFRA] Upload unit-tests.log as an artifact

No new revisions were added by this update.

Summary of changes:
 .github/workflows/build_and_test.yml | 6 ++
 1 file changed, 6 insertions(+)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (babb654 -> 70964e7)

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from babb654  [SPARK-32647][INFRA] Report SparkR test results with JUnit 
reporter
 add 70964e7  [SPARK-21040][CORE][FOLLOW-UP] Only calculate 
executorKillTime when speculation is enabled

No new revisions were added by this update.

Summary of changes:
 .../scala/org/apache/spark/scheduler/TaskSetManager.scala | 15 +--
 1 file changed, 9 insertions(+), 6 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (babb654 -> 70964e7)

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from babb654  [SPARK-32647][INFRA] Report SparkR test results with JUnit 
reporter
 add 70964e7  [SPARK-21040][CORE][FOLLOW-UP] Only calculate 
executorKillTime when speculation is enabled

No new revisions were added by this update.

Summary of changes:
 .../scala/org/apache/spark/scheduler/TaskSetManager.scala | 15 +--
 1 file changed, 9 insertions(+), 6 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (babb654 -> 70964e7)

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from babb654  [SPARK-32647][INFRA] Report SparkR test results with JUnit 
reporter
 add 70964e7  [SPARK-21040][CORE][FOLLOW-UP] Only calculate 
executorKillTime when speculation is enabled

No new revisions were added by this update.

Summary of changes:
 .../scala/org/apache/spark/scheduler/TaskSetManager.scala | 15 +--
 1 file changed, 9 insertions(+), 6 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (babb654 -> 70964e7)

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from babb654  [SPARK-32647][INFRA] Report SparkR test results with JUnit 
reporter
 add 70964e7  [SPARK-21040][CORE][FOLLOW-UP] Only calculate 
executorKillTime when speculation is enabled

No new revisions were added by this update.

Summary of changes:
 .../scala/org/apache/spark/scheduler/TaskSetManager.scala | 15 +--
 1 file changed, 9 insertions(+), 6 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (babb654 -> 70964e7)

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from babb654  [SPARK-32647][INFRA] Report SparkR test results with JUnit 
reporter
 add 70964e7  [SPARK-21040][CORE][FOLLOW-UP] Only calculate 
executorKillTime when speculation is enabled

No new revisions were added by this update.

Summary of changes:
 .../scala/org/apache/spark/scheduler/TaskSetManager.scala | 15 +--
 1 file changed, 9 insertions(+), 6 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (d0dfe49 -> babb654)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from d0dfe49  [MINOR][INFRA] Rename master.yml to build_and_test.yml
 add babb654  [SPARK-32647][INFRA] Report SparkR test results with JUnit 
reporter

No new revisions were added by this update.

Summary of changes:
 R/pkg/tests/run-all.R | 9 ++---
 appveyor.yml  | 4 ++--
 2 files changed, 8 insertions(+), 5 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (d0dfe49 -> babb654)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from d0dfe49  [MINOR][INFRA] Rename master.yml to build_and_test.yml
 add babb654  [SPARK-32647][INFRA] Report SparkR test results with JUnit 
reporter

No new revisions were added by this update.

Summary of changes:
 R/pkg/tests/run-all.R | 9 ++---
 appveyor.yml  | 4 ++--
 2 files changed, 8 insertions(+), 5 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (d0dfe49 -> babb654)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from d0dfe49  [MINOR][INFRA] Rename master.yml to build_and_test.yml
 add babb654  [SPARK-32647][INFRA] Report SparkR test results with JUnit 
reporter

No new revisions were added by this update.

Summary of changes:
 R/pkg/tests/run-all.R | 9 ++---
 appveyor.yml  | 4 ++--
 2 files changed, 8 insertions(+), 5 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (d0dfe49 -> babb654)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from d0dfe49  [MINOR][INFRA] Rename master.yml to build_and_test.yml
 add babb654  [SPARK-32647][INFRA] Report SparkR test results with JUnit 
reporter

No new revisions were added by this update.

Summary of changes:
 R/pkg/tests/run-all.R | 9 ++---
 appveyor.yml  | 4 ++--
 2 files changed, 8 insertions(+), 5 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (d0dfe49 -> babb654)

2020-08-18 Thread gurwls223

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from d0dfe49  [MINOR][INFRA] Rename master.yml to build_and_test.yml
 add babb654  [SPARK-32647][INFRA] Report SparkR test results with JUnit 
reporter

No new revisions were added by this update.

Summary of changes:
 R/pkg/tests/run-all.R | 9 ++---
 appveyor.yml  | 4 ++--
 2 files changed, 8 insertions(+), 5 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (21e0dd04 -> d0dfe49)

2020-08-18 Thread gengliang

This is an automated email from the ASF dual-hosted git repository.

gengliang pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 21e0dd04 [SPARK-32119][FOLLOWUP][DOC] Update monitoring doc following 
the improvement in SPARK-32119
 add d0dfe49  [MINOR][INFRA] Rename master.yml to build_and_test.yml

No new revisions were added by this update.

Summary of changes:
 .github/workflows/{master.yml => build_and_test.yml} | 0
 1 file changed, 0 insertions(+), 0 deletions(-)
 rename .github/workflows/{master.yml => build_and_test.yml} (100%)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (21e0dd04 -> d0dfe49)

2020-08-18 Thread gengliang

This is an automated email from the ASF dual-hosted git repository.

gengliang pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 21e0dd04 [SPARK-32119][FOLLOWUP][DOC] Update monitoring doc following 
the improvement in SPARK-32119
 add d0dfe49  [MINOR][INFRA] Rename master.yml to build_and_test.yml

No new revisions were added by this update.

Summary of changes:
 .github/workflows/{master.yml => build_and_test.yml} | 0
 1 file changed, 0 insertions(+), 0 deletions(-)
 rename .github/workflows/{master.yml => build_and_test.yml} (100%)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (21e0dd04 -> d0dfe49)

2020-08-18 Thread gengliang

This is an automated email from the ASF dual-hosted git repository.

gengliang pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 21e0dd04 [SPARK-32119][FOLLOWUP][DOC] Update monitoring doc following 
the improvement in SPARK-32119
 add d0dfe49  [MINOR][INFRA] Rename master.yml to build_and_test.yml

No new revisions were added by this update.

Summary of changes:
 .github/workflows/{master.yml => build_and_test.yml} | 0
 1 file changed, 0 insertions(+), 0 deletions(-)
 rename .github/workflows/{master.yml => build_and_test.yml} (100%)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (21e0dd04 -> d0dfe49)

2020-08-18 Thread gengliang

This is an automated email from the ASF dual-hosted git repository.

gengliang pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 21e0dd04 [SPARK-32119][FOLLOWUP][DOC] Update monitoring doc following 
the improvement in SPARK-32119
 add d0dfe49  [MINOR][INFRA] Rename master.yml to build_and_test.yml

No new revisions were added by this update.

Summary of changes:
 .github/workflows/{master.yml => build_and_test.yml} | 0
 1 file changed, 0 insertions(+), 0 deletions(-)
 rename .github/workflows/{master.yml => build_and_test.yml} (100%)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (21e0dd04 -> d0dfe49)

2020-08-18 Thread gengliang

This is an automated email from the ASF dual-hosted git repository.

gengliang pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 21e0dd04 [SPARK-32119][FOLLOWUP][DOC] Update monitoring doc following 
the improvement in SPARK-32119
 add d0dfe49  [MINOR][INFRA] Rename master.yml to build_and_test.yml

No new revisions were added by this update.

Summary of changes:
 .github/workflows/{master.yml => build_and_test.yml} | 0
 1 file changed, 0 insertions(+), 0 deletions(-)
 rename .github/workflows/{master.yml => build_and_test.yml} (100%)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated: [SPARK-32119][FOLLOWUP][DOC] Update monitoring doc following the improvement in SPARK-32119

2020-08-18 Thread sarutak

This is an automated email from the ASF dual-hosted git repository.

sarutak pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git


The following commit(s) were added to refs/heads/master by this push:
 new 21e0dd04 [SPARK-32119][FOLLOWUP][DOC] Update monitoring doc following 
the improvement in SPARK-32119
21e0dd04 is described below

commit 21e0dd0461ef6496bea1436687fae46463f310b2
Author: Luca Canali 
AuthorDate: Tue Aug 18 18:53:34 2020 +0900

[SPARK-32119][FOLLOWUP][DOC] Update monitoring doc following the 
improvement in SPARK-32119

### What changes were proposed in this pull request?
Update monitoring doc following the improvement/fix in SPARK-32119.

### Why are the changes needed?
SPARK-32119 removes the limitations listed in the monitoring doc 
"Distribution of the jar files containing the plugin code is currently not done 
by Spark."

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
Not relevant

Closes #29463 from LucaCanali/followupSPARK32119.

Authored-by: Luca Canali 
Signed-off-by: Kousuke Saruta 
---
 docs/monitoring.md | 6 --
 1 file changed, 6 deletions(-)

diff --git a/docs/monitoring.md b/docs/monitoring.md
index 31fc160..97948f6 100644
--- a/docs/monitoring.md
+++ b/docs/monitoring.md
@@ -1366,9 +1366,3 @@ Both take a comma-separated list of class names that 
implement the
 possible for one list to be placed in the Spark default config file, allowing 
users to
 easily add other plugins from the command line without overwriting the config 
file's list. Duplicate
 plugins are ignored.
-
-Distribution of the jar files containing the plugin code is currently not done 
by Spark. The user
-or admin should make sure that the jar files are available to Spark 
applications, for example, by
-including the plugin jar with the Spark distribution. The exception to this 
rule is the YARN
-backend, where the --jars command line option (or equivalent 
config entry) can be
-used to make the plugin code available to both executors and cluster-mode 
drivers.


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch branch-3.0 updated (6cdc32f -> a36514e)

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a change to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 6cdc32f  [SPARK-32622][SQL][TEST] Add case-sensitivity test for ORC 
predicate pushdown
 add a36514e  [3.0][SPARK-32518][CORE] 
CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds 
of resources

No new revisions were added by this update.

Summary of changes:
 .../main/scala/org/apache/spark/SparkContext.scala | 10 +++---
 .../org/apache/spark/internal/config/Tests.scala   | 15 +
 .../scheduler/BarrierJobAllocationFailed.scala |  4 +--
 .../org/apache/spark/scheduler/DAGScheduler.scala  | 10 +++---
 .../spark/scheduler/ExecutorResourceInfo.scala |  1 +
 .../apache/spark/scheduler/SchedulerBackend.scala  |  3 +-
 .../apache/spark/scheduler/TaskSchedulerImpl.scala | 36 +-
 .../cluster/CoarseGrainedSchedulerBackend.scala| 22 +++--
 .../spark/BarrierStageOnSubmittedSuite.scala   | 36 ++
 .../scala/org/apache/spark/SparkContextSuite.scala |  2 ++
 10 files changed, 124 insertions(+), 15 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch branch-3.0 updated (6cdc32f -> a36514e)

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a change to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git.


from 6cdc32f  [SPARK-32622][SQL][TEST] Add case-sensitivity test for ORC 
predicate pushdown
 add a36514e  [3.0][SPARK-32518][CORE] 
CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds 
of resources

No new revisions were added by this update.

Summary of changes:
 .../main/scala/org/apache/spark/SparkContext.scala | 10 +++---
 .../org/apache/spark/internal/config/Tests.scala   | 15 +
 .../scheduler/BarrierJobAllocationFailed.scala |  4 +--
 .../org/apache/spark/scheduler/DAGScheduler.scala  | 10 +++---
 .../spark/scheduler/ExecutorResourceInfo.scala |  1 +
 .../apache/spark/scheduler/SchedulerBackend.scala  |  3 +-
 .../apache/spark/scheduler/TaskSchedulerImpl.scala | 36 +-
 .../cluster/CoarseGrainedSchedulerBackend.scala| 22 +++--
 .../spark/BarrierStageOnSubmittedSuite.scala   | 36 ++
 .../scala/org/apache/spark/SparkContextSuite.scala |  2 ++
 10 files changed, 124 insertions(+), 15 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (b33066f -> 1ac23de)

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from b33066f  [SPARK-32622][SQL][TEST] Add case-sensitivity test for ORC 
predicate pushdown
 add 1ac23de  [SPARK-32613][CORE] Fix regressions in DecommissionWorkerSuite

No new revisions were added by this update.

Summary of changes:
 .../executor/CoarseGrainedExecutorBackend.scala| 10 +++-
 .../org/apache/spark/internal/config/package.scala | 10 
 .../org/apache/spark/scheduler/DAGScheduler.scala  | 41 ++
 .../apache/spark/scheduler/TaskSchedulerImpl.scala | 29 --
 .../spark/deploy/DecommissionWorkerSuite.scala | 64 ++
 .../spark/scheduler/TaskSchedulerImplSuite.scala   | 36 +---
 6 files changed, 153 insertions(+), 37 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch branch-3.0 updated: [3.0][SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a commit to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git


The following commit(s) were added to refs/heads/branch-3.0 by this push:
 new a36514e  [3.0][SPARK-32518][CORE] 
CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds 
of resources
a36514e is described below

commit a36514e1fbf6920f0d18a7d8fb690822843eb2a2
Author: yi.wu 
AuthorDate: Tue Aug 18 06:50:05 2020 +

[3.0][SPARK-32518][CORE] 
CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds 
of resources

### What changes were proposed in this pull request?

1.  Make `CoarseGrainedSchedulerBackend.maxNumConcurrentTasks()` considers 
all kinds of resources when calculating the max concurrent tasks

2. Refactor `calculateAvailableSlots()` to make it be able to be used for 
both `CoarseGrainedSchedulerBackend` and `TaskSchedulerImpl`

### Why are the changes needed?

Currently, `CoarseGrainedSchedulerBackend.maxNumConcurrentTasks()` only 
considers the CPU for the max concurrent tasks. This can cause the application 
to hang when a barrier stage requires extra custom resources but the cluster 
doesn't have enough corresponding resources. Because, without the checking for 
other custom resources in `maxNumConcurrentTasks`, the barrier stage can be 
submitted to the `TaskSchedulerImpl`. But the `TaskSchedulerImpl` won't launch 
tasks for the barrier stage [...]

If the barrier stage doesn't launch all the tasks in one true, the 
application will fail and suggest user to disable delay scheduling. However, 
this actually a misleading suggestion since the real root cause is not enough 
resources.

### Does this PR introduce _any_ user-facing change?

Yes. In case of a barrier stage requires more custom resources than the 
cluster has, previously, the application will fail with misleading suggestion 
of disabling delay scheduling. After this PR, the application will fail with 
the error message saying not enough resources.

### How was this patch tested?

Added a unit test.

Closes #29395 from Ngone51/backport-spark-32518.

Authored-by: yi.wu 
Signed-off-by: Wenchen Fan 
---
 .../main/scala/org/apache/spark/SparkContext.scala | 10 +++---
 .../org/apache/spark/internal/config/Tests.scala   | 15 +
 .../scheduler/BarrierJobAllocationFailed.scala |  4 +--
 .../org/apache/spark/scheduler/DAGScheduler.scala  | 10 +++---
 .../spark/scheduler/ExecutorResourceInfo.scala |  1 +
 .../apache/spark/scheduler/SchedulerBackend.scala  |  3 +-
 .../apache/spark/scheduler/TaskSchedulerImpl.scala | 36 +-
 .../cluster/CoarseGrainedSchedulerBackend.scala| 22 +++--
 .../spark/BarrierStageOnSubmittedSuite.scala   | 36 ++
 .../scala/org/apache/spark/SparkContextSuite.scala |  2 ++
 10 files changed, 124 insertions(+), 15 deletions(-)

diff --git a/core/src/main/scala/org/apache/spark/SparkContext.scala 
b/core/src/main/scala/org/apache/spark/SparkContext.scala
index 65c08cf..66fe1d7 100644
--- a/core/src/main/scala/org/apache/spark/SparkContext.scala
+++ b/core/src/main/scala/org/apache/spark/SparkContext.scala
@@ -1597,7 +1597,8 @@ class SparkContext(config: SparkConf) extends Logging {
   }
 
   /**
-   * Get the max number of tasks that can be concurrent launched currently.
+   * Get the max number of tasks that can be concurrent launched based on the 
resources
+   * could be used, even if some of them are being used at the moment.
* Note that please don't cache the value returned by this method, because 
the number can change
* due to add/remove executors.
*
@@ -2776,8 +2777,9 @@ object SparkContext extends Logging {
   }
   // some cluster managers don't set the EXECUTOR_CORES config by default 
(standalone
   // and mesos coarse grained), so we can't rely on that config for those.
-  val shouldCheckExecCores = executorCores.isDefined || 
sc.conf.contains(EXECUTOR_CORES) ||
+  var shouldCheckExecCores = executorCores.isDefined || 
sc.conf.contains(EXECUTOR_CORES) ||
 (master.equalsIgnoreCase("yarn") || master.startsWith("k8s"))
+  shouldCheckExecCores &= !sc.conf.get(SKIP_VALIDATE_CORES_TESTING)
 
   // Number of cores per executor must meet at least one task requirement.
   if (shouldCheckExecCores && execCores < taskCores) {
@@ -2833,7 +2835,7 @@ object SparkContext extends Logging {
   limitingResourceName = taskReq.resourceName
 }
   }
-  if(!shouldCheckExecCores && Utils.isDynamicAllocationEnabled(sc.conf)) {
+  if(!shouldCheckExecCores) {
 // if we can't rely on the executor cores config throw a warning for 
user
 logWarning("Please ensure that the number of slots available on your " 
+
   "executors is limited by the number

[spark] branch master updated (b33066f -> 1ac23de)

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from b33066f  [SPARK-32622][SQL][TEST] Add case-sensitivity test for ORC 
predicate pushdown
 add 1ac23de  [SPARK-32613][CORE] Fix regressions in DecommissionWorkerSuite

No new revisions were added by this update.

Summary of changes:
 .../executor/CoarseGrainedExecutorBackend.scala| 10 +++-
 .../org/apache/spark/internal/config/package.scala | 10 
 .../org/apache/spark/scheduler/DAGScheduler.scala  | 41 ++
 .../apache/spark/scheduler/TaskSchedulerImpl.scala | 29 --
 .../spark/deploy/DecommissionWorkerSuite.scala | 64 ++
 .../spark/scheduler/TaskSchedulerImplSuite.scala   | 36 +---
 6 files changed, 153 insertions(+), 37 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch branch-3.0 updated: [3.0][SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a commit to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git


The following commit(s) were added to refs/heads/branch-3.0 by this push:
 new a36514e  [3.0][SPARK-32518][CORE] 
CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds 
of resources
a36514e is described below

commit a36514e1fbf6920f0d18a7d8fb690822843eb2a2
Author: yi.wu 
AuthorDate: Tue Aug 18 06:50:05 2020 +

[3.0][SPARK-32518][CORE] 
CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds 
of resources

### What changes were proposed in this pull request?

1.  Make `CoarseGrainedSchedulerBackend.maxNumConcurrentTasks()` considers 
all kinds of resources when calculating the max concurrent tasks

2. Refactor `calculateAvailableSlots()` to make it be able to be used for 
both `CoarseGrainedSchedulerBackend` and `TaskSchedulerImpl`

### Why are the changes needed?

Currently, `CoarseGrainedSchedulerBackend.maxNumConcurrentTasks()` only 
considers the CPU for the max concurrent tasks. This can cause the application 
to hang when a barrier stage requires extra custom resources but the cluster 
doesn't have enough corresponding resources. Because, without the checking for 
other custom resources in `maxNumConcurrentTasks`, the barrier stage can be 
submitted to the `TaskSchedulerImpl`. But the `TaskSchedulerImpl` won't launch 
tasks for the barrier stage [...]

If the barrier stage doesn't launch all the tasks in one true, the 
application will fail and suggest user to disable delay scheduling. However, 
this actually a misleading suggestion since the real root cause is not enough 
resources.

### Does this PR introduce _any_ user-facing change?

Yes. In case of a barrier stage requires more custom resources than the 
cluster has, previously, the application will fail with misleading suggestion 
of disabling delay scheduling. After this PR, the application will fail with 
the error message saying not enough resources.

### How was this patch tested?

Added a unit test.

Closes #29395 from Ngone51/backport-spark-32518.

Authored-by: yi.wu 
Signed-off-by: Wenchen Fan 
---
 .../main/scala/org/apache/spark/SparkContext.scala | 10 +++---
 .../org/apache/spark/internal/config/Tests.scala   | 15 +
 .../scheduler/BarrierJobAllocationFailed.scala |  4 +--
 .../org/apache/spark/scheduler/DAGScheduler.scala  | 10 +++---
 .../spark/scheduler/ExecutorResourceInfo.scala |  1 +
 .../apache/spark/scheduler/SchedulerBackend.scala  |  3 +-
 .../apache/spark/scheduler/TaskSchedulerImpl.scala | 36 +-
 .../cluster/CoarseGrainedSchedulerBackend.scala| 22 +++--
 .../spark/BarrierStageOnSubmittedSuite.scala   | 36 ++
 .../scala/org/apache/spark/SparkContextSuite.scala |  2 ++
 10 files changed, 124 insertions(+), 15 deletions(-)

diff --git a/core/src/main/scala/org/apache/spark/SparkContext.scala 
b/core/src/main/scala/org/apache/spark/SparkContext.scala
index 65c08cf..66fe1d7 100644
--- a/core/src/main/scala/org/apache/spark/SparkContext.scala
+++ b/core/src/main/scala/org/apache/spark/SparkContext.scala
@@ -1597,7 +1597,8 @@ class SparkContext(config: SparkConf) extends Logging {
   }
 
   /**
-   * Get the max number of tasks that can be concurrent launched currently.
+   * Get the max number of tasks that can be concurrent launched based on the 
resources
+   * could be used, even if some of them are being used at the moment.
* Note that please don't cache the value returned by this method, because 
the number can change
* due to add/remove executors.
*
@@ -2776,8 +2777,9 @@ object SparkContext extends Logging {
   }
   // some cluster managers don't set the EXECUTOR_CORES config by default 
(standalone
   // and mesos coarse grained), so we can't rely on that config for those.
-  val shouldCheckExecCores = executorCores.isDefined || 
sc.conf.contains(EXECUTOR_CORES) ||
+  var shouldCheckExecCores = executorCores.isDefined || 
sc.conf.contains(EXECUTOR_CORES) ||
 (master.equalsIgnoreCase("yarn") || master.startsWith("k8s"))
+  shouldCheckExecCores &= !sc.conf.get(SKIP_VALIDATE_CORES_TESTING)
 
   // Number of cores per executor must meet at least one task requirement.
   if (shouldCheckExecCores && execCores < taskCores) {
@@ -2833,7 +2835,7 @@ object SparkContext extends Logging {
   limitingResourceName = taskReq.resourceName
 }
   }
-  if(!shouldCheckExecCores && Utils.isDynamicAllocationEnabled(sc.conf)) {
+  if(!shouldCheckExecCores) {
 // if we can't rely on the executor cores config throw a warning for 
user
 logWarning("Please ensure that the number of slots available on your " 
+
   "executors is limited by the number

[spark] branch master updated (b33066f -> 1ac23de)

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from b33066f  [SPARK-32622][SQL][TEST] Add case-sensitivity test for ORC 
predicate pushdown
 add 1ac23de  [SPARK-32613][CORE] Fix regressions in DecommissionWorkerSuite

No new revisions were added by this update.

Summary of changes:
 .../executor/CoarseGrainedExecutorBackend.scala| 10 +++-
 .../org/apache/spark/internal/config/package.scala | 10 
 .../org/apache/spark/scheduler/DAGScheduler.scala  | 41 ++
 .../apache/spark/scheduler/TaskSchedulerImpl.scala | 29 --
 .../spark/deploy/DecommissionWorkerSuite.scala | 64 ++
 .../spark/scheduler/TaskSchedulerImplSuite.scala   | 36 +---
 6 files changed, 153 insertions(+), 37 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch branch-3.0 updated: [3.0][SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a commit to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git


The following commit(s) were added to refs/heads/branch-3.0 by this push:
 new a36514e  [3.0][SPARK-32518][CORE] 
CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds 
of resources
a36514e is described below

commit a36514e1fbf6920f0d18a7d8fb690822843eb2a2
Author: yi.wu 
AuthorDate: Tue Aug 18 06:50:05 2020 +

[3.0][SPARK-32518][CORE] 
CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds 
of resources

### What changes were proposed in this pull request?

1.  Make `CoarseGrainedSchedulerBackend.maxNumConcurrentTasks()` considers 
all kinds of resources when calculating the max concurrent tasks

2. Refactor `calculateAvailableSlots()` to make it be able to be used for 
both `CoarseGrainedSchedulerBackend` and `TaskSchedulerImpl`

### Why are the changes needed?

Currently, `CoarseGrainedSchedulerBackend.maxNumConcurrentTasks()` only 
considers the CPU for the max concurrent tasks. This can cause the application 
to hang when a barrier stage requires extra custom resources but the cluster 
doesn't have enough corresponding resources. Because, without the checking for 
other custom resources in `maxNumConcurrentTasks`, the barrier stage can be 
submitted to the `TaskSchedulerImpl`. But the `TaskSchedulerImpl` won't launch 
tasks for the barrier stage [...]

If the barrier stage doesn't launch all the tasks in one true, the 
application will fail and suggest user to disable delay scheduling. However, 
this actually a misleading suggestion since the real root cause is not enough 
resources.

### Does this PR introduce _any_ user-facing change?

Yes. In case of a barrier stage requires more custom resources than the 
cluster has, previously, the application will fail with misleading suggestion 
of disabling delay scheduling. After this PR, the application will fail with 
the error message saying not enough resources.

### How was this patch tested?

Added a unit test.

Closes #29395 from Ngone51/backport-spark-32518.

Authored-by: yi.wu 
Signed-off-by: Wenchen Fan 
---
 .../main/scala/org/apache/spark/SparkContext.scala | 10 +++---
 .../org/apache/spark/internal/config/Tests.scala   | 15 +
 .../scheduler/BarrierJobAllocationFailed.scala |  4 +--
 .../org/apache/spark/scheduler/DAGScheduler.scala  | 10 +++---
 .../spark/scheduler/ExecutorResourceInfo.scala |  1 +
 .../apache/spark/scheduler/SchedulerBackend.scala  |  3 +-
 .../apache/spark/scheduler/TaskSchedulerImpl.scala | 36 +-
 .../cluster/CoarseGrainedSchedulerBackend.scala| 22 +++--
 .../spark/BarrierStageOnSubmittedSuite.scala   | 36 ++
 .../scala/org/apache/spark/SparkContextSuite.scala |  2 ++
 10 files changed, 124 insertions(+), 15 deletions(-)

diff --git a/core/src/main/scala/org/apache/spark/SparkContext.scala 
b/core/src/main/scala/org/apache/spark/SparkContext.scala
index 65c08cf..66fe1d7 100644
--- a/core/src/main/scala/org/apache/spark/SparkContext.scala
+++ b/core/src/main/scala/org/apache/spark/SparkContext.scala
@@ -1597,7 +1597,8 @@ class SparkContext(config: SparkConf) extends Logging {
   }
 
   /**
-   * Get the max number of tasks that can be concurrent launched currently.
+   * Get the max number of tasks that can be concurrent launched based on the 
resources
+   * could be used, even if some of them are being used at the moment.
* Note that please don't cache the value returned by this method, because 
the number can change
* due to add/remove executors.
*
@@ -2776,8 +2777,9 @@ object SparkContext extends Logging {
   }
   // some cluster managers don't set the EXECUTOR_CORES config by default 
(standalone
   // and mesos coarse grained), so we can't rely on that config for those.
-  val shouldCheckExecCores = executorCores.isDefined || 
sc.conf.contains(EXECUTOR_CORES) ||
+  var shouldCheckExecCores = executorCores.isDefined || 
sc.conf.contains(EXECUTOR_CORES) ||
 (master.equalsIgnoreCase("yarn") || master.startsWith("k8s"))
+  shouldCheckExecCores &= !sc.conf.get(SKIP_VALIDATE_CORES_TESTING)
 
   // Number of cores per executor must meet at least one task requirement.
   if (shouldCheckExecCores && execCores < taskCores) {
@@ -2833,7 +2835,7 @@ object SparkContext extends Logging {
   limitingResourceName = taskReq.resourceName
 }
   }
-  if(!shouldCheckExecCores && Utils.isDynamicAllocationEnabled(sc.conf)) {
+  if(!shouldCheckExecCores) {
 // if we can't rely on the executor cores config throw a warning for 
user
 logWarning("Please ensure that the number of slots available on your " 
+
   "executors is limited by the number

[spark] branch master updated (b33066f -> 1ac23de)

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from b33066f  [SPARK-32622][SQL][TEST] Add case-sensitivity test for ORC 
predicate pushdown
 add 1ac23de  [SPARK-32613][CORE] Fix regressions in DecommissionWorkerSuite

No new revisions were added by this update.

Summary of changes:
 .../executor/CoarseGrainedExecutorBackend.scala| 10 +++-
 .../org/apache/spark/internal/config/package.scala | 10 
 .../org/apache/spark/scheduler/DAGScheduler.scala  | 41 ++
 .../apache/spark/scheduler/TaskSchedulerImpl.scala | 29 --
 .../spark/deploy/DecommissionWorkerSuite.scala | 64 ++
 .../spark/scheduler/TaskSchedulerImplSuite.scala   | 36 +---
 6 files changed, 153 insertions(+), 37 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch master updated (b33066f -> 1ac23de)

2020-08-18 Thread wenchen

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


from b33066f  [SPARK-32622][SQL][TEST] Add case-sensitivity test for ORC 
predicate pushdown
 add 1ac23de  [SPARK-32613][CORE] Fix regressions in DecommissionWorkerSuite

No new revisions were added by this update.

Summary of changes:
 .../executor/CoarseGrainedExecutorBackend.scala| 10 +++-
 .../org/apache/spark/internal/config/package.scala | 10 
 .../org/apache/spark/scheduler/DAGScheduler.scala  | 41 ++
 .../apache/spark/scheduler/TaskSchedulerImpl.scala | 29 --
 .../spark/deploy/DecommissionWorkerSuite.scala | 64 ++
 .../spark/scheduler/TaskSchedulerImplSuite.scala   | 36 +---
 6 files changed, 153 insertions(+), 37 deletions(-)


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

52 matches

Mail list logo