[GitHub] [spark] SparkQA commented on pull request #34429: [SPARK-37150][SQL] Migrate DESCRIBE NAMESPACE to use V2 command by default

2021-10-30 Thread GitBox
SparkQA commented on pull request #34429: URL: https://github.com/apache/spark/pull/34429#issuecomment-955639719 **[Test build #144788 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144788/testReport)** for PR 34429 at commit

[GitHub] [spark] imback82 commented on a change in pull request #34429: [SPARK-37150][SQL] Migrate DESCRIBE NAMESPACE to use V2 command by default

2021-10-30 Thread GitBox
imback82 commented on a change in pull request #34429: URL: https://github.com/apache/spark/pull/34429#discussion_r739759163 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/DescribeNamespaceExec.scala ## @@ -44,10 +44,13 @@ case class

[GitHub] [spark] AmplabJenkins commented on pull request #34450: [SPARK-37171][SQL]Add forany and forall to Datasets/Dataframes

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34450: URL: https://github.com/apache/spark/pull/34450#issuecomment-955634907 Can one of the admins verify this patch? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [spark] dhirennavani opened a new pull request #34450: [SPARK-37171][SQL]Add forany and forall to Datasets/Dataframes

2021-10-30 Thread GitBox
dhirennavani opened a new pull request #34450: URL: https://github.com/apache/spark/pull/34450 Add forany and forall api for Dataframe/Datasets API To provide a higher level of abstraction for Spark customers Yes, forany and forall methods are added to the Dataframe/Dataset

[GitHub] [spark] HyukjinKwon commented on a change in pull request #34418: [SPARK-37129][TESTS] Supplement Java 17 benchmark results created by GitHub Actions machines

2021-10-30 Thread GitBox
HyukjinKwon commented on a change in pull request #34418: URL: https://github.com/apache/spark/pull/34418#discussion_r739746311 ## File path: .github/workflows/benchmark.yml ## @@ -27,7 +27,7 @@ on: required: true default: '*' jdk: -

[GitHub] [spark] HyukjinKwon commented on a change in pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
HyukjinKwon commented on a change in pull request #34449: URL: https://github.com/apache/spark/pull/34449#discussion_r739745310 ## File path: binder/postBuild ## @@ -21,4 +21,4 @@ # Jupyter notebook. VERSION=$(python -c "exec(open('python/pyspark/version.py').read());

[GitHub] [spark] HyukjinKwon commented on a change in pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
HyukjinKwon commented on a change in pull request #34449: URL: https://github.com/apache/spark/pull/34449#discussion_r739744937 ## File path: binder/postBuild ## @@ -21,4 +21,4 @@ # Jupyter notebook. VERSION=$(python -c "exec(open('python/pyspark/version.py').read());

[GitHub] [spark] HyukjinKwon commented on a change in pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
HyukjinKwon commented on a change in pull request #34449: URL: https://github.com/apache/spark/pull/34449#discussion_r739745124 ## File path: binder/postBuild ## @@ -21,4 +21,4 @@ # Jupyter notebook. VERSION=$(python -c "exec(open('python/pyspark/version.py').read());

[GitHub] [spark] HyukjinKwon commented on a change in pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
HyukjinKwon commented on a change in pull request #34449: URL: https://github.com/apache/spark/pull/34449#discussion_r739744975 ## File path: binder/postBuild ## @@ -21,4 +21,4 @@ # Jupyter notebook. VERSION=$(python -c "exec(open('python/pyspark/version.py').read());

[GitHub] [spark] HyukjinKwon commented on a change in pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
HyukjinKwon commented on a change in pull request #34449: URL: https://github.com/apache/spark/pull/34449#discussion_r739744937 ## File path: binder/postBuild ## @@ -21,4 +21,4 @@ # Jupyter notebook. VERSION=$(python -c "exec(open('python/pyspark/version.py').read());

[GitHub] [spark] dongjoon-hyun commented on pull request #34432: [SPARK-37134][PYTHON][DOCS] Clarify the options in "Using PySpark Native Features"

2021-10-30 Thread GitBox
dongjoon-hyun commented on pull request #34432: URL: https://github.com/apache/spark/pull/34432#issuecomment-955613475 +1, LGTM. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [spark] dongjoon-hyun commented on pull request #34415: [SPARK-37117][SQL] Fix reading encrypted parquet files with external key material

2021-10-30 Thread GitBox
dongjoon-hyun commented on pull request #34415: URL: https://github.com/apache/spark/pull/34415#issuecomment-955613389 +1, LGTM. Thank you all. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [spark] dongjoon-hyun commented on pull request #34426: [SPARK-37147][SS] MetricsReporter producing NullPointerException when element 'triggerExecution' not present in Map[]

2021-10-30 Thread GitBox
dongjoon-hyun commented on pull request #34426: URL: https://github.com/apache/spark/pull/34426#issuecomment-955613307 +1, LGTM. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [spark] dongjoon-hyun commented on pull request #34417: [MINOR][SS] Remove unused config "pauseBackgroundWorkForCommit" from RocksDB

2021-10-30 Thread GitBox
dongjoon-hyun commented on pull request #34417: URL: https://github.com/apache/spark/pull/34417#issuecomment-955613125 +1, LGTM. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [spark] github-actions[bot] commented on pull request #32558: [SPARK-34953][CORE][SQL] Add the code change for adding the DateType in the infer schema while reading in CSV and JSON

2021-10-30 Thread GitBox
github-actions[bot] commented on pull request #32558: URL: https://github.com/apache/spark/pull/32558#issuecomment-955611433 We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue

[GitHub] [spark] HeartSaVioR closed pull request #34417: [MINOR][SS] Remove unused config "pauseBackgroundWorkForCommit" from RocksDB

2021-10-30 Thread GitBox
HeartSaVioR closed pull request #34417: URL: https://github.com/apache/spark/pull/34417 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [spark] HeartSaVioR commented on pull request #34417: [MINOR][SS] Remove unused config "pauseBackgroundWorkForCommit" from RocksDB

2021-10-30 Thread GitBox
HeartSaVioR commented on pull request #34417: URL: https://github.com/apache/spark/pull/34417#issuecomment-955603934 Thanks! Merging to master/3.2. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [spark] HeartSaVioR commented on pull request #34426: [SPARK-37147][SS] MetricsReporter producing NullPointerException when element 'triggerExecution' not present in Map[]

2021-10-30 Thread GitBox
HeartSaVioR commented on pull request #34426: URL: https://github.com/apache/spark/pull/34426#issuecomment-955603890 Thanks @gitplaneta for your contribution! I merged this to master/3.2 branches. -- This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] HeartSaVioR closed pull request #34426: [SPARK-37147][SS] MetricsReporter producing NullPointerException when element 'triggerExecution' not present in Map[]

2021-10-30 Thread GitBox
HeartSaVioR closed pull request #34426: URL: https://github.com/apache/spark/pull/34426 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34337: URL: https://github.com/apache/spark/pull/34337#issuecomment-955595115 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144787/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34448: [SPARK-37169][SQL] Fix incorrect cast value when cast DateType to NumericType

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34448: URL: https://github.com/apache/spark/pull/34448#issuecomment-955595114 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144786/

[GitHub] [spark] AmplabJenkins commented on pull request #34448: [SPARK-37169][SQL] Fix incorrect cast value when cast DateType to NumericType

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34448: URL: https://github.com/apache/spark/pull/34448#issuecomment-955595114 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144786/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34337: URL: https://github.com/apache/spark/pull/34337#issuecomment-955595115 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144787/ -- This

[GitHub] [spark] TheHollidayInn commented on pull request #34447: [MINOR][DOCS] Add import for MultivariateGaussian to Docs

2021-10-30 Thread GitBox
TheHollidayInn commented on pull request #34447: URL: https://github.com/apache/spark/pull/34447#issuecomment-955595015 I will double check. I was going through each item and found this one. -- This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [spark] SparkQA removed a comment on pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-30 Thread GitBox
SparkQA removed a comment on pull request #34337: URL: https://github.com/apache/spark/pull/34337#issuecomment-955418901 **[Test build #144787 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144787/testReport)** for PR 34337 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34448: [SPARK-37169][SQL] Fix incorrect cast value when cast DateType to NumericType

2021-10-30 Thread GitBox
SparkQA removed a comment on pull request #34448: URL: https://github.com/apache/spark/pull/34448#issuecomment-955418678 **[Test build #144786 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144786/testReport)** for PR 34448 at commit

[GitHub] [spark] SparkQA commented on pull request #34448: [SPARK-37169][SQL] Fix incorrect cast value when cast DateType to NumericType

2021-10-30 Thread GitBox
SparkQA commented on pull request #34448: URL: https://github.com/apache/spark/pull/34448#issuecomment-955593685 **[Test build #144786 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144786/testReport)** for PR 34448 at commit

[GitHub] [spark] SparkQA commented on pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-30 Thread GitBox
SparkQA commented on pull request #34337: URL: https://github.com/apache/spark/pull/34337#issuecomment-955593637 **[Test build #144787 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144787/testReport)** for PR 34337 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34449: URL: https://github.com/apache/spark/pull/34449#issuecomment-955580078 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144785/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34089: [SPARK-36837][BUILD] Upgrade Kafka to 3.0.1

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34089: URL: https://github.com/apache/spark/pull/34089#issuecomment-955580077 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49257/

[GitHub] [spark] AmplabJenkins commented on pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34449: URL: https://github.com/apache/spark/pull/34449#issuecomment-955580078 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144785/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34089: [SPARK-36837][BUILD] Upgrade Kafka to 3.0.1

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34089: URL: https://github.com/apache/spark/pull/34089#issuecomment-955580077 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49257/ --

[GitHub] [spark] SparkQA removed a comment on pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
SparkQA removed a comment on pull request #34449: URL: https://github.com/apache/spark/pull/34449#issuecomment-955418554 **[Test build #144785 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144785/testReport)** for PR 34449 at commit

[GitHub] [spark] SparkQA commented on pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
SparkQA commented on pull request #34449: URL: https://github.com/apache/spark/pull/34449#issuecomment-955579323 **[Test build #144785 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144785/testReport)** for PR 34449 at commit

[GitHub] [spark] SparkQA commented on pull request #34089: [SPARK-36837][BUILD] Upgrade Kafka to 3.0.1

2021-10-30 Thread GitBox
SparkQA commented on pull request #34089: URL: https://github.com/apache/spark/pull/34089#issuecomment-955576939 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49257/ -- This is an automated message from the

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34449: URL: https://github.com/apache/spark/pull/34449#issuecomment-955573009 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49254/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34337: URL: https://github.com/apache/spark/pull/34337#issuecomment-955573008 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49256/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34448: [SPARK-37169][SQL] Fix incorrect cast value when cast DateType to NumericType

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34448: URL: https://github.com/apache/spark/pull/34448#issuecomment-955573007 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49255/

[GitHub] [spark] AmplabJenkins commented on pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34337: URL: https://github.com/apache/spark/pull/34337#issuecomment-955573008 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49256/ --

[GitHub] [spark] AmplabJenkins commented on pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34449: URL: https://github.com/apache/spark/pull/34449#issuecomment-955573009 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49254/ --

[GitHub] [spark] AmplabJenkins commented on pull request #34448: [SPARK-37169][SQL] Fix incorrect cast value when cast DateType to NumericType

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34448: URL: https://github.com/apache/spark/pull/34448#issuecomment-955573007 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49255/ --

[GitHub] [spark] SparkQA commented on pull request #34448: [SPARK-37169][SQL] Fix incorrect cast value when cast DateType to NumericType

2021-10-30 Thread GitBox
SparkQA commented on pull request #34448: URL: https://github.com/apache/spark/pull/34448#issuecomment-955572257 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49255/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34089: [SPARK-36837][BUILD] Upgrade Kafka to 3.0.1

2021-10-30 Thread GitBox
SparkQA commented on pull request #34089: URL: https://github.com/apache/spark/pull/34089#issuecomment-955571572 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49257/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
SparkQA commented on pull request #34449: URL: https://github.com/apache/spark/pull/34449#issuecomment-955562057 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49254/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-30 Thread GitBox
SparkQA commented on pull request #34337: URL: https://github.com/apache/spark/pull/34337#issuecomment-955561467 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49256/ -- This is an automated message from the

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34234: URL: https://github.com/apache/spark/pull/34234#issuecomment-955516431 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49253/

[GitHub] [spark] AmplabJenkins commented on pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34234: URL: https://github.com/apache/spark/pull/34234#issuecomment-955516431 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49253/ --

[GitHub] [spark] SparkQA commented on pull request #34448: [SPARK-37169][SQL] Fix un-correct cast value when cast DateType to NumericType

2021-10-30 Thread GitBox
SparkQA commented on pull request #34448: URL: https://github.com/apache/spark/pull/34448#issuecomment-955505172 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49255/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-30 Thread GitBox
SparkQA commented on pull request #34337: URL: https://github.com/apache/spark/pull/34337#issuecomment-955492808 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49256/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
SparkQA commented on pull request #34449: URL: https://github.com/apache/spark/pull/34449#issuecomment-955492548 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49254/ -- This is an automated message from the Apache

[GitHub] [spark] MaxGekk closed pull request #34412: [SPARK-37138][SQL] Support ANSI Interval types in ApproxCountDistinctForIntervals/ApproximatePercentile/Percentile

2021-10-30 Thread GitBox
MaxGekk closed pull request #34412: URL: https://github.com/apache/spark/pull/34412 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [spark] MaxGekk commented on pull request #34412: [SPARK-37138][SQL] Support ANSI Interval types in ApproxCountDistinctForIntervals/ApproximatePercentile/Percentile

2021-10-30 Thread GitBox
MaxGekk commented on pull request #34412: URL: https://github.com/apache/spark/pull/34412#issuecomment-955487287 +1, LGTM. Merging to master. Thank you, @AngersZh . -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[GitHub] [spark] SparkQA commented on pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-30 Thread GitBox
SparkQA commented on pull request #34234: URL: https://github.com/apache/spark/pull/34234#issuecomment-955461575 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49253/ -- This is an automated message from the

[GitHub] [spark] sarutak commented on pull request #34356: [SPARK-36554][SQL][PYTHON] Expose make_date expression in functions.scala

2021-10-30 Thread GitBox
sarutak commented on pull request #34356: URL: https://github.com/apache/spark/pull/34356#issuecomment-955439783 @nicolasazrak Please change `pyspark.sql.rst` together whenever you add APIs for PySpark. Also, could you make sure that the API docs are successfully build and the layout

[GitHub] [spark] sarutak commented on a change in pull request #34356: [SPARK-36554][SQL][PYTHON] Expose make_date expression in functions.scala

2021-10-30 Thread GitBox
sarutak commented on a change in pull request #34356: URL: https://github.com/apache/spark/pull/34356#discussion_r739672789 ## File path: python/pyspark/sql/functions.py ## @@ -2131,6 +2131,32 @@ def weekofyear(col: "ColumnOrName") -> Column: return

[GitHub] [spark] huaxingao commented on pull request #34442: [SPARK-37165][SQL] Add REPEATABLE in TABLESAMPLE to specify seed

2021-10-30 Thread GitBox
huaxingao commented on pull request #34442: URL: https://github.com/apache/spark/pull/34442#issuecomment-955431286 Merged to mater. Thanks a lot for reviewing! @viirya -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [spark] huaxingao closed pull request #34442: [SPARK-37165][SQL] Add REPEATABLE in TABLESAMPLE to specify seed

2021-10-30 Thread GitBox
huaxingao closed pull request #34442: URL: https://github.com/apache/spark/pull/34442 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [spark] srowen commented on pull request #34383: [SPARK-37102][BUILD] Removed redundant exclusions in `hadoop-cloud` module

2021-10-30 Thread GitBox
srowen commented on pull request #34383: URL: https://github.com/apache/spark/pull/34383#issuecomment-955423689 Merged to master -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [spark] srowen closed pull request #34383: [SPARK-37102][BUILD] Removed redundant exclusions in `hadoop-cloud` module

2021-10-30 Thread GitBox
srowen closed pull request #34383: URL: https://github.com/apache/spark/pull/34383 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [spark] SparkQA commented on pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-30 Thread GitBox
SparkQA commented on pull request #34337: URL: https://github.com/apache/spark/pull/34337#issuecomment-955418901 **[Test build #144787 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144787/testReport)** for PR 34337 at commit

[GitHub] [spark] SparkQA commented on pull request #34448: [SPARK-37169][SQL] Fix un-correct cast value when cast DateType to NumericType

2021-10-30 Thread GitBox
SparkQA commented on pull request #34448: URL: https://github.com/apache/spark/pull/34448#issuecomment-955418678 **[Test build #144786 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144786/testReport)** for PR 34448 at commit

[GitHub] [spark] SparkQA commented on pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
SparkQA commented on pull request #34449: URL: https://github.com/apache/spark/pull/34449#issuecomment-955418554 **[Test build #144785 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144785/testReport)** for PR 34449 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34383: [SPARK-37102][BUILD] Removed redundant exclusions in `hadoop-cloud` module

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34383: URL: https://github.com/apache/spark/pull/34383#issuecomment-955417074 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144783/

[GitHub] [spark] AmplabJenkins commented on pull request #34383: [SPARK-37102][BUILD] Removed redundant exclusions in `hadoop-cloud` module

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34383: URL: https://github.com/apache/spark/pull/34383#issuecomment-955417074 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144783/ -- This

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #34448: [SPARK-37169][SQL] Fix un-correct cast value when cast DateType to NumericType

2021-10-30 Thread GitBox
AngersZh commented on a change in pull request #34448: URL: https://github.com/apache/spark/pull/34448#discussion_r739669531 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Cast.scala ## @@ -620,8 +620,17 @@ abstract class CastBase

[GitHub] [spark] SparkQA commented on pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-30 Thread GitBox
SparkQA commented on pull request #34234: URL: https://github.com/apache/spark/pull/34234#issuecomment-955388957 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49253/ -- This is an automated message from the Apache

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #34337: [SPARK-37066][SQL] Improve error message to show file path when failed to read next file

2021-10-30 Thread GitBox
AngersZh commented on a change in pull request #34337: URL: https://github.com/apache/spark/pull/34337#discussion_r739668516 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/FilePartitionReader.scala ## @@ -66,17 +64,16 @@ class

[GitHub] [spark] SparkQA removed a comment on pull request #34383: [SPARK-37102][BUILD] Removed redundant exclusions in `hadoop-cloud` module

2021-10-30 Thread GitBox
SparkQA removed a comment on pull request #34383: URL: https://github.com/apache/spark/pull/34383#issuecomment-955208646 **[Test build #144783 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144783/testReport)** for PR 34383 at commit

[GitHub] [spark] SparkQA commented on pull request #34383: [SPARK-37102][BUILD] Removed redundant exclusions in `hadoop-cloud` module

2021-10-30 Thread GitBox
SparkQA commented on pull request #34383: URL: https://github.com/apache/spark/pull/34383#issuecomment-95536 **[Test build #144783 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144783/testReport)** for PR 34383 at commit

[GitHub] [spark] sarutak commented on pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
sarutak commented on pull request #34449: URL: https://github.com/apache/spark/pull/34449#issuecomment-955352498 cc: @HyukjinKwon -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [spark] sarutak opened a new pull request #34449: [SPARK-37170][PYTHON][DOCS] Pin PySpark version for Binder

2021-10-30 Thread GitBox
sarutak opened a new pull request #34449: URL: https://github.com/apache/spark/pull/34449 ### What changes were proposed in this pull request? This PR proposes to pin the version of PySpark to be installed in the live notebook environment. ### Why are the changes needed?

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34412: [SPARK-37138][SQL] Support ANSI Interval types in ApproxCountDistinctForIntervals/ApproximatePercentile/Percentile

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34412: URL: https://github.com/apache/spark/pull/34412#issuecomment-955342418 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144782/

[GitHub] [spark] AmplabJenkins commented on pull request #34412: [SPARK-37138][SQL] Support ANSI Interval types in ApproxCountDistinctForIntervals/ApproximatePercentile/Percentile

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34412: URL: https://github.com/apache/spark/pull/34412#issuecomment-955342418 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144782/ -- This

[GitHub] [spark] SparkQA removed a comment on pull request #34412: [SPARK-37138][SQL] Support ANSI Interval types in ApproxCountDistinctForIntervals/ApproximatePercentile/Percentile

2021-10-30 Thread GitBox
SparkQA removed a comment on pull request #34412: URL: https://github.com/apache/spark/pull/34412#issuecomment-955187220 **[Test build #144782 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144782/testReport)** for PR 34412 at commit

[GitHub] [spark] SparkQA commented on pull request #34412: [SPARK-37138][SQL] Support ANSI Interval types in ApproxCountDistinctForIntervals/ApproximatePercentile/Percentile

2021-10-30 Thread GitBox
SparkQA commented on pull request #34412: URL: https://github.com/apache/spark/pull/34412#issuecomment-955338930 **[Test build #144782 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144782/testReport)** for PR 34412 at commit

[GitHub] [spark] nicolasazrak commented on pull request #34356: [SPARK-36554][SQL][PYTHON] Expose make_date expression in functions.scala

2021-10-30 Thread GitBox
nicolasazrak commented on pull request #34356: URL: https://github.com/apache/spark/pull/34356#issuecomment-955337387 @yoda-mon @HyukjinKwon @sarutak can we merge this? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34234: URL: https://github.com/apache/spark/pull/34234#issuecomment-955330033 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144784/

[GitHub] [spark] SparkQA removed a comment on pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-30 Thread GitBox
SparkQA removed a comment on pull request #34234: URL: https://github.com/apache/spark/pull/34234#issuecomment-955320457 **[Test build #144784 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144784/testReport)** for PR 34234 at commit

[GitHub] [spark] SparkQA commented on pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-30 Thread GitBox
SparkQA commented on pull request #34234: URL: https://github.com/apache/spark/pull/34234#issuecomment-955329931 **[Test build #144784 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144784/testReport)** for PR 34234 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34234: URL: https://github.com/apache/spark/pull/34234#issuecomment-955330033 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144784/ -- This

[GitHub] [spark] SparkQA commented on pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-30 Thread GitBox
SparkQA commented on pull request #34234: URL: https://github.com/apache/spark/pull/34234#issuecomment-955320457 **[Test build #144784 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144784/testReport)** for PR 34234 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #34412: [SPARK-37138][SQL] Support ANSI Interval types in ApproxCountDistinctForIntervals/ApproximatePercentile/Percentile

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34412: URL: https://github.com/apache/spark/pull/34412#issuecomment-955315610 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144781/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34446: [SPARK-37161][SQL] RowToColumnConverter support AnsiIntervalType

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34446: URL: https://github.com/apache/spark/pull/34446#issuecomment-955314540 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144780/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34412: [SPARK-37138][SQL] Support ANSI Interval types in ApproxCountDistinctForIntervals/ApproximatePercentile/Percentile

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34412: URL: https://github.com/apache/spark/pull/34412#issuecomment-955315610 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144781/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34383: [SPARK-37102][BUILD] Removed redundant exclusions in `hadoop-cloud` module

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34383: URL: https://github.com/apache/spark/pull/34383#issuecomment-955314539 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49252/

[GitHub] [spark] AmplabJenkins commented on pull request #34383: [SPARK-37102][BUILD] Removed redundant exclusions in `hadoop-cloud` module

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34383: URL: https://github.com/apache/spark/pull/34383#issuecomment-955314539 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/49252/ --

[GitHub] [spark] AmplabJenkins commented on pull request #34446: [SPARK-37161][SQL] RowToColumnConverter support AnsiIntervalType

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34446: URL: https://github.com/apache/spark/pull/34446#issuecomment-955314540 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144780/ -- This

[GitHub] [spark] SparkQA removed a comment on pull request #34412: [SPARK-37138][SQL] Support ANSI Interval types in ApproxCountDistinctForIntervals/ApproximatePercentile/Percentile

2021-10-30 Thread GitBox
SparkQA removed a comment on pull request #34412: URL: https://github.com/apache/spark/pull/34412#issuecomment-955186318 **[Test build #144781 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144781/testReport)** for PR 34412 at commit

[GitHub] [spark] SparkQA commented on pull request #34412: [SPARK-37138][SQL] Support ANSI Interval types in ApproxCountDistinctForIntervals/ApproximatePercentile/Percentile

2021-10-30 Thread GitBox
SparkQA commented on pull request #34412: URL: https://github.com/apache/spark/pull/34412#issuecomment-955311717 **[Test build #144781 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144781/testReport)** for PR 34412 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34446: [SPARK-37161][SQL] RowToColumnConverter support AnsiIntervalType

2021-10-30 Thread GitBox
SparkQA removed a comment on pull request #34446: URL: https://github.com/apache/spark/pull/34446#issuecomment-955186308 **[Test build #144780 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144780/testReport)** for PR 34446 at commit

[GitHub] [spark] wangyum commented on a change in pull request #33404: [SPARK-36194][SQL] Remove the aggregation from left semi/anti join if the same aggregation has already been done on left side

2021-10-30 Thread GitBox
wangyum commented on a change in pull request #33404: URL: https://github.com/apache/spark/pull/33404#discussion_r739663634 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/DistinctAttributesVisitor.scala ## @@ -0,0 +1,100 @@ +/* + *

[GitHub] [spark] wangyum commented on a change in pull request #33404: [SPARK-36194][SQL] Remove the aggregation from left semi/anti join if the same aggregation has already been done on left side

2021-10-30 Thread GitBox
wangyum commented on a change in pull request #33404: URL: https://github.com/apache/spark/pull/33404#discussion_r739663634 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/DistinctAttributesVisitor.scala ## @@ -0,0 +1,100 @@ +/* + *

[GitHub] [spark] SparkQA commented on pull request #34446: [SPARK-37161][SQL] RowToColumnConverter support AnsiIntervalType

2021-10-30 Thread GitBox
SparkQA commented on pull request #34446: URL: https://github.com/apache/spark/pull/34446#issuecomment-955287925 **[Test build #144780 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144780/testReport)** for PR 34446 at commit

[GitHub] [spark] wankunde commented on a change in pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-30 Thread GitBox
wankunde commented on a change in pull request #34234: URL: https://github.com/apache/spark/pull/34234#discussion_r739662025 ## File path: core/src/test/scala/org/apache/spark/scheduler/MapStatusSuite.scala ## @@ -191,4 +191,61 @@ class MapStatusSuite extends SparkFunSuite {

[GitHub] [spark] wankunde commented on a change in pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-30 Thread GitBox
wankunde commented on a change in pull request #34234: URL: https://github.com/apache/spark/pull/34234#discussion_r739661970 ## File path: core/src/main/scala/org/apache/spark/internal/config/package.scala ## @@ -1178,6 +1178,27 @@ package object config {

[GitHub] [spark] wankunde commented on a change in pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-10-30 Thread GitBox
wankunde commented on a change in pull request #34234: URL: https://github.com/apache/spark/pull/34234#discussion_r739661964 ## File path: core/src/main/scala/org/apache/spark/scheduler/MapStatus.scala ## @@ -255,9 +255,35 @@ private[spark] object HighlyCompressedMapStatus {

[GitHub] [spark] SparkQA commented on pull request #34383: [SPARK-37102][BUILD] Removed redundant exclusions in `hadoop-cloud` module

2021-10-30 Thread GitBox
SparkQA commented on pull request #34383: URL: https://github.com/apache/spark/pull/34383#issuecomment-955251685 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/49252/ -- This is an automated message from the

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34412: [SPARK-37138][SQL] Support ANSI Interval types in ApproxCountDistinctForIntervals/ApproximatePercentile/Percentile

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34412: URL: https://github.com/apache/spark/pull/34412#issuecomment-955241803 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144779/

[GitHub] [spark] AmplabJenkins commented on pull request #34412: [SPARK-37138][SQL] Support ANSI Interval types in ApproxCountDistinctForIntervals/ApproximatePercentile/Percentile

2021-10-30 Thread GitBox
AmplabJenkins commented on pull request #34412: URL: https://github.com/apache/spark/pull/34412#issuecomment-955241803 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144779/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34448: [SPARK-37169][SQL] Fix un-correct cast value when cast DateType to NumericType

2021-10-30 Thread GitBox
AmplabJenkins removed a comment on pull request #34448: URL: https://github.com/apache/spark/pull/34448#issuecomment-955241538 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/144778/

  1   2   >