[GitHub] [spark] AmplabJenkins commented on pull request #32914: [SPARK-35763][SS] Add a new copy method to StateStoreCustomMetric

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32914:
URL: https://github.com/apache/spark/pull/32914#issuecomment-861233427


   Can one of the admins verify this patch?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32912: [SPARK-35429][CORE] Remove commons-httpclient due to EOL and CVEs

2021-06-15 Thread GitBox


SparkQA commented on pull request #32912:
URL: https://github.com/apache/spark/pull/32912#issuecomment-861235546


   **[Test build #139797 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139797/testReport)**
 for PR 32912 at commit 
[`3c0eb47`](https://github.com/apache/spark/commit/3c0eb4763bf74a67eabb21fc02565790d9505ee0).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


SparkQA commented on pull request #32645:
URL: https://github.com/apache/spark/pull/32645#issuecomment-861235933


   **[Test build #139798 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139798/testReport)**
 for PR 32645 at commit 
[`b9248aa`](https://github.com/apache/spark/commit/b9248aafc830c2f5b4daee9b6a6ed8b5d572d038).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #32904: [SPARK-33122][FOLLOWUP][SQL] Distinct aggs are not duplicate sensitive

2021-06-15 Thread GitBox


dongjoon-hyun commented on pull request #32904:
URL: https://github.com/apache/spark/pull/32904#issuecomment-861236385






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun edited a comment on pull request #32904: [SPARK-33122][FOLLOWUP][SQL] Distinct aggs are not duplicate sensitive

2021-06-15 Thread GitBox


dongjoon-hyun edited a comment on pull request #32904:
URL: https://github.com/apache/spark/pull/32904#issuecomment-861236385


   Hi, @tanelk . Could you take a look at GitHub Action UT failures?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #32904: [SPARK-33122][FOLLOWUP][SQL] Distinct aggs are not duplicate sensitive

2021-06-15 Thread GitBox


dongjoon-hyun commented on pull request #32904:
URL: https://github.com/apache/spark/pull/32904#issuecomment-861236940


   cc @maropu 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32904: [SPARK-33122][FOLLOWUP][SQL] Distinct aggs are not duplicate sensitive

2021-06-15 Thread GitBox


SparkQA commented on pull request #32904:
URL: https://github.com/apache/spark/pull/32904#issuecomment-861238629


   **[Test build #139799 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139799/testReport)**
 for PR 32904 at commit 
[`3f0cdae`](https://github.com/apache/spark/commit/3f0cdae64db7a3a5dafd628f890e936cf743d16d).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32904: [SPARK-33122][FOLLOWUP][SQL] Distinct aggs are not duplicate sensitive

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32904:
URL: https://github.com/apache/spark/pull/32904#issuecomment-860597774






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32645:
URL: https://github.com/apache/spark/pull/32645#issuecomment-861233047


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139795/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32881: [SPARK-33298][CORE] Decouple file naming from FileCommitProtocol

2021-06-15 Thread GitBox


SparkQA commented on pull request #32881:
URL: https://github.com/apache/spark/pull/32881#issuecomment-861248200


   **[Test build #139796 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139796/testReport)**
 for PR 32881 at commit 
[`1bb6e16`](https://github.com/apache/spark/commit/1bb6e16721eb1b0d76762da3b8349988583ae692).
* This patch **fails Spark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #32881: [SPARK-33298][CORE] Decouple file naming from FileCommitProtocol

2021-06-15 Thread GitBox


SparkQA removed a comment on pull request #32881:
URL: https://github.com/apache/spark/pull/32881#issuecomment-861174335


   **[Test build #139796 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139796/testReport)**
 for PR 32881 at commit 
[`1bb6e16`](https://github.com/apache/spark/commit/1bb6e16721eb1b0d76762da3b8349988583ae692).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32885: [SPARK-35742][SQL] Expression.semanticEquals should be symmetrical

2021-06-15 Thread GitBox


SparkQA commented on pull request #32885:
URL: https://github.com/apache/spark/pull/32885#issuecomment-861256012


   **[Test build #139789 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139789/testReport)**
 for PR 32885 at commit 
[`5095fff`](https://github.com/apache/spark/commit/5095fffcd0cd76548eadef802cdb81261029ff12).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #32885: [SPARK-35742][SQL] Expression.semanticEquals should be symmetrical

2021-06-15 Thread GitBox


SparkQA removed a comment on pull request #32885:
URL: https://github.com/apache/spark/pull/32885#issuecomment-861132129


   **[Test build #139789 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139789/testReport)**
 for PR 32885 at commit 
[`5095fff`](https://github.com/apache/spark/commit/5095fffcd0cd76548eadef802cdb81261029ff12).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32880: [SPARK-35056][SQL] Group exception messages in execution/streaming

2021-06-15 Thread GitBox


SparkQA commented on pull request #32880:
URL: https://github.com/apache/spark/pull/32880#issuecomment-861257115


   **[Test build #139790 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139790/testReport)**
 for PR 32880 at commit 
[`7c8d275`](https://github.com/apache/spark/commit/7c8d275364b1ccf83492e55f0a63f357ec7fb8ad).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #32880: [SPARK-35056][SQL] Group exception messages in execution/streaming

2021-06-15 Thread GitBox


SparkQA removed a comment on pull request #32880:
URL: https://github.com/apache/spark/pull/32880#issuecomment-861132135


   **[Test build #139790 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139790/testReport)**
 for PR 32880 at commit 
[`7c8d275`](https://github.com/apache/spark/commit/7c8d275364b1ccf83492e55f0a63f357ec7fb8ad).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] gengliangwang opened a new pull request #32915: [SPARK-35764][SQL] Assign pretty names to TimestampWithoutTZType

2021-06-15 Thread GitBox


gengliangwang opened a new pull request #32915:
URL: https://github.com/apache/spark/pull/32915


   
   
   ### What changes were proposed in this pull request?
   
   
   In the PR, I propose to override the typeName() method in 
TimestampWithoutTZType, and assign it a name according to the ANSI SQL standard
   
![image](https://user-images.githubusercontent.com/1097932/122013859-2cf50680-cdf1-11eb-9fcd-0ec1b59fb5c0.png)
   
   ### Why are the changes needed?
   
   To improve Spark SQL user experience, and have readable types in error 
messages.
   
   
   
   ### Does this PR introduce _any_ user-facing change?
   
   
   No, the new timestamp type is not released yet.
   ### How was this patch tested?
   
   Unit test
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


SparkQA commented on pull request #32645:
URL: https://github.com/apache/spark/pull/32645#issuecomment-861271831


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44324/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32904: [SPARK-33122][FOLLOWUP][SQL] Distinct aggs are not duplicate sensitive

2021-06-15 Thread GitBox


SparkQA commented on pull request #32904:
URL: https://github.com/apache/spark/pull/32904#issuecomment-861274681


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44325/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Ngone51 commented on pull request #32385: [WIP][SPARK-35275][CORE] Add checksum for shuffle blocks and diagnose corruption

2021-06-15 Thread GitBox


Ngone51 commented on pull request #32385:
URL: https://github.com/apache/spark/pull/32385#issuecomment-861274556


   oh..@mridulm Sorry if I confused you here. I have planed to split this PR 
into two separate PRs to ease the review:
   
   * write checksum file (Ready to review 
https://github.com/apache/spark/pull/32401)
   * diagnose corruption (Not done yet)
   
   So please help review the smaller PR there.
   
   And I'll try to resolve your comments in the separate PRs. Thanks!
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32881: [SPARK-33298][CORE] Decouple file naming from FileCommitProtocol

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32881:
URL: https://github.com/apache/spark/pull/32881#issuecomment-861276632


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139796/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32880: [SPARK-35056][SQL] Group exception messages in execution/streaming

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32880:
URL: https://github.com/apache/spark/pull/32880#issuecomment-861276637


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139790/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32885: [SPARK-35742][SQL] Expression.semanticEquals should be symmetrical

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32885:
URL: https://github.com/apache/spark/pull/32885#issuecomment-861276633


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139789/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] viirya commented on pull request #32885: [SPARK-35742][SQL] Expression.semanticEquals should be symmetrical

2021-06-15 Thread GitBox


viirya commented on pull request #32885:
URL: https://github.com/apache/spark/pull/32885#issuecomment-861276752


   LGTM


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32881: [SPARK-33298][CORE] Decouple file naming from FileCommitProtocol

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32881:
URL: https://github.com/apache/spark/pull/32881#issuecomment-861276632


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139796/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32880: [SPARK-35056][SQL] Group exception messages in execution/streaming

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32880:
URL: https://github.com/apache/spark/pull/32880#issuecomment-861276637


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139790/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32885: [SPARK-35742][SQL] Expression.semanticEquals should be symmetrical

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32885:
URL: https://github.com/apache/spark/pull/32885#issuecomment-861276633


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139789/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on a change in pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


MaxGekk commented on a change in pull request #32645:
URL: https://github.com/apache/spark/pull/32645#discussion_r651546525



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/intervalExpressions.scala
##
@@ -345,6 +346,66 @@ case class MakeInterval(
 )
 }
 
+// scalastyle:off line.size.limit
+@ExpressionDescription(
+  usage = "_FUNC_(years, months) - Make year-month interval from years, 
months.",
+  arguments = """
+Arguments:
+  * years - the number of years, positive or negative
+  * months - the number of months, positive or negative
+  """,
+  examples = """
+Examples:
+  > SELECT _FUNC_(1, 2);
+   1-2
+  > SELECT _FUNC_(1, 0);
+   1-0
+  > SELECT _FUNC_(0, 1);
+   0-1
+  """,
+  since = "3.2.0",
+  group = "datetime_funcs")
+// scalastyle:on line.size.limit
+case class MakeYMInterval(years: Expression, months: Expression)
+  extends BinaryExpression with ImplicitCastInputTypes with NullIntolerant {
+
+  def this(years: Expression) = this(years, Literal(0))
+  def this() = this(Literal(0))
+
+  override def left: Expression = years
+  override def right: Expression = months
+  override def inputTypes: Seq[AbstractDataType] = Seq(IntegerType, 
IntegerType)
+  override def dataType: DataType = YearMonthIntervalType
+  override def nullable: Boolean = children.exists(_.nullable)
+
+  private def evalIntValue(dt: DataType, value: Any): Long = dt match {
+case _: ByteType | _: ShortType | _: IntegerType => 
value.asInstanceOf[Number].longValue()
+case _: LongType => value.asInstanceOf[Long]
+  }
+
+  override def nullSafeEval(year: Any, month: Any): Any = {
+LongExactNumeric.toInt(Math.addExact(evalIntValue(right.dataType, month),
+  Math.multiplyExact(evalIntValue(left.dataType, year), MONTHS_PER_YEAR)))
+  }
+
+  override def doGenCode(ctx: CodegenContext, ev: ExprCode): ExprCode = {
+defineCodeGen(ctx, ev, (years, months) => {
+  val extractor = LongExactNumeric.getClass.getName.stripSuffix("$")
+  s"""
+ |$extractor.toInt(java.lang.Math.addExact($months,
+ |  java.lang.Math.multiplyExact($years, $MONTHS_PER_YEAR)))
+ |""".stripMargin
+})
+  }
+
+  override def prettyName: String = "make_ym_interval"
+
+  override protected def withNewChildrenInternal(
+  newLeft: Expression,
+  newRight: Expression): Expression =
+copy(years = newLeft, months = newRight)
+}

Review comment:
   ok. I see. Thanks.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dgd-contributor opened a new pull request #32916: [WIP][SPARK-35064][SQL] group error in spark-catalyst

2021-06-15 Thread GitBox


dgd-contributor opened a new pull request #32916:
URL: https://github.com/apache/spark/pull/32916


   What changes were proposed in this pull request?
   This PR group exception messages in 
sql/catalyst/src/main/scala/org/apache/spark/sql (except catalyst)
   
   Why are the changes needed?
   It will largely help with standardization of error messages and its 
maintenance.
   
   Does this PR introduce any user-facing change?
   No. Error messages remain unchanged.
   
   How was this patch tested?
   No new tests - pass all original tests to make sure it doesn't break any 
existing behavior.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32916: [WIP][SPARK-35064][SQL] group error in spark-catalyst

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32916:
URL: https://github.com/apache/spark/pull/32916#issuecomment-861279329


   Can one of the admins verify this patch?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32915: [SPARK-35764][SQL] Assign pretty names to TimestampWithoutTZType

2021-06-15 Thread GitBox


SparkQA commented on pull request #32915:
URL: https://github.com/apache/spark/pull/32915#issuecomment-861279596


   **[Test build #139800 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139800/testReport)**
 for PR 32915 at commit 
[`1e53571`](https://github.com/apache/spark/commit/1e5357158acd7fe371de542c09e69052708e5f89).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32821: [SPARK-35342][PYTHON] Introduce DecimalOps and make `isnull` method data-type-based

2021-06-15 Thread GitBox


SparkQA commented on pull request #32821:
URL: https://github.com/apache/spark/pull/32821#issuecomment-861279903


   **[Test build #139801 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139801/testReport)**
 for PR 32821 at commit 
[`bca1d84`](https://github.com/apache/spark/commit/bca1d84b288a166e367ff0785b298808c670b17b).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


SparkQA commented on pull request #32645:
URL: https://github.com/apache/spark/pull/32645#issuecomment-861280008


   **[Test build #139802 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139802/testReport)**
 for PR 32645 at commit 
[`06c908b`](https://github.com/apache/spark/commit/06c908b487ab218bd14e1e679d179f702f8c3e97).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on a change in pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


MaxGekk commented on a change in pull request #32645:
URL: https://github.com/apache/spark/pull/32645#discussion_r651548989



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/intervalExpressions.scala
##
@@ -345,6 +346,66 @@ case class MakeInterval(
 )
 }
 
+// scalastyle:off line.size.limit
+@ExpressionDescription(
+  usage = "_FUNC_(years, months) - Make year-month interval from years, 
months.",
+  arguments = """
+Arguments:
+  * years - the number of years, positive or negative
+  * months - the number of months, positive or negative
+  """,
+  examples = """
+Examples:
+  > SELECT _FUNC_(1, 2);
+   1-2
+  > SELECT _FUNC_(1, 0);
+   1-0
+  > SELECT _FUNC_(0, 1);
+   0-1
+  """,
+  since = "3.2.0",
+  group = "datetime_funcs")
+// scalastyle:on line.size.limit
+case class MakeYMInterval(years: Expression, months: Expression)
+  extends BinaryExpression with ImplicitCastInputTypes with NullIntolerant {
+
+  def this(years: Expression) = this(years, Literal(0))
+  def this() = this(Literal(0))
+
+  override def left: Expression = years
+  override def right: Expression = months
+  override def inputTypes: Seq[AbstractDataType] = Seq(IntegerType, 
IntegerType)
+  override def dataType: DataType = YearMonthIntervalType
+  override def nullable: Boolean = children.exists(_.nullable)
+
+  private def evalIntValue(dt: DataType, value: Any): Long = dt match {
+case _: ByteType | _: ShortType | _: IntegerType => 
value.asInstanceOf[Number].longValue()

Review comment:
   Let's follow `MakeInterval`, and handle/accept only `IntegerType` as it 
is declared by:
   ```scala
 override def inputTypes: Seq[AbstractDataType] = Seq(IntegerType, 
IntegerType)
   ```




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32904: [SPARK-33122][FOLLOWUP][SQL] Distinct aggs are not duplicate sensitive

2021-06-15 Thread GitBox


SparkQA commented on pull request #32904:
URL: https://github.com/apache/spark/pull/32904#issuecomment-861281750


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44326/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] sarutak opened a new pull request #32917: [SPARK-35758][DOCS] Update the document about building Spark with Hadoop

2021-06-15 Thread GitBox


sarutak opened a new pull request #32917:
URL: https://github.com/apache/spark/pull/32917


   ### What changes were proposed in this pull request?
   
   This PR updates the document about building Spark with Hadoop for Hadoop 3.x 
and Hadoop 3.2.
   
   ### Why are the changes needed?
   
   The document says about how to build like as follows:
   ```
   ./build/mvn -Pyarn -Dhadoop.version=2.8.5 -DskipTests clean package
   ```
   
   But this command fails because the default build settings are for Hadoop 3.x.
   So, we need to modify the command example.
   
   ### Does this PR introduce _any_ user-facing change?
   
   No.
   
   ### How was this patch tested?
   
   I confirmed both of these commands successfully finished.
   ```
   ./build/mvn -Pyarn -Dhadoop.version=3.3.0 -DskipTests package
   ./build/mvn -Phadoop-2.7 -Pyarn -Dhadoop.version=2.8.5 -DskipTests package
   ```
   
   I also built the document and confirmed the result.
   This is before:
   
![hadoop-version-before](https://user-images.githubusercontent.com/4736016/122016157-bf020c80-cdfb-11eb-8e74-4840861f8541.png)
   
   And this is after:
   
![hadoop-version-after](https://user-images.githubusercontent.com/4736016/122016188-c75a4780-cdfb-11eb-8427-2f0765e6ff7a.png)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32917: [SPARK-35758][DOCS] Update the document about building Spark with Hadoop

2021-06-15 Thread GitBox


SparkQA commented on pull request #32917:
URL: https://github.com/apache/spark/pull/32917#issuecomment-861282868


   **[Test build #139803 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139803/testReport)**
 for PR 32917 at commit 
[`06e7b30`](https://github.com/apache/spark/commit/06e7b301ecf3c8e96332c5c57725c101ca61bfb4).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32912: [SPARK-35429][CORE] Remove commons-httpclient due to EOL and CVEs

2021-06-15 Thread GitBox


SparkQA commented on pull request #32912:
URL: https://github.com/apache/spark/pull/32912#issuecomment-861283860


   Kubernetes integration test unable to build dist.
   
   exiting with code: 1
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44323/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32912: [SPARK-35429][CORE] Remove commons-httpclient due to EOL and CVEs

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32912:
URL: https://github.com/apache/spark/pull/32912#issuecomment-861283903


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/44323/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32912: [SPARK-35429][CORE] Remove commons-httpclient due to EOL and CVEs

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32912:
URL: https://github.com/apache/spark/pull/32912#issuecomment-861283903


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/44323/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] sarutak commented on pull request #32845: [SPARK-35691][CORE] addFile/addJar/addDirectory should put CanonicalFile

2021-06-15 Thread GitBox


sarutak commented on pull request #32845:
URL: https://github.com/apache/spark/pull/32845#issuecomment-861294700


   retest this please.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] sarutak commented on pull request #32845: [SPARK-35691][CORE] addFile/addJar/addDirectory should put CanonicalFile

2021-06-15 Thread GitBox


sarutak commented on pull request #32845:
URL: https://github.com/apache/spark/pull/32845#issuecomment-861294828


   LGTM. Pending Jenkins.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32917: [SPARK-35758][DOCS] Update the document about building Spark with Hadoop for Hadoop 2.x and 3.x

2021-06-15 Thread GitBox


SparkQA commented on pull request #32917:
URL: https://github.com/apache/spark/pull/32917#issuecomment-861295559


   **[Test build #139803 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139803/testReport)**
 for PR 32917 at commit 
[`06e7b30`](https://github.com/apache/spark/commit/06e7b301ecf3c8e96332c5c57725c101ca61bfb4).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on pull request #32909: [WIP][SPARK-35680][SQL] Add fields to `YearMonthIntervalType`

2021-06-15 Thread GitBox


MaxGekk commented on pull request #32909:
URL: https://github.com/apache/spark/pull/32909#issuecomment-861295970


   @cloud-fan I think it is ready for review. I just need to replace 
`SPARK-X` in the code, and open sub-tasks in JIRA. Could you take a look at 
this, please.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #32917: [SPARK-35758][DOCS] Update the document about building Spark with Hadoop for Hadoop 2.x and 3.x

2021-06-15 Thread GitBox


SparkQA removed a comment on pull request #32917:
URL: https://github.com/apache/spark/pull/32917#issuecomment-861282868


   **[Test build #139803 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139803/testReport)**
 for PR 32917 at commit 
[`06e7b30`](https://github.com/apache/spark/commit/06e7b301ecf3c8e96332c5c57725c101ca61bfb4).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


SparkQA commented on pull request #32645:
URL: https://github.com/apache/spark/pull/32645#issuecomment-861297404


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44324/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32904: [SPARK-33122][FOLLOWUP][SQL] Distinct aggs are not duplicate sensitive

2021-06-15 Thread GitBox


SparkQA commented on pull request #32904:
URL: https://github.com/apache/spark/pull/32904#issuecomment-861297951


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44325/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32904: [SPARK-33122][FOLLOWUP][SQL] Distinct aggs are not duplicate sensitive

2021-06-15 Thread GitBox


SparkQA commented on pull request #32904:
URL: https://github.com/apache/spark/pull/32904#issuecomment-861309244


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44326/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32821: [SPARK-35342][PYTHON] Introduce DecimalOps and make `isnull` method data-type-based

2021-06-15 Thread GitBox


SparkQA commented on pull request #32821:
URL: https://github.com/apache/spark/pull/32821#issuecomment-861312401


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44328/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


SparkQA commented on pull request #32645:
URL: https://github.com/apache/spark/pull/32645#issuecomment-861314335


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44329/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #32885: [SPARK-35742][SQL] Expression.semanticEquals should be symmetrical

2021-06-15 Thread GitBox


cloud-fan commented on pull request #32885:
URL: https://github.com/apache/spark/pull/32885#issuecomment-861315658


   thanks for the review, merging to master!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan closed pull request #32885: [SPARK-35742][SQL] Expression.semanticEquals should be symmetrical

2021-06-15 Thread GitBox


cloud-fan closed pull request #32885:
URL: https://github.com/apache/spark/pull/32885


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] sarutak commented on pull request #32845: [SPARK-35691][CORE] addFile/addJar/addDirectory should put CanonicalFile

2021-06-15 Thread GitBox


sarutak commented on pull request #32845:
URL: https://github.com/apache/spark/pull/32845#issuecomment-861316002


   retest this please.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32915: [SPARK-35764][SQL] Assign pretty names to TimestampWithoutTZType

2021-06-15 Thread GitBox


SparkQA commented on pull request #32915:
URL: https://github.com/apache/spark/pull/32915#issuecomment-861318378


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44327/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32904: [SPARK-33122][FOLLOWUP][SQL] Distinct aggs are not duplicate sensitive

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32904:
URL: https://github.com/apache/spark/pull/32904#issuecomment-861320182






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32917: [SPARK-35758][DOCS] Update the document about building Spark with Hadoop for Hadoop 2.x and 3.x

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32917:
URL: https://github.com/apache/spark/pull/32917#issuecomment-861320181


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139803/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32645:
URL: https://github.com/apache/spark/pull/32645#issuecomment-861320187


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/44324/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32881: [SPARK-33298][CORE] Decouple file naming from FileCommitProtocol

2021-06-15 Thread GitBox


SparkQA commented on pull request #32881:
URL: https://github.com/apache/spark/pull/32881#issuecomment-861321549


   **[Test build #139805 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139805/testReport)**
 for PR 32881 at commit 
[`65346ab`](https://github.com/apache/spark/commit/65346ab1e7dd110777d2243d3337b1d335bb9326).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32909: [WIP][SPARK-35680][SQL] Add fields to `YearMonthIntervalType`

2021-06-15 Thread GitBox


SparkQA commented on pull request #32909:
URL: https://github.com/apache/spark/pull/32909#issuecomment-861321432


   **[Test build #139804 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139804/testReport)**
 for PR 32909 at commit 
[`efe001c`](https://github.com/apache/spark/commit/efe001cde3a9b0a0b4b8e4f8db0c4334f0e0ca2f).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32845: [SPARK-35691][CORE] addFile/addJar/addDirectory should put CanonicalFile

2021-06-15 Thread GitBox


SparkQA commented on pull request #32845:
URL: https://github.com/apache/spark/pull/32845#issuecomment-861321691


   **[Test build #139806 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139806/testReport)**
 for PR 32845 at commit 
[`f3ec9a3`](https://github.com/apache/spark/commit/f3ec9a3229e937ddde0ff9f72342043a058147c4).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32742: [SPARK-35608][SQL] Support AQE optimizer side transformUpWithPruning

2021-06-15 Thread GitBox


SparkQA commented on pull request #32742:
URL: https://github.com/apache/spark/pull/32742#issuecomment-861321820


   **[Test build #139807 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139807/testReport)**
 for PR 32742 at commit 
[`1a770bb`](https://github.com/apache/spark/commit/1a770bb1cee034ad67a256df94e8022dd32bdbb9).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] tanelk commented on pull request #32904: [SPARK-33122][FOLLOWUP][SQL] Distinct aggs are not duplicate sensitive

2021-06-15 Thread GitBox


tanelk commented on pull request #32904:
URL: https://github.com/apache/spark/pull/32904#issuecomment-861325178


   > Hi, @tanelk . Could you take a look at GitHub Action UT failures?
   
   I ran `StreamingAggregationSuite` and 
`KafkaMicroBatchV1SourceWithAdminSuite` localy and all tests passed. Lets wait 
for the jenkins results.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32904: [SPARK-33122][FOLLOWUP][SQL] Distinct aggs are not duplicate sensitive

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32904:
URL: https://github.com/apache/spark/pull/32904#issuecomment-861320182






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32645:
URL: https://github.com/apache/spark/pull/32645#issuecomment-861320187


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/44324/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32917: [SPARK-35758][DOCS] Update the document about building Spark with Hadoop for Hadoop 2.x and 3.x

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32917:
URL: https://github.com/apache/spark/pull/32917#issuecomment-861320181


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139803/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on pull request #32915: [SPARK-35764][SQL] Assign pretty names to TimestampWithoutTZType

2021-06-15 Thread GitBox


MaxGekk commented on pull request #32915:
URL: https://github.com/apache/spark/pull/32915#issuecomment-861332617


   +1, LGTM. Merging to master.
   Thank you, @gengliangwang .


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32917: [SPARK-35758][DOCS] Update the document about building Spark with Hadoop for Hadoop 2.x and 3.x

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32917:
URL: https://github.com/apache/spark/pull/32917#issuecomment-861333624


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/44330/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32917: [SPARK-35758][DOCS] Update the document about building Spark with Hadoop for Hadoop 2.x and 3.x

2021-06-15 Thread GitBox


SparkQA commented on pull request #32917:
URL: https://github.com/apache/spark/pull/32917#issuecomment-861333598


   Kubernetes integration test unable to build dist.
   
   exiting with code: 1
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44330/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk closed pull request #32915: [SPARK-35764][SQL] Assign pretty names to TimestampWithoutTZType

2021-06-15 Thread GitBox


MaxGekk closed pull request #32915:
URL: https://github.com/apache/spark/pull/32915


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32917: [SPARK-35758][DOCS] Update the document about building Spark with Hadoop for Hadoop 2.x and 3.x

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32917:
URL: https://github.com/apache/spark/pull/32917#issuecomment-861333624


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/44330/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on pull request #32880: [SPARK-35056][SQL] Group exception messages in execution/streaming

2021-06-15 Thread GitBox


MaxGekk commented on pull request #32880:
URL: https://github.com/apache/spark/pull/32880#issuecomment-861336109


   +1, LGTM. Merging to master.
   Thank you, @beliefer .


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk closed pull request #32880: [SPARK-35056][SQL] Group exception messages in execution/streaming

2021-06-15 Thread GitBox


MaxGekk closed pull request #32880:
URL: https://github.com/apache/spark/pull/32880


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] beliefer commented on pull request #32880: [SPARK-35056][SQL] Group exception messages in execution/streaming

2021-06-15 Thread GitBox


beliefer commented on pull request #32880:
URL: https://github.com/apache/spark/pull/32880#issuecomment-861338050


   @MaxGekk Thank you!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32821: [SPARK-35342][PYTHON] Introduce DecimalOps and make `isnull` method data-type-based

2021-06-15 Thread GitBox


SparkQA commented on pull request #32821:
URL: https://github.com/apache/spark/pull/32821#issuecomment-861338489


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44328/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


SparkQA commented on pull request #32645:
URL: https://github.com/apache/spark/pull/32645#issuecomment-861339534


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44329/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] beliefer commented on pull request #32916: [WIP][SPARK-35064][SQL] Group error in spark-catalyst

2021-06-15 Thread GitBox


beliefer commented on pull request #32916:
URL: https://github.com/apache/spark/pull/32916#issuecomment-861340459


   @dgd-contributor Do you check all the `Exception` in package 
`sql/catalyst/src/main/scala/org/apache/spark/sql` ?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] beliefer commented on a change in pull request #32916: [WIP][SPARK-35064][SQL] Group error in spark-catalyst

2021-06-15 Thread GitBox


beliefer commented on a change in pull request #32916:
URL: https://github.com/apache/spark/pull/32916#discussion_r651613992



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogV2Util.scala
##
@@ -146,7 +146,7 @@ private[sql] object CatalogV2Util {
   Some(parent.copy(dataType = addField(parentType, newField, 
add.position(
 
 case _ =>
-  throw new IllegalArgumentException(s"Not a struct: 
${names.init.last}")

Review comment:
   `IllegalArgumentException` is assert-like Exception, please restore it.

##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogV2Util.scala
##
@@ -146,7 +146,7 @@ private[sql] object CatalogV2Util {
   Some(parent.copy(dataType = addField(parentType, newField, 
add.position(
 
 case _ =>
-  throw new IllegalArgumentException(s"Not a struct: 
${names.init.last}")

Review comment:
   `IllegalArgumentException` is assert-like exception, please restore it.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] beliefer commented on a change in pull request #32916: [WIP][SPARK-35064][SQL] Group error in spark-catalyst

2021-06-15 Thread GitBox


beliefer commented on a change in pull request #32916:
URL: https://github.com/apache/spark/pull/32916#discussion_r651614405



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogV2Util.scala
##
@@ -171,7 +171,7 @@ private[sql] object CatalogV2Util {
 case update: UpdateColumnPosition =>
   def updateFieldPos(struct: StructType, name: String): StructType = {
 val oldField = struct.fields.find(_.name == name).getOrElse {
-  throw new IllegalArgumentException("Field not found: " + name)

Review comment:
   ditto

##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogV2Util.scala
##
@@ -185,7 +185,7 @@ private[sql] object CatalogV2Util {
 case parentType: StructType =>
   Some(parent.copy(dataType = updateFieldPos(parentType, 
names.last)))
 case _ =>
-  throw new IllegalArgumentException(s"Not a struct: 
${names.init.last}")

Review comment:
   ditto

##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogV2Util.scala
##
@@ -211,7 +211,7 @@ private[sql] object CatalogV2Util {
   val afterCol = position.asInstanceOf[After].column()
   val fieldIndex = schema.fields.indexWhere(_.name == afterCol)
   if (fieldIndex == -1) {
-throw new IllegalArgumentException("AFTER column not found: " + 
afterCol)

Review comment:
   ditto




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] beliefer commented on a change in pull request #32916: [WIP][SPARK-35064][SQL] Group error in spark-catalyst

2021-06-15 Thread GitBox


beliefer commented on a change in pull request #32916:
URL: https://github.com/apache/spark/pull/32916#discussion_r651614603



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogV2Util.scala
##
@@ -224,7 +224,7 @@ private[sql] object CatalogV2Util {
   update: StructField => Option[StructField]): StructType = {
 
 val pos = struct.getFieldIndex(fieldNames.head)
-.getOrElse(throw new IllegalArgumentException(s"Cannot find field: 
${fieldNames.head}"))

Review comment:
   ditto

##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogV2Util.scala
##
@@ -236,15 +236,15 @@ private[sql] object CatalogV2Util {
 
   case (Seq("key"), map @ MapType(keyType, _, _)) =>
 val updated = update(StructField("key", keyType, nullable = false))
-.getOrElse(throw new IllegalArgumentException(s"Cannot delete map 
key"))

Review comment:
   ditto

##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogV2Util.scala
##
@@ -236,15 +236,15 @@ private[sql] object CatalogV2Util {
 
   case (Seq("key"), map @ MapType(keyType, _, _)) =>
 val updated = update(StructField("key", keyType, nullable = false))
-.getOrElse(throw new IllegalArgumentException(s"Cannot delete map 
key"))
+.getOrElse(throw QueryExecutionErrors.cannotDelete("map key"))
 Some(field.copy(dataType = map.copy(keyType = updated.dataType)))
 
   case (Seq("key", names @ _*), map @ MapType(keyStruct: StructType, _, 
_)) =>
 Some(field.copy(dataType = map.copy(keyType = replace(keyStruct, 
names, update
 
   case (Seq("value"), map @ MapType(_, mapValueType, isNullable)) =>
 val updated = update(StructField("value", mapValueType, nullable = 
isNullable))
-.getOrElse(throw new IllegalArgumentException(s"Cannot delete map 
value"))

Review comment:
   ditto

##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogV2Util.scala
##
@@ -254,7 +254,7 @@ private[sql] object CatalogV2Util {
 
   case (Seq("element"), array @ ArrayType(elementType, isNullable)) =>
 val updated = update(StructField("element", elementType, nullable = 
isNullable))
-.getOrElse(throw new IllegalArgumentException(s"Cannot delete 
array element"))

Review comment:
   ditto




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] beliefer commented on a change in pull request #32916: [WIP][SPARK-35064][SQL] Group error in spark-catalyst

2021-06-15 Thread GitBox


beliefer commented on a change in pull request #32916:
URL: https://github.com/apache/spark/pull/32916#discussion_r651614899



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogV2Util.scala
##
@@ -263,8 +263,7 @@ private[sql] object CatalogV2Util {
 Some(field.copy(dataType = array.copy(elementType = 
replace(elementStruct, names, update
 
   case (names, dataType) =>
-throw new IllegalArgumentException(

Review comment:
   ditto




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32915: [SPARK-35764][SQL] Assign pretty names to TimestampWithoutTZType

2021-06-15 Thread GitBox


SparkQA commented on pull request #32915:
URL: https://github.com/apache/spark/pull/32915#issuecomment-861345046


   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44327/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


SparkQA commented on pull request #32645:
URL: https://github.com/apache/spark/pull/32645#issuecomment-861354363


   **[Test build #139798 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139798/testReport)**
 for PR 32645 at commit 
[`b9248aa`](https://github.com/apache/spark/commit/b9248aafc830c2f5b4daee9b6a6ed8b5d572d038).
* This patch **fails Spark unit tests**.
* This patch **does not merge cleanly**.
* This patch adds no public classes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


SparkQA removed a comment on pull request #32645:
URL: https://github.com/apache/spark/pull/32645#issuecomment-861235933


   **[Test build #139798 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139798/testReport)**
 for PR 32645 at commit 
[`b9248aa`](https://github.com/apache/spark/commit/b9248aafc830c2f5b4daee9b6a6ed8b5d572d038).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #32470: [SPARK-35712][SQL] Simplify ResolveAggregateFunctions

2021-06-15 Thread GitBox


cloud-fan commented on a change in pull request #32470:
URL: https://github.com/apache/spark/pull/32470#discussion_r651629291



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/unresolved.scala
##
@@ -652,3 +652,15 @@ case object UnresolvedSeed extends LeafExpression with 
Unevaluable {
   override def dataType: DataType = throw new UnresolvedException("dataType")
   override lazy val resolved = false
 }
+
+/**
+ * An intermediate expression to hold a resolved (nested) column. Some rules 
may need to undo the
+ * column resolution and use this expression to keep the original column name.
+ */
+case class TempResolvedColumn(child: Expression, nameParts: Seq[String]) 
extends UnaryExpression

Review comment:
   Actually, we don't require it to be a named expression, so it's more 
robust to keep the type as `Expression`.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32909: [WIP][SPARK-35680][SQL] Add fields to `YearMonthIntervalType`

2021-06-15 Thread GitBox


SparkQA commented on pull request #32909:
URL: https://github.com/apache/spark/pull/32909#issuecomment-861361771


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44331/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] peter-toth commented on a change in pull request #32885: [SPARK-35742][SQL] Expression.semanticEquals should be symmetrical

2021-06-15 Thread GitBox


peter-toth commented on a change in pull request #32885:
URL: https://github.com/apache/spark/pull/32885#discussion_r651638388



##
File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/HashAggregateExec.scala
##
@@ -73,7 +73,9 @@ case class HashAggregateExec(
   // This is for testing. We force TungstenAggregationIterator to fall back to 
the unsafe row hash
   // map and/or the sort-based aggregation once it has processed a given 
number of input rows.
   private val testFallbackStartsAt: Option[(Int, Int)] = {
-sqlContext.getConf("spark.sql.TungstenAggregate.testFallbackStartsAt", 
null) match {
+Option(sqlContext).map { sc =>

Review comment:
   Sorry, for the late comment @cloud-fan. I think I've run into this issue 
before:
   
https://github.com/apache/spark/pull/28885/files#diff-9b62cef6bfdeb6c802bb120c7a724a974d5067a69585285bebb64c48603f8d6fR105-R108.
 The point is that there might be other nodes where canonicalization on 
executor side can cause issues. `SortExec.enableRadixSort` is the other one I 
found.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32645:
URL: https://github.com/apache/spark/pull/32645#issuecomment-861363317






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32821: [SPARK-35342][PYTHON] Introduce DecimalOps and make `isnull` method data-type-based

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32821:
URL: https://github.com/apache/spark/pull/32821#issuecomment-861363319


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/44328/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32915: [SPARK-35764][SQL] Assign pretty names to TimestampWithoutTZType

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32915:
URL: https://github.com/apache/spark/pull/32915#issuecomment-861363313


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/44327/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32645: [SPARK-35129][SQL] Construct year-month interval column from integral fields

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32645:
URL: https://github.com/apache/spark/pull/32645#issuecomment-861363316






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32915: [SPARK-35764][SQL] Assign pretty names to TimestampWithoutTZType

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32915:
URL: https://github.com/apache/spark/pull/32915#issuecomment-861363313


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/44327/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32821: [SPARK-35342][PYTHON] Introduce DecimalOps and make `isnull` method data-type-based

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32821:
URL: https://github.com/apache/spark/pull/32821#issuecomment-861363319


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/44328/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] peter-toth commented on a change in pull request #32885: [SPARK-35742][SQL] Expression.semanticEquals should be symmetrical

2021-06-15 Thread GitBox


peter-toth commented on a change in pull request #32885:
URL: https://github.com/apache/spark/pull/32885#discussion_r651638388



##
File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/HashAggregateExec.scala
##
@@ -73,7 +73,9 @@ case class HashAggregateExec(
   // This is for testing. We force TungstenAggregationIterator to fall back to 
the unsafe row hash
   // map and/or the sort-based aggregation once it has processed a given 
number of input rows.
   private val testFallbackStartsAt: Option[(Int, Int)] = {
-sqlContext.getConf("spark.sql.TungstenAggregate.testFallbackStartsAt", 
null) match {
+Option(sqlContext).map { sc =>

Review comment:
   Sorry for the late comment @cloud-fan, but I think I've run into this 
issue before:
   
https://github.com/apache/spark/pull/28885/files#diff-9b62cef6bfdeb6c802bb120c7a724a974d5067a69585285bebb64c48603f8d6fR105-R108.
 The point is that there might be other nodes where canonicalization on 
executor side can cause issues. `SortExec.enableRadixSort` is the other one I 
found.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32470: [SPARK-35712][SQL] Simplify ResolveAggregateFunctions

2021-06-15 Thread GitBox


SparkQA commented on pull request #32470:
URL: https://github.com/apache/spark/pull/32470#issuecomment-861365825


   **[Test build #139808 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139808/testReport)**
 for PR 32470 at commit 
[`b362a09`](https://github.com/apache/spark/commit/b362a098978be65ab1fc033fe0213a78a467b6ae).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32912: [SPARK-35429][CORE] Remove commons-httpclient due to EOL and CVEs

2021-06-15 Thread GitBox


SparkQA commented on pull request #32912:
URL: https://github.com/apache/spark/pull/32912#issuecomment-861367891


   **[Test build #139797 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139797/testReport)**
 for PR 32912 at commit 
[`3c0eb47`](https://github.com/apache/spark/commit/3c0eb4763bf74a67eabb21fc02565790d9505ee0).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32845: [SPARK-35691][CORE] addFile/addJar/addDirectory should put CanonicalFile

2021-06-15 Thread GitBox


SparkQA commented on pull request #32845:
URL: https://github.com/apache/spark/pull/32845#issuecomment-861367881


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44333/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32912: [SPARK-35429][CORE] Remove commons-httpclient due to EOL and CVEs

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32912:
URL: https://github.com/apache/spark/pull/32912#issuecomment-861369339


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139797/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #32881: [SPARK-33298][CORE] Decouple file naming from FileCommitProtocol

2021-06-15 Thread GitBox


SparkQA commented on pull request #32881:
URL: https://github.com/apache/spark/pull/32881#issuecomment-861370670


   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/44332/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #32912: [SPARK-35429][CORE] Remove commons-httpclient due to EOL and CVEs

2021-06-15 Thread GitBox


AmplabJenkins removed a comment on pull request #32912:
URL: https://github.com/apache/spark/pull/32912#issuecomment-861369339


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/139797/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #32912: [SPARK-35429][CORE] Remove commons-httpclient due to EOL and CVEs

2021-06-15 Thread GitBox


SparkQA removed a comment on pull request #32912:
URL: https://github.com/apache/spark/pull/32912#issuecomment-861235546


   **[Test build #139797 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/139797/testReport)**
 for PR 32912 at commit 
[`3c0eb47`](https://github.com/apache/spark/commit/3c0eb4763bf74a67eabb21fc02565790d9505ee0).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #32742: [SPARK-35608][SQL] Support AQE optimizer side transformUpWithPruning

2021-06-15 Thread GitBox


AmplabJenkins commented on pull request #32742:
URL: https://github.com/apache/spark/pull/32742#issuecomment-861375308


   
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/44334/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



  1   2   3   4   5   6   7   >