Re: [PR] [SPARK-47927][SQL]: Fix nullability attribute in UDF decoder [spark]

2024-04-27 Thread via GitHub
cloud-fan closed pull request #46156: [SPARK-47927][SQL]: Fix nullability attribute in UDF decoder URL: https://github.com/apache/spark/pull/46156 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [PR] [SPARK-47927][SQL]: Fix nullability attribute in UDF decoder [spark]

2024-04-27 Thread via GitHub
cloud-fan commented on PR #46156: URL: https://github.com/apache/spark/pull/46156#issuecomment-2081341603 thanks, merging to master/3.5/3.4! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

Re: [PR] [SPARK-47927][SQL]: Fix nullability attribute in UDF decoder [spark]

2024-04-27 Thread via GitHub
cloud-fan commented on PR #46156: URL: https://github.com/apache/spark/pull/46156#issuecomment-2081341418 good catch! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

Re: [PR] [SPARK-48004][SQL] Add WriteFilesExecBase trait for v1 write [spark]

2024-04-27 Thread via GitHub
cloud-fan commented on PR #46240: URL: https://github.com/apache/spark/pull/46240#issuecomment-2081340902 late LGTM -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsub

Re: [PR] [SPARK-48002][PYTHON][SS] Add test for observed metrics in PySpark StreamingQueryListener [spark]

2024-04-27 Thread via GitHub
WweiL commented on PR #46237: URL: https://github.com/apache/spark/pull/46237#issuecomment-2081327432 @HyukjinKwon I think we can merge this now : ) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

Re: [PR] [SPARK-47292][SS] safeMapToJValue should consider null typed values [spark]

2024-04-27 Thread via GitHub
WweiL commented on PR #46260: URL: https://github.com/apache/spark/pull/46260#issuecomment-2081317940 CC @HeartSaVioR PTAL, thank you! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[PR] [SPARK-47292][SS] safeMapToJValue should consider null typed values [spark]

2024-04-27 Thread via GitHub
WweiL opened a new pull request, #46260: URL: https://github.com/apache/spark/pull/46260 ### What changes were proposed in this pull request? Additional null check to the `safeMapToJValue`. Normally we won't create a `StreamingQueryProgress` with map fields as null. It is also

Re: [PR] [SPARK-48021][ML][BUILD][FOLLOWUP] add `--add-modules=jdk.incubator.vector` to maven compile args [spark]

2024-04-27 Thread via GitHub
panbingkun commented on PR #46259: URL: https://github.com/apache/spark/pull/46259#issuecomment-2081311526 > We can manually verify it through Maven test `build/mvn test -pl mllib-local`: > > Before > > ![image](https://private-user-images.githubusercontent.com/1475305/3262271

Re: [PR] [SPARK-48021][ML][BUILD][FOLLOWUP] add `--add-modules=jdk.incubator.vector` to maven compile args [spark]

2024-04-27 Thread via GitHub
LuciferYang commented on PR #46259: URL: https://github.com/apache/spark/pull/46259#issuecomment-2081310548 We can manually verify it through Maven test: Before ![image](https://github.com/apache/spark/assets/1475305/1c002f85-175e-4554-a5a5-b05eab244f9c) there is a WARNI

Re: [PR] [SPARK-48019] Fix incorrect behavior in ColumnVector/ColumnarArray with dictionary and nulls [spark]

2024-04-27 Thread via GitHub
cloud-fan closed pull request #46254: [SPARK-48019] Fix incorrect behavior in ColumnVector/ColumnarArray with dictionary and nulls URL: https://github.com/apache/spark/pull/46254 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub an

Re: [PR] [SPARK-48019] Fix incorrect behavior in ColumnVector/ColumnarArray with dictionary and nulls [spark]

2024-04-27 Thread via GitHub
cloud-fan commented on PR #46254: URL: https://github.com/apache/spark/pull/46254#issuecomment-2081305430 thanks, merging to master/3.5! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specif

Re: [PR] [SPARK-48021][ML][BUILD][FOLLOWUP] add `--add-modules=jdk.incubator.vector` to maven compile args [spark]

2024-04-27 Thread via GitHub
LuciferYang commented on PR #46259: URL: https://github.com/apache/spark/pull/46259#issuecomment-2081303527 Yes, we should keep `JavaModuleOptions`, `extraTestJavaArgs` in `SparkBuild.scala`, and `extraTestJavaArgs` in `pom.xml` consistent. -- This is an automated message from the Apache

Re: [PR] [SPARK-48021][ML][BUILD][FOLLOWUP] add `--add-modules=jdk.incubator.vector` to maven compile args [spark]

2024-04-27 Thread via GitHub
panbingkun commented on PR #46259: URL: https://github.com/apache/spark/pull/46259#issuecomment-2081303045 cc @LuciferYang -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [PR] [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions` [spark]

2024-04-27 Thread via GitHub
panbingkun commented on PR #46246: URL: https://github.com/apache/spark/pull/46246#issuecomment-2081300766 > @panbingkun we should add `--add-modules=jdk.incubator.vector` to `extraJavaTestArgs ` in `pom.xml` too > > https://github.com/apache/spark/blob/64d321926bbcede05d1c145405d503

Re: [PR] [SPARK-48011][Core] Store LogKey name as a value to avoid generating new string instances [spark]

2024-04-27 Thread via GitHub
LuciferYang commented on PR #46249: URL: https://github.com/apache/spark/pull/46249#issuecomment-2081298871 late LGTM -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

Re: [PR] [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions` [spark]

2024-04-27 Thread via GitHub
LuciferYang commented on PR #46246: URL: https://github.com/apache/spark/pull/46246#issuecomment-2081298219 @panbingkun we should add `--add-modules=jdk.incubator.vector` to `pom.xml` too https://github.com/apache/spark/blob/64d321926bbcede05d1c145405d503b3431f185b/pom.xml#L305-L

[PR] [Only for check Docker Image] Check installed packages on ubuntu 22.04 [spark]

2024-04-27 Thread via GitHub
panbingkun opened a new pull request, #46258: URL: https://github.com/apache/spark/pull/46258 ### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How

Re: [PR] [SPARK-47516][INFRA] Move `remove unused installation package logic` from `each test job` to `create the docker image` [spark]

2024-04-27 Thread via GitHub
panbingkun commented on PR #45659: URL: https://github.com/apache/spark/pull/45659#issuecomment-2081287287 > @panbingkun > > Hi, bingkun, when rebuild the image in https://github.com/zhengruifeng/spark/actions/runs/8857365994/job/24324764602 > > I see such warnings: > >

[PR] [SPARK-48024][PYTHON][CONNECT][TESTS] Enable `UDFParityTests.test_udf_timestamp_ntz` [spark]

2024-04-27 Thread via GitHub
zhengruifeng opened a new pull request, #46257: URL: https://github.com/apache/spark/pull/46257 ### What changes were proposed in this pull request? Enable `UDFParityTests.test_udf_timestamp_ntz` ### Why are the changes needed? for test coverage ### Does this PR in

Re: [PR] [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions` [spark]

2024-04-27 Thread via GitHub
dongjoon-hyun commented on PR #46246: URL: https://github.com/apache/spark/pull/46246#issuecomment-2081266374 Merged to master. Thank you, @panbingkun and all! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL ab

Re: [PR] [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions` [spark]

2024-04-27 Thread via GitHub
dongjoon-hyun closed pull request #46246: [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions` URL: https://github.com/apache/spark/pull/46246 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub an

Re: [PR] [SPARK-48020][INFRA][PYTHON] Pin 'pandas==2.2.2' [spark]

2024-04-27 Thread via GitHub
dongjoon-hyun commented on PR #46256: URL: https://github.com/apache/spark/pull/46256#issuecomment-2081265499 Thank you all! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [PR] [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions` [spark]

2024-04-27 Thread via GitHub
zhengruifeng commented on PR #46246: URL: https://github.com/apache/spark/pull/46246#issuecomment-2081262259 also cc @WeichenXu123 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific co

Re: [PR] [SPARK-46744][SPARK-SHELL][SQL][CONNECT][PYTHON][R] Display clear `exit command` for all spark terminal [spark]

2024-04-27 Thread via GitHub
github-actions[bot] commented on PR #44769: URL: https://github.com/apache/spark/pull/44769#issuecomment-2081261642 We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.

Re: [PR] [SPARK-44635][CORE] Handle shuffle fetch failures in decommissions [spark]

2024-04-27 Thread via GitHub
github-actions[bot] commented on PR #42296: URL: https://github.com/apache/spark/pull/42296#issuecomment-2081261651 We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.

Re: [PR] [SPARK-48020][INFRA][PYTHON] Pin 'pandas==2.2.2' [spark]

2024-04-27 Thread via GitHub
zhengruifeng commented on PR #46256: URL: https://github.com/apache/spark/pull/46256#issuecomment-2081261491 thank you @yaooqinn and @HyukjinKwon -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

Re: [PR] [Don't review, only for test][SPARK-48022][BUILD] Upgrade `jersey` to `3.1.6` [spark]

2024-04-27 Thread via GitHub
panbingkun commented on PR #46252: URL: https://github.com/apache/spark/pull/46252#issuecomment-2081261447 > The below MR may give some hints also to this ticket, bumping Jersey to v3.1.x requires all Spark to comply with EE10 standards, as I have tried during the Jetty 12 upgrade. #45500

Re: [PR] [Don't review, only for test][SPARK-48022][BUILD] Upgrade `jersey` to `3.1.6` [spark]

2024-04-27 Thread via GitHub
HiuKwok commented on PR #46252: URL: https://github.com/apache/spark/pull/46252#issuecomment-2081188223 The below MR may give some hints also to this ticket, bumping Jersey to v3.1.x requires all Spark to comply with EE10 standards, as I have tried during the Jetty 12 upgrade. https://gi

Re: [PR] [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions` [spark]

2024-04-27 Thread via GitHub
panbingkun commented on PR #46246: URL: https://github.com/apache/spark/pull/46246#issuecomment-2080865185 > Thank you for looking into that! Let me know what I should do to update dev.ludovic.netlib further for the needs of Spark Thank all for writing in such `detail` in the previous

Re: [PR] [SPARK-48020][INFRA][PYTHON] Pin 'pandas==2.2.2' [spark]

2024-04-27 Thread via GitHub
yaooqinn closed pull request #46256: [SPARK-48020][INFRA][PYTHON] Pin 'pandas==2.2.2' URL: https://github.com/apache/spark/pull/46256 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

Re: [PR] [SPARK-48020][INFRA][PYTHON] Pin 'pandas==2.2.2' [spark]

2024-04-27 Thread via GitHub
yaooqinn commented on PR #46256: URL: https://github.com/apache/spark/pull/46256#issuecomment-2080844028 Thank you @zhengruifeng @HyukjinKwon Merged to master. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use t

Re: [PR] [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions` [spark]

2024-04-27 Thread via GitHub
luhenry commented on PR #46246: URL: https://github.com/apache/spark/pull/46246#issuecomment-2080831209 Thank you for looking into that! Let me know what I should do to update dev.ludovic.netlib further for the needs of Spark -- This is an automated message from the Apache Git Service. T

Re: [PR] [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions` [spark]

2024-04-27 Thread via GitHub
panbingkun commented on PR #46246: URL: https://github.com/apache/spark/pull/46246#issuecomment-2080813720 > Before this flag was gated on Java 21 - it's OK to set this on earlier versions? OK if so Yes, the JDK version of the above manual test environment (local) is `17`. https://

Re: [PR] [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions` [spark]

2024-04-27 Thread via GitHub
srowen commented on PR #46246: URL: https://github.com/apache/spark/pull/46246#issuecomment-2080797254 Before this flag was gated on Java 21 - it's OK to set this on earlier versions? OK if so -- This is an automated message from the Apache Git Service. To respond to the message, please l

Re: [PR] [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions` [spark]

2024-04-27 Thread via GitHub
panbingkun commented on PR #46246: URL: https://github.com/apache/spark/pull/46246#issuecomment-2080762133 cc @luhenry @srowen @zhengruifeng @dongjoon-hyun @LuciferYang -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

Re: [PR] [SPARK-47730][K8S] Support `APP_ID` and `EXECUTOR_ID` placeholders in labels [spark]

2024-04-27 Thread via GitHub
jshmchenxi commented on code in PR #46149: URL: https://github.com/apache/spark/pull/46149#discussion_r1581765866 ## resource-managers/kubernetes/core/src/test/scala/org/apache/spark/deploy/k8s/features/BasicDriverFeatureStepSuite.scala: ## @@ -35,7 +35,9 @@ import org.apache.sp

Re: [PR] [SPARK-47730][K8S] Support `APP_ID` and `EXECUTOR_ID` placeholders in labels [spark]

2024-04-27 Thread via GitHub
jshmchenxi commented on PR #46149: URL: https://github.com/apache/spark/pull/46149#issuecomment-2080412508 It's been a busy week, sorry for the delay. I'll address your comments today, thanks! @dongjoon-hyun -- This is an automated message from the Apache Git Service. To respond to the m