[jira] [Created] (SPARK-35820) Support cast between different DayTimeIntervalType

2021-06-18 Thread angerszhu (Jira)
angerszhu created SPARK-35820: - Summary: Support cast between different DayTimeIntervalType Key: SPARK-35820 URL: https://issues.apache.org/jira/browse/SPARK-35820 Project: Spark Issue Type:

[jira] [Commented] (SPARK-35820) Support cast between different DayTimeIntervalType

2021-06-18 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365843#comment-17365843 ] angerszhu commented on SPARK-35820: --- Working on this > Support cast between different

[jira] [Updated] (SPARK-35819) Support cast between different field YearMonthIntervalType

2021-06-18 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-35819: -- Description: Support cast between YearMonthIntervalType  year/tear  year/month month/month > Support

[jira] [Created] (SPARK-35819) Support cast between different field YearMonthIntervalType

2021-06-18 Thread angerszhu (Jira)
angerszhu created SPARK-35819: - Summary: Support cast between different field YearMonthIntervalType Key: SPARK-35819 URL: https://issues.apache.org/jira/browse/SPARK-35819 Project: Spark Issue

[jira] [Commented] (SPARK-35772) Check all year-month interval types in HiveInspectors tests

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365840#comment-17365840 ] Apache Spark commented on SPARK-35772: -- User 'AngersZh' has created a pull request for this

[jira] [Assigned] (SPARK-35772) Check all year-month interval types in HiveInspectors tests

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35772: Assignee: Apache Spark > Check all year-month interval types in HiveInspectors tests >

[jira] [Assigned] (SPARK-35772) Check all year-month interval types in HiveInspectors tests

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35772: Assignee: (was: Apache Spark) > Check all year-month interval types in

[jira] [Assigned] (SPARK-35817) Queries against wide Avro tables can be slow

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35817: Assignee: Apache Spark > Queries against wide Avro tables can be slow >

[jira] [Assigned] (SPARK-35817) Queries against wide Avro tables can be slow

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35817: Assignee: (was: Apache Spark) > Queries against wide Avro tables can be slow >

[jira] [Commented] (SPARK-35817) Queries against wide Avro tables can be slow

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365835#comment-17365835 ] Apache Spark commented on SPARK-35817: -- User 'bersprockets' has created a pull request for this

[jira] [Commented] (SPARK-35817) Queries against wide Avro tables can be slow

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365834#comment-17365834 ] Apache Spark commented on SPARK-35817: -- User 'bersprockets' has created a pull request for this

[jira] [Resolved] (SPARK-35708) Add BaseTest for DataTypeOps

2021-06-18 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-35708. --- Fix Version/s: 3.2.0 Assignee: Yikun Jiang Resolution: Fixed Issue resolved

[jira] [Commented] (SPARK-35470) Enable disallow_untyped_defs mypy check for pyspark.pandas.base.

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365830#comment-17365830 ] Apache Spark commented on SPARK-35470: -- User 'ueshin' has created a pull request for this issue:

[jira] [Commented] (SPARK-35470) Enable disallow_untyped_defs mypy check for pyspark.pandas.base.

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365829#comment-17365829 ] Apache Spark commented on SPARK-35470: -- User 'ueshin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-35470) Enable disallow_untyped_defs mypy check for pyspark.pandas.base.

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35470: Assignee: (was: Apache Spark) > Enable disallow_untyped_defs mypy check for

[jira] [Assigned] (SPARK-35470) Enable disallow_untyped_defs mypy check for pyspark.pandas.base.

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35470: Assignee: Apache Spark > Enable disallow_untyped_defs mypy check for

[jira] [Updated] (SPARK-35652) Different Behaviour join vs joinWith in self joining

2021-06-18 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-35652: -- Fix Version/s: (was: 3.0.3) 3.0.4 > Different Behaviour join vs

[jira] [Updated] (SPARK-35767) CoalesceExec can execute child plan twice

2021-06-18 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-35767: -- Fix Version/s: (was: 3.0.3) 3.0.4 > CoalesceExec can execute child

[jira] [Updated] (SPARK-35796) UT `handles k8s cluster mode` fails on MacOs >= 10.15

2021-06-18 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-35796: -- Fix Version/s: (was: 3.0.3) 3.0.4 > UT `handles k8s cluster mode`

[jira] [Updated] (SPARK-35796) UT `handles k8s cluster mode` fails on MacOs >= 10.15

2021-06-18 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-35796: -- Issue Type: Bug (was: Improvement) > UT `handles k8s cluster mode` fails on MacOs >= 10.15 >

[jira] [Updated] (SPARK-35796) UT `handles k8s cluster mode` fails on MacOs >= 10.15

2021-06-18 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-35796: -- Affects Version/s: (was: 3.1.3) 3.2.0 3.0.3

[jira] [Updated] (SPARK-35796) UT `handles k8s cluster mode` fails on MacOs >= 10.15

2021-06-18 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-35796: -- Component/s: Spark Core > UT `handles k8s cluster mode` fails on MacOs >= 10.15 >

[jira] [Resolved] (SPARK-35796) UT `handles k8s cluster mode` fails on MacOs >= 10.15

2021-06-18 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-35796. --- Fix Version/s: 3.0.3 3.1.3 3.2.0 Resolution:

[jira] [Assigned] (SPARK-35796) UT `handles k8s cluster mode` fails on MacOs >= 10.15

2021-06-18 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-35796: - Assignee: Yazhi Wang > UT `handles k8s cluster mode` fails on MacOs >= 10.15 >

[jira] [Commented] (SPARK-35593) Support shuffle data recovery on the reused PVCs

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365826#comment-17365826 ] Apache Spark commented on SPARK-35593: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Commented] (SPARK-35593) Support shuffle data recovery on the reused PVCs

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365825#comment-17365825 ] Apache Spark commented on SPARK-35593: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Commented] (SPARK-35810) Remove ps.broadcast API

2021-06-18 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365823#comment-17365823 ] Takuya Ueshin commented on SPARK-35810: --- I think we should deprecate it first and remove later

[jira] [Commented] (SPARK-35808) Always enable the `pandas_metadata` in DataFrame.parquet

2021-06-18 Thread Kevin Su (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365822#comment-17365822 ] Kevin Su commented on SPARK-35808: -- [~ueshin] Thanks for the reply. Got it. > Always enable the

[jira] [Updated] (SPARK-35805) API auditing in Pandas API on Spark

2021-06-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-35805: - Target Version/s: 3.2.0 Priority: Blocker (was: Major) > API auditing in Pandas

[jira] [Updated] (SPARK-35805) API auditing in Pandas API on Spark

2021-06-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-35805: - Summary: API auditing in Pandas API on Spark (was: Pandas API on Spark improvements) > API

[jira] [Resolved] (SPARK-35808) Always enable the `pandas_metadata` in DataFrame.parquet

2021-06-18 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-35808. --- Resolution: Won't Do I'd close this. > Always enable the `pandas_metadata` in

[jira] [Resolved] (SPARK-35806) Rename the `mode` argument to avoid confusion with `mode` argument in pandas

2021-06-18 Thread Haejoon Lee (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haejoon Lee resolved SPARK-35806. - Resolution: Invalid Seems like we need more discussion. Let me just close this for now. >

[jira] [Resolved] (SPARK-35807) Rename the `num_files` argument

2021-06-18 Thread Haejoon Lee (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haejoon Lee resolved SPARK-35807. - Resolution: Invalid Seems like we need more discussion. Let me just close this for now. >

[jira] [Commented] (SPARK-35808) Always enable the `pandas_metadata` in DataFrame.parquet

2021-06-18 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365816#comment-17365816 ] Takuya Ueshin commented on SPARK-35808: --- I think we shouldn't do this. The reason we introduced

[jira] [Resolved] (SPARK-35565) Add a config for ignoring metadata directory of file stream sink

2021-06-18 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim resolved SPARK-35565. -- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32702

[jira] [Commented] (SPARK-35817) Queries against wide Avro tables can be slow

2021-06-18 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365764#comment-17365764 ] Bruce Robbins commented on SPARK-35817: --- [~xkrogen] Thanks! {quote}I guess we should create a map

[jira] [Comment Edited] (SPARK-35817) Queries against wide Avro tables can be slow

2021-06-18 Thread Erik Krogen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365762#comment-17365762 ] Erik Krogen edited comment on SPARK-35817 at 6/18/21, 10:09 PM: Thanks

[jira] [Commented] (SPARK-35817) Queries against wide Avro tables can be slow

2021-06-18 Thread Erik Krogen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365762#comment-17365762 ] Erik Krogen commented on SPARK-35817: - Thanks for catching this [~bersprockets]! I will be happy to

[jira] [Updated] (SPARK-35818) Upgrade SBT to 1.5.4

2021-06-18 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-35818: -- Parent: SPARK-35781 Issue Type: Sub-task (was: Improvement) > Upgrade SBT to 1.5.4 >

[jira] [Commented] (SPARK-35818) Upgrade SBT to 1.5.4

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365750#comment-17365750 ] Apache Spark commented on SPARK-35818: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-35818) Upgrade SBT to 1.5.4

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35818: Assignee: (was: Apache Spark) > Upgrade SBT to 1.5.4 > > >

[jira] [Assigned] (SPARK-35818) Upgrade SBT to 1.5.4

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35818: Assignee: Apache Spark > Upgrade SBT to 1.5.4 > > >

[jira] [Commented] (SPARK-35818) Upgrade SBT to 1.5.4

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365748#comment-17365748 ] Apache Spark commented on SPARK-35818: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Created] (SPARK-35818) Upgrade SBT to 1.5.4

2021-06-18 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created SPARK-35818: - Summary: Upgrade SBT to 1.5.4 Key: SPARK-35818 URL: https://issues.apache.org/jira/browse/SPARK-35818 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-35817) Queries against wide Avro tables can be slow

2021-06-18 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365697#comment-17365697 ] Bruce Robbins commented on SPARK-35817: --- The referenced line of code is meant to respect case

[jira] [Created] (SPARK-35817) Queries against wide Avro tables can be slow

2021-06-18 Thread Bruce Robbins (Jira)
Bruce Robbins created SPARK-35817: - Summary: Queries against wide Avro tables can be slow Key: SPARK-35817 URL: https://issues.apache.org/jira/browse/SPARK-35817 Project: Spark Issue Type:

[jira] [Commented] (SPARK-35478) Enable disallow_untyped_defs mypy check for pyspark.pandas.window.

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365696#comment-17365696 ] Apache Spark commented on SPARK-35478: -- User 'ueshin' has created a pull request for this issue:

[jira] [Commented] (SPARK-35478) Enable disallow_untyped_defs mypy check for pyspark.pandas.window.

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365695#comment-17365695 ] Apache Spark commented on SPARK-35478: -- User 'ueshin' has created a pull request for this issue:

[jira] [Resolved] (SPARK-35478) Enable disallow_untyped_defs mypy check for pyspark.pandas.window.

2021-06-18 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-35478. --- Fix Version/s: 3.2.0 Assignee: Kevin Su Resolution: Fixed Issue resolved by

[jira] [Resolved] (SPARK-35342) Introduce DecimalOps

2021-06-18 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-35342. --- Fix Version/s: 3.2.0 Assignee: Yikun Jiang Resolution: Fixed Issue resolved

[jira] [Updated] (SPARK-35816) Spark read write with multiple Hadoop HA cluster limitation

2021-06-18 Thread Anupam Jain (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anupam Jain updated SPARK-35816: Description: I have two Hadoop HA cluster: h1 and h2. Want to read from h1-HDFS and write to

[jira] [Created] (SPARK-35816) Spark read write with multiple Hadoop HA cluster limitation

2021-06-18 Thread Anupam Jain (Jira)
Anupam Jain created SPARK-35816: --- Summary: Spark read write with multiple Hadoop HA cluster limitation Key: SPARK-35816 URL: https://issues.apache.org/jira/browse/SPARK-35816 Project: Spark

[jira] [Comment Edited] (SPARK-35756) unionByName should support nested struct also

2021-06-18 Thread Wassim Almaaoui (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365579#comment-17365579 ] Wassim Almaaoui edited comment on SPARK-35756 at 6/18/21, 4:26 PM: --- I

[jira] [Commented] (SPARK-35756) unionByName should support nested struct also

2021-06-18 Thread Wassim Almaaoui (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365579#comment-17365579 ] Wassim Almaaoui commented on SPARK-35756: - I was not expecting this to work, we don't have any

[jira] [Commented] (SPARK-35756) unionByName should support nested struct also

2021-06-18 Thread Saurabh Chawla (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365549#comment-17365549 ] Saurabh Chawla commented on SPARK-35756: This will work struct also  if allowMissingColumns is

[jira] [Commented] (SPARK-35815) Allow delayThreshold for watermark to be represented as ANSI day-time/year-month interval literals

2021-06-18 Thread Kousuke Saruta (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365547#comment-17365547 ] Kousuke Saruta commented on SPARK-35815: Waiting for SPARK-35749 and SPARK-35773 are merged. >

[jira] [Created] (SPARK-35815) Allow delayThreshold for watermark to be represented as ANSI day-time/year-month interval literals

2021-06-18 Thread Kousuke Saruta (Jira)
Kousuke Saruta created SPARK-35815: -- Summary: Allow delayThreshold for watermark to be represented as ANSI day-time/year-month interval literals Key: SPARK-35815 URL:

[jira] [Commented] (SPARK-35811) Deprecate DataFrame.to_spark_io

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365517#comment-17365517 ] Apache Spark commented on SPARK-35811: -- User 'pingsutw' has created a pull request for this issue:

[jira] [Assigned] (SPARK-35811) Deprecate DataFrame.to_spark_io

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35811: Assignee: Apache Spark > Deprecate DataFrame.to_spark_io >

[jira] [Assigned] (SPARK-35811) Deprecate DataFrame.to_spark_io

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35811: Assignee: (was: Apache Spark) > Deprecate DataFrame.to_spark_io >

[jira] [Assigned] (SPARK-34120) Improve the statistics estimation

2021-06-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-34120: --- Assignee: Yuming Wang > Improve the statistics estimation >

[jira] [Resolved] (SPARK-34120) Improve the statistics estimation

2021-06-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-34120. - Fix Version/s: 3.2.0 Resolution: Fixed > Improve the statistics estimation >

[jira] [Assigned] (SPARK-35185) Improve Distinct statistics estimation

2021-06-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-35185: --- Assignee: Yuming Wang > Improve Distinct statistics estimation >

[jira] [Resolved] (SPARK-35185) Improve Distinct statistics estimation

2021-06-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-35185. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32291

[jira] [Commented] (SPARK-35808) Always enable the `pandas_metadata` in DataFrame.parquet

2021-06-18 Thread Kevin Su (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365506#comment-17365506 ] Kevin Su commented on SPARK-35808: -- [~itholic] So could we remove argument "pandas_metadata" in 

[jira] [Comment Edited] (SPARK-35700) spark.sql.orc.filterPushdown not working with Spark 3.1.1 for tables with varchar data type

2021-06-18 Thread Saurabh Chawla (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365492#comment-17365492 ] Saurabh Chawla edited comment on SPARK-35700 at 6/18/21, 1:10 PM: --

[jira] [Commented] (SPARK-35795) Cannot resolve column when there is a unrecognized hint in subquery

2021-06-18 Thread HonglunChen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365495#comment-17365495 ] HonglunChen commented on SPARK-35795: - [~saurabhc100] Thanks > Cannot resolve column when there is

[jira] [Commented] (SPARK-35700) spark.sql.orc.filterPushdown not working with Spark 3.1.1 for tables with varchar data type

2021-06-18 Thread Saurabh Chawla (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365492#comment-17365492 ] Saurabh Chawla commented on SPARK-35700: [~arghya18] - I tried to reproduce this scenario, but

[jira] [Comment Edited] (SPARK-35089) non consistent results running count for same dataset after filter and lead window function

2021-06-18 Thread Domagoj (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365468#comment-17365468 ] Domagoj edited comment on SPARK-35089 at 6/18/21, 12:44 PM: [~revans2], tnx

[jira] [Updated] (SPARK-35787) Does anyone has performance issue after upgrade from 3.0 to 3.1?

2021-06-18 Thread Vidmantas Drasutis (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vidmantas Drasutis updated SPARK-35787: --- Description: Hello.   We had using spark 3.0.2 and query was executed in ~100

[jira] [Assigned] (SPARK-35608) Support AQE optimizer side transformUpWithPruning

2021-06-18 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang reassigned SPARK-35608: -- Assignee: XiDuo You > Support AQE optimizer side transformUpWithPruning >

[jira] [Resolved] (SPARK-35608) Support AQE optimizer side transformUpWithPruning

2021-06-18 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang resolved SPARK-35608. Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32742

[jira] [Commented] (SPARK-35795) Cannot resolve column when there is a unrecognized hint in subquery

2021-06-18 Thread Saurabh Chawla (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365471#comment-17365471 ] Saurabh Chawla commented on SPARK-35795: This is fixed as the part of the - SPARK-35673 This

[jira] [Resolved] (SPARK-35469) Enable disallow_untyped_defs mypy check for pyspark.pandas.accessors.

2021-06-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-35469. -- Fix Version/s: 3.2.0 Assignee: Takuya Ueshin Resolution: Fixed > Enable

[jira] [Commented] (SPARK-35089) non consistent results running count for same dataset after filter and lead window function

2021-06-18 Thread Domagoj (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365468#comment-17365468 ] Domagoj commented on SPARK-35089: - [~revans2], tnx for detailed explanation. I still have a problem

[jira] [Assigned] (SPARK-35678) add a common softmax function

2021-06-18 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-35678: Assignee: zhengruifeng > add a common softmax function > - >

[jira] [Assigned] (SPARK-35619) Refactor LinearRegression - make huber support virtual centering

2021-06-18 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-35619: Assignee: zhengruifeng > Refactor LinearRegression - make huber support virtual

[jira] [Assigned] (SPARK-35100) Refactor AFT - support virtual centering

2021-06-18 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-35100: Assignee: zhengruifeng > Refactor AFT - support virtual centering >

[jira] [Assigned] (SPARK-35024) Refactor LinearSVC - support virtual centering

2021-06-18 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-35024: Assignee: zhengruifeng > Refactor LinearSVC - support virtual centering >

[jira] [Updated] (SPARK-35024) Refactor LinearSVC - support virtual centering

2021-06-18 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-35024: - Fix Version/s: 3.2.0 > Refactor LinearSVC - support virtual centering >

[jira] [Updated] (SPARK-35619) Refactor LinearRegression - make huber support virtual centering

2021-06-18 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-35619: - Fix Version/s: 3.2.0 > Refactor LinearRegression - make huber support virtual centering >

[jira] [Updated] (SPARK-35100) Refactor AFT - support virtual centering

2021-06-18 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-35100: - Fix Version/s: 3.2.0 > Refactor AFT - support virtual centering >

[jira] [Updated] (SPARK-35666) add new gemv to skip array shape checking

2021-06-18 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-35666: - Fix Version/s: 3.2.0 > add new gemv to skip array shape checking >

[jira] [Updated] (SPARK-35707) optimize sparse GEMM by skipping bound checking

2021-06-18 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-35707: - Fix Version/s: 3.2.0 > optimize sparse GEMM by skipping bound checking >

[jira] [Commented] (SPARK-35678) add a common softmax function

2021-06-18 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365454#comment-17365454 ] zhengruifeng commented on SPARK-35678: -- [~hyukjin.kwon] Thanks for reminding me. I just notice that

[jira] [Created] (SPARK-35814) Mismatched types when creating a new column when using Arrow

2021-06-18 Thread Nic Crane (Jira)
Nic Crane created SPARK-35814: - Summary: Mismatched types when creating a new column when using Arrow Key: SPARK-35814 URL: https://issues.apache.org/jira/browse/SPARK-35814 Project: Spark

[jira] [Commented] (SPARK-35378) Eagerly execute commands in QueryExecution instead of caller sides

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365415#comment-17365415 ] Apache Spark commented on SPARK-35378: -- User 'beliefer' has created a pull request for this issue:

[jira] [Commented] (SPARK-35378) Eagerly execute commands in QueryExecution instead of caller sides

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365413#comment-17365413 ] Apache Spark commented on SPARK-35378: -- User 'beliefer' has created a pull request for this issue:

[jira] [Assigned] (SPARK-35769) Truncate java.time.Period by fields of year-month interval type

2021-06-18 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk reassigned SPARK-35769: Assignee: angerszhu > Truncate java.time.Period by fields of year-month interval type >

[jira] [Resolved] (SPARK-35769) Truncate java.time.Period by fields of year-month interval type

2021-06-18 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk resolved SPARK-35769. -- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32945

[jira] [Commented] (SPARK-35498) Add an API "inheritable_thread_target" which return a wrapped thread target for pyspark pin thread mode

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365310#comment-17365310 ] Apache Spark commented on SPARK-35498: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-35498) Add an API "inheritable_thread_target" which return a wrapped thread target for pyspark pin thread mode

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365309#comment-17365309 ] Apache Spark commented on SPARK-35498: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-35303) Enable pinned thread mode by default

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365307#comment-17365307 ] Apache Spark commented on SPARK-35303: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-35498) Add an API "inheritable_thread_target" which return a wrapped thread target for pyspark pin thread mode

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365308#comment-17365308 ] Apache Spark commented on SPARK-35498: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-35303) Enable pinned thread mode by default

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365305#comment-17365305 ] Apache Spark commented on SPARK-35303: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-35678) add a common softmax function

2021-06-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365294#comment-17365294 ] Hyukjin Kwon commented on SPARK-35678: -- [~podongfeng] please set the fix version .. > add a common

[jira] [Updated] (SPARK-35678) add a common softmax function

2021-06-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-35678: - Fix Version/s: 3.2.0 > add a common softmax function > - > >

[jira] [Commented] (SPARK-35694) Increase the default JVM stack size of SBT/Maven

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365288#comment-17365288 ] Apache Spark commented on SPARK-35694: -- User 'gengliangwang' has created a pull request for this

[jira] [Commented] (SPARK-35694) Increase the default JVM stack size of SBT/Maven

2021-06-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17365287#comment-17365287 ] Apache Spark commented on SPARK-35694: -- User 'gengliangwang' has created a pull request for this