[jira] [Resolved] (SPARK-36283) Bug when creating dataframe without schema and with Arrow disabled

2021-07-27 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen resolved SPARK-36283. Resolution: Duplicate > Bug when creating dataframe without schema and with Arrow disabled > -

[jira] [Commented] (SPARK-35211) Bug when creating dataframe without schema and with Arrow disabled

2021-07-27 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17387883#comment-17387883 ] Darcy Shen commented on SPARK-35211: Updated > Bug when creating dataframe without

[jira] [Updated] (SPARK-35211) Bug when creating dataframe without schema and with Arrow disabled

2021-07-27 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-35211: --- Summary: Bug when creating dataframe without schema and with Arrow disabled (was: Support UDT for P

[jira] [Updated] (SPARK-35211) Bug when creating dataframe without schema and with Arrow disabled

2021-07-27 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-35211: --- Description: A reproducible small repo can be found here: https://github.com/darcy-shen/spark-36283

[jira] [Updated] (SPARK-36283) Bug when creating dataframe without schema and with Arrow disabled

2021-07-26 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-36283: --- External issue URL: (was: https://issues.apache.org/jira/browse/SPARK-35211) > Bug when creating

[jira] [Updated] (SPARK-36283) Bug when creating dataframe without schema and with Arrow disabled

2021-07-26 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-36283: --- External issue URL: https://issues.apache.org/jira/browse/SPARK-35211 > Bug when creating dataframe

[jira] [Updated] (SPARK-36283) Bug when creating dataframe without schema and with Arrow disabled

2021-07-25 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-36283: --- Description: A reproducible small repo can be found here: https://github.com/darcy-shen/spark-36283

[jira] [Updated] (SPARK-36283) Bug when creating dataframe without schema and with Arrow disabled

2021-07-25 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-36283: --- Description: h2. Case 1: Create PySpark Dataframe using Pandas DataFrame with Arrow disabled and w

[jira] [Updated] (SPARK-36283) Bug when creating dataframe without schema and with Arrow disabled

2021-07-25 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-36283: --- Description: h2. Case 1 Create PySpark Dataframe using Pandas DataFrame with Arrow disabled and w

[jira] [Updated] (SPARK-36283) Bug when creating dataframe without schema and with Arrow disabled

2021-07-25 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-36283: --- Description: h2. Case 1 {code:python} spark = SparkSession.builder.getOrCreate() spark.conf.set("spa

[jira] [Created] (SPARK-36283) Bug when creating dataframe without schema and with Arrow disabled

2021-07-25 Thread Darcy Shen (Jira)
Darcy Shen created SPARK-36283: -- Summary: Bug when creating dataframe without schema and with Arrow disabled Key: SPARK-36283 URL: https://issues.apache.org/jira/browse/SPARK-36283 Project: Spark

[jira] [Updated] (SPARK-35211) Support UDT for Pandas with Arrow Disabled

2021-04-24 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-35211: --- External issue URL: (was: https://github.com/apache/spark/pull/32320) > Support UDT for Pandas wit

[jira] [Updated] (SPARK-35211) Support UDT for Pandas with Arrow Disabled

2021-04-24 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-35211: --- External issue URL: https://github.com/apache/spark/pull/32320 > Support UDT for Pandas with Arrow D

[jira] [Updated] (SPARK-35211) Support UDT for Pandas with Arrow Disabled

2021-04-24 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-35211: --- Labels: correctness (was: ) > Support UDT for Pandas with Arrow Disabled >

[jira] [Commented] (SPARK-35211) Support UDT for Pandas with Arrow Disabled

2021-04-24 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17331175#comment-17331175 ] Darcy Shen commented on SPARK-35211: Provided with schema, strict type check will be

[jira] [Updated] (SPARK-35211) Support UDT for Pandas with Arrow Disabled

2021-04-24 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-35211: --- Description: {code:java} $ pip freeze certifi==2020.12.5 coverage==5.5 flake8==3.9.0 mccabe==0.6.1 m

[jira] [Comment Edited] (SPARK-35211) Support UDT for Pandas with Arrow Disabled

2021-04-24 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17331170#comment-17331170 ] Darcy Shen edited comment on SPARK-35211 at 4/24/21, 8:52 AM:

[jira] [Commented] (SPARK-35211) Support UDT for Pandas with Arrow Disabled

2021-04-24 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17331170#comment-17331170 ] Darcy Shen commented on SPARK-35211: With schema provided, it works fine. {code} (sp

[jira] [Updated] (SPARK-35211) Support UDT for Pandas with Arrow Disabled

2021-04-24 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-35211: --- Description: {code:java} $ pip freeze certifi==2020.12.5 coverage==5.5 flake8==3.9.0 mccabe==0.6.1 m

[jira] [Updated] (SPARK-35211) Support UDT for Pandas with Arrow Disabled

2021-04-24 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-35211: --- Description: {code:java} $ pip freeze certifi==2020.12.5 coverage==5.5 flake8==3.9.0 mccabe==0.6.1 m

[jira] [Updated] (SPARK-35211) Support UDT for Pandas with Arrow Disabled

2021-04-24 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-35211: --- Description: {code:java} pip freeze certifi==2020.12.5 coverage==5.5 flake8==3.9.0 mccabe==0.6.1 my

[jira] [Updated] (SPARK-35211) Support UDT for Pandas with Arrow Disabled

2021-04-24 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-35211: --- Parent: SPARK-34600 Issue Type: Sub-task (was: Bug) > Support UDT for Pandas with Arrow Dis

[jira] [Created] (SPARK-35211) Support UDT for Pandas with Arrow Disabled

2021-04-24 Thread Darcy Shen (Jira)
Darcy Shen created SPARK-35211: -- Summary: Support UDT for Pandas with Arrow Disabled Key: SPARK-35211 URL: https://issues.apache.org/jira/browse/SPARK-35211 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-34799) Return UDT from Pandas UDF: @pandas_udf(UDT)

2021-03-31 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34799: --- Summary: Return UDT from Pandas UDF: @pandas_udf(UDT) (was: Return User-defined types from Pandas U

[jira] [Updated] (SPARK-34799) Return UDT from Pandas UDF: @pandas_udf(UDT)

2021-03-31 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34799: --- Description: Focus on a simpler case first: `@pandas_udf(UDT)`. Because Pandas UDF uses pyarrow

[jira] [Updated] (SPARK-34771) Support UDT for Pandas/Spark convertion with Arrow support Enabled

2021-03-31 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Summary: Support UDT for Pandas/Spark convertion with Arrow support Enabled (was: Support UDT for P

[jira] [Updated] (SPARK-34771) Support UDT for Pandas/Spark conversion with Arrow support Enabled

2021-03-31 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Summary: Support UDT for Pandas/Spark conversion with Arrow support Enabled (was: Support UDT for P

[jira] [Updated] (SPARK-34771) Support UDT for Pandas/Spark convertion with Arrow Enabled

2021-03-31 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Summary: Support UDT for Pandas/Spark convertion with Arrow Enabled (was: Support UDT for Pandas/Sp

[jira] [Updated] (SPARK-34771) Support UDT for Pandas/Spark convertion with Arrow Optimization

2021-03-31 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Summary: Support UDT for Pandas/Spark convertion with Arrow Optimization (was: Support UDT for Pand

[jira] [Updated] (SPARK-34835) Support TimestampType in UDT

2021-03-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34835: --- Description: For user defined type with TimestampType, conversion from pandas to pyarrow and from p

[jira] [Updated] (SPARK-34835) Support TimestampType in UDT

2021-03-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34835: --- Description: For user defined type with TimestampType, conversion from pandas to pyarrow and from p

[jira] [Updated] (SPARK-34835) Support TimestampType in UDT

2021-03-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34835: --- Description: For user defined type with TimestampType, conversion from pandas to pyarrow and from p

[jira] [Updated] (SPARK-34835) Support TimestampType in UDT

2021-03-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34835: --- Description: For user defined type with TimestampType, conversion from pandas to pyarrow and from p

[jira] [Updated] (SPARK-34835) Support TimestampType in UDT

2021-03-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34835: --- Description: For user defined type with TimestampType, conversion from pandas to pyarrow and from p

[jira] [Updated] (SPARK-34835) Support TimestampType in UDT

2021-03-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34835: --- Description: For user defined type with TimestampType, conversion from pandas to pyarrow and from p

[jira] [Updated] (SPARK-34835) Support TimestampType in UDT

2021-03-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34835: --- Description: For user defined type with TimestampType, conversion from pandas to pyarrow and from py

[jira] [Created] (SPARK-34835) Support TimestampType in UDT

2021-03-23 Thread Darcy Shen (Jira)
Darcy Shen created SPARK-34835: -- Summary: Support TimestampType in UDT Key: SPARK-34835 URL: https://issues.apache.org/jira/browse/SPARK-34835 Project: Spark Issue Type: Sub-task Compo

[jira] [Comment Edited] (SPARK-34600) Support user defined types in Pandas UDF

2021-03-18 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17304606#comment-17304606 ] Darcy Shen edited comment on SPARK-34600 at 3/19/21, 3:14 AM:

[jira] [Commented] (SPARK-34600) Support user defined types in Pandas UDF

2021-03-18 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17304606#comment-17304606 ] Darcy Shen commented on SPARK-34600: [~hyukjin.kwon][~eddyxu] Let me move the PR to

[jira] [Updated] (SPARK-34600) Support user defined types in Pandas UDF

2021-03-18 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34600: --- Description: This is an umbrella ticket. (was: Because Pandas UDF uses pyarrow to passing data, it

[jira] [Updated] (SPARK-34799) Return User-defined types from Pandas UDF

2021-03-18 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34799: --- Description: Because Pandas UDF uses pyarrow to passing data, it does not currently support UserDef

[jira] [Created] (SPARK-34799) Return User-defined types from Pandas UDF

2021-03-18 Thread Darcy Shen (Jira)
Darcy Shen created SPARK-34799: -- Summary: Return User-defined types from Pandas UDF Key: SPARK-34799 URL: https://issues.apache.org/jira/browse/SPARK-34799 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-34771) Support UDT for Pandas with Arrow Optimization

2021-03-18 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Description: {code:python} spark.conf.set("spark.sql.execution.arrow.enabled", "true") from pyspark.

[jira] [Updated] (SPARK-34771) Support UDT for Pandas with Arrow Optimization

2021-03-18 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Description: {code:python} spark.conf.set("spark.sql.execution.arrow.enabled", "true") from pyspark.

[jira] [Updated] (SPARK-34771) Support UDT for Pandas with Arrow Optimization

2021-03-18 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Description: {code:python} spark.conf.set("spark.sql.execution.arrow.enabled", "true") from pyspark.

[jira] [Updated] (SPARK-34771) Support UDT for Pandas with Arrow Optimization

2021-03-17 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Description: {code:python} spark.conf.set("spark.sql.execution.arrow.enabled", "true") from pyspark.

[jira] [Updated] (SPARK-34771) Support UDT for Pandas with Arrow Optimization

2021-03-17 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Description: {code:python} $ bin/pyspark Python 3.8.8 (default, Feb 24 2021, 13:46:16) [Clang 10.0.0

[jira] [Updated] (SPARK-34771) Support UDT for Pandas

2021-03-17 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Description: {code:python} $ bin/pyspark Python 3.8.8 (default, Feb 24 2021, 13:46:16) [Clang 10.

[jira] [Updated] (SPARK-34771) Support UDT for Pandas with Arrow Optimization

2021-03-17 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Summary: Support UDT for Pandas with Arrow Optimization (was: Support UDT for Pandas) > Support UD

[jira] [Updated] (SPARK-34771) Support UDT for Pandas

2021-03-17 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Description: {code:python} $ bin/pyspark Python 3.8.8 (default, Feb 24 2021, 13:46:16) [Clang 10.0.

[jira] [Updated] (SPARK-34771) Support UDT for Pandas

2021-03-17 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Description: (spark) ➜ spark git:(SPARK_34771) ✗ bin/pyspark Python 3.8.8 (default, Feb 24 2021, 13

[jira] [Updated] (SPARK-34771) Support UDT for Pandas

2021-03-17 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Description: (spark) ➜ spark git:(SPARK_34771) ✗ bin/pyspark Python 3.8.8 (default, Feb 24 2021, 13

[jira] [Updated] (SPARK-34771) Support UDT for Pandas

2021-03-17 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-34771: --- Description: (spark) ➜ spark git:(SPARK_34771) ✗ bin/pyspark Python 3.8.8 (default, Feb 24 2021, 13

[jira] [Created] (SPARK-34771) Support UDT for Pandas

2021-03-16 Thread Darcy Shen (Jira)
Darcy Shen created SPARK-34771: -- Summary: Support UDT for Pandas Key: SPARK-34771 URL: https://issues.apache.org/jira/browse/SPARK-34771 Project: Spark Issue Type: Sub-task Components:

[jira] [Updated] (SPARK-33894) Word2VecSuite failed for Scala 2.13

2020-12-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-33894: --- Description: This may be the first failed build: https://amplab.cs.berkeley.edu/jenkins/job/spark-ma

[jira] [Updated] (SPARK-33894) Word2VecSuite failed for Scala 2.13

2020-12-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-33894: --- Description: This may be the first failed build: https://amplab.cs.berkeley.edu/jenkins/job/spark-ma

[jira] [Updated] (SPARK-33894) Word2VecSuite failed for Scala 2.13

2020-12-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-33894: --- Description: This may be the first failed build: https://amplab.cs.berkeley.edu/jenkins/job/spark-ma

[jira] [Updated] (SPARK-33894) Word2VecSuite failed for Scala 2.13

2020-12-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-33894: --- Description: This may be the first failed build: https://amplab.cs.berkeley.edu/jenkins/job/spark-ma

[jira] [Updated] (SPARK-33894) Word2VecSuite failed for Scala 2.13

2020-12-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-33894: --- Description: This may be the first failed build: https://amplab.cs.berkeley.edu/jenkins/job/spark-ma

[jira] [Updated] (SPARK-33894) Word2VecSuite failed for Scala 2.13

2020-12-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-33894: --- Description: This may be the first failed build: https://amplab.cs.berkeley.edu/jenkins/job/spark-ma

[jira] [Updated] (SPARK-33894) Word2VecSuite failed for Scala 2.13

2020-12-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-33894: --- Description: This may be the first failed build: https://amplab.cs.berkeley.edu/jenkins/job/spark-ma

[jira] [Updated] (SPARK-33894) Word2VecSuite failed for Scala 2.13

2020-12-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-33894: --- Description: This may be the first failed build: https://amplab.cs.berkeley.edu/jenkins/job/spark-ma

[jira] [Updated] (SPARK-33894) Word2VecSuite failed for Scala 2.13

2020-12-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-33894: --- Description: This may be the first failed build: https://amplab.cs.berkeley.edu/jenkins/job/spark-ma

[jira] [Updated] (SPARK-33894) Word2VecSuite failed for Scala 2.13

2020-12-23 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-33894: --- Parent: SPARK-25075 Issue Type: Sub-task (was: Test) > Word2VecSuite failed for Scala 2.13

[jira] [Created] (SPARK-33894) Word2VecSuite failed for Scala 2.13

2020-12-23 Thread Darcy Shen (Jira)
Darcy Shen created SPARK-33894: -- Summary: Word2VecSuite failed for Scala 2.13 Key: SPARK-33894 URL: https://issues.apache.org/jira/browse/SPARK-33894 Project: Spark Issue Type: Test Co

[jira] [Commented] (SPARK-32526) Let sql/catalyst module tests pass for Scala 2.13

2020-12-11 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17248259#comment-17248259 ] Darcy Shen commented on SPARK-32526: ExpressionEncoderSuite still fails to work: ht

[jira] [Commented] (SPARK-33044) Add a Jenkins build and test job for Scala 2.13

2020-12-10 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17247693#comment-17247693 ] Darcy Shen commented on SPARK-33044: https://amplab.cs.berkeley.edu/jenkins/job/spar

[jira] [Commented] (SPARK-33044) Add a Jenkins build and test job for Scala 2.13

2020-12-10 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17247687#comment-17247687 ] Darcy Shen commented on SPARK-33044: Awesome! > Add a Jenkins build and test job fo

[jira] [Commented] (SPARK-33348) Use scala.jdk.CollectionConverters replace scala.collection.JavaConverters

2020-11-19 Thread Darcy Shen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17235462#comment-17235462 ] Darcy Shen commented on SPARK-33348: Would better keep it until Apache Spark switche

[jira] [Updated] (SPARK-21708) use sbt 1.x

2019-03-25 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-21708: --- Affects Version/s: 2.4.0 > use sbt 1.x > --- > > Key: SPARK-21708 >

[jira] [Updated] (SPARK-27160) Incorrect Literal Casting of DecimalType in OrcFilters

2019-03-15 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-27160: --- Description: DecimalType Literal should not be casted to Long. eg. For `df.filter("x < 3.14")`, ass

[jira] [Updated] (SPARK-27160) Incorrect Literal Casting of DecimalType in OrcFilters

2019-03-15 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-27160: --- Priority: Blocker (was: Major) > Incorrect Literal Casting of DecimalType in OrcFilters > -

[jira] [Updated] (SPARK-27160) Incorrect Literal Casting of DecimalType in OrcFilters

2019-03-15 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-27160: --- Target Version/s: (was: 2.4.1) > Incorrect Literal Casting of DecimalType in OrcFilters >

[jira] [Updated] (SPARK-27160) Incorrect Literal Casting of DecimalType in OrcFilters

2019-03-15 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-27160: --- Fix Version/s: 3.0.0 > Incorrect Literal Casting of DecimalType in OrcFilters >

[jira] [Updated] (SPARK-27160) Incorrect Literal Casting of DecimalType in OrcFilters

2019-03-15 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-27160: --- Fix Version/s: 2.4.1 > Incorrect Literal Casting of DecimalType in OrcFilters >

[jira] [Updated] (SPARK-27160) Incorrect Literal Casting of DecimalType in OrcFilters

2019-03-15 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-27160: --- Labels: correctness (was: ) > Incorrect Literal Casting of DecimalType in OrcFilters >

[jira] [Updated] (SPARK-27160) Incorrect Literal Casting of DecimalType in OrcFilters

2019-03-15 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-27160: --- Target Version/s: 2.4.1 > Incorrect Literal Casting of DecimalType in OrcFilters > -

[jira] [Created] (SPARK-27160) Incorrect Literal Casting of DecimalType in OrcFilters

2019-03-14 Thread Darcy Shen (JIRA)
Darcy Shen created SPARK-27160: -- Summary: Incorrect Literal Casting of DecimalType in OrcFilters Key: SPARK-27160 URL: https://issues.apache.org/jira/browse/SPARK-27160 Project: Spark Issue Type

[jira] [Updated] (SPARK-26885) Remove yyyy/yyyy-[d]d format in DataTimeUtils for stringToTimestamp and stringToDate

2019-02-15 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-26885: --- Summary: Remove /-[d]d format in DataTimeUtils for stringToTimestamp and stringToDate (was:

[jira] [Created] (SPARK-26885) Remove yyyy format in DataTimeUtils for stringToTimestamp and stringToDate

2019-02-14 Thread Darcy Shen (JIRA)
Darcy Shen created SPARK-26885: -- Summary: Remove format in DataTimeUtils for stringToTimestamp and stringToDate Key: SPARK-26885 URL: https://issues.apache.org/jira/browse/SPARK-26885 Project: Spark

[jira] [Commented] (SPARK-21708) use sbt 1.0.0

2019-01-22 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16748705#comment-16748705 ] Darcy Shen commented on SPARK-21708: I tried to use sbt 1.x, but failed on the sbt e

[jira] [Commented] (SPARK-26132) Remove support for Scala 2.11 in Spark 3.0.0

2018-12-12 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718981#comment-16718981 ] Darcy Shen commented on SPARK-26132: [~srowen] I'm adding subtasks for https://issu

[jira] [Updated] (SPARK-26338) Use scala-xml explicitly

2018-12-11 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-26338: --- Issue Type: Sub-task (was: Improvement) Parent: SPARK-25075 > Use scala-xml explicitly > --

[jira] [Updated] (SPARK-25075) Build and test Spark against Scala 2.13

2018-12-11 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-25075: --- Issue Type: Umbrella (was: Bug) > Build and test Spark against Scala 2.13 > ---

[jira] [Updated] (SPARK-25075) Build and test Spark against Scala 2.13

2018-12-11 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-25075: --- Issue Type: Bug (was: Sub-task) Parent: (was: SPARK-26338) > Build and test Spark again

[jira] [Updated] (SPARK-25075) Build and test Spark against Scala 2.13

2018-12-11 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-25075: --- Issue Type: Sub-task (was: Umbrella) Parent: SPARK-26338 > Build and test Spark against Sca

[jira] [Updated] (SPARK-25075) Build and test Spark against Scala 2.13

2018-12-11 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-25075: --- Affects Version/s: (was: 2.1.0) 2.4.0 > Build and test Spark against Scal

[jira] [Created] (SPARK-26338) Use scala-xml explicitly

2018-12-11 Thread Darcy Shen (JIRA)
Darcy Shen created SPARK-26338: -- Summary: Use scala-xml explicitly Key: SPARK-26338 URL: https://issues.apache.org/jira/browse/SPARK-26338 Project: Spark Issue Type: Improvement Compon

[jira] [Commented] (SPARK-25075) Build and test Spark against Scala 2.13

2018-12-10 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16716302#comment-16716302 ] Darcy Shen commented on SPARK-25075: I maintained a list of scala libraries which sp

[jira] [Updated] (SPARK-26321) Split a SQL in a correct way

2018-12-10 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-26321: --- Description: First: ./build/mvn -Phive-thriftserver -DskipTests package   Then: $ bin/spark-sql

[jira] [Commented] (SPARK-23974) Do not allocate more containers as expected in dynamic allocation

2018-12-10 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16714611#comment-16714611 ] Darcy Shen commented on SPARK-23974: Well, we are upgrading from Spark 2.1.1 to Spar

[jira] [Created] (SPARK-26321) Split a SQL in a correct way

2018-12-10 Thread Darcy Shen (JIRA)
Darcy Shen created SPARK-26321: -- Summary: Split a SQL in a correct way Key: SPARK-26321 URL: https://issues.apache.org/jira/browse/SPARK-26321 Project: Spark Issue Type: Improvement Co

[jira] [Created] (SPARK-26319) Add appendReadColumns Unit Test for HiveShimSuite

2018-12-10 Thread Darcy Shen (JIRA)
Darcy Shen created SPARK-26319: -- Summary: Add appendReadColumns Unit Test for HiveShimSuite Key: SPARK-26319 URL: https://issues.apache.org/jira/browse/SPARK-26319 Project: Spark Issue Type: Imp

[jira] [Commented] (SPARK-21708) use sbt 1.0.0

2018-12-09 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16714101#comment-16714101 ] Darcy Shen commented on SPARK-21708: Please make the Priority higher. sbt 1.x is be

[jira] [Commented] (SPARK-14220) Build and test Spark against Scala 2.12

2018-09-06 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16605457#comment-16605457 ] Darcy Shen commented on SPARK-14220: Congrats!   First green one:   https://ampl

[jira] [Commented] (SPARK-25298) spark-tools build failure for Scala 2.12

2018-09-01 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16599651#comment-16599651 ] Darcy Shen commented on SPARK-25298: sbt -Dscala-2.12 -Dscala.version=2.12.6   Thi

[jira] [Created] (SPARK-25304) enable HiveSparkSubmitSuite SPARK-8489 test for Scala 2.12

2018-09-01 Thread Darcy Shen (JIRA)
Darcy Shen created SPARK-25304: -- Summary: enable HiveSparkSubmitSuite SPARK-8489 test for Scala 2.12 Key: SPARK-25304 URL: https://issues.apache.org/jira/browse/SPARK-25304 Project: Spark Issue

[jira] [Commented] (SPARK-25297) Future for Scala 2.12 will block on a already shutdown ExecutionContext

2018-09-01 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16599580#comment-16599580 ] Darcy Shen commented on SPARK-25297: This issue has been fixed by https://github.com

[jira] [Created] (SPARK-25298) spark-tools build failure for Scala 2.12

2018-08-31 Thread Darcy Shen (JIRA)
Darcy Shen created SPARK-25298: -- Summary: spark-tools build failure for Scala 2.12 Key: SPARK-25298 URL: https://issues.apache.org/jira/browse/SPARK-25298 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-25297) Future for Scala 2.12 will block on a already shutdown ExecutionContext

2018-08-31 Thread Darcy Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darcy Shen updated SPARK-25297: --- Description: *+see [https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test/job/spark-master-

  1   2   >