date:20180110

[jira] [Created] (SPARK-23039) Fix the bug in alter table set location.

2018-01-10 Thread xubo245 (JIRA)

xubo245 created SPARK-23039: --- Summary: Fix the bug in alter table set location. Key: SPARK-23039 URL: https://issues.apache.org/jira/browse/SPARK-23039 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-21179) Unable to return Hive INT data type into Spark via Hive JDBC driver: Caused by: java.sql.SQLDataException: [Simba][JDBC](10140) Error converting value to int.

2018-01-10 Thread Abhishek Soni (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321818#comment-16321818 ] Abhishek Soni commented on SPARK-21179: --- Thanks [~mwalton_mstr], I was only overriding

[jira] [Assigned] (SPARK-23038) Update docker/spark-test (JDK/OS)

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23038: Assignee: Apache Spark > Update docker/spark-test (JDK/OS) >

[jira] [Commented] (SPARK-23038) Update docker/spark-test (JDK/OS)

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321812#comment-16321812 ] Apache Spark commented on SPARK-23038: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-23038) Update docker/spark-test (JDK/OS)

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23038: Assignee: (was: Apache Spark) > Update docker/spark-test (JDK/OS) >

[jira] [Updated] (SPARK-23038) Update docker/spark-test (JDK/OS)

2018-01-10 Thread Dongjoon Hyun (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-23038: -- Summary: Update docker/spark-test (JDK/OS) (was: Update docker/spark-test) > Update

[jira] [Updated] (SPARK-23038) Update docker/spark-test

2018-01-10 Thread Dongjoon Hyun (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-23038: -- Description: This issue aims to update the followings in `docker/spark-test`. - JDK7 -> JDK8:

[jira] [Updated] (SPARK-23038) Update docker/spark-test

2018-01-10 Thread Dongjoon Hyun (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-23038: -- Summary: Update docker/spark-test (was: Update docker/spark-test to use JDK8) > Update

[jira] [Created] (SPARK-23038) Update docker/spark-test to use JDK8

2018-01-10 Thread Dongjoon Hyun (JIRA)

Dongjoon Hyun created SPARK-23038: - Summary: Update docker/spark-test to use JDK8 Key: SPARK-23038 URL: https://issues.apache.org/jira/browse/SPARK-23038 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-23037) RFormula should not use deprecated OneHotEncoder and should include VectorSizeHint in pipeline

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23037: Assignee: Apache Spark > RFormula should not use deprecated OneHotEncoder and should

[jira] [Assigned] (SPARK-23037) RFormula should not use deprecated OneHotEncoder and should include VectorSizeHint in pipeline

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23037: Assignee: (was: Apache Spark) > RFormula should not use deprecated OneHotEncoder and

[jira] [Commented] (SPARK-23037) RFormula should not use deprecated OneHotEncoder and should include VectorSizeHint in pipeline

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321740#comment-16321740 ] Apache Spark commented on SPARK-23037: -- User 'MrBago' has created a pull request for this issue:

[jira] [Created] (SPARK-23037) RFormula should not use deprecated OneHotEncoder and should include VectorSizeHint in pipeline

2018-01-10 Thread Bago Amirbekian (JIRA)

Bago Amirbekian created SPARK-23037: --- Summary: RFormula should not use deprecated OneHotEncoder and should include VectorSizeHint in pipeline Key: SPARK-23037 URL:

[jira] [Commented] (SPARK-22921) Merge script should prompt for assigning jiras

2018-01-10 Thread Saisai Shao (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321669#comment-16321669 ] Saisai Shao commented on SPARK-22921: - Hi [~irashid], looks like the changes will throw an exception

[jira] [Resolved] (SPARK-22587) Spark job fails if fs.defaultFS and application jar are different url

2018-01-10 Thread Saisai Shao (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao resolved SPARK-22587. - Resolution: Fixed > Spark job fails if fs.defaultFS and application jar are different url >

[jira] [Updated] (SPARK-22587) Spark job fails if fs.defaultFS and application jar are different url

2018-01-10 Thread Saisai Shao (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-22587: Fix Version/s: 2.3.0 > Spark job fails if fs.defaultFS and application jar are different url >

[jira] [Assigned] (SPARK-22587) Spark job fails if fs.defaultFS and application jar are different url

2018-01-10 Thread Saisai Shao (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao reassigned SPARK-22587: --- Assignee: Mingjie Tang > Spark job fails if fs.defaultFS and application jar are different

[jira] [Commented] (SPARK-23027) optimizer a simple query using a non-existent data is too slow

2018-01-10 Thread wangminfeng (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321645#comment-16321645 ] wangminfeng commented on SPARK-23027: - i read the doc you gived, it looks like useful，i will try

[jira] [Assigned] (SPARK-23036) Add withGlobalTempView for testing and correct some improper with view related method usage

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23036: Assignee: Apache Spark > Add withGlobalTempView for testing and correct some improper

[jira] [Assigned] (SPARK-23036) Add withGlobalTempView for testing and correct some improper with view related method usage

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23036: Assignee: (was: Apache Spark) > Add withGlobalTempView for testing and correct some

[jira] [Commented] (SPARK-23036) Add withGlobalTempView for testing and correct some improper with view related method usage

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321638#comment-16321638 ] Apache Spark commented on SPARK-23036: -- User 'xubo245' has created a pull request for this issue:

[jira] [Updated] (SPARK-23036) Add withGlobalTempView for testing and correct some improper with view related method usage

2018-01-10 Thread xubo245 (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xubo245 updated SPARK-23036: Summary: Add withGlobalTempView for testing and correct some improper with view related method usage

[jira] [Created] (SPARK-23036) Add withGlobalTempView for testing

2018-01-10 Thread xubo245 (JIRA)

xubo245 created SPARK-23036: --- Summary: Add withGlobalTempView for testing Key: SPARK-23036 URL: https://issues.apache.org/jira/browse/SPARK-23036 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-23024) Spark ui about the contents of the form need to have hidden and show features, when the table records very much.

2018-01-10 Thread guoxiaolongzte (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guoxiaolongzte updated SPARK-23024: --- Attachment: 1.png 2.png > Spark ui about the contents of the form need to

[jira] [Assigned] (SPARK-23035) Fix warning: TEMPORARY TABLE ... USING ... is deprecated and use TempViewAlreadyExistsException when create temp view

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23035: Assignee: Apache Spark > Fix warning: TEMPORARY TABLE ... USING ... is deprecated and

[jira] [Commented] (SPARK-23035) Fix warning: TEMPORARY TABLE ... USING ... is deprecated and use TempViewAlreadyExistsException when create temp view

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321553#comment-16321553 ] Apache Spark commented on SPARK-23035: -- User 'xubo245' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23035) Fix warning: TEMPORARY TABLE ... USING ... is deprecated and use TempViewAlreadyExistsException when create temp view

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23035: Assignee: (was: Apache Spark) > Fix warning: TEMPORARY TABLE ... USING ... is

[jira] [Created] (SPARK-23035) Fix warning: TEMPORARY TABLE ... USING ... is deprecated and use TempViewAlreadyExistsException when create temp view

2018-01-10 Thread xubo245 (JIRA)

xubo245 created SPARK-23035: --- Summary: Fix warning: TEMPORARY TABLE ... USING ... is deprecated and use TempViewAlreadyExistsException when create temp view Key: SPARK-23035 URL:

[jira] [Commented] (SPARK-23034) Display tablename for `HiveTableScan` node in UI

2018-01-10 Thread Tejas Patil (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321529#comment-16321529 ] Tejas Patil commented on SPARK-23034: - [~dongjoon] recommended that the scope of this JIRA could be

[jira] [Commented] (SPARK-23027) optimizer a simple query using a non-existent data is too slow

2018-01-10 Thread Takeshi Yamamuro (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321487#comment-16321487 ] Takeshi Yamamuro commented on SPARK-23027: -- You tried v2.1?

[jira] [Assigned] (SPARK-23034) Display tablename for `HiveTableScan` node in UI

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23034: Assignee: (was: Apache Spark) > Display tablename for `HiveTableScan` node in UI >

[jira] [Assigned] (SPARK-23034) Display tablename for `HiveTableScan` node in UI

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23034: Assignee: Apache Spark > Display tablename for `HiveTableScan` node in UI >

[jira] [Commented] (SPARK-23034) Display tablename for `HiveTableScan` node in UI

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321419#comment-16321419 ] Apache Spark commented on SPARK-23034: -- User 'tejasapatil' has created a pull request for this

[jira] [Created] (SPARK-23034) Display tablename for `HiveTableScan` node in UI

2018-01-10 Thread Tejas Patil (JIRA)

Tejas Patil created SPARK-23034: --- Summary: Display tablename for `HiveTableScan` node in UI Key: SPARK-23034 URL: https://issues.apache.org/jira/browse/SPARK-23034 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-23033) disable task-level retry for continuous execution

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23033: Assignee: Apache Spark > disable task-level retry for continuous execution >

[jira] [Updated] (SPARK-23032) Add a per-query codegenStageId to WholeStageCodegenExec

2018-01-10 Thread Kris Mok (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kris Mok updated SPARK-23032: - Description: Proposing to add a per-query ID to the codegen stages as represented by

[jira] [Assigned] (SPARK-23033) disable task-level retry for continuous execution

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23033: Assignee: (was: Apache Spark) > disable task-level retry for continuous execution >

[jira] [Commented] (SPARK-23033) disable task-level retry for continuous execution

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321404#comment-16321404 ] Apache Spark commented on SPARK-23033: -- User 'jose-torres' has created a pull request for this

[jira] [Created] (SPARK-23033) disable task-level retry for continuous execution

2018-01-10 Thread Jose Torres (JIRA)

Jose Torres created SPARK-23033: --- Summary: disable task-level retry for continuous execution Key: SPARK-23033 URL: https://issues.apache.org/jira/browse/SPARK-23033 Project: Spark Issue Type:

[jira] [Commented] (SPARK-23032) Add a per-query codegenStageId to WholeStageCodegenExec

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321398#comment-16321398 ] Apache Spark commented on SPARK-23032: -- User 'rednaxelafx' has created a pull request for this

[jira] [Assigned] (SPARK-23032) Add a per-query codegenStageId to WholeStageCodegenExec

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23032: Assignee: (was: Apache Spark) > Add a per-query codegenStageId to

[jira] [Assigned] (SPARK-23032) Add a per-query codegenStageId to WholeStageCodegenExec

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23032: Assignee: Apache Spark > Add a per-query codegenStageId to WholeStageCodegenExec >

[jira] [Created] (SPARK-23032) Add a per-query codegenStageId to WholeStageCodegenExec

2018-01-10 Thread Kris Mok (JIRA)

Kris Mok created SPARK-23032: Summary: Add a per-query codegenStageId to WholeStageCodegenExec Key: SPARK-23032 URL: https://issues.apache.org/jira/browse/SPARK-23032 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-22989) sparkstreaming ui show 0 records when spark-streaming-kafka application restore from checkpoint

2018-01-10 Thread Shixiong Zhu (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-22989. -- Resolution: Duplicate > sparkstreaming ui show 0 records when spark-streaming-kafka

[jira] [Updated] (SPARK-22991) High read latency with spark streaming 2.2.1 and kafka 0.10.0.1

2018-01-10 Thread Shixiong Zhu (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-22991: - Component/s: (was: Structured Streaming) (was: Spark Core)

[jira] [Updated] (SPARK-22975) MetricsReporter producing NullPointerException when there was no progress reported

2018-01-10 Thread Shixiong Zhu (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-22975: - Component/s: (was: SQL) Structured Streaming > MetricsReporter producing

[jira] [Resolved] (SPARK-22951) count() after dropDuplicates() on emptyDataFrame returns incorrect value

2018-01-10 Thread Cheng Lian (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-22951. Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 20174

[jira] [Comment Edited] (SPARK-17998) Reading Parquet files coalesces parts into too few in-memory partitions

2018-01-10 Thread Fernando Pereira (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-17998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321234#comment-16321234 ] Fernando Pereira edited comment on SPARK-17998 at 1/10/18 10:20 PM:

[jira] [Commented] (SPARK-17998) Reading Parquet files coalesces parts into too few in-memory partitions

2018-01-10 Thread Fernando Pereira (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-17998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321234#comment-16321234 ] Fernando Pereira commented on SPARK-17998: -- It says spark.sql.files.maxPartitionBytes in this

[jira] [Created] (SPARK-23031) Merge script should allow arbitrary assignees

2018-01-10 Thread Marcelo Vanzin (JIRA)

Marcelo Vanzin created SPARK-23031: -- Summary: Merge script should allow arbitrary assignees Key: SPARK-23031 URL: https://issues.apache.org/jira/browse/SPARK-23031 Project: Spark Issue

[jira] [Assigned] (SPARK-23020) Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23020: Assignee: Apache Spark > Flaky Test:

[jira] [Commented] (SPARK-23020) Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16321087#comment-16321087 ] Apache Spark commented on SPARK-23020: -- User 'vanzin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23020) Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23020: Assignee: (was: Apache Spark) > Flaky Test:

[jira] [Commented] (SPARK-23030) Decrease memory consumption with toPandas() collection using Arrow

2018-01-10 Thread Bryan Cutler (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320942#comment-16320942 ] Bryan Cutler commented on SPARK-23030: -- I'm looking into this, will submit a WIP PR if I see an

[jira] [Created] (SPARK-23030) Decrease memory consumption with toPandas() collection using Arrow

2018-01-10 Thread Bryan Cutler (JIRA)

Bryan Cutler created SPARK-23030: Summary: Decrease memory consumption with toPandas() collection using Arrow Key: SPARK-23030 URL: https://issues.apache.org/jira/browse/SPARK-23030 Project: Spark

[jira] [Updated] (SPARK-21187) Complete support for remaining Spark data types in Arrow Converters

2018-01-10 Thread Bryan Cutler (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-21187: - Description: This is to track adding the remaining type support in Arrow Converters.

[jira] [Updated] (SPARK-16060) Vectorized Orc reader

2018-01-10 Thread Dongjoon Hyun (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-16060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-16060: -- Affects Version/s: 1.6.3 2.0.2 2.1.2

[jira] [Assigned] (SPARK-22951) count() after dropDuplicates() on emptyDataFrame returns incorrect value

2018-01-10 Thread Cheng Lian (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian reassigned SPARK-22951: -- Assignee: Feng Liu > count() after dropDuplicates() on emptyDataFrame returns incorrect value

[jira] [Created] (SPARK-23029) Setting spark.shuffle.file.buffer will make the shuffle fail

2018-01-10 Thread Fernando Pereira (JIRA)

Fernando Pereira created SPARK-23029: Summary: Setting spark.shuffle.file.buffer will make the shuffle fail Key: SPARK-23029 URL: https://issues.apache.org/jira/browse/SPARK-23029 Project: Spark

[jira] [Updated] (SPARK-22951) count() after dropDuplicates() on emptyDataFrame returns incorrect value

2018-01-10 Thread Cheng Lian (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-22951: --- Target Version/s: 2.3.0 > count() after dropDuplicates() on emptyDataFrame returns incorrect value >

[jira] [Updated] (SPARK-22951) count() after dropDuplicates() on emptyDataFrame returns incorrect value

2018-01-10 Thread Cheng Lian (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-22951: --- Labels: correctness (was: ) > count() after dropDuplicates() on emptyDataFrame returns incorrect

[jira] [Commented] (SPARK-23020) Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-01-10 Thread Marcelo Vanzin (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320753#comment-16320753 ] Marcelo Vanzin commented on SPARK-23020: I think I found the race in the code, now need to figure

[jira] [Resolved] (SPARK-23019) Flaky Test: org.apache.spark.JavaJdbcRDDSuite.testJavaJdbcRDD

2018-01-10 Thread Marcelo Vanzin (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-23019. Resolution: Fixed Assignee: Gengliang Wang Fix Version/s: 2.3.0 > Flaky

[jira] [Commented] (SPARK-22982) Remove unsafe asynchronous close() call from FileDownloadChannel

2018-01-10 Thread Sean Owen (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320667#comment-16320667 ] Sean Owen commented on SPARK-22982: --- Agreed. java.nio should be OK as that was introduced in Java 7.

[jira] [Commented] (SPARK-22982) Remove unsafe asynchronous close() call from FileDownloadChannel

2018-01-10 Thread Josh Rosen (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320664#comment-16320664 ] Josh Rosen commented on SPARK-22982: In theory this affects all 1.6.0+ versions. It's going to be

[jira] [Updated] (SPARK-22972) Couldn't find corresponding Hive SerDe for data source provider org.apache.spark.sql.hive.orc.

2018-01-10 Thread Xiao Li (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22972: Fix Version/s: 2.2.2 > Couldn't find corresponding Hive SerDe for data source provider >

[jira] [Commented] (SPARK-21727) Operating on an ArrayType in a SparkR DataFrame throws error

2018-01-10 Thread Neil Alexander McQuarrie (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320439#comment-16320439 ] Neil Alexander McQuarrie commented on SPARK-21727: -- Okay great. Implemented and tests

[jira] [Issue Comment Deleted] (SPARK-22946) Recursive withColumn calls cause org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificUnsafeProjection" grows beyond 64 KB

2018-01-10 Thread Marco Gaido (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Gaido updated SPARK-22946: Comment: was deleted (was: I am unable to reproduce on master. If I remember correctly, this

[jira] [Comment Edited] (SPARK-22946) Recursive withColumn calls cause org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificUnsafeProjection" grows beyond 64 KB

2018-01-10 Thread Marco Gaido (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320402#comment-16320402 ] Marco Gaido edited comment on SPARK-22946 at 1/10/18 3:13 PM: -- I am unable

[jira] [Commented] (SPARK-22946) Recursive withColumn calls cause org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificUnsafeProjection" grows beyond 64 KB

2018-01-10 Thread Marco Gaido (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320402#comment-16320402 ] Marco Gaido commented on SPARK-22946: - I am unable to reproduce on master. If I remember correctly,

[jira] [Commented] (SPARK-23028) Bump master branch version to 2.4.0-SNAPSHOT

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320393#comment-16320393 ] Apache Spark commented on SPARK-23028: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23028) Bump master branch version to 2.4.0-SNAPSHOT

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23028: Assignee: Apache Spark (was: Xiao Li) > Bump master branch version to 2.4.0-SNAPSHOT >

[jira] [Assigned] (SPARK-23028) Bump master branch version to 2.4.0-SNAPSHOT

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23028: Assignee: Xiao Li (was: Apache Spark) > Bump master branch version to 2.4.0-SNAPSHOT >

[jira] [Updated] (SPARK-23028) Bump master branch version to 2.4.0-SNAPSHOT

2018-01-10 Thread Xiao Li (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-23028: Component/s: (was: SQL) Build > Bump master branch version to 2.4.0-SNAPSHOT >

[jira] [Created] (SPARK-23028) Bump master branch version to 2.4.0-SNAPSHOT

2018-01-10 Thread Xiao Li (JIRA)

Xiao Li created SPARK-23028: --- Summary: Bump master branch version to 2.4.0-SNAPSHOT Key: SPARK-23028 URL: https://issues.apache.org/jira/browse/SPARK-23028 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-22991) High read latency with spark streaming 2.2.1 and kafka 0.10.0.1

2018-01-10 Thread Sean Owen (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320365#comment-16320365 ] Sean Owen commented on SPARK-22991: --- But that would also mean you were using Kafka 0.8 and not 0.10,

[jira] [Updated] (SPARK-23026) Add RegisterUDF to PySpark

2018-01-10 Thread Sean Owen (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-23026: -- Issue Type: Improvement (was: Bug) > Add RegisterUDF to PySpark > -- > >

[jira] [Commented] (SPARK-21157) Report Total Memory Used by Spark Executors

2018-01-10 Thread assia ydroudj (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320293#comment-16320293 ] assia ydroudj commented on SPARK-21157: --- I m beginner in apache spark and have installed a

[jira] [Assigned] (SPARK-23019) Flaky Test: org.apache.spark.JavaJdbcRDDSuite.testJavaJdbcRDD

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23019: Assignee: Apache Spark > Flaky Test: org.apache.spark.JavaJdbcRDDSuite.testJavaJdbcRDD >

[jira] [Assigned] (SPARK-23019) Flaky Test: org.apache.spark.JavaJdbcRDDSuite.testJavaJdbcRDD

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23019: Assignee: (was: Apache Spark) > Flaky Test:

[jira] [Commented] (SPARK-23019) Flaky Test: org.apache.spark.JavaJdbcRDDSuite.testJavaJdbcRDD

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320225#comment-16320225 ] Apache Spark commented on SPARK-23019: -- User 'gengliangwang' has created a pull request for this

[jira] [Commented] (SPARK-22982) Remove unsafe asynchronous close() call from FileDownloadChannel

2018-01-10 Thread Sean Owen (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320203#comment-16320203 ] Sean Owen commented on SPARK-22982: --- Does this affect earlier branches in the same way? Seems important

[jira] [Commented] (SPARK-21396) Spark Hive Thriftserver doesn't return UDT field

2018-01-10 Thread Ken Tore Tallakstad (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320201#comment-16320201 ] Ken Tore Tallakstad commented on SPARK-21396: - Does the bug originate from this function

[jira] [Comment Edited] (SPARK-21396) Spark Hive Thriftserver doesn't return UDT field

2018-01-10 Thread Ken Tore Tallakstad (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320201#comment-16320201 ] Ken Tore Tallakstad edited comment on SPARK-21396 at 1/10/18 1:15 PM:

[jira] [Comment Edited] (SPARK-21396) Spark Hive Thriftserver doesn't return UDT field

2018-01-10 Thread Ken Tore Tallakstad (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320201#comment-16320201 ] Ken Tore Tallakstad edited comment on SPARK-21396 at 1/10/18 1:15 PM:

[jira] [Commented] (SPARK-18147) Broken Spark SQL Codegen

2018-01-10 Thread Alexander Chermenin (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-18147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320159#comment-16320159 ] Alexander Chermenin commented on SPARK-18147: - Is it resolved for the 2.2.1 version? I have a

[jira] [Assigned] (SPARK-23025) DataSet with scala.Null causes Exception

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23025: Assignee: Apache Spark > DataSet with scala.Null causes Exception >

[jira] [Commented] (SPARK-23025) DataSet with scala.Null causes Exception

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320148#comment-16320148 ] Apache Spark commented on SPARK-23025: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23025) DataSet with scala.Null causes Exception

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23025: Assignee: (was: Apache Spark) > DataSet with scala.Null causes Exception >

[jira] [Updated] (SPARK-23027) optimizer a simple query using a non-existent data is too slow

2018-01-10 Thread wangminfeng (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangminfeng updated SPARK-23027: Summary: optimizer a simple query using a non-existent data is too slow (was: optimizer a simple

[jira] [Created] (SPARK-23027) optimizer a simple query

2018-01-10 Thread wangminfeng (JIRA)

wangminfeng created SPARK-23027: --- Summary: optimizer a simple query Key: SPARK-23027 URL: https://issues.apache.org/jira/browse/SPARK-23027 Project: Spark Issue Type: Bug Components:

[jira] [Commented] (SPARK-23000) Flaky test suite DataSourceWithHiveMetastoreCatalogSuite in Spark 2.3

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320110#comment-16320110 ] Apache Spark commented on SPARK-23000: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-21179) Unable to return Hive INT data type into Spark via Hive JDBC driver: Caused by: java.sql.SQLDataException: [Simba][JDBC](10140) Error converting value to int.

2018-01-10 Thread Matthew Walton (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320100#comment-16320100 ] Matthew Walton edited comment on SPARK-21179 at 1/10/18 11:29 AM: -- I

[jira] [Commented] (SPARK-21183) Unable to return Google BigQuery INTEGER data type into Spark via google BigQuery JDBC driver: java.sql.SQLDataException: [Simba][JDBC](10140) Error converting value t

2018-01-10 Thread Matthew Walton (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320106#comment-16320106 ] Matthew Walton commented on SPARK-21183: I ended up getting an answer and work around from Simba

[jira] [Comment Edited] (SPARK-21179) Unable to return Hive INT data type into Spark via Hive JDBC driver: Caused by: java.sql.SQLDataException: [Simba][JDBC](10140) Error converting value to int.

2018-01-10 Thread Matthew Walton (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320100#comment-16320100 ] Matthew Walton edited comment on SPARK-21179 at 1/10/18 11:27 AM: -- I

[jira] [Commented] (SPARK-21179) Unable to return Hive INT data type into Spark via Hive JDBC driver: Caused by: java.sql.SQLDataException: [Simba][JDBC](10140) Error converting value to int.

2018-01-10 Thread Matthew Walton (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320100#comment-16320100 ] Matthew Walton commented on SPARK-21179: I ended up getting a resolution from Simba: "I

[jira] [Commented] (SPARK-17998) Reading Parquet files coalesces parts into too few in-memory partitions

2018-01-10 Thread sam (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-17998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320041#comment-16320041 ] sam commented on SPARK-17998: - [~srowen] Thanks, no idea where I got that from, cursed weakly typed silently

[jira] [Commented] (SPARK-23026) Add RegisterUDF to PySpark

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320022#comment-16320022 ] Apache Spark commented on SPARK-23026: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23026) Add RegisterUDF to PySpark

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23026: Assignee: Xiao Li (was: Apache Spark) > Add RegisterUDF to PySpark >

[jira] [Assigned] (SPARK-23026) Add RegisterUDF to PySpark

2018-01-10 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-23026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23026: Assignee: Apache Spark (was: Xiao Li) > Add RegisterUDF to PySpark >

1 2 >

1 - 100 of 110 matches

Mail list logo