[jira] [Created] (SPARK-22515) Estimation relation size based on numRows * rowSize

2017-11-13 Thread Zhenhua Wang (JIRA)
Zhenhua Wang created SPARK-22515: Summary: Estimation relation size based on numRows * rowSize Key: SPARK-22515 URL: https://issues.apache.org/jira/browse/SPARK-22515 Project: Spark Issue

[jira] [Updated] (SPARK-22490) PySpark doc has misleading string for SparkSession.builder

2017-11-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-22490: - Affects Version/s: 2.2.2 > PySpark doc has misleading string for SparkSession.builder >

[jira] [Updated] (SPARK-22511) Update maven central repo address

2017-11-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-22511: - Affects Version/s: 2.3.0 > Update maven central repo address > -

[jira] [Updated] (SPARK-21098) Set lineseparator csv multiline and csv write to \n

2017-11-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-21098: - Affects Version/s: (was: 2.2.1) 2.2.2 > Set lineseparator csv

[jira] [Updated] (SPARK-21859) SparkFiles.get failed on driver in yarn-cluster and yarn-client mode

2017-11-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-21859: - Affects Version/s: (was: 2.2.1) > SparkFiles.get failed on driver in yarn-cluster and

[jira] [Updated] (SPARK-22511) Update maven central repo address

2017-11-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-22511: - Affects Version/s: 2.2.2 > Update maven central repo address > -

[jira] [Updated] (SPARK-21245) Resolve code duplication for classification/regression summarizers

2017-11-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-21245: - Affects Version/s: (was: 2.2.1) 2.2.2 > Resolve code duplication for

[jira] [Updated] (SPARK-21259) More rules for scalastyle

2017-11-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-21259: - Affects Version/s: (was: 2.2.1) 2.2.2 > More rules for scalastyle >

[jira] [Commented] (SPARK-22510) Exceptions caused by 64KB JVM bytecode limit

2017-11-13 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250839#comment-16250839 ] Kazuaki Ishizaki commented on SPARK-22510: -- [~smilegator] Thanks, good idea. I will add other

[jira] [Commented] (SPARK-10496) Efficient DataFrame cumulative sum

2017-11-13 Thread Thomas Han (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250802#comment-16250802 ] Thomas Han commented on SPARK-10496: Any updates on this? > Efficient DataFrame cumulative sum >

[jira] [Assigned] (SPARK-22511) Update maven central repo address

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22511: Assignee: Apache Spark > Update maven central repo address >

[jira] [Commented] (SPARK-22511) Update maven central repo address

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250729#comment-16250729 ] Apache Spark commented on SPARK-22511: -- User 'srowen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22511) Update maven central repo address

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22511: Assignee: (was: Apache Spark) > Update maven central repo address >

[jira] [Assigned] (SPARK-14228) Lost executor of RPC disassociated, and occurs exception: Could not find CoarseGrainedScheduler or it has been stopped

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14228: Assignee: (was: Apache Spark) > Lost executor of RPC disassociated, and occurs

[jira] [Assigned] (SPARK-14228) Lost executor of RPC disassociated, and occurs exception: Could not find CoarseGrainedScheduler or it has been stopped

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14228: Assignee: Apache Spark > Lost executor of RPC disassociated, and occurs exception: Could

[jira] [Commented] (SPARK-14228) Lost executor of RPC disassociated, and occurs exception: Could not find CoarseGrainedScheduler or it has been stopped

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250714#comment-16250714 ] Apache Spark commented on SPARK-14228: -- User 'devaraj-kavali' has created a pull request for this

[jira] [Commented] (SPARK-9104) expose network layer memory usage

2017-11-13 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250675#comment-16250675 ] Saisai Shao commented on SPARK-9104: [~vsr] I think SPARK-21934 already exposed Netty shuffle metrics

[jira] [Assigned] (SPARK-12375) VectorIndexer: allow unknown categories

2017-11-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-12375: - Assignee: Weichen Xu (was: yuhao yang) > VectorIndexer: allow unknown

[jira] [Commented] (SPARK-22504) Optimization in overwrite table in case of failure

2017-11-13 Thread xuchuanyin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250621#comment-16250621 ] xuchuanyin commented on SPARK-22504: [~srowen] thanks for your reply. My opinion is as below: how do

[jira] [Commented] (SPARK-22451) Reduce decision tree aggregate size for unordered features from O(2^numCategories) to O(numCategories)

2017-11-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250612#comment-16250612 ] Joseph K. Bradley commented on SPARK-22451: --- Whoops yes I think you're right. Funny we never

[jira] [Commented] (SPARK-22514) move ColumnVector.Array and ColumnarBatch.Row to individual files

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250610#comment-16250610 ] Apache Spark commented on SPARK-22514: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22514) move ColumnVector.Array and ColumnarBatch.Row to individual files

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22514: Assignee: Wenchen Fan (was: Apache Spark) > move ColumnVector.Array and

[jira] [Assigned] (SPARK-22514) move ColumnVector.Array and ColumnarBatch.Row to individual files

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22514: Assignee: Apache Spark (was: Wenchen Fan) > move ColumnVector.Array and

[jira] [Created] (SPARK-22514) move ColumnVector.Array and ColumnarBatch.Row to individual files

2017-11-13 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-22514: --- Summary: move ColumnVector.Array and ColumnarBatch.Row to individual files Key: SPARK-22514 URL: https://issues.apache.org/jira/browse/SPARK-22514 Project: Spark

[jira] [Commented] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250589#comment-16250589 ] Felix Cheung commented on SPARK-22471: -- yes, RC1 has been cut. > SQLListener consumes much memory

[jira] [Updated] (SPARK-22042) ReorderJoinPredicates can break when child's partitioning is not decided

2017-11-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-22042: - Target Version/s: 2.2.2 (was: 2.2.1) > ReorderJoinPredicates can break when child's

[jira] [Updated] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-22471: - Target Version/s: 2.2.2 > SQLListener consumes much memory causing OutOfMemoryError >

[jira] [Updated] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-22471: - Fix Version/s: (was: 2.2.1) 2.2.2 > SQLListener consumes much memory

[jira] [Resolved] (SPARK-21046) simplify the array offset and length in ColumnVector

2017-11-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-21046. - Resolution: Not A Problem > simplify the array offset and length in ColumnVector >

[jira] [Reopened] (SPARK-21046) simplify the array offset and length in ColumnVector

2017-11-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reopened SPARK-21046: - > simplify the array offset and length in ColumnVector >

[jira] [Assigned] (SPARK-22513) Provide build profile for hadoop 2.8

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22513: Assignee: (was: Apache Spark) > Provide build profile for hadoop 2.8 >

[jira] [Assigned] (SPARK-22513) Provide build profile for hadoop 2.8

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22513: Assignee: Apache Spark > Provide build profile for hadoop 2.8 >

[jira] [Commented] (SPARK-22513) Provide build profile for hadoop 2.8

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250521#comment-16250521 ] Apache Spark commented on SPARK-22513: -- User 'cko' has created a pull request for this issue:

[jira] [Commented] (SPARK-13127) Upgrade Parquet to 1.9 (Fixes parquet sorting)

2017-11-13 Thread Dong Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250512#comment-16250512 ] Dong Jiang commented on SPARK-13127: [~igozali], I think you are referring to this parquet ticket:

[jira] [Comment Edited] (SPARK-13127) Upgrade Parquet to 1.9 (Fixes parquet sorting)

2017-11-13 Thread Dong Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250512#comment-16250512 ] Dong Jiang edited comment on SPARK-13127 at 11/13/17 11:56 PM: --- [~igozali],

[jira] [Updated] (SPARK-21646) Add new type coercion rules to compatible with Hive

2017-11-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-21646: Target Version/s: 2.3.0 > Add new type coercion rules to compatible with Hive >

[jira] [Updated] (SPARK-22469) Accuracy problem in comparison with string and numeric

2017-11-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-22469: Labels: (was: release-notes) > Accuracy problem in comparison with string and numeric >

[jira] [Commented] (SPARK-22513) Provide build profile for hadoop 2.8

2017-11-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250485#comment-16250485 ] Sean Owen commented on SPARK-22513: --- It's not required. It should work fine with 2.8 as-is if you use

[jira] [Updated] (SPARK-22469) Accuracy problem in comparison with string and numeric

2017-11-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-22469: Labels: release-notes (was: ) > Accuracy problem in comparison with string and numeric >

[jira] [Assigned] (SPARK-22377) Maven nightly snapshot jenkins jobs are broken on multiple workers due to lsof

2017-11-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-22377: Assignee: Hyukjin Kwon > Maven nightly snapshot jenkins jobs are broken on multiple

[jira] [Resolved] (SPARK-22377) Maven nightly snapshot jenkins jobs are broken on multiple workers due to lsof

2017-11-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22377. -- Resolution: Fixed Fix Version/s: 2.1.3 2.3.0 2.2.1

[jira] [Created] (SPARK-22513) Provide build profile for hadoop 2.8

2017-11-13 Thread Christine Koppelt (JIRA)
Christine Koppelt created SPARK-22513: - Summary: Provide build profile for hadoop 2.8 Key: SPARK-22513 URL: https://issues.apache.org/jira/browse/SPARK-22513 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-22509) Spark Streaming: jobs with same batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-22509. -- Resolution: Not A Bug I don't think it's worth to do such improvement in Spark Streaming. Even

[jira] [Reopened] (SPARK-22509) Spark Streaming: jobs with same batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reopened SPARK-22509: -- > Spark Streaming: jobs with same batch length all start at the same time, > permit jobs to be

[jira] [Resolved] (SPARK-22509) Spark Streaming: jobs with same batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-22509. -- Resolution: Duplicate > Spark Streaming: jobs with same batch length all start at the same

[jira] [Updated] (SPARK-22509) Spark Streaming: jobs with same batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-22509: - Component/s: (was: Structured Streaming) DStreams > Spark Streaming: jobs

[jira] [Updated] (SPARK-22509) Spark Streaming: jobs with same batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Wallace Baggaley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wallace Baggaley updated SPARK-22509: - Description: Using Spark Streaming, a batch with batch length of five for example will

[jira] [Updated] (SPARK-22509) Spark Streaming: jobs with same batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Wallace Baggaley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wallace Baggaley updated SPARK-22509: - Summary: Spark Streaming: jobs with same batch length all start at the same time, permit

[jira] [Resolved] (SPARK-22512) How do we send UUID in spark dataset (using Java) to postgreSQL

2017-11-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-22512. Resolution: Invalid Please use the mailing lists for questions.

[jira] [Created] (SPARK-22512) How do we send UUID in spark dataset (using Java) to postgreSQL

2017-11-13 Thread Abhijit Parasnis (JIRA)
Abhijit Parasnis created SPARK-22512: Summary: How do we send UUID in spark dataset (using Java) to postgreSQL Key: SPARK-22512 URL: https://issues.apache.org/jira/browse/SPARK-22512 Project:

[jira] [Commented] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250343#comment-16250343 ] Marcelo Vanzin commented on SPARK-22471: If RC1 has been cut then there probably should be a

[jira] [Commented] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250332#comment-16250332 ] Dongjoon Hyun commented on SPARK-22471: --- Thank you for merging, [~vanzin]. Although this is late

[jira] [Assigned] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-22471: -- Assignee: Arseniy Tashoyan > SQLListener consumes much memory causing

[jira] [Resolved] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-22471. Resolution: Fixed Fix Version/s: 2.2.1 Issue resolved by pull request 19711

[jira] [Created] (SPARK-22511) Update maven central repo address

2017-11-13 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-22511: Summary: Update maven central repo address Key: SPARK-22511 URL: https://issues.apache.org/jira/browse/SPARK-22511 Project: Spark Issue Type: Bug

[jira] [Comment Edited] (SPARK-22510) Exceptions caused by 64KB JVM bytecode limit

2017-11-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250292#comment-16250292 ] Xiao Li edited comment on SPARK-22510 at 11/13/17 9:42 PM: --- [~kiszk] Could you

[jira] [Commented] (SPARK-22510) Exceptions caused by 64KB JVM bytecode limit

2017-11-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250292#comment-16250292 ] Xiao Li commented on SPARK-22510: - [~kiszk] Could you just add the new subtasks under this umbrella

[jira] [Updated] (SPARK-21720) Filter predicate with many conditions throw stackoverflow error

2017-11-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-21720: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > Filter predicate with many conditions throw

[jira] [Updated] (SPARK-22494) Coalesce and AtLeastNNonNulls can cause 64KB JVM bytecode limit exception

2017-11-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22494: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > Coalesce and AtLeastNNonNulls can cause

[jira] [Updated] (SPARK-22498) 64KB JVM bytecode limit problem with concat and concat_ws

2017-11-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22498: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > 64KB JVM bytecode limit problem with concat

[jira] [Updated] (SPARK-22510) Exceptions caused by 64KB JVM bytecode limit

2017-11-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22510: Summary: Exceptions caused by 64KB JVM bytecode limit (was: 64KB JVM bytecode limit ) > Exceptions

[jira] [Updated] (SPARK-22499) 64KB JVM bytecode limit problem with least and greatest

2017-11-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22499: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > 64KB JVM bytecode limit problem with least

[jira] [Updated] (SPARK-22500) 64KB JVM bytecode limit problem with cast

2017-11-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22500: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > 64KB JVM bytecode limit problem with cast >

[jira] [Updated] (SPARK-22508) 64KB JVM bytecode limit problem with GenerateUnsafeRowJoiner.create()

2017-11-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22508: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > 64KB JVM bytecode limit problem with

[jira] [Updated] (SPARK-22501) 64KB JVM bytecode limit problem with in

2017-11-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22501: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > 64KB JVM bytecode limit problem with in >

[jira] [Created] (SPARK-22510) 64KB JVM bytecode limit

2017-11-13 Thread Xiao Li (JIRA)
Xiao Li created SPARK-22510: --- Summary: 64KB JVM bytecode limit Key: SPARK-22510 URL: https://issues.apache.org/jira/browse/SPARK-22510 Project: Spark Issue Type: Umbrella Components:

[jira] [Created] (SPARK-22509) Spark Streaming: jobs with 5 minute batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Wallace Baggaley (JIRA)
Wallace Baggaley created SPARK-22509: Summary: Spark Streaming: jobs with 5 minute batch length all start at the same time, permit jobs to be offset Key: SPARK-22509 URL:

[jira] [Assigned] (SPARK-22495) Fix setup of SPARK_HOME variable on Windows

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22495: Assignee: Apache Spark > Fix setup of SPARK_HOME variable on Windows >

[jira] [Assigned] (SPARK-22495) Fix setup of SPARK_HOME variable on Windows

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22495: Assignee: (was: Apache Spark) > Fix setup of SPARK_HOME variable on Windows >

[jira] [Commented] (SPARK-22495) Fix setup of SPARK_HOME variable on Windows

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250137#comment-16250137 ] Apache Spark commented on SPARK-22495: -- User 'jsnowacki' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-22505) toDF() / createDataFrame() type inference doesn't work as expected

2017-11-13 Thread Ruslan Dautkhanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250054#comment-16250054 ] Ruslan Dautkhanov edited comment on SPARK-22505 at 11/13/17 7:47 PM: -

[jira] [Commented] (SPARK-21994) Spark 2.2 can not read Parquet table created by itself

2017-11-13 Thread Srinivasa Reddy Vundela (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250066#comment-16250066 ] Srinivasa Reddy Vundela commented on SPARK-21994: - [~srowen] Thats right, it is not

[jira] [Commented] (SPARK-22505) toDF() / createDataFrame() type inference doesn't work as expected

2017-11-13 Thread Ruslan Dautkhanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250054#comment-16250054 ] Ruslan Dautkhanov commented on SPARK-22505: --- Looks like we already discussed very similar topic

[jira] [Commented] (SPARK-21994) Spark 2.2 can not read Parquet table created by itself

2017-11-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250039#comment-16250039 ] Sean Owen commented on SPARK-21994: --- (Don't think that would be meaningful outside Cloudera at the

[jira] [Commented] (SPARK-21994) Spark 2.2 can not read Parquet table created by itself

2017-11-13 Thread Srinivasa Reddy Vundela (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250032#comment-16250032 ] Srinivasa Reddy Vundela commented on SPARK-21994: - commit

[jira] [Commented] (SPARK-20791) Use Apache Arrow to Improve Spark createDataFrame from Pandas.DataFrame

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250029#comment-16250029 ] Apache Spark commented on SPARK-20791: -- User 'BryanCutler' has created a pull request for this

[jira] [Commented] (SPARK-22490) PySpark doc has misleading string for SparkSession.builder

2017-11-13 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250025#comment-16250025 ] Dongjoon Hyun commented on SPARK-22490: --- Hi, [~smilegator]. Could you review the PR? I assumed that

[jira] [Commented] (SPARK-22508) 64KB JVM bytecode limit problem with GenerateUnsafeRowJoiner.create()

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250001#comment-16250001 ] Apache Spark commented on SPARK-22508: -- User 'kiszk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22508) 64KB JVM bytecode limit problem with GenerateUnsafeRowJoiner.create()

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22508: Assignee: (was: Apache Spark) > 64KB JVM bytecode limit problem with

[jira] [Assigned] (SPARK-22508) 64KB JVM bytecode limit problem with GenerateUnsafeRowJoiner.create()

2017-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22508: Assignee: Apache Spark > 64KB JVM bytecode limit problem with

[jira] [Created] (SPARK-22508) 64KB JVM bytecode limit problem with GenerateUnsafeRowJoiner.create()

2017-11-13 Thread Kazuaki Ishizaki (JIRA)
Kazuaki Ishizaki created SPARK-22508: Summary: 64KB JVM bytecode limit problem with GenerateUnsafeRowJoiner.create() Key: SPARK-22508 URL: https://issues.apache.org/jira/browse/SPARK-22508

[jira] [Comment Edited] (SPARK-9104) expose network layer memory usage

2017-11-13 Thread Srinivasa Reddy Vundela (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249904#comment-16249904 ] Srinivasa Reddy Vundela edited comment on SPARK-9104 at 11/13/17 6:39 PM:

[jira] [Commented] (SPARK-9104) expose network layer memory usage

2017-11-13 Thread Srinivasa Reddy Vundela (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249904#comment-16249904 ] Srinivasa Reddy Vundela commented on SPARK-9104: Hi [~jerryshao] Thanks for the PR which

[jira] [Commented] (SPARK-22507) Cannot register inner class with Kryo using SparkConf

2017-11-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249844#comment-16249844 ] Sean Owen commented on SPARK-22507: --- What if you make it a non-inner class? although that should be OK,

[jira] [Commented] (SPARK-22507) Cannot register inner class with Kryo using SparkConf

2017-11-13 Thread Yu LIU (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249835#comment-16249835 ] Yu LIU commented on SPARK-22507: I did something like this: {code:java} package

[jira] [Commented] (SPARK-22507) Cannot register inner class with Kryo using SparkConf

2017-11-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249812#comment-16249812 ] Sean Owen commented on SPARK-22507: --- Are you sure the class is on the classpath? how do you try to

[jira] [Created] (SPARK-22507) Cannot register inner class with Kryo using SparkConf

2017-11-13 Thread Yu LIU (JIRA)
Yu LIU created SPARK-22507: -- Summary: Cannot register inner class with Kryo using SparkConf Key: SPARK-22507 URL: https://issues.apache.org/jira/browse/SPARK-22507 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-22431) Creating Permanent view with illegal type

2017-11-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249756#comment-16249756 ] Herman van Hovell commented on SPARK-22431: --- I look forward to the PR :) > Creating Permanent

[jira] [Commented] (SPARK-22505) toDF() / createDataFrame() type inference doesn't work as expected

2017-11-13 Thread Ruslan Dautkhanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249702#comment-16249702 ] Ruslan Dautkhanov commented on SPARK-22505: --- In a way, you can think of this as of Pandas'

[jira] [Commented] (SPARK-22505) toDF() / createDataFrame() type inference doesn't work as expected

2017-11-13 Thread Ruslan Dautkhanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249689#comment-16249689 ] Ruslan Dautkhanov commented on SPARK-22505: --- [~hyukjin.kwon] Yep, '1' is of type 'str'. This

[jira] [Commented] (SPARK-20387) Permissive mode is not replacing corrupt record with null

2017-11-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249563#comment-16249563 ] Hyukjin Kwon commented on SPARK-20387: -- BTW, the example and input indeed has a problem. I had to

[jira] [Comment Edited] (SPARK-20387) Permissive mode is not replacing corrupt record with null

2017-11-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249553#comment-16249553 ] Hyukjin Kwon edited comment on SPARK-20387 at 11/13/17 1:38 PM: Sounds it

[jira] [Resolved] (SPARK-20387) Permissive mode is not replacing corrupt record with null

2017-11-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20387. -- Resolution: Duplicate > Permissive mode is not replacing corrupt record with null >

[jira] [Commented] (SPARK-20387) Permissive mode is not replacing corrupt record with null

2017-11-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249553#comment-16249553 ] Hyukjin Kwon commented on SPARK-20387: -- Yup, all sound correct ^ and sounds it is related with

[jira] [Commented] (SPARK-22431) Creating Permanent view with illegal type

2017-11-13 Thread Sunitha Kambhampati (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249517#comment-16249517 ] Sunitha Kambhampati commented on SPARK-22431: - Thanks for the response. Option 1 sounds

[jira] [Commented] (SPARK-22431) Creating Permanent view with illegal type

2017-11-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249511#comment-16249511 ] Herman van Hovell commented on SPARK-22431: --- [~ksunitha] Thanks for the thorough analysis! I

[jira] [Commented] (SPARK-20387) Permissive mode is not replacing corrupt record with null

2017-11-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249508#comment-16249508 ] Sean Owen commented on SPARK-20387: --- That's not the same example. I believe the underlying number

[jira] [Commented] (SPARK-22431) Creating Permanent view with illegal type

2017-11-13 Thread Sunitha Kambhampati (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249505#comment-16249505 ] Sunitha Kambhampati commented on SPARK-22431: - *Observations:* I ran a few tests with the

[jira] [Commented] (SPARK-22504) Optimization in overwrite table in case of failure

2017-11-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249502#comment-16249502 ] Sean Owen commented on SPARK-22504: --- Removing the original table isn't a problem; the user asked for

[jira] [Closed] (SPARK-22439) Not able to get numeric columns for the file having decimal values

2017-11-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-22439. - > Not able to get numeric columns for the file having decimal values >

  1   2   >