date:20171113

[jira] [Created] (SPARK-22515) Estimation relation size based on numRows * rowSize

2017-11-13 Thread Zhenhua Wang (JIRA)

Zhenhua Wang created SPARK-22515: Summary: Estimation relation size based on numRows * rowSize Key: SPARK-22515 URL: https://issues.apache.org/jira/browse/SPARK-22515 Project: Spark Issue Typ

[jira] [Updated] (SPARK-22490) PySpark doc has misleading string for SparkSession.builder

2017-11-13 Thread Felix Cheung (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-22490: - Affects Version/s: 2.2.2 > PySpark doc has misleading string for SparkSession.builder > -

[jira] [Updated] (SPARK-22511) Update maven central repo address

2017-11-13 Thread Felix Cheung (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-22511: - Affects Version/s: 2.3.0 > Update maven central repo address > -

[jira] [Updated] (SPARK-21098) Set lineseparator csv multiline and csv write to \n

2017-11-13 Thread Felix Cheung (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-21098: - Affects Version/s: (was: 2.2.1) 2.2.2 > Set lineseparator csv multilin

[jira] [Updated] (SPARK-21859) SparkFiles.get failed on driver in yarn-cluster and yarn-client mode

2017-11-13 Thread Felix Cheung (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-21859: - Affects Version/s: (was: 2.2.1) > SparkFiles.get failed on driver in yarn-cluster and yarn-cl

[jira] [Updated] (SPARK-22511) Update maven central repo address

2017-11-13 Thread Felix Cheung (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-22511: - Affects Version/s: 2.2.2 > Update maven central repo address > -

[jira] [Updated] (SPARK-21245) Resolve code duplication for classification/regression summarizers

2017-11-13 Thread Felix Cheung (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-21245: - Affects Version/s: (was: 2.2.1) 2.2.2 > Resolve code duplication for c

[jira] [Updated] (SPARK-21259) More rules for scalastyle

2017-11-13 Thread Felix Cheung (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-21259: - Affects Version/s: (was: 2.2.1) 2.2.2 > More rules for scalastyle > --

[jira] [Commented] (SPARK-22510) Exceptions caused by 64KB JVM bytecode limit

2017-11-13 Thread Kazuaki Ishizaki (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250839#comment-16250839 ] Kazuaki Ishizaki commented on SPARK-22510: -- [~smilegator] Thanks, good idea. I w

[jira] [Commented] (SPARK-10496) Efficient DataFrame cumulative sum

2017-11-13 Thread Thomas Han (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-10496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250802#comment-16250802 ] Thomas Han commented on SPARK-10496: Any updates on this? > Efficient DataFrame cum

[jira] [Assigned] (SPARK-22511) Update maven central repo address

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22511: Assignee: Apache Spark > Update maven central repo address > -

[jira] [Commented] (SPARK-22511) Update maven central repo address

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250729#comment-16250729 ] Apache Spark commented on SPARK-22511: -- User 'srowen' has created a pull request for

[jira] [Assigned] (SPARK-22511) Update maven central repo address

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22511: Assignee: (was: Apache Spark) > Update maven central repo address > --

[jira] [Assigned] (SPARK-14228) Lost executor of RPC disassociated, and occurs exception: Could not find CoarseGrainedScheduler or it has been stopped

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-14228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14228: Assignee: (was: Apache Spark) > Lost executor of RPC disassociated, and occurs excepti

[jira] [Assigned] (SPARK-14228) Lost executor of RPC disassociated, and occurs exception: Could not find CoarseGrainedScheduler or it has been stopped

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-14228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14228: Assignee: Apache Spark > Lost executor of RPC disassociated, and occurs exception: Could n

[jira] [Commented] (SPARK-14228) Lost executor of RPC disassociated, and occurs exception: Could not find CoarseGrainedScheduler or it has been stopped

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-14228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250714#comment-16250714 ] Apache Spark commented on SPARK-14228: -- User 'devaraj-kavali' has created a pull req

[jira] [Commented] (SPARK-9104) expose network layer memory usage

2017-11-13 Thread Saisai Shao (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-9104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250675#comment-16250675 ] Saisai Shao commented on SPARK-9104: [~vsr] I think SPARK-21934 already exposed Netty

[jira] [Assigned] (SPARK-12375) VectorIndexer: allow unknown categories

2017-11-13 Thread Joseph K. Bradley (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-12375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-12375: - Assignee: Weichen Xu (was: yuhao yang) > VectorIndexer: allow unknown categorie

[jira] [Commented] (SPARK-22504) Optimization in overwrite table in case of failure

2017-11-13 Thread xuchuanyin (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250621#comment-16250621 ] xuchuanyin commented on SPARK-22504: [~srowen] thanks for your reply. My opinion is a

[jira] [Commented] (SPARK-22451) Reduce decision tree aggregate size for unordered features from O(2^numCategories) to O(numCategories)

2017-11-13 Thread Joseph K. Bradley (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250612#comment-16250612 ] Joseph K. Bradley commented on SPARK-22451: --- Whoops yes I think you're right.

[jira] [Commented] (SPARK-22514) move ColumnVector.Array and ColumnarBatch.Row to individual files

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250610#comment-16250610 ] Apache Spark commented on SPARK-22514: -- User 'cloud-fan' has created a pull request

[jira] [Assigned] (SPARK-22514) move ColumnVector.Array and ColumnarBatch.Row to individual files

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22514: Assignee: Wenchen Fan (was: Apache Spark) > move ColumnVector.Array and ColumnarBatch.Row

[jira] [Assigned] (SPARK-22514) move ColumnVector.Array and ColumnarBatch.Row to individual files

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22514: Assignee: Apache Spark (was: Wenchen Fan) > move ColumnVector.Array and ColumnarBatch.Row

[jira] [Created] (SPARK-22514) move ColumnVector.Array and ColumnarBatch.Row to individual files

2017-11-13 Thread Wenchen Fan (JIRA)

Wenchen Fan created SPARK-22514: --- Summary: move ColumnVector.Array and ColumnarBatch.Row to individual files Key: SPARK-22514 URL: https://issues.apache.org/jira/browse/SPARK-22514 Project: Spark

[jira] [Commented] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Felix Cheung (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250589#comment-16250589 ] Felix Cheung commented on SPARK-22471: -- yes, RC1 has been cut. > SQLListener consum

[jira] [Updated] (SPARK-22042) ReorderJoinPredicates can break when child's partitioning is not decided

2017-11-13 Thread Felix Cheung (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-22042: - Target Version/s: 2.2.2 (was: 2.2.1) > ReorderJoinPredicates can break when child's partitioning

[jira] [Updated] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Felix Cheung (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-22471: - Target Version/s: 2.2.2 > SQLListener consumes much memory causing OutOfMemoryError > ---

[jira] [Updated] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Felix Cheung (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-22471: - Fix Version/s: (was: 2.2.1) 2.2.2 > SQLListener consumes much memory causi

[jira] [Resolved] (SPARK-21046) simplify the array offset and length in ColumnVector

2017-11-13 Thread Wenchen Fan (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-21046. - Resolution: Not A Problem > simplify the array offset and length in ColumnVector > --

[jira] [Reopened] (SPARK-21046) simplify the array offset and length in ColumnVector

2017-11-13 Thread Wenchen Fan (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reopened SPARK-21046: - > simplify the array offset and length in ColumnVector >

[jira] [Assigned] (SPARK-22513) Provide build profile for hadoop 2.8

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22513: Assignee: (was: Apache Spark) > Provide build profile for hadoop 2.8 > ---

[jira] [Assigned] (SPARK-22513) Provide build profile for hadoop 2.8

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22513: Assignee: Apache Spark > Provide build profile for hadoop 2.8 > --

[jira] [Commented] (SPARK-22513) Provide build profile for hadoop 2.8

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250521#comment-16250521 ] Apache Spark commented on SPARK-22513: -- User 'cko' has created a pull request for th

[jira] [Commented] (SPARK-13127) Upgrade Parquet to 1.9 (Fixes parquet sorting)

2017-11-13 Thread Dong Jiang (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-13127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250512#comment-16250512 ] Dong Jiang commented on SPARK-13127: [~igozali], I think you are referring to this pa

[jira] [Comment Edited] (SPARK-13127) Upgrade Parquet to 1.9 (Fixes parquet sorting)

2017-11-13 Thread Dong Jiang (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-13127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250512#comment-16250512 ] Dong Jiang edited comment on SPARK-13127 at 11/13/17 11:56 PM:

[jira] [Updated] (SPARK-21646) Add new type coercion rules to compatible with Hive

2017-11-13 Thread Xiao Li (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-21646: Target Version/s: 2.3.0 > Add new type coercion rules to compatible with Hive > ---

[jira] [Updated] (SPARK-22469) Accuracy problem in comparison with string and numeric

2017-11-13 Thread Wenchen Fan (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-22469: Labels: (was: release-notes) > Accuracy problem in comparison with string and numeric >

[jira] [Commented] (SPARK-22513) Provide build profile for hadoop 2.8

2017-11-13 Thread Sean Owen (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250485#comment-16250485 ] Sean Owen commented on SPARK-22513: --- It's not required. It should work fine with 2.8 as

[jira] [Updated] (SPARK-22469) Accuracy problem in comparison with string and numeric

2017-11-13 Thread Wenchen Fan (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-22469: Labels: release-notes (was: ) > Accuracy problem in comparison with string and numeric >

[jira] [Assigned] (SPARK-22377) Maven nightly snapshot jenkins jobs are broken on multiple workers due to lsof

2017-11-13 Thread Hyukjin Kwon (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-22377: Assignee: Hyukjin Kwon > Maven nightly snapshot jenkins jobs are broken on multiple worker

[jira] [Resolved] (SPARK-22377) Maven nightly snapshot jenkins jobs are broken on multiple workers due to lsof

2017-11-13 Thread Hyukjin Kwon (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22377. -- Resolution: Fixed Fix Version/s: 2.1.3 2.3.0 2.2.1

[jira] [Created] (SPARK-22513) Provide build profile for hadoop 2.8

2017-11-13 Thread Christine Koppelt (JIRA)

Christine Koppelt created SPARK-22513: - Summary: Provide build profile for hadoop 2.8 Key: SPARK-22513 URL: https://issues.apache.org/jira/browse/SPARK-22513 Project: Spark Issue Type: Im

[jira] [Resolved] (SPARK-22509) Spark Streaming: jobs with same batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Shixiong Zhu (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-22509. -- Resolution: Not A Bug I don't think it's worth to do such improvement in Spark Streaming. Even

[jira] [Reopened] (SPARK-22509) Spark Streaming: jobs with same batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Shixiong Zhu (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reopened SPARK-22509: -- > Spark Streaming: jobs with same batch length all start at the same time, > permit jobs to be off

[jira] [Resolved] (SPARK-22509) Spark Streaming: jobs with same batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Shixiong Zhu (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-22509. -- Resolution: Duplicate > Spark Streaming: jobs with same batch length all start at the same time

[jira] [Updated] (SPARK-22509) Spark Streaming: jobs with same batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Shixiong Zhu (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-22509: - Component/s: (was: Structured Streaming) DStreams > Spark Streaming: jobs wi

[jira] [Updated] (SPARK-22509) Spark Streaming: jobs with same batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Wallace Baggaley (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wallace Baggaley updated SPARK-22509: - Description: Using Spark Streaming, a batch with batch length of five for example will ru

[jira] [Updated] (SPARK-22509) Spark Streaming: jobs with same batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Wallace Baggaley (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wallace Baggaley updated SPARK-22509: - Summary: Spark Streaming: jobs with same batch length all start at the same time, permit

[jira] [Resolved] (SPARK-22512) How do we send UUID in spark dataset (using Java) to postgreSQL

2017-11-13 Thread Marcelo Vanzin (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-22512. Resolution: Invalid Please use the mailing lists for questions. http://spark.apache.org/com

[jira] [Created] (SPARK-22512) How do we send UUID in spark dataset (using Java) to postgreSQL

2017-11-13 Thread Abhijit Parasnis (JIRA)

Abhijit Parasnis created SPARK-22512: Summary: How do we send UUID in spark dataset (using Java) to postgreSQL Key: SPARK-22512 URL: https://issues.apache.org/jira/browse/SPARK-22512 Project: Spar

[jira] [Commented] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Marcelo Vanzin (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250343#comment-16250343 ] Marcelo Vanzin commented on SPARK-22471: If RC1 has been cut then there probably

[jira] [Commented] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Dongjoon Hyun (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250332#comment-16250332 ] Dongjoon Hyun commented on SPARK-22471: --- Thank you for merging, [~vanzin]. Althoug

[jira] [Assigned] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Marcelo Vanzin (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-22471: -- Assignee: Arseniy Tashoyan > SQLListener consumes much memory causing OutOfMemoryError

[jira] [Resolved] (SPARK-22471) SQLListener consumes much memory causing OutOfMemoryError

2017-11-13 Thread Marcelo Vanzin (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-22471. Resolution: Fixed Fix Version/s: 2.2.1 Issue resolved by pull request 19711 [https:/

[jira] [Created] (SPARK-22511) Update maven central repo address

2017-11-13 Thread Felix Cheung (JIRA)

Felix Cheung created SPARK-22511: Summary: Update maven central repo address Key: SPARK-22511 URL: https://issues.apache.org/jira/browse/SPARK-22511 Project: Spark Issue Type: Bug C

[jira] [Comment Edited] (SPARK-22510) Exceptions caused by 64KB JVM bytecode limit

2017-11-13 Thread Xiao Li (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250292#comment-16250292 ] Xiao Li edited comment on SPARK-22510 at 11/13/17 9:42 PM: --- [~k

[jira] [Commented] (SPARK-22510) Exceptions caused by 64KB JVM bytecode limit

2017-11-13 Thread Xiao Li (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250292#comment-16250292 ] Xiao Li commented on SPARK-22510: - [~kiszk] Could you just add the new subtasks under thi

[jira] [Updated] (SPARK-21720) Filter predicate with many conditions throw stackoverflow error

2017-11-13 Thread Xiao Li (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-21720: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > Filter predicate with many conditions throw

[jira] [Updated] (SPARK-22494) Coalesce and AtLeastNNonNulls can cause 64KB JVM bytecode limit exception

2017-11-13 Thread Xiao Li (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22494: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > Coalesce and AtLeastNNonNulls can cause 64KB

[jira] [Updated] (SPARK-22498) 64KB JVM bytecode limit problem with concat and concat_ws

2017-11-13 Thread Xiao Li (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22498: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > 64KB JVM bytecode limit problem with concat

[jira] [Updated] (SPARK-22510) Exceptions caused by 64KB JVM bytecode limit

2017-11-13 Thread Xiao Li (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22510: Summary: Exceptions caused by 64KB JVM bytecode limit (was: 64KB JVM bytecode limit ) > Exceptions cause

[jira] [Updated] (SPARK-22499) 64KB JVM bytecode limit problem with least and greatest

2017-11-13 Thread Xiao Li (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22499: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > 64KB JVM bytecode limit problem with least a

[jira] [Updated] (SPARK-22500) 64KB JVM bytecode limit problem with cast

2017-11-13 Thread Xiao Li (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22500: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > 64KB JVM bytecode limit problem with cast >

[jira] [Updated] (SPARK-22508) 64KB JVM bytecode limit problem with GenerateUnsafeRowJoiner.create()

2017-11-13 Thread Xiao Li (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22508: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > 64KB JVM bytecode limit problem with Generat

[jira] [Updated] (SPARK-22501) 64KB JVM bytecode limit problem with in

2017-11-13 Thread Xiao Li (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22501: Issue Type: Sub-task (was: Bug) Parent: SPARK-22510 > 64KB JVM bytecode limit problem with in > --

[jira] [Created] (SPARK-22510) 64KB JVM bytecode limit

2017-11-13 Thread Xiao Li (JIRA)

Xiao Li created SPARK-22510: --- Summary: 64KB JVM bytecode limit Key: SPARK-22510 URL: https://issues.apache.org/jira/browse/SPARK-22510 Project: Spark Issue Type: Umbrella Components: SQL

[jira] [Created] (SPARK-22509) Spark Streaming: jobs with 5 minute batch length all start at the same time, permit jobs to be offset

2017-11-13 Thread Wallace Baggaley (JIRA)

Wallace Baggaley created SPARK-22509: Summary: Spark Streaming: jobs with 5 minute batch length all start at the same time, permit jobs to be offset Key: SPARK-22509 URL: https://issues.apache.org/jira/browse/

[jira] [Assigned] (SPARK-22495) Fix setup of SPARK_HOME variable on Windows

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22495: Assignee: Apache Spark > Fix setup of SPARK_HOME variable on Windows > ---

[jira] [Assigned] (SPARK-22495) Fix setup of SPARK_HOME variable on Windows

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22495: Assignee: (was: Apache Spark) > Fix setup of SPARK_HOME variable on Windows >

[jira] [Commented] (SPARK-22495) Fix setup of SPARK_HOME variable on Windows

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250137#comment-16250137 ] Apache Spark commented on SPARK-22495: -- User 'jsnowacki' has created a pull request

[jira] [Comment Edited] (SPARK-22505) toDF() / createDataFrame() type inference doesn't work as expected

2017-11-13 Thread Ruslan Dautkhanov (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250054#comment-16250054 ] Ruslan Dautkhanov edited comment on SPARK-22505 at 11/13/17 7:47 PM: --

[jira] [Commented] (SPARK-21994) Spark 2.2 can not read Parquet table created by itself

2017-11-13 Thread Srinivasa Reddy Vundela (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250066#comment-16250066 ] Srinivasa Reddy Vundela commented on SPARK-21994: - [~srowen] Thats right,

[jira] [Commented] (SPARK-22505) toDF() / createDataFrame() type inference doesn't work as expected

2017-11-13 Thread Ruslan Dautkhanov (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250054#comment-16250054 ] Ruslan Dautkhanov commented on SPARK-22505: --- Looks like we already discussed ve

[jira] [Commented] (SPARK-21994) Spark 2.2 can not read Parquet table created by itself

2017-11-13 Thread Sean Owen (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250039#comment-16250039 ] Sean Owen commented on SPARK-21994: --- (Don't think that would be meaningful outside Clou

[jira] [Commented] (SPARK-21994) Spark 2.2 can not read Parquet table created by itself

2017-11-13 Thread Srinivasa Reddy Vundela (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250032#comment-16250032 ] Srinivasa Reddy Vundela commented on SPARK-21994: - commit d5e3ba3e970c724

[jira] [Commented] (SPARK-20791) Use Apache Arrow to Improve Spark createDataFrame from Pandas.DataFrame

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-20791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250029#comment-16250029 ] Apache Spark commented on SPARK-20791: -- User 'BryanCutler' has created a pull reques

[jira] [Commented] (SPARK-22490) PySpark doc has misleading string for SparkSession.builder

2017-11-13 Thread Dongjoon Hyun (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250025#comment-16250025 ] Dongjoon Hyun commented on SPARK-22490: --- Hi, [~smilegator]. Could you review the PR

[jira] [Commented] (SPARK-22508) 64KB JVM bytecode limit problem with GenerateUnsafeRowJoiner.create()

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250001#comment-16250001 ] Apache Spark commented on SPARK-22508: -- User 'kiszk' has created a pull request for

[jira] [Assigned] (SPARK-22508) 64KB JVM bytecode limit problem with GenerateUnsafeRowJoiner.create()

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22508: Assignee: (was: Apache Spark) > 64KB JVM bytecode limit problem with GenerateUnsafeRow

[jira] [Assigned] (SPARK-22508) 64KB JVM bytecode limit problem with GenerateUnsafeRowJoiner.create()

2017-11-13 Thread Apache Spark (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22508: Assignee: Apache Spark > 64KB JVM bytecode limit problem with GenerateUnsafeRowJoiner.crea

[jira] [Created] (SPARK-22508) 64KB JVM bytecode limit problem with GenerateUnsafeRowJoiner.create()

2017-11-13 Thread Kazuaki Ishizaki (JIRA)

Kazuaki Ishizaki created SPARK-22508: Summary: 64KB JVM bytecode limit problem with GenerateUnsafeRowJoiner.create() Key: SPARK-22508 URL: https://issues.apache.org/jira/browse/SPARK-22508 Project

[jira] [Comment Edited] (SPARK-9104) expose network layer memory usage

2017-11-13 Thread Srinivasa Reddy Vundela (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-9104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249904#comment-16249904 ] Srinivasa Reddy Vundela edited comment on SPARK-9104 at 11/13/17 6:39 PM: --

[jira] [Commented] (SPARK-9104) expose network layer memory usage

2017-11-13 Thread Srinivasa Reddy Vundela (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-9104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249904#comment-16249904 ] Srinivasa Reddy Vundela commented on SPARK-9104: Hi [~jerryshao] Thanks fo

[jira] [Commented] (SPARK-22507) Cannot register inner class with Kryo using SparkConf

2017-11-13 Thread Sean Owen (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249844#comment-16249844 ] Sean Owen commented on SPARK-22507: --- What if you make it a non-inner class? although th

[jira] [Commented] (SPARK-22507) Cannot register inner class with Kryo using SparkConf

2017-11-13 Thread Yu LIU (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249835#comment-16249835 ] Yu LIU commented on SPARK-22507: I did something like this: {code:java} package local_pr

[jira] [Commented] (SPARK-22507) Cannot register inner class with Kryo using SparkConf

2017-11-13 Thread Sean Owen (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249812#comment-16249812 ] Sean Owen commented on SPARK-22507: --- Are you sure the class is on the classpath? how do

[jira] [Created] (SPARK-22507) Cannot register inner class with Kryo using SparkConf

2017-11-13 Thread Yu LIU (JIRA)

Yu LIU created SPARK-22507: -- Summary: Cannot register inner class with Kryo using SparkConf Key: SPARK-22507 URL: https://issues.apache.org/jira/browse/SPARK-22507 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-22431) Creating Permanent view with illegal type

2017-11-13 Thread Herman van Hovell (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249756#comment-16249756 ] Herman van Hovell commented on SPARK-22431: --- I look forward to the PR :) > Cre

[jira] [Commented] (SPARK-22505) toDF() / createDataFrame() type inference doesn't work as expected

2017-11-13 Thread Ruslan Dautkhanov (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249702#comment-16249702 ] Ruslan Dautkhanov commented on SPARK-22505: --- In a way, you can think of this as

[jira] [Commented] (SPARK-22505) toDF() / createDataFrame() type inference doesn't work as expected

2017-11-13 Thread Ruslan Dautkhanov (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249689#comment-16249689 ] Ruslan Dautkhanov commented on SPARK-22505: --- [~hyukjin.kwon] Yep, '1' is of typ

[jira] [Commented] (SPARK-20387) Permissive mode is not replacing corrupt record with null

2017-11-13 Thread Hyukjin Kwon (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-20387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249563#comment-16249563 ] Hyukjin Kwon commented on SPARK-20387: -- BTW, the example and input indeed has a prob

[jira] [Comment Edited] (SPARK-20387) Permissive mode is not replacing corrupt record with null

2017-11-13 Thread Hyukjin Kwon (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-20387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249553#comment-16249553 ] Hyukjin Kwon edited comment on SPARK-20387 at 11/13/17 1:38 PM: ---

[jira] [Resolved] (SPARK-20387) Permissive mode is not replacing corrupt record with null

2017-11-13 Thread Hyukjin Kwon (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-20387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20387. -- Resolution: Duplicate > Permissive mode is not replacing corrupt record with null > ---

[jira] [Commented] (SPARK-20387) Permissive mode is not replacing corrupt record with null

2017-11-13 Thread Hyukjin Kwon (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-20387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249553#comment-16249553 ] Hyukjin Kwon commented on SPARK-20387: -- Yup, all sound correct ^ and sounds it is re

[jira] [Commented] (SPARK-22431) Creating Permanent view with illegal type

2017-11-13 Thread Sunitha Kambhampati (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249517#comment-16249517 ] Sunitha Kambhampati commented on SPARK-22431: - Thanks for the response. Opt

[jira] [Commented] (SPARK-22431) Creating Permanent view with illegal type

2017-11-13 Thread Herman van Hovell (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249511#comment-16249511 ] Herman van Hovell commented on SPARK-22431: --- [~ksunitha] Thanks for the thoroug

[jira] [Commented] (SPARK-20387) Permissive mode is not replacing corrupt record with null

2017-11-13 Thread Sean Owen (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-20387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249508#comment-16249508 ] Sean Owen commented on SPARK-20387: --- That's not the same example. I believe the underly

[jira] [Commented] (SPARK-22431) Creating Permanent view with illegal type

2017-11-13 Thread Sunitha Kambhampati (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249505#comment-16249505 ] Sunitha Kambhampati commented on SPARK-22431: - *Observations:* I ran a few te

[jira] [Commented] (SPARK-22504) Optimization in overwrite table in case of failure

2017-11-13 Thread Sean Owen (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249502#comment-16249502 ] Sean Owen commented on SPARK-22504: --- Removing the original table isn't a problem; the u

[jira] [Closed] (SPARK-22439) Not able to get numeric columns for the file having decimal values

2017-11-13 Thread Sean Owen (JIRA)

[ https://issues.apache.org/jira/browse/SPARK-22439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-22439. - > Not able to get numeric columns for the file having decimal values > --

1 2 >

1 - 100 of 118 matches

Mail list logo