[jira] [Created] (SPARK-15937) Spark declares a succeeding job to be failed in yarn-cluster mode if the job takes very small time (~ < 10 seconds) to finish

2016-06-13 Thread Subroto Sanyal (JIRA)
Subroto Sanyal created SPARK-15937: -- Summary: Spark declares a succeeding job to be failed in yarn-cluster mode if the job takes very small time (~ < 10 seconds) to finish Key: SPARK-15937 URL:

[jira] [Commented] (SPARK-15908) Add varargs-type dropDuplicates() function in SparkR

2016-06-13 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328949#comment-15328949 ] Dongjoon Hyun commented on SPARK-15908: --- Hi, [~sunrui]. I did SPARK-15807. If you didn't start yet,

[jira] [Commented] (SPARK-14351) Optimize ImpurityAggregator for decision trees

2016-06-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328941#comment-15328941 ] Manoj Kumar commented on SPARK-14351: - I can try working on this. > Optimize ImpurityAggregator for

[jira] [Resolved] (SPARK-15932) document the contract of encoder serializer expressions

2016-06-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15932. - Resolution: Fixed Fix Version/s: 2.0.0 > document the contract of encoder serializer

[jira] [Comment Edited] (SPARK-3155) Support DecisionTree pruning

2016-06-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328939#comment-15328939 ] Manoj Kumar edited comment on SPARK-3155 at 6/14/16 5:01 AM: - 1. I agree that

[jira] [Commented] (SPARK-3155) Support DecisionTree pruning

2016-06-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328939#comment-15328939 ] Manoj Kumar commented on SPARK-3155: 1. I agree that the use cases are limited to single trees. You

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.10 Consumer API

2016-06-13 Thread Sean McKibben (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328894#comment-15328894 ] Sean McKibben commented on SPARK-12177: --- Unfortunately I can't contribute what I would like to, but

[jira] [Updated] (SPARK-14351) Optimize ImpurityAggregator for decision trees

2016-06-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14351: -- Priority: Major (was: Minor) > Optimize ImpurityAggregator for decision trees >

[jira] [Commented] (SPARK-10835) Change Output of NGram to Array(String, True)

2016-06-13 Thread Hansa Nanayakkara (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328883#comment-15328883 ] Hansa Nanayakkara commented on SPARK-10835: --- Although problem is solved for the Tokenizer it

[jira] [Commented] (SPARK-3155) Support DecisionTree pruning

2016-06-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328880#comment-15328880 ] Joseph K. Bradley commented on SPARK-3155: -- A few thoughts: (1) I'm less sure about the priority

[jira] [Updated] (SPARK-15364) Implement Python picklers for ml.Vector and ml.Matrix under spark.ml.python

2016-06-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15364: -- Assignee: Liang-Chi Hsieh > Implement Python picklers for ml.Vector and ml.Matrix under

[jira] [Updated] (SPARK-15364) Implement Python picklers for ml.Vector and ml.Matrix under spark.ml.python

2016-06-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15364: -- Target Version/s: 2.0.0 (was: 2.1.0) > Implement Python picklers for ml.Vector and ml.Matrix

[jira] [Resolved] (SPARK-15364) Implement Python picklers for ml.Vector and ml.Matrix under spark.ml.python

2016-06-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-15364. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13219

[jira] [Commented] (SPARK-15757) Error occurs when using Spark sql "select" statement on orc file after hive sql "insert overwrite tb1 select * from sourcTb" has been executed on this orc file

2016-06-13 Thread marymwu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328860#comment-15328860 ] marymwu commented on SPARK-15757: - Any update? > Error occurs when using Spark sql "select" statement on

[jira] [Commented] (SPARK-15918) unionAll returns wrong result when two dataframes has schema in different order

2016-06-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328851#comment-15328851 ] Hyukjin Kwon commented on SPARK-15918: -- Actually, I met this case before and was thinking it might

[jira] [Comment Edited] (SPARK-15930) Add Row count property to FPGrowth model

2016-06-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328802#comment-15328802 ] yuhao yang edited comment on SPARK-15930 at 6/14/16 2:46 AM: - That looks

[jira] [Updated] (SPARK-15808) Wrong Results or Strange Errors In Append-mode DataFrame Writing Due to Mismatched File Formats

2016-06-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-15808: - Assignee: Xiao Li > Wrong Results or Strange Errors In Append-mode DataFrame Writing Due to >

[jira] [Resolved] (SPARK-15808) Wrong Results or Strange Errors In Append-mode DataFrame Writing Due to Mismatched File Formats

2016-06-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-15808. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13546

[jira] [Commented] (SPARK-15690) Fast single-node (single-process) in-memory shuffle

2016-06-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328815#comment-15328815 ] Reynold Xin commented on SPARK-15690: - Definitely no serialization/deserialization. > Fast

[jira] [Commented] (SPARK-15934) Return binary mode in ThriftServer

2016-06-13 Thread Egor Pahomov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328814#comment-15328814 ] Egor Pahomov commented on SPARK-15934: -- Sure, let me create a pull request tomorrow. I would test,

[jira] [Commented] (SPARK-15930) Add Row count property to FPGrowth model

2016-06-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328802#comment-15328802 ] yuhao yang commented on SPARK-15930: That looks reasonable. +1. [~John Aherne] I would wait for one

[jira] [Comment Edited] (SPARK-15930) Add Row count property to FPGrowth model

2016-06-13 Thread John Aherne (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328786#comment-15328786 ] John Aherne edited comment on SPARK-15930 at 6/14/16 2:04 AM: -- The row count

[jira] [Commented] (SPARK-15930) Add Row count property to FPGrowth model

2016-06-13 Thread John Aherne (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328786#comment-15328786 ] John Aherne commented on SPARK-15930: - In your example, the row count would be 4. > Add Row count

[jira] [Comment Edited] (SPARK-15930) Add Row count property to FPGrowth model

2016-06-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328759#comment-15328759 ] yuhao yang edited comment on SPARK-15930 at 6/14/16 1:30 AM: - ||

[jira] [Commented] (SPARK-15930) Add Row count property to FPGrowth model

2016-06-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328759#comment-15328759 ] yuhao yang commented on SPARK-15930: || items|| freq|| |[27]|5| |

[jira] [Commented] (SPARK-15934) Return binary mode in ThriftServer

2016-06-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328756#comment-15328756 ] Reynold Xin commented on SPARK-15934: - [~epahomov] do you want to create a pr to revert the change?

[jira] [Assigned] (SPARK-15935) Enable test for sql/streaming.py and fix these tests

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15935: Assignee: Apache Spark (was: Shixiong Zhu) > Enable test for sql/streaming.py and fix

[jira] [Assigned] (SPARK-15935) Enable test for sql/streaming.py and fix these tests

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15935: Assignee: Shixiong Zhu (was: Apache Spark) > Enable test for sql/streaming.py and fix

[jira] [Commented] (SPARK-15935) Enable test for sql/streaming.py and fix these tests

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328736#comment-15328736 ] Apache Spark commented on SPARK-15935: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Created] (SPARK-15936) CLONE - Add class weights to Random Forest

2016-06-13 Thread Yuewei Na (JIRA)
Yuewei Na created SPARK-15936: - Summary: CLONE - Add class weights to Random Forest Key: SPARK-15936 URL: https://issues.apache.org/jira/browse/SPARK-15936 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-15868) Executors table in Executors tab should sort Executor IDs in numerical order (not alphabetical order)

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328723#comment-15328723 ] Apache Spark commented on SPARK-15868: -- User 'ajbozarth' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15868) Executors table in Executors tab should sort Executor IDs in numerical order (not alphabetical order)

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15868: Assignee: (was: Apache Spark) > Executors table in Executors tab should sort Executor

[jira] [Assigned] (SPARK-15868) Executors table in Executors tab should sort Executor IDs in numerical order (not alphabetical order)

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15868: Assignee: Apache Spark > Executors table in Executors tab should sort Executor IDs in

[jira] [Updated] (SPARK-15910) Schema is not checked when converting DataFrame to Dataset using Kryo encoder

2016-06-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-15910: Assignee: Sean Owen > Schema is not checked when converting DataFrame to Dataset using Kryo

[jira] [Updated] (SPARK-15910) Schema is not checked when converting DataFrame to Dataset using Kryo encoder

2016-06-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-15910: Assignee: Sean Zhong (was: Sean Owen) > Schema is not checked when converting DataFrame to

[jira] [Resolved] (SPARK-15910) Schema is not checked when converting DataFrame to Dataset using Kryo encoder

2016-06-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-15910. - Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13632

[jira] [Created] (SPARK-15935) Enable test for sql/streaming.py and fix these tests

2016-06-13 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-15935: Summary: Enable test for sql/streaming.py and fix these tests Key: SPARK-15935 URL: https://issues.apache.org/jira/browse/SPARK-15935 Project: Spark Issue

[jira] [Commented] (SPARK-15690) Fast single-node (single-process) in-memory shuffle

2016-06-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328717#comment-15328717 ] Shivaram Venkataraman commented on SPARK-15690: --- Yeah I dont think you'll see much

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.10 Consumer API

2016-06-13 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328707#comment-15328707 ] Cody Koeninger commented on SPARK-12177: I don't think waiting for 0.11 makes sense. > Update

[jira] [Created] (SPARK-15934) Return binary mode in ThriftServer

2016-06-13 Thread Egor Pahomov (JIRA)
Egor Pahomov created SPARK-15934: Summary: Return binary mode in ThriftServer Key: SPARK-15934 URL: https://issues.apache.org/jira/browse/SPARK-15934 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.10 Consumer API

2016-06-13 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328699#comment-15328699 ] Mark Grover commented on SPARK-12177: - Hi Ismael and Cody, My personal opinion was to hold off

[jira] [Resolved] (SPARK-15929) DataFrameSuite path globbing error message tests are not fully portable

2016-06-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-15929. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13649

[jira] [Commented] (SPARK-15905) Driver hung while writing to console progress bar

2016-06-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328648#comment-15328648 ] Shixiong Zhu commented on SPARK-15905: -- The last time I encounter FileOutputStream.writeBytes hangs

[jira] [Commented] (SPARK-15905) Driver hung while writing to console progress bar

2016-06-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328640#comment-15328640 ] Shixiong Zhu commented on SPARK-15905: -- By the way, how did you use Spark? Did you just run it or

[jira] [Commented] (SPARK-15905) Driver hung while writing to console progress bar

2016-06-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328638#comment-15328638 ] Shixiong Zhu commented on SPARK-15905: -- Oh, the thread state is `RUNNABLE`. So not a deadlock. Could

[jira] [Comment Edited] (SPARK-15905) Driver hung while writing to console progress bar

2016-06-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328627#comment-15328627 ] Shixiong Zhu edited comment on SPARK-15905 at 6/13/16 11:42 PM: Do you

[jira] [Commented] (SPARK-15905) Driver hung while writing to console progress bar

2016-06-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328627#comment-15328627 ] Shixiong Zhu commented on SPARK-15905: -- Do you have the whole jstack output? I guess some places

[jira] [Assigned] (SPARK-15933) Refactor reader-writer interface for streaming DFs to use DataStreamReader/Writer

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15933: Assignee: Tathagata Das (was: Apache Spark) > Refactor reader-writer interface for

[jira] [Assigned] (SPARK-15933) Refactor reader-writer interface for streaming DFs to use DataStreamReader/Writer

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15933: Assignee: Apache Spark (was: Tathagata Das) > Refactor reader-writer interface for

[jira] [Commented] (SPARK-15933) Refactor reader-writer interface for streaming DFs to use DataStreamReader/Writer

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328609#comment-15328609 ] Apache Spark commented on SPARK-15933: -- User 'tdas' has created a pull request for this issue:

[jira] [Created] (SPARK-15933) Refactor reader-writer interface for streaming DFs to use DataStreamReader/Writer

2016-06-13 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-15933: - Summary: Refactor reader-writer interface for streaming DFs to use DataStreamReader/Writer Key: SPARK-15933 URL: https://issues.apache.org/jira/browse/SPARK-15933

[jira] [Commented] (SPARK-15905) Driver hung while writing to console progress bar

2016-06-13 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328608#comment-15328608 ] Tejas Patil commented on SPARK-15905: - Another instance but this time not via console progress bar.

[jira] [Commented] (SPARK-15905) Driver hung while writing to console progress bar

2016-06-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328597#comment-15328597 ] Shixiong Zhu commented on SPARK-15905: -- [~tejasp] Probably some deadlock in Spark. It would be great

[jira] [Commented] (SPARK-3155) Support DecisionTree pruning

2016-06-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328592#comment-15328592 ] Manoj Kumar commented on SPARK-3155: I would like to add support for pruning DecisionTrees as part of

[jira] [Issue Comment Deleted] (SPARK-3155) Support DecisionTree pruning

2016-06-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-3155: --- Comment: was deleted (was: I would like to add support for pruning DecisionTrees as part of my

[jira] [Commented] (SPARK-15905) Driver hung while writing to console progress bar

2016-06-13 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328591#comment-15328591 ] Tejas Patil commented on SPARK-15905: - [~zsxwing] : This does not repro consistently but happens one

[jira] [Assigned] (SPARK-15932) document the contract of encoder serializer expressions

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15932: Assignee: Wenchen Fan (was: Apache Spark) > document the contract of encoder serializer

[jira] [Assigned] (SPARK-15932) document the contract of encoder serializer expressions

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15932: Assignee: Apache Spark (was: Wenchen Fan) > document the contract of encoder serializer

[jira] [Commented] (SPARK-15932) document the contract of encoder serializer expressions

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328586#comment-15328586 ] Apache Spark commented on SPARK-15932: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Updated] (SPARK-15914) Add deprecated method back to SQLContext for source code backward compatiblity

2016-06-13 Thread Sean Zhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Zhong updated SPARK-15914: --- Description: We removed some deprecated method in SQLContext in branch Spark 2.0. For example:

[jira] [Created] (SPARK-15932) document the contract of encoder serializer expressions

2016-06-13 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-15932: --- Summary: document the contract of encoder serializer expressions Key: SPARK-15932 URL: https://issues.apache.org/jira/browse/SPARK-15932 Project: Spark Issue

[jira] [Updated] (SPARK-15914) Add deprecated method back to SQLContext for source code backward compatiblity

2016-06-13 Thread Sean Zhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Zhong updated SPARK-15914: --- Summary: Add deprecated method back to SQLContext for source code backward compatiblity (was: Add

[jira] [Commented] (SPARK-3155) Support DecisionTree pruning

2016-06-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328487#comment-15328487 ] Manoj Kumar commented on SPARK-3155: I would like to add support for pruning DecisionTrees as part of

[jira] [Resolved] (SPARK-15925) Replaces registerTempTable with createOrReplaceTempView in SparkR

2016-06-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-15925. --- Resolution: Fixed Issue resolved by pull request 13644

[jira] [Commented] (SPARK-15176) Job Scheduling Within Application Suffers from Priority Inversion

2016-06-13 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328466#comment-15328466 ] Kay Ousterhout commented on SPARK-15176: I thought about this a little more and I think I'm in

[jira] [Commented] (SPARK-15931) SparkR tests failing on R 3.3.0

2016-06-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328432#comment-15328432 ] Shivaram Venkataraman commented on SPARK-15931: --- cc [~felixcheung] We should print out what

[jira] [Commented] (SPARK-15776) Type coercion incorrect

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328420#comment-15328420 ] Apache Spark commented on SPARK-15776: -- User 'clockfly' has created a pull request for this issue:

[jira] [Commented] (SPARK-15861) pyspark mapPartitions with none generator functions / functors

2016-06-13 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328417#comment-15328417 ] Bryan Cutler commented on SPARK-15861: -- {{mapPartitions}} will expect the function to return a

[jira] [Commented] (SPARK-15918) unionAll returns wrong result when two dataframes has schema in different order

2016-06-13 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328405#comment-15328405 ] Dongjoon Hyun commented on SPARK-15918: --- Hi, [~Prabhu Joseph]. Instead of changing one of the

[jira] [Commented] (SPARK-15931) SparkR tests failing on R 3.3.0

2016-06-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328403#comment-15328403 ] Cheng Lian commented on SPARK-15931: cc [~mengxr] > SparkR tests failing on R 3.3.0 >

[jira] [Created] (SPARK-15931) SparkR tests failing on R 3.3.0

2016-06-13 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-15931: -- Summary: SparkR tests failing on R 3.3.0 Key: SPARK-15931 URL: https://issues.apache.org/jira/browse/SPARK-15931 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-15690) Fast single-node (single-process) in-memory shuffle

2016-06-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328390#comment-15328390 ] Reynold Xin commented on SPARK-15690: - Yes there is definitely no reason to go through network for a

[jira] [Resolved] (SPARK-15887) Bring back the hive-site.xml support for Spark 2.0

2016-06-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-15887. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13611

[jira] [Commented] (SPARK-15753) Move some Analyzer stuff to Analyzer from DataFrameWriter

2016-06-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328381#comment-15328381 ] Wenchen Fan commented on SPARK-15753: - this is reverted, see discussion

[jira] [Commented] (SPARK-15861) pyspark mapPartitions with none generator functions / functors

2016-06-13 Thread Greg Bowyer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328380#comment-15328380 ] Greg Bowyer commented on SPARK-15861: - ... Hum from my end-users testing it does not seem to fail if

[jira] [Commented] (SPARK-15690) Fast single-node (single-process) in-memory shuffle

2016-06-13 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328378#comment-15328378 ] Saisai Shao commented on SPARK-15690: - I see. Since everything is in a single process, looks like

[jira] [Assigned] (SPARK-9623) RandomForestRegressor: provide variance of predictions

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9623: --- Assignee: Apache Spark > RandomForestRegressor: provide variance of predictions >

[jira] [Created] (SPARK-15930) Add Row count property to FPGrowth model

2016-06-13 Thread John Aherne (JIRA)
John Aherne created SPARK-15930: --- Summary: Add Row count property to FPGrowth model Key: SPARK-15930 URL: https://issues.apache.org/jira/browse/SPARK-15930 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-9623) RandomForestRegressor: provide variance of predictions

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9623: --- Assignee: (was: Apache Spark) > RandomForestRegressor: provide variance of predictions >

[jira] [Commented] (SPARK-9623) RandomForestRegressor: provide variance of predictions

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328343#comment-15328343 ] Apache Spark commented on SPARK-9623: - User 'MechCoder' has created a pull request for this issue:

[jira] [Commented] (SPARK-15929) DataFrameSuite path globbing error message tests are not fully portable

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328329#comment-15328329 ] Apache Spark commented on SPARK-15929: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Created] (SPARK-15929) DataFrameSuite path globbing error message tests are not fully portable

2016-06-13 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-15929: -- Summary: DataFrameSuite path globbing error message tests are not fully portable Key: SPARK-15929 URL: https://issues.apache.org/jira/browse/SPARK-15929 Project: Spark

[jira] [Deleted] (SPARK-15928) Eliminate redundant code in DAGScheduler's getParentStages and getAncestorShuffleDependencies methods.

2016-06-13 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout deleted SPARK-15928: --- > Eliminate redundant code in DAGScheduler's getParentStages and >

[jira] [Commented] (SPARK-15905) Driver hung while writing to console progress bar

2016-06-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328266#comment-15328266 ] Shixiong Zhu commented on SPARK-15905: -- Do you have a reproducer? What does your code look like? >

[jira] [Commented] (SPARK-15690) Fast single-node (single-process) in-memory shuffle

2016-06-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328264#comment-15328264 ] Reynold Xin commented on SPARK-15690: - Yup. Eventually we can also generalize this to multiple

[jira] [Comment Edited] (SPARK-15861) pyspark mapPartitions with none generator functions / functors

2016-06-13 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328245#comment-15328245 ] Bryan Cutler edited comment on SPARK-15861 at 6/13/16 9:05 PM: ---

[jira] [Closed] (SPARK-5374) abstract RDD's DAG graph iteration in DAGScheduler

2016-06-13 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout closed SPARK-5374. - Resolution: Duplicate Closing this because it duplicates the more narrowly-scoped JIRAs linked

[jira] [Commented] (SPARK-15690) Fast single-node (single-process) in-memory shuffle

2016-06-13 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328255#comment-15328255 ] Saisai Shao commented on SPARK-15690: - Hi [~rxin], what's the meaning of "single-process", is that

[jira] [Commented] (SPARK-15861) pyspark mapPartitions with none generator functions / functors

2016-06-13 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328245#comment-15328245 ] Bryan Cutler commented on SPARK-15861: -- [~gbow...@fastmail.co.uk] {{mapPartitions}} expects a

[jira] [Resolved] (SPARK-15889) Add a unique id to ContinuousQuery

2016-06-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-15889. -- Resolution: Fixed > Add a unique id to ContinuousQuery > -- >

[jira] [Updated] (SPARK-15889) Add a unique id to ContinuousQuery

2016-06-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-15889: - Fix Version/s: 2.0.0 > Add a unique id to ContinuousQuery > -- >

[jira] [Updated] (SPARK-15530) Partitioning discovery logic HadoopFsRelation should use a higher setting of parallelism

2016-06-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-15530: - Assignee: Takeshi Yamamuro > Partitioning discovery logic HadoopFsRelation should use a higher setting

[jira] [Resolved] (SPARK-15530) Partitioning discovery logic HadoopFsRelation should use a higher setting of parallelism

2016-06-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-15530. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13444

[jira] [Commented] (SPARK-15924) SparkR parser bug with backslash in comments

2016-06-13 Thread Xuan Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328205#comment-15328205 ] Xuan Wang commented on SPARK-15924: --- I then realized that this is not a problem with SparkR, so I

[jira] [Commented] (SPARK-15822) segmentation violation in o.a.s.unsafe.types.UTF8String

2016-06-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328200#comment-15328200 ] Herman van Hovell commented on SPARK-15822: --- [~robbinspg] Could you try this without caching?

[jira] [Resolved] (SPARK-15676) Disallow Column Names as Partition Columns For Hive Tables

2016-06-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-15676. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13415

[jira] [Updated] (SPARK-15676) Disallow Column Names as Partition Columns For Hive Tables

2016-06-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-15676: - Assignee: Xiao Li > Disallow Column Names as Partition Columns For Hive Tables >

[jira] [Commented] (SPARK-15923) Spark Application rest api returns "no such app: "

2016-06-13 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328181#comment-15328181 ] Thomas Graves commented on SPARK-15923: --- can you give some more details? Did you have your

[jira] [Commented] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328061#comment-15328061 ] Apache Spark commented on SPARK-15784: -- User 'wangmiao1981' has created a pull request for this

[jira] [Assigned] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15784: Assignee: (was: Apache Spark) > Add Power Iteration Clustering to spark.ml >

  1   2   3   >