[jira] [Commented] (SPARK-14077) Support weighted instances in naive Bayes

2016-05-01 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265652#comment-15265652 ] zhengruifeng commented on SPARK-14077: -- OK, I will have a try > Support weighted in

[jira] [Assigned] (SPARK-14781) Support subquery in nested predicates

2016-05-01 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-14781: -- Assignee: Davies Liu > Support subquery in nested predicates > ---

[jira] [Commented] (SPARK-15043) Fix and re-enable flaky test: mllib.stat.JavaStatisticsSuite.testCorr

2016-05-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265655#comment-15265655 ] Sean Owen commented on SPARK-15043: --- I'll take a look and try to fix it. I didn't see t

[jira] [Created] (SPARK-15044) spark-sql will throw "input path does not exist" exception if it handles a partition which exists in hive table, but the path is removed manually

2016-05-01 Thread huangyu (JIRA)
huangyu created SPARK-15044: --- Summary: spark-sql will throw "input path does not exist" exception if it handles a partition which exists in hive table, but the path is removed manually Key: SPARK-15044 URL: https://iss

[jira] [Updated] (SPARK-15044) spark-sql will throw "input path does not exist" exception if it handles a partition which exists in hive table, but the path is removed manually

2016-05-01 Thread huangyu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] huangyu updated SPARK-15044: Description: spark-sql will throw "input path not exist" exception if it handles a partition which exists

[jira] [Assigned] (SPARK-14781) Support subquery in nested predicates

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14781: Assignee: Davies Liu (was: Apache Spark) > Support subquery in nested predicates > --

[jira] [Commented] (SPARK-14781) Support subquery in nested predicates

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265658#comment-15265658 ] Apache Spark commented on SPARK-14781: -- User 'davies' has created a pull request for

[jira] [Assigned] (SPARK-14781) Support subquery in nested predicates

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14781: Assignee: Apache Spark (was: Davies Liu) > Support subquery in nested predicates > --

[jira] [Updated] (SPARK-15044) spark-sql will throw "input path does not exist" exception if it handles a partition which exists in hive table, but the path is removed manually

2016-05-01 Thread huangyu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] huangyu updated SPARK-15044: Description: spark-sql will throw "input path not exist" exception if it handles a partition which exists

[jira] [Assigned] (SPARK-15043) Fix and re-enable flaky test: mllib.stat.JavaStatisticsSuite.testCorr

2016-05-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-15043: - Assignee: Sean Owen > Fix and re-enable flaky test: mllib.stat.JavaStatisticsSuite.testCorr > --

[jira] [Assigned] (SPARK-15043) Fix and re-enable flaky test: mllib.stat.JavaStatisticsSuite.testCorr

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15043: Assignee: Sean Owen (was: Apache Spark) > Fix and re-enable flaky test: mllib.stat.JavaSt

[jira] [Commented] (SPARK-15043) Fix and re-enable flaky test: mllib.stat.JavaStatisticsSuite.testCorr

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265681#comment-15265681 ] Apache Spark commented on SPARK-15043: -- User 'srowen' has created a pull request for

[jira] [Assigned] (SPARK-15043) Fix and re-enable flaky test: mllib.stat.JavaStatisticsSuite.testCorr

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15043: Assignee: Apache Spark (was: Sean Owen) > Fix and re-enable flaky test: mllib.stat.JavaSt

[jira] [Created] (SPARK-15045) Remove dead code in TaskMemoryManager.cleanUpAllAllocatedMemory for pageTable

2016-05-01 Thread Jacek Laskowski (JIRA)
Jacek Laskowski created SPARK-15045: --- Summary: Remove dead code in TaskMemoryManager.cleanUpAllAllocatedMemory for pageTable Key: SPARK-15045 URL: https://issues.apache.org/jira/browse/SPARK-15045 P

[jira] [Commented] (SPARK-14785) Support correlated scalar subquery

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265731#comment-15265731 ] Apache Spark commented on SPARK-14785: -- User 'hvanhovell' has created a pull request

[jira] [Commented] (SPARK-1239) Improve fetching of map output statuses

2016-05-01 Thread Chris Bannister (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265756#comment-15265756 ] Chris Bannister commented on SPARK-1239: Im seeing frequent job failures where the

[jira] [Issue Comment Deleted] (SPARK-1239) Improve fetching of map output statuses

2016-05-01 Thread Chris Bannister (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Bannister updated SPARK-1239: --- Comment: was deleted (was: Im seeing frequent job failures where the executors are unable to

[jira] [Resolved] (SPARK-14505) Creating two SparkContext Object in the same jvm, the first one will can not run any tasks!

2016-05-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14505. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12273 [https://github.co

[jira] [Updated] (SPARK-14505) Creating two SparkContext Object in the same jvm, the first one will can not run any tasks!

2016-05-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-14505: -- Assignee: The sea Issue Type: Bug (was: Improvement) > Creating two SparkContext Object in the s

[jira] [Commented] (SPARK-14993) Inconsistent behavior of partitioning discovery

2016-05-01 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265794#comment-15265794 ] Xiao Li commented on SPARK-14993: - Doing it now. Thanks! > Inconsistent behavior of part

[jira] [Updated] (SPARK-13289) Word2Vec generate infinite distances when numIterations>5

2016-05-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-13289: -- Assignee: Junyang Shen (was: Nick Pentreath) > Word2Vec generate infinite distances when numIterations

[jira] [Assigned] (SPARK-14985) Update LinearRegression, LogisticRegression summary internals to handle model copy

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14985: Assignee: (was: Apache Spark) > Update LinearRegression, LogisticRegression summary in

[jira] [Commented] (SPARK-14985) Update LinearRegression, LogisticRegression summary internals to handle model copy

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265803#comment-15265803 ] Apache Spark commented on SPARK-14985: -- User 'BenFradet' has created a pull request

[jira] [Assigned] (SPARK-14985) Update LinearRegression, LogisticRegression summary internals to handle model copy

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14985: Assignee: Apache Spark > Update LinearRegression, LogisticRegression summary internals to

[jira] [Commented] (SPARK-928) Add support for Unsafe-based serializer in Kryo 2.22

2016-05-01 Thread Sandeep Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265805#comment-15265805 ] Sandeep Singh commented on SPARK-928: - [~joshrosen] I've started working on it. I trie

[jira] [Commented] (SPARK-7898) pyspark merges stderr into stdout

2016-05-01 Thread Sam Steingold (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265863#comment-15265863 ] Sam Steingold commented on SPARK-7898: -- No, this is _NOT_ what I am talking about! I

[jira] [Created] (SPARK-15046) When running hive-thriftserver with yarn on a secure cluster the workers fail with java.lang.NumberFormatException

2016-05-01 Thread Trystan Leftwich (JIRA)
Trystan Leftwich created SPARK-15046: Summary: When running hive-thriftserver with yarn on a secure cluster the workers fail with java.lang.NumberFormatException Key: SPARK-15046 URL: https://issues.apache.org

[jira] [Resolved] (SPARK-14931) Mismatched default Param values between pipelines in Spark and PySpark

2016-05-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-14931. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12816 [h

[jira] [Commented] (SPARK-15046) When running hive-thriftserver with yarn on a secure cluster the workers fail with java.lang.NumberFormatException

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265902#comment-15265902 ] Apache Spark commented on SPARK-15046: -- User 'trystanleftwich' has created a pull re

[jira] [Assigned] (SPARK-15046) When running hive-thriftserver with yarn on a secure cluster the workers fail with java.lang.NumberFormatException

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15046: Assignee: (was: Apache Spark) > When running hive-thriftserver with yarn on a secure c

[jira] [Assigned] (SPARK-15046) When running hive-thriftserver with yarn on a secure cluster the workers fail with java.lang.NumberFormatException

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15046: Assignee: Apache Spark > When running hive-thriftserver with yarn on a secure cluster the

[jira] [Created] (SPARK-15047) Cleanup SQLParser

2016-05-01 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-15047: - Summary: Cleanup SQLParser Key: SPARK-15047 URL: https://issues.apache.org/jira/browse/SPARK-15047 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-14973) The CrossValidator and TrainValidationSplit miss the seed when saving and loading

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265958#comment-15265958 ] Apache Spark commented on SPARK-14973: -- User 'yinxusen' has created a pull request f

[jira] [Created] (SPARK-15048) when running Thriftserver with yarn on a secure cluster it will pass the wrong keytab location.

2016-05-01 Thread Trystan Leftwich (JIRA)
Trystan Leftwich created SPARK-15048: Summary: when running Thriftserver with yarn on a secure cluster it will pass the wrong keytab location. Key: SPARK-15048 URL: https://issues.apache.org/jira/browse/SPARK-

[jira] [Updated] (SPARK-15043) Fix and re-enable flaky test: mllib.stat.JavaStatisticsSuite.testCorr

2016-05-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15043: Priority: Critical (was: Blocker) > Fix and re-enable flaky test: mllib.stat.JavaStatisticsSuite.t

[jira] [Resolved] (SPARK-14060) Move StringToColumn implicit class into SQLImplicits

2016-05-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14060. - Resolution: Fixed Fix Version/s: 2.0.0 > Move StringToColumn implicit class into SQLImplic

[jira] [Resolved] (SPARK-13830) Fetch large directly result from executor is very slow

2016-05-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13830. - Resolution: Fixed Assignee: Davies Liu Fix Version/s: 2.0.0 > Fetch large directl

[jira] [Closed] (SPARK-7025) Create a Java-friendly input source API

2016-05-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-7025. -- Resolution: Later > Create a Java-friendly input source API > --- >

[jira] [Commented] (SPARK-15047) Cleanup SQLParser

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266001#comment-15266001 ] Apache Spark commented on SPARK-15047: -- User 'hvanhovell' has created a pull request

[jira] [Assigned] (SPARK-15047) Cleanup SQLParser

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15047: Assignee: Herman van Hovell (was: Apache Spark) > Cleanup SQLParser > - >

[jira] [Assigned] (SPARK-15047) Cleanup SQLParser

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15047: Assignee: Apache Spark (was: Herman van Hovell) > Cleanup SQLParser > - >

[jira] [Created] (SPARK-15049) Rename NewAccumulator to AccumulatorV2

2016-05-01 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15049: --- Summary: Rename NewAccumulator to AccumulatorV2 Key: SPARK-15049 URL: https://issues.apache.org/jira/browse/SPARK-15049 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-15049) Rename NewAccumulator to AccumulatorV2

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15049: Assignee: Apache Spark (was: Reynold Xin) > Rename NewAccumulator to AccumulatorV2 >

[jira] [Assigned] (SPARK-15049) Rename NewAccumulator to AccumulatorV2

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15049: Assignee: Reynold Xin (was: Apache Spark) > Rename NewAccumulator to AccumulatorV2 >

[jira] [Commented] (SPARK-15049) Rename NewAccumulator to AccumulatorV2

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266004#comment-15266004 ] Apache Spark commented on SPARK-15049: -- User 'rxin' has created a pull request for t

[jira] [Commented] (SPARK-14302) Python examples code merge and clean up

2016-05-01 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266006#comment-15266006 ] Xusen Yin commented on SPARK-14302: --- [~kanjilal] Thanks for working on this. However, I

[jira] [Commented] (SPARK-14864) [MLLIB] Implement Doc2Vec

2016-05-01 Thread Peter Mountanos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266024#comment-15266024 ] Peter Mountanos commented on SPARK-14864: - I will try to work out this issue if n

[jira] [Comment Edited] (SPARK-14864) [MLLIB] Implement Doc2Vec

2016-05-01 Thread Peter Mountanos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266024#comment-15266024 ] Peter Mountanos edited comment on SPARK-14864 at 5/2/16 12:45 AM: -

[jira] [Commented] (SPARK-14302) Python examples code merge and clean up

2016-05-01 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266029#comment-15266029 ] Saikat Kanjilal commented on SPARK-14302: - Works for me, so what else can I help

[jira] [Comment Edited] (SPARK-14995) Add "since" tag in Roxygen documentation for SparkR API methods

2016-05-01 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266031#comment-15266031 ] Sun Rui edited comment on SPARK-14995 at 5/2/16 1:07 AM: - [~felix

[jira] [Commented] (SPARK-14995) Add "since" tag in Roxygen documentation for SparkR API methods

2016-05-01 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266031#comment-15266031 ] Sun Rui commented on SPARK-14995: - [~felixcheung] I think no need to add "spark". Just s

[jira] [Commented] (SPARK-15044) spark-sql will throw "input path does not exist" exception if it handles a partition which exists in hive table, but the path is removed manually

2016-05-01 Thread Niranjan Molkeri` (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266042#comment-15266042 ] Niranjan Molkeri` commented on SPARK-15044: --- I tried to reproduce the error in

[jira] [Resolved] (SPARK-13425) Documentation for CSV datasource options

2016-05-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13425. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.0.0 > Documentation for

[jira] [Created] (SPARK-15050) Put CSV options as Python csv function parameters

2016-05-01 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15050: --- Summary: Put CSV options as Python csv function parameters Key: SPARK-15050 URL: https://issues.apache.org/jira/browse/SPARK-15050 Project: Spark Issue Type: S

[jira] [Commented] (SPARK-15045) Remove dead code in TaskMemoryManager.cleanUpAllAllocatedMemory for pageTable

2016-05-01 Thread Abhinav Gupta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266057#comment-15266057 ] Abhinav Gupta commented on SPARK-15045: --- I would like to work on this issue. Any s

[jira] [Commented] (SPARK-14495) Distinct aggregation cannot be used in the having clause

2016-05-01 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266069#comment-15266069 ] Xin Wu commented on SPARK-14495: I can recreated it on branch-1.6. and another workaround

[jira] [Comment Edited] (SPARK-14495) Distinct aggregation cannot be used in the having clause

2016-05-01 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266069#comment-15266069 ] Xin Wu edited comment on SPARK-14495 at 5/2/16 2:21 AM: I can rec

[jira] [Assigned] (SPARK-14993) Inconsistent behavior of partitioning discovery

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14993: Assignee: (was: Apache Spark) > Inconsistent behavior of partitioning discovery >

[jira] [Assigned] (SPARK-14993) Inconsistent behavior of partitioning discovery

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14993: Assignee: Apache Spark > Inconsistent behavior of partitioning discovery > ---

[jira] [Commented] (SPARK-14993) Inconsistent behavior of partitioning discovery

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266073#comment-15266073 ] Apache Spark commented on SPARK-14993: -- User 'gatorsmile' has created a pull request

[jira] [Created] (SPARK-15051) Aggregator with DataFrame does not allow Alias

2016-05-01 Thread koert kuipers (JIRA)
koert kuipers created SPARK-15051: - Summary: Aggregator with DataFrame does not allow Alias Key: SPARK-15051 URL: https://issues.apache.org/jira/browse/SPARK-15051 Project: Spark Issue Type:

[jira] [Updated] (SPARK-15051) Aggregator with DataFrame does not allow Alias

2016-05-01 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] koert kuipers updated SPARK-15051: -- Description: this works: {noformat} object SimpleSum extends Aggregator[Row, Int, Int] { def

[jira] [Assigned] (SPARK-15045) Remove dead code in TaskMemoryManager.cleanUpAllAllocatedMemory for pageTable

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15045: Assignee: Apache Spark > Remove dead code in TaskMemoryManager.cleanUpAllAllocatedMemory f

[jira] [Commented] (SPARK-15045) Remove dead code in TaskMemoryManager.cleanUpAllAllocatedMemory for pageTable

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266078#comment-15266078 ] Apache Spark commented on SPARK-15045: -- User 'abhi951990' has created a pull request

[jira] [Commented] (SPARK-14974) spark sql job create too many files in HDFS when doing insert overwrite hive table

2016-05-01 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266077#comment-15266077 ] Xiao Li commented on SPARK-14974: - 200w is 200 万. 万 is a Chinese unit. It means 10,000. :

[jira] [Assigned] (SPARK-15045) Remove dead code in TaskMemoryManager.cleanUpAllAllocatedMemory for pageTable

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15045: Assignee: (was: Apache Spark) > Remove dead code in TaskMemoryManager.cleanUpAllAlloca

[jira] [Resolved] (SPARK-15049) Rename NewAccumulator to AccumulatorV2

2016-05-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15049. - Resolution: Fixed > Rename NewAccumulator to AccumulatorV2 >

[jira] [Updated] (SPARK-14931) Mismatched default Param values between pipelines in Spark and PySpark

2016-05-01 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-14931: Component/s: PySpark ML > Mismatched default Param values between pipelines in Spark and P

[jira] [Created] (SPARK-15052) Add ways to create SparkSession without requiring explicitly creating SparkContext first

2016-05-01 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15052: --- Summary: Add ways to create SparkSession without requiring explicitly creating SparkContext first Key: SPARK-15052 URL: https://issues.apache.org/jira/browse/SPARK-15052

[jira] [Updated] (SPARK-15052) Use builder pattern to create SparkSession

2016-05-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15052: Summary: Use builder pattern to create SparkSession (was: Add ways to create SparkSession without

[jira] [Assigned] (SPARK-15052) Use builder pattern to create SparkSession

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15052: Assignee: Reynold Xin (was: Apache Spark) > Use builder pattern to create SparkSession >

[jira] [Commented] (SPARK-15052) Use builder pattern to create SparkSession

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266088#comment-15266088 ] Apache Spark commented on SPARK-15052: -- User 'rxin' has created a pull request for t

[jira] [Assigned] (SPARK-15052) Use builder pattern to create SparkSession

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15052: Assignee: Apache Spark (was: Reynold Xin) > Use builder pattern to create SparkSession >

[jira] [Created] (SPARK-15053) Fix Java Lint errors on Hive-Thriftserver module

2016-05-01 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-15053: - Summary: Fix Java Lint errors on Hive-Thriftserver module Key: SPARK-15053 URL: https://issues.apache.org/jira/browse/SPARK-15053 Project: Spark Issue Type

[jira] [Created] (SPARK-15054) Deprecate old accumulator API

2016-05-01 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15054: --- Summary: Deprecate old accumulator API Key: SPARK-15054 URL: https://issues.apache.org/jira/browse/SPARK-15054 Project: Spark Issue Type: Sub-task Co

[jira] [Assigned] (SPARK-15053) Fix Java Lint errors on Hive-Thriftserver module

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15053: Assignee: (was: Apache Spark) > Fix Java Lint errors on Hive-Thriftserver module > ---

[jira] [Commented] (SPARK-15053) Fix Java Lint errors on Hive-Thriftserver module

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266089#comment-15266089 ] Apache Spark commented on SPARK-15053: -- User 'dongjoon-hyun' has created a pull requ

[jira] [Assigned] (SPARK-15054) Deprecate old accumulator API

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15054: Assignee: Apache Spark (was: Reynold Xin) > Deprecate old accumulator API > -

[jira] [Assigned] (SPARK-15053) Fix Java Lint errors on Hive-Thriftserver module

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15053: Assignee: Apache Spark > Fix Java Lint errors on Hive-Thriftserver module > --

[jira] [Assigned] (SPARK-15054) Deprecate old accumulator API

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15054: Assignee: Reynold Xin (was: Apache Spark) > Deprecate old accumulator API > -

[jira] [Commented] (SPARK-15054) Deprecate old accumulator API

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266090#comment-15266090 ] Apache Spark commented on SPARK-15054: -- User 'rxin' has created a pull request for t

[jira] [Commented] (SPARK-14302) Python examples code merge and clean up

2016-05-01 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266093#comment-15266093 ] Xusen Yin commented on SPARK-14302: --- I'll close it, anything else I'll let you know. Th

[jira] [Resolved] (SPARK-14302) Python examples code merge and clean up

2016-05-01 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xusen Yin resolved SPARK-14302. --- Resolution: Won't Fix > Python examples code merge and clean up > ---

[jira] [Commented] (SPARK-15044) spark-sql will throw "input path does not exist" exception if it handles a partition which exists in hive table, but the path is removed manually

2016-05-01 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266116#comment-15266116 ] Xin Wu commented on SPARK-15044: I tried {code}alter table test drop partition (p=1){code

[jira] [Updated] (SPARK-15053) Fix Java Lint errors on Hive-Thriftserver module

2016-05-01 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-15053: -- Component/s: Build > Fix Java Lint errors on Hive-Thriftserver module > ---

[jira] [Commented] (SPARK-3190) Creation of large graph(> 2.15 B nodes) seems to be broken:possible overflow somewhere

2016-05-01 Thread Yuance Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266145#comment-15266145 ] Yuance Li commented on SPARK-3190: -- The PR{2106,7923} can not fix the problem completely,

[jira] [Assigned] (SPARK-15050) Put CSV options as Python csv function parameters

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15050: Assignee: Apache Spark (was: Hyukjin Kwon) > Put CSV options as Python csv function param

[jira] [Assigned] (SPARK-15050) Put CSV options as Python csv function parameters

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15050: Assignee: Hyukjin Kwon (was: Apache Spark) > Put CSV options as Python csv function param

[jira] [Commented] (SPARK-15050) Put CSV options as Python csv function parameters

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266147#comment-15266147 ] Apache Spark commented on SPARK-15050: -- User 'HyukjinKwon' has created a pull reques

[jira] [Commented] (SPARK-14986) Spark SQL returns incorrect results for LATERAL VIEW OUTER queries if all inner columns are projected out

2016-05-01 Thread Vijay Parmar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266148#comment-15266148 ] Vijay Parmar commented on SPARK-14986: -- Hi Andrey, I am new to Spark but would like

[jira] [Commented] (SPARK-3190) Creation of large graph(> 2.15 B nodes) seems to be broken:possible overflow somewhere

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266154#comment-15266154 ] Apache Spark commented on SPARK-3190: - User 'liyuance' has created a pull request for

[jira] [Comment Edited] (SPARK-928) Add support for Unsafe-based serializer in Kryo 2.22

2016-05-01 Thread Sandeep Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265805#comment-15265805 ] Sandeep Singh edited comment on SPARK-928 at 5/2/16 6:35 AM: - [

[jira] [Assigned] (SPARK-12922) Implement gapply() on DataFrame in SparkR

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12922: Assignee: Apache Spark > Implement gapply() on DataFrame in SparkR > -

[jira] [Commented] (SPARK-12922) Implement gapply() on DataFrame in SparkR

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266161#comment-15266161 ] Apache Spark commented on SPARK-12922: -- User 'NarineK' has created a pull request fo

[jira] [Assigned] (SPARK-12922) Implement gapply() on DataFrame in SparkR

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12922: Assignee: (was: Apache Spark) > Implement gapply() on DataFrame in SparkR > --

[jira] [Created] (SPARK-15055) Remove setValue method on accumulators

2016-05-01 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15055: --- Summary: Remove setValue method on accumulators Key: SPARK-15055 URL: https://issues.apache.org/jira/browse/SPARK-15055 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-15055) Remove setValue method on accumulators

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266162#comment-15266162 ] Apache Spark commented on SPARK-15055: -- User 'rxin' has created a pull request for t

[jira] [Assigned] (SPARK-15055) Remove setValue method on accumulators

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15055: Assignee: Reynold Xin (was: Apache Spark) > Remove setValue method on accumulators >

[jira] [Assigned] (SPARK-15055) Remove setValue method on accumulators

2016-05-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15055: Assignee: Apache Spark (was: Reynold Xin) > Remove setValue method on accumulators >