[jira] [Commented] (SPARK-11441) HadoopFsRelation is not scalable in number of files read/written

2015-11-12 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15002751#comment-15002751 ] koert kuipers commented on SPARK-11441: --- going over the code base it seems that there are 2

[jira] [Assigned] (SPARK-11699) TrackStateRDDSuite fails on Jenkins builds

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11699: Assignee: Apache Spark > TrackStateRDDSuite fails on Jenkins builds >

[jira] [Created] (SPARK-11713) Initial RDD for updateStateByKey for pyspark

2015-11-12 Thread David Watson (JIRA)
David Watson created SPARK-11713: Summary: Initial RDD for updateStateByKey for pyspark Key: SPARK-11713 URL: https://issues.apache.org/jira/browse/SPARK-11713 Project: Spark Issue Type: New

[jira] [Assigned] (SPARK-11612) Model export/import for spark.ml: Pipeline and PipelineModel

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11612: Assignee: Apache Spark (was: Joseph K. Bradley) > Model export/import for spark.ml:

[jira] [Resolved] (SPARK-11191) [1.5] Can't create UDF's using hive thrift service

2015-11-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-11191. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9664

[jira] [Assigned] (SPARK-11710) Document new memory management model

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11710: Assignee: Andrew Or (was: Apache Spark) > Document new memory management model >

[jira] [Commented] (SPARK-11710) Document new memory management model

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15002867#comment-15002867 ] Apache Spark commented on SPARK-11710: -- User 'andrewor14' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11710) Document new memory management model

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11710: Assignee: Apache Spark (was: Andrew Or) > Document new memory management model >

[jira] [Assigned] (SPARK-11703) Improve the docker-mesos image

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11703: Assignee: (was: Apache Spark) > Improve the docker-mesos image >

[jira] [Commented] (SPARK-11637) Alias do not work with udf with * parameter

2015-11-12 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15002986#comment-15002986 ] Xiao Li commented on SPARK-11637: - The fix is ready. Will submit a PR soon. > Alias do not work with

[jira] [Commented] (SPARK-11699) TrackStateRDDSuite fails on Jenkins builds

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15002794#comment-15002794 ] Apache Spark commented on SPARK-11699: -- User 'tedyu' has created a pull request for this issue:

[jira] [Commented] (SPARK-11654) add reduce to GroupedDataset

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15002795#comment-15002795 ] Apache Spark commented on SPARK-11654: -- User 'marmbrus' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11699) TrackStateRDDSuite fails on Jenkins builds

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11699: Assignee: (was: Apache Spark) > TrackStateRDDSuite fails on Jenkins builds >

[jira] [Commented] (SPARK-11709) Include call site info in SparkContext.assertNotStopped

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15002841#comment-15002841 ] Apache Spark commented on SPARK-11709: -- User 'mengxr' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11709) Include call site info in SparkContext.assertNotStopped

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11709: Assignee: Apache Spark (was: Xiangrui Meng) > Include call site info in

[jira] [Assigned] (SPARK-11712) Refactor spark.ml LDAModel to be abstract

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11712: Assignee: Apache Spark (was: Joseph K. Bradley) > Refactor spark.ml LDAModel to be

[jira] [Commented] (SPARK-11712) Refactor spark.ml LDAModel to be abstract

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15002940#comment-15002940 ] Apache Spark commented on SPARK-11712: -- User 'jkbradley' has created a pull request for this issue:

[jira] [Commented] (SPARK-11246) [1.5] Table cache for Parquet broken in 1.5

2015-11-12 Thread Gurgen Tumanyan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15002954#comment-15002954 ] Gurgen Tumanyan commented on SPARK-11246: - Hi [~yhuai] I am running into an issue that might be

[jira] [Commented] (SPARK-11617) MEMORY LEAK: ByteBuf.release() was not called before it's garbage-collected

2015-11-12 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15002988#comment-15002988 ] Marcelo Vanzin commented on SPARK-11617: [~raynow] I don't know your github alias, but I updated

[jira] [Updated] (SPARK-11707) StreamCorruptedException if authentication is enabled

2015-11-12 Thread Jacek Lewandowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Lewandowski updated SPARK-11707: -- Description: When authentication (and encryption) is enabled (at least in standalone

[jira] [Resolved] (SPARK-11699) TrackStateRDDSuite fails on Jenkins builds

2015-11-12 Thread Ted Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu resolved SPARK-11699. Resolution: Duplicate Same as SPARK-11290 > TrackStateRDDSuite fails on Jenkins builds >

[jira] [Updated] (SPARK-11707) StreamCorruptedException if authentication is enabled

2015-11-12 Thread Jacek Lewandowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Lewandowski updated SPARK-11707: -- Description: When authentication is enabled (at least in standalone mode), the

[jira] [Created] (SPARK-11711) Finalizer memory leak is pyspark

2015-11-12 Thread David Watson (JIRA)
David Watson created SPARK-11711: Summary: Finalizer memory leak is pyspark Key: SPARK-11711 URL: https://issues.apache.org/jira/browse/SPARK-11711 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-11712) Refactor spark.ml LDAModel to be abstract

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11712: Assignee: Joseph K. Bradley (was: Apache Spark) > Refactor spark.ml LDAModel to be

[jira] [Commented] (SPARK-11246) [1.5] Table cache for Parquet broken in 1.5

2015-11-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15002959#comment-15002959 ] Michael Armbrust commented on SPARK-11246: -- When building the cache, we are going to read all of

[jira] [Created] (SPARK-11707) StreamCorruptedException if authentication is enabled

2015-11-12 Thread Jacek Lewandowski (JIRA)
Jacek Lewandowski created SPARK-11707: - Summary: StreamCorruptedException if authentication is enabled Key: SPARK-11707 URL: https://issues.apache.org/jira/browse/SPARK-11707 Project: Spark

[jira] [Commented] (SPARK-11672) Flaky test: ml.JavaDefaultReadWriteSuite

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15002918#comment-15002918 ] Apache Spark commented on SPARK-11672: -- User 'mengxr' has created a pull request for this issue:

[jira] [Commented] (SPARK-8459) Add import/export to spark.mllib bisecting k-means

2015-11-12 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003015#comment-15003015 ] Yu Ishikawa commented on SPARK-8459: I'm working on this issue. > Add import/export to spark.mllib

[jira] [Assigned] (SPARK-11612) Model export/import for spark.ml: Pipeline and PipelineModel

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11612: Assignee: Joseph K. Bradley (was: Apache Spark) > Model export/import for spark.ml:

[jira] [Commented] (SPARK-11612) Model export/import for spark.ml: Pipeline and PipelineModel

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15002834#comment-15002834 ] Apache Spark commented on SPARK-11612: -- User 'jkbradley' has created a pull request for this issue:

[jira] [Created] (SPARK-11710) Document new memory management model

2015-11-12 Thread Andrew Or (JIRA)
Andrew Or created SPARK-11710: - Summary: Document new memory management model Key: SPARK-11710 URL: https://issues.apache.org/jira/browse/SPARK-11710 Project: Spark Issue Type: Sub-task

[jira] [Resolved] (SPARK-11702) Guava ClassLoading Issue When Using Different Hive Metastore Version

2015-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-11702. --- Resolution: Not A Problem Yes, you've described the situation as it stands and it's on-purpose.

[jira] [Updated] (SPARK-11396) datetime function: to_unix_timestamp

2015-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11396: -- Assignee: Adrian Wang > datetime function: to_unix_timestamp > >

[jira] [Updated] (SPARK-10113) Support for unsigned Parquet logical types

2015-11-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-10113: - Assignee: Hyukjin Kwon > Support for unsigned Parquet logical types >

[jira] [Resolved] (SPARK-10113) Support for unsigned Parquet logical types

2015-11-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-10113. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9646

[jira] [Created] (SPARK-11712) Refactor spark.ml LDAModel to be abstract

2015-11-12 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-11712: - Summary: Refactor spark.ml LDAModel to be abstract Key: SPARK-11712 URL: https://issues.apache.org/jira/browse/SPARK-11712 Project: Spark Issue

[jira] [Commented] (SPARK-11703) Improve the docker-mesos image

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15002985#comment-15002985 ] Apache Spark commented on SPARK-11703: -- User 'lmtjalves' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11703) Improve the docker-mesos image

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11703: Assignee: Apache Spark > Improve the docker-mesos image > --

[jira] [Resolved] (SPARK-11687) Mixed usage of fold and foldLeft, reduce and reduceLeft and reduceOption and reduceLeftOption

2015-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-11687. --- Resolution: Won't Fix > Mixed usage of fold and foldLeft, reduce and reduceLeft and reduceOption and

[jira] [Commented] (SPARK-11617) MEMORY LEAK: ByteBuf.release() was not called before it's garbage-collected

2015-11-12 Thread Naden Franciscus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15001815#comment-15001815 ] Naden Franciscus commented on SPARK-11617: -- Can confirm both of these issues. Could it be

[jira] [Assigned] (SPARK-11692) Support for Parquet logical types, JSON and BSON (embedded types)

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11692: Assignee: Apache Spark > Support for Parquet logical types, JSON and BSON (embedded

[jira] [Commented] (SPARK-11692) Support for Parquet logical types, JSON and BSON (embedded types)

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15001870#comment-15001870 ] Apache Spark commented on SPARK-11692: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-11692) Support for Parquet logical types, JSON and BSON (embedded types)

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11692: Assignee: (was: Apache Spark) > Support for Parquet logical types, JSON and BSON

[jira] [Commented] (SPARK-5968) Parquet warning in spark-shell

2015-11-12 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15001885#comment-15001885 ] Cheng Lian commented on SPARK-5968: --- As explained in the JIRA description, this issue shouldn't affect

[jira] [Commented] (SPARK-11676) Parquet filter tests all pass if filters are not really pushed down

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15001913#comment-15001913 ] Apache Spark commented on SPARK-11676: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-11676) Parquet filter tests all pass if filters are not really pushed down

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11676: Assignee: Apache Spark > Parquet filter tests all pass if filters are not really pushed

[jira] [Assigned] (SPARK-11676) Parquet filter tests all pass if filters are not really pushed down

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11676: Assignee: (was: Apache Spark) > Parquet filter tests all pass if filters are not

[jira] [Created] (SPARK-11694) Parquet logical types are not being tested properly

2015-11-12 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-11694: Summary: Parquet logical types are not being tested properly Key: SPARK-11694 URL: https://issues.apache.org/jira/browse/SPARK-11694 Project: Spark Issue

[jira] [Commented] (SPARK-9435) Java UDFs don't work with GROUP BY expressions

2015-11-12 Thread DOAN DuyHai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15001866#comment-15001866 ] DOAN DuyHai commented on SPARK-9435: Same error for me: {code:java} // Register

[jira] [Comment Edited] (SPARK-11693) spark kafka direct streaming exception

2015-11-12 Thread xiaoxiaoluo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15001877#comment-15001877 ] xiaoxiaoluo edited comment on SPARK-11693 at 11/12/15 9:48 AM: --- Should we

[jira] [Resolved] (SPARK-11661) We should still pushdown filters returned by a data source's unhandledFilters

2015-11-12 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-11661. Resolution: Fixed Fix Version/s: 1.6.0 1.7.0 Issue resolved by pull

[jira] [Commented] (SPARK-11692) Support for Parquet logical types, JSON and BISON (embedded types)

2015-11-12 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15001855#comment-15001855 ] Hyukjin Kwon commented on SPARK-11692: -- I will work on this. > Support for Parquet logical types,

[jira] [Updated] (SPARK-11692) Support for Parquet logical types, JSON and BSON (embedded types)

2015-11-12 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-11692: - Summary: Support for Parquet logical types, JSON and BSON (embedded types) (was: Support for

[jira] [Commented] (SPARK-11693) spark kafka direct streaming exception

2015-11-12 Thread xiaoxiaoluo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15001877#comment-15001877 ] xiaoxiaoluo commented on SPARK-11693: - Should we catch this exception from this file

[jira] [Assigned] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11691: Assignee: (was: Apache Spark) > Allow to specify compression codec in

[jira] [Commented] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15001834#comment-15001834 ] Apache Spark commented on SPARK-11691: -- User 'zjffdu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11691: Assignee: Apache Spark > Allow to specify compression codec in HadoopFsRelation when

[jira] [Created] (SPARK-11692) Support for Parquet logical types, JSON and BISON (embedded types)

2015-11-12 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-11692: Summary: Support for Parquet logical types, JSON and BISON (embedded types) Key: SPARK-11692 URL: https://issues.apache.org/jira/browse/SPARK-11692 Project: Spark

[jira] [Created] (SPARK-11693) spark kafka direct streaming exception

2015-11-12 Thread xiaoxiaoluo (JIRA)
xiaoxiaoluo created SPARK-11693: --- Summary: spark kafka direct streaming exception Key: SPARK-11693 URL: https://issues.apache.org/jira/browse/SPARK-11693 Project: Spark Issue Type: Question

[jira] [Resolved] (SPARK-11658) simplify documentation for PySpark combineByKey

2015-11-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-11658. --- Resolution: Fixed Assignee: chris snow Fix Version/s: 1.7.0 Target

[jira] [Updated] (SPARK-2533) Show summary of locality level of completed tasks in the each stage page of web UI

2015-11-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2533: - Assignee: Jean-Baptiste Onofré > Show summary of locality level of completed tasks in the each stage page

[jira] [Resolved] (SPARK-11709) Include call site info in SparkContext.assertNotStopped

2015-11-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11709. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9675

[jira] [Commented] (SPARK-11390) Query plan with/without filterPushdown indistinguishable

2015-11-12 Thread Zee Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003170#comment-15003170 ] Zee Chen commented on SPARK-11390: -- Test output: {code}

[jira] [Resolved] (SPARK-10384) Univariate statistics as UDAFs

2015-11-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10384. --- Resolution: Fixed Fix Version/s: 1.6.0 > Univariate statistics as UDAFs >

[jira] [Commented] (SPARK-10384) Univariate statistics as UDAFs

2015-11-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003190#comment-15003190 ] Xiangrui Meng commented on SPARK-10384: --- I marked this JIRA as resolved. Approximate

[jira] [Resolved] (SPARK-11420) Updating Stddev support with Imperative Aggregate

2015-11-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11420. --- Resolution: Fixed Fix Version/s: 1.6.0 > Updating Stddev support with Imperative

[jira] [Comment Edited] (SPARK-11439) Optimization of creating sparse feature without dense one

2015-11-12 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003384#comment-15003384 ] Kai Sasaki edited comment on SPARK-11439 at 11/13/15 1:48 AM: -- [~nakul02] It

[jira] [Comment Edited] (SPARK-11439) Optimization of creating sparse feature without dense one

2015-11-12 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003384#comment-15003384 ] Kai Sasaki edited comment on SPARK-11439 at 11/13/15 1:48 AM: -- [~nakul02] It

[jira] [Commented] (SPARK-11439) Optimization of creating sparse feature without dense one

2015-11-12 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003384#comment-15003384 ] Kai Sasaki commented on SPARK-11439: [~nakul02] It seems to indicate the model in SparkR here.

[jira] [Comment Edited] (SPARK-11439) Optimization of creating sparse feature without dense one

2015-11-12 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003384#comment-15003384 ] Kai Sasaki edited comment on SPARK-11439 at 11/13/15 1:47 AM: -- [~nakul02] It

[jira] [Created] (SPARK-11708) 20-25% performance regression in TeraSort

2015-11-12 Thread Nishkam Ravi (JIRA)
Nishkam Ravi created SPARK-11708: Summary: 20-25% performance regression in TeraSort Key: SPARK-11708 URL: https://issues.apache.org/jira/browse/SPARK-11708 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-11709) Include call site info in SparkContext.assertNotStopped

2015-11-12 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-11709: - Summary: Include call site info in SparkContext.assertNotStopped Key: SPARK-11709 URL: https://issues.apache.org/jira/browse/SPARK-11709 Project: Spark

[jira] [Assigned] (SPARK-11709) Include call site info in SparkContext.assertNotStopped

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11709: Assignee: Xiangrui Meng (was: Apache Spark) > Include call site info in

[jira] [Updated] (SPARK-11191) [1.5] Can't create UDF's using hive thrift service

2015-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11191: -- Assignee: Cheng Lian > [1.5] Can't create UDF's using hive thrift service >

[jira] [Updated] (SPARK-11707) StreamCorruptedException if authentication is enabled

2015-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11707: -- Component/s: Spark Core > StreamCorruptedException if authentication is enabled >

[jira] [Commented] (SPARK-11707) StreamCorruptedException if authentication is enabled

2015-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003073#comment-15003073 ] Sean Owen commented on SPARK-11707: --- Isn't this likely some kind of misconfiguration locally? you have

[jira] [Updated] (SPARK-11664) Add methods to get bisecting k-means cluster structure

2015-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11664: -- Fix Version/s: (was: 1.6.0) > Add methods to get bisecting k-means cluster structure >

[jira] [Updated] (SPARK-11665) Support other distance metrics for bisecting k-means

2015-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11665: -- Fix Version/s: (was: 1.6.0) > Support other distance metrics for bisecting k-means >

[jira] [Updated] (SPARK-11669) Python interface to SparkR GLM module

2015-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11669: -- Target Version/s: 1.5.1, 1.5.0 (was: 1.5.0, 1.5.1) Priority: Minor (was: Major)

[jira] [Updated] (SPARK-11666) Find the best `k` by cutting bisecting k-means cluster tree without recomputation

2015-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11666: -- Fix Version/s: (was: 1.6.0) > Find the best `k` by cutting bisecting k-means cluster tree without

[jira] [Commented] (SPARK-11392) GroupedIterator's hasNext is not idempotent

2015-11-12 Thread Nakul Jindal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003184#comment-15003184 ] Nakul Jindal commented on SPARK-11392: -- Sorry, it's been a while since I last worked on this.

[jira] [Commented] (SPARK-11583) Make MapStatus use less memory uage

2015-11-12 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003185#comment-15003185 ] Imran Rashid commented on SPARK-11583: -- [~lemire] after reading a couple of comments in the old prs,

[jira] [Commented] (SPARK-11665) Support other distance metrics for bisecting k-means

2015-11-12 Thread Jun Zheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003186#comment-15003186 ] Jun Zheng commented on SPARK-11665: --- In bisecting k-means and regular k-means, the distance metric is

[jira] [Updated] (SPARK-11671) Example for sqlContext.createDataDrame from pandas.DataFrame has a typo

2015-11-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-11671: -- Assignee: chris snow > Example for sqlContext.createDataDrame from pandas.DataFrame has a typo >

[jira] [Updated] (SPARK-11671) Example for sqlContext.createDataDrame from pandas.DataFrame has a typo

2015-11-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-11671: -- Fix Version/s: (was: 1.7.0) 1.6.0 > Example for sqlContext.createDataDrame from

[jira] [Updated] (SPARK-11658) simplify documentation for PySpark combineByKey

2015-11-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-11658: -- Fix Version/s: (was: 1.7.0) 1.6.0 > simplify documentation for PySpark

[jira] [Updated] (SPARK-11658) simplify documentation for PySpark combineByKey

2015-11-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-11658: -- Target Version/s: 1.6.0 (was: 1.7.0) > simplify documentation for PySpark combineByKey >

[jira] [Updated] (SPARK-11671) Example for sqlContext.createDataDrame from pandas.DataFrame has a typo

2015-11-12 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-11671: -- Target Version/s: 1.6.0 (was: 1.7.0) > Example for sqlContext.createDataDrame from pandas.DataFrame

[jira] [Created] (SPARK-11716) UDFRegistration Drops Input Type Information

2015-11-12 Thread Artjom Metro (JIRA)
Artjom Metro created SPARK-11716: Summary: UDFRegistration Drops Input Type Information Key: SPARK-11716 URL: https://issues.apache.org/jira/browse/SPARK-11716 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-11654) add reduce to GroupedDataset

2015-11-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-11654: - Assignee: Wenchen Fan > add reduce to GroupedDataset > > >

[jira] [Resolved] (SPARK-11654) add reduce to GroupedDataset

2015-11-12 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-11654. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9673

[jira] [Updated] (SPARK-11668) R style summary stats in GLM package SparkR

2015-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11668: -- Priority: Minor (was: Major) Fix Version/s: (was: 1.5.1) Please don't set fix version;

[jira] [Updated] (SPARK-11712) Refactor spark.ml LDAModel to be abstract

2015-11-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11712: -- Fix Version/s: (was: 1.6.0) > Refactor spark.ml LDAModel to be abstract >

[jira] [Resolved] (SPARK-11655) SparkLauncherBackendSuite leaks child processes

2015-11-12 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-11655. Resolution: Fixed Assignee: Marcelo Vanzin Fix Version/s: 1.6.0 >

[jira] [Commented] (SPARK-11583) Make MapStatus use less memory uage

2015-11-12 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003084#comment-15003084 ] Imran Rashid commented on SPARK-11583: -- [~lemire] I thought Kent Yao's analysis was pretty

[jira] [Commented] (SPARK-11664) Add methods to get bisecting k-means cluster structure

2015-11-12 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003085#comment-15003085 ] Yu Ishikawa commented on SPARK-11664: - [~srowen] thank you for letting me know. I intended to set it

[jira] [Created] (SPARK-11714) Make Spark on Mesos honor port restrictions

2015-11-12 Thread Charles Allen (JIRA)
Charles Allen created SPARK-11714: - Summary: Make Spark on Mesos honor port restrictions Key: SPARK-11714 URL: https://issues.apache.org/jira/browse/SPARK-11714 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-11390) Query plan with/without filterPushdown indistinguishable

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11390: Assignee: Apache Spark > Query plan with/without filterPushdown indistinguishable >

[jira] [Assigned] (SPARK-11390) Query plan with/without filterPushdown indistinguishable

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11390: Assignee: (was: Apache Spark) > Query plan with/without filterPushdown

[jira] [Commented] (SPARK-11390) Query plan with/without filterPushdown indistinguishable

2015-11-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003110#comment-15003110 ] Apache Spark commented on SPARK-11390: -- User 'zeocio' has created a pull request for this issue:

  1   2   3   >