[jira] [Commented] (SPARK-5594) SparkException: Failed to get broadcast (TorrentBroadcast)

2015-02-04 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305396#comment-14305396 ] Lianhui Wang commented on SPARK-5594: - @John Sandiford, now you provide logs of

[jira] [Resolved] (SPARK-5585) Flaky test: Python regression

2015-02-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5585. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4358

[jira] [Created] (SPARK-5595) In memory data cache should be invalidated after insert into/overwrite

2015-02-04 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5595: --- Summary: In memory data cache should be invalidated after insert into/overwrite Key: SPARK-5595 URL: https://issues.apache.org/jira/browse/SPARK-5595 Project: Spark

[jira] [Commented] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-02-04 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305605#comment-14305605 ] Manoj Kumar commented on SPARK-5021: Thanks for the comment. That also seems to fail,

[jira] [Commented] (SPARK-5566) Tokenizer for mllib package

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305588#comment-14305588 ] Joseph K. Bradley commented on SPARK-5566: -- Do you mean to share the underlying

[jira] [Assigned] (SPARK-5596) Model import/export for GLMs and Naive Bayes

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-5596: Assignee: Joseph K. Bradley Model import/export for GLMs and Naive Bayes

[jira] [Created] (SPARK-5596) Model import/export for GLMs and Naive Bayes

2015-02-04 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5596: Summary: Model import/export for GLMs and Naive Bayes Key: SPARK-5596 URL: https://issues.apache.org/jira/browse/SPARK-5596 Project: Spark Issue

[jira] [Created] (SPARK-5598) Model import/export for ALS

2015-02-04 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5598: Summary: Model import/export for ALS Key: SPARK-5598 URL: https://issues.apache.org/jira/browse/SPARK-5598 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-5598) Model import/export for ALS

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5598: - Assignee: Xiangrui Meng Model import/export for ALS ---

[jira] [Commented] (SPARK-5596) Model import/export for GLMs and Naive Bayes

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305598#comment-14305598 ] Apache Spark commented on SPARK-5596: - User 'jkbradley' has created a pull request for

[jira] [Assigned] (SPARK-5597) Model import/export for DecisionTree and ensembles

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-5597: Assignee: Joseph K. Bradley Model import/export for DecisionTree and ensembles

[jira] [Created] (SPARK-5597) Model import/export for DecisionTree and ensembles

2015-02-04 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5597: Summary: Model import/export for DecisionTree and ensembles Key: SPARK-5597 URL: https://issues.apache.org/jira/browse/SPARK-5597 Project: Spark

[jira] [Created] (SPARK-5589) Split pyspark/sql.py into multiple files

2015-02-04 Thread Davies Liu (JIRA)
Davies Liu created SPARK-5589: - Summary: Split pyspark/sql.py into multiple files Key: SPARK-5589 URL: https://issues.apache.org/jira/browse/SPARK-5589 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-5529) Executor is still hold while BlockManager has been removed

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304849#comment-14304849 ] Apache Spark commented on SPARK-5529: - User 'shenh062326' has created a pull request

[jira] [Created] (SPARK-5588) support select/filter by SQL expression for Python DataFrame

2015-02-04 Thread Davies Liu (JIRA)
Davies Liu created SPARK-5588: - Summary: support select/filter by SQL expression for Python DataFrame Key: SPARK-5588 URL: https://issues.apache.org/jira/browse/SPARK-5588 Project: Spark Issue

[jira] [Resolved] (SPARK-5379) Add awaitTerminationOrTimeout

2015-02-04 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-5379. -- Resolution: Fixed Target Version/s: 1.3.0 Add awaitTerminationOrTimeout

[jira] [Commented] (SPARK-5529) Executor is still hold while BlockManager has been removed

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304803#comment-14304803 ] Reynold Xin commented on SPARK-5529: [~lianhuiwang] yes - I think they should fate

[jira] [Commented] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304817#comment-14304817 ] Reynold Xin commented on SPARK-4550: Sandy, The proposal seems to assume that objects

[jira] [Commented] (SPARK-5588) support select/filter by SQL expression for Python DataFrame

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304775#comment-14304775 ] Apache Spark commented on SPARK-5588: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-3619) Upgrade to Mesos 0.21 to work around MESOS-1688

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304795#comment-14304795 ] Apache Spark commented on SPARK-3619: - User 'jongyoul' has created a pull request for

[jira] [Issue Comment Deleted] (SPARK-5529) Executor is still hold while BlockManager has been removed

2015-02-04 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lianhui Wang updated SPARK-5529: Comment: was deleted (was: OK, I will address a PR later and then please help to review code.)

[jira] [Commented] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-02-04 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304864#comment-14304864 ] Sandy Ryza commented on SPARK-4550: --- I had heard rumors to that effect, so I ran some

[jira] [Commented] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-02-04 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304816#comment-14304816 ] Sandy Ryza commented on SPARK-4550: --- WIP branch:

[jira] [Commented] (SPARK-5529) Executor is still hold while BlockManager has been removed

2015-02-04 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304819#comment-14304819 ] Hong Shen commented on SPARK-5529: -- I had changed it in our own branch. Executor is

[jira] [Commented] (SPARK-5529) Executor is still hold while BlockManager has been removed

2015-02-04 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304810#comment-14304810 ] Lianhui Wang commented on SPARK-5529: - OK, I will address a PR later and then please

[jira] [Commented] (SPARK-5529) Executor is still hold while BlockManager has been removed

2015-02-04 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304811#comment-14304811 ] Lianhui Wang commented on SPARK-5529: - OK, I will address a PR later and then please

[jira] [Resolved] (SPARK-5574) Utils.createDirectory ignores namePrefix

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5574. Resolution: Fixed Fix Version/s: 1.3.0 Utils.createDirectory ignores namePrefix

[jira] [Assigned] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-02-04 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza reassigned SPARK-4550: - Assignee: Sandy Ryza In sort-based shuffle, store map outputs in serialized form

[jira] [Updated] (SPARK-5529) Executor is still hold while BlockManager has been removed

2015-02-04 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-5529: - Attachment: SPARK-5529.patch Executor is still hold while BlockManager has been removed

[jira] [Commented] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-02-04 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305703#comment-14305703 ] Manoj Kumar commented on SPARK-5021: Oops, I was thinking along the completely wrong

[jira] [Commented] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-02-04 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305658#comment-14305658 ] Travis Galoppo commented on SPARK-5021: --- Why not something like: {code} private def

[jira] [Commented] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-02-04 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305775#comment-14305775 ] Travis Galoppo commented on SPARK-5021: --- For the vectorMean function, the resulting

[jira] [Comment Edited] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-02-04 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305775#comment-14305775 ] Travis Galoppo edited comment on SPARK-5021 at 2/4/15 7:23 PM:

[jira] [Updated] (SPARK-5588) support select/filter by SQL expression for Python DataFrame

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5588: --- Issue Type: Sub-task (was: New Feature) Parent: SPARK-5166 support select/filter by SQL

[jira] [Resolved] (SPARK-5588) support select/filter by SQL expression for Python DataFrame

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5588. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Davies Liu support select/filter

[jira] [Updated] (SPARK-5594) SparkException: Failed to get broadcast (TorrentBroadcast)

2015-02-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5594: --- Priority: Critical (was: Major) SparkException: Failed to get broadcast (TorrentBroadcast)

[jira] [Commented] (SPARK-5594) SparkException: Failed to get broadcast (TorrentBroadcast)

2015-02-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305657#comment-14305657 ] Patrick Wendell commented on SPARK-5594: I've seen this occasionally in unit tests

[jira] [Created] (SPARK-5600) Sort order of unfinished apps can be wrong in History Server

2015-02-04 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-5600: - Summary: Sort order of unfinished apps can be wrong in History Server Key: SPARK-5600 URL: https://issues.apache.org/jira/browse/SPARK-5600 Project: Spark

[jira] [Commented] (SPARK-5529) Executor is still hold while BlockManager has been removed

2015-02-04 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305706#comment-14305706 ] Sandy Ryza commented on SPARK-5529: --- [~shenhong] [~lianhuiwang] both of these patches

[jira] [Commented] (SPARK-5600) Sort order of unfinished apps can be wrong in History Server

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305767#comment-14305767 ] Apache Spark commented on SPARK-5600: - User 'vanzin' has created a pull request for

[jira] [Resolved] (SPARK-5579) Provide support for project using SQL expression

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5579. Resolution: Fixed Fix Version/s: 1.3.0 Provide support for project using SQL expression

[jira] [Resolved] (SPARK-5235) Determine serializability of SQLContext

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5235. Resolution: Fixed Assignee: Reynold Xin Determine serializability of SQLContext

[jira] [Commented] (SPARK-4705) Driver retries in yarn-cluster mode always fail if event logging is enabled

2015-02-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305635#comment-14305635 ] Marcelo Vanzin commented on SPARK-4705: --- Hi [~twinkle], bq. Please note that as of

[jira] [Created] (SPARK-5599) Audit MLlib public APIs for 1.3

2015-02-04 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5599: Summary: Audit MLlib public APIs for 1.3 Key: SPARK-5599 URL: https://issues.apache.org/jira/browse/SPARK-5599 Project: Spark Issue Type: Task

[jira] [Commented] (SPARK-4877) userClassPathFirst doesn't handle user classes inheriting from parent

2015-02-04 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305810#comment-14305810 ] holdenk commented on SPARK-4877: Hi Matt, I don't believe we need to override loadClass,

[jira] [Commented] (SPARK-5235) Determine serializability of SQLContext

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305827#comment-14305827 ] Reynold Xin commented on SPARK-5235: OK I talked with [~marmbrus] more. It seems like

[jira] [Resolved] (SPARK-4964) Exactly-once + WAL-free Kafka Support in Spark Streaming

2015-02-04 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-4964. -- Resolution: Fixed Fix Version/s: 1.3.0 Exactly-once + WAL-free Kafka Support in Spark

[jira] [Commented] (SPARK-4587) Model export/import

2015-02-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305973#comment-14305973 ] Sean Owen commented on SPARK-4587: -- Coming late to the discussion, with a few comments on

[jira] [Commented] (SPARK-5597) Model import/export for DecisionTree and ensembles

2015-02-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305982#comment-14305982 ] Sean Owen commented on SPARK-5597: -- Here's some code to export the MLlib RDF models to

[jira] [Commented] (SPARK-5597) Model import/export for DecisionTree and ensembles

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305993#comment-14305993 ] Joseph K. Bradley commented on SPARK-5597: -- I'm partway done with a PR for this

[jira] [Commented] (SPARK-4877) userClassPathFirst doesn't handle user classes inheriting from parent

2015-02-04 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306104#comment-14306104 ] Stephen Haberman commented on SPARK-4877: - Hi Matt, I know about the

[jira] [Commented] (SPARK-4587) Model export/import

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305991#comment-14305991 ] Joseph K. Bradley commented on SPARK-4587: -- Thanks for the correction about

[jira] [Resolved] (SPARK-4707) Reliable Kafka Receiver can lose data if the block generator fails to store data

2015-02-04 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-4707. -- Resolution: Fixed Fix Version/s: 1.3.0 Reliable Kafka Receiver can lose data if the

[jira] [Commented] (SPARK-4587) Model export/import

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306022#comment-14306022 ] Joseph K. Bradley commented on SPARK-4587: -- You may be right about compression;

[jira] [Commented] (SPARK-4587) Model export/import

2015-02-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306032#comment-14306032 ] Sean Owen commented on SPARK-4587: -- True; you could also store N separate PMML models! At

[jira] [Commented] (SPARK-4877) userClassPathFirst doesn't handle user classes inheriting from parent

2015-02-04 Thread Matt Whelan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306082#comment-14306082 ] Matt Whelan commented on SPARK-4877: Overriding only findClass ignores caching, which

[jira] [Commented] (SPARK-4587) Model export/import

2015-02-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306115#comment-14306115 ] Sean Owen commented on SPARK-4587: -- OK, an internal-only format makes sense. So the idea

[jira] [Created] (SPARK-5602) Better support for creating DataFrame from local data collection

2015-02-04 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5602: -- Summary: Better support for creating DataFrame from local data collection Key: SPARK-5602 URL: https://issues.apache.org/jira/browse/SPARK-5602 Project: Spark

[jira] [Updated] (SPARK-4939) Python updateStateByKey example hang in local mode

2015-02-04 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-4939: -- Affects Version/s: (was: 1.3.0) 1.2.1 Fix Version/s: 1.2.2

[jira] [Resolved] (SPARK-4939) Python updateStateByKey example hang in local mode

2015-02-04 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-4939. --- Resolution: Fixed Fixed by

[jira] [Commented] (SPARK-1302) httpd doesn't start in spark-ec2 (cc2.8xlarge)

2015-02-04 Thread Mikhail Strebkov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305879#comment-14305879 ] Mikhail Strebkov commented on SPARK-1302: - The PR fixed the issue for me, also the

[jira] [Commented] (SPARK-4905) Flaky FlumeStreamSuite test: org.apache.spark.streaming.flume.FlumeStreamSuite.flume input stream

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305955#comment-14305955 ] Apache Spark commented on SPARK-4905: - User 'harishreedharan' has created a pull

[jira] [Commented] (SPARK-5598) Model import/export for ALS

2015-02-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305978#comment-14305978 ] Sean Owen commented on SPARK-5598: -- For what it's worth, I completely made up a

[jira] [Commented] (SPARK-4587) Model export/import

2015-02-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306002#comment-14306002 ] Sean Owen commented on SPARK-4587: -- You can get hundreds of megabytes of XML, yeah. I had

[jira] [Commented] (SPARK-4587) Model export/import

2015-02-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306088#comment-14306088 ] Xiangrui Meng commented on SPARK-4587: -- [~srowen] Parquet is just an implementation

[jira] [Created] (SPARK-5601) Make streaming algorithms Java-friendly

2015-02-04 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5601: Summary: Make streaming algorithms Java-friendly Key: SPARK-5601 URL: https://issues.apache.org/jira/browse/SPARK-5601 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4144) Support incremental model training of Naive Bayes classifier

2015-02-04 Thread Chris Fregly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305351#comment-14305351 ] Chris Fregly commented on SPARK-4144: - Hi there! Any update on this? I was thinking

[jira] [Commented] (SPARK-5593) Replace BlockManager listener with Executor listener in ExecutorAllocationListener

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305375#comment-14305375 ] Apache Spark commented on SPARK-5593: - User 'lianhuiwang' has created a pull request

[jira] [Updated] (SPARK-5593) Replace BlockManager listener with Executor listener in ExecutorAllocationListener

2015-02-04 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lianhui Wang updated SPARK-5593: Description: More strictly, in ExecutorAllocationListener, we need to replace onBlockManagerAdded,

[jira] [Commented] (SPARK-5604) Remove setCheckpointDir from LDA and tree Strategy

2015-02-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306737#comment-14306737 ] Xiangrui Meng commented on SPARK-5604: -- Yes. I talked to [~tdas] about this and he

[jira] [Created] (SPARK-5610) Generate Java docs without package private classes and methods

2015-02-04 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5610: Summary: Generate Java docs without package private classes and methods Key: SPARK-5610 URL: https://issues.apache.org/jira/browse/SPARK-5610 Project: Spark

[jira] [Closed] (SPARK-5308) MD5 / SHA1 hash format doesn't match standard Maven output

2015-02-04 Thread Kuldeep (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kuldeep closed SPARK-5308. -- MD5 / SHA1 hash format doesn't match standard Maven output

[jira] [Commented] (SPARK-5609) PythonMLlibAPI trainGaussianMixture seed should use Java type

2015-02-04 Thread Meethu Mathew (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306593#comment-14306593 ] Meethu Mathew commented on SPARK-5609: -- Please assign the ticket to me.

[jira] [Commented] (SPARK-5604) Remove setCheckpointDir from LDA

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306594#comment-14306594 ] Joseph K. Bradley commented on SPARK-5604: -- If we're doing this, then we should

[jira] [Comment Edited] (SPARK-5609) PythonMLlibAPI trainGaussianMixture seed should use Java type

2015-02-04 Thread Meethu Mathew (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306593#comment-14306593 ] Meethu Mathew edited comment on SPARK-5609 at 2/5/15 4:03 AM: --

[jira] [Updated] (SPARK-5586) Automatically provide sqlContext in Spark shell

2015-02-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5586: --- Priority: Blocker (was: Critical) Automatically provide sqlContext in Spark shell

[jira] [Updated] (SPARK-5586) Automatically provide sqlContext in Spark shell

2015-02-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5586: --- Assignee: (was: Patrick Wendell) Automatically provide sqlContext in Spark shell

[jira] [Created] (SPARK-5604) Remove setCheckpointDir from LDA

2015-02-04 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5604: Summary: Remove setCheckpointDir from LDA Key: SPARK-5604 URL: https://issues.apache.org/jira/browse/SPARK-5604 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-4587) Model export/import

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4587: - Description: This is an umbrella JIRA for one of the most requested features on the user

[jira] [Commented] (SPARK-5605) Allow using String to specify colum name in DSL aggregate functions

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306326#comment-14306326 ] Apache Spark commented on SPARK-5605: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-5599) Audit MLlib public APIs for 1.3

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306325#comment-14306325 ] Apache Spark commented on SPARK-5599: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-5602) Better support for creating DataFrame from local data collection

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306128#comment-14306128 ] Apache Spark commented on SPARK-5602: - User 'rxin' has created a pull request for this

[jira] [Resolved] (SPARK-5591) NoSuchObjectException for CTAS

2015-02-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5591. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4365

[jira] [Resolved] (SPARK-5587) Support change database owner

2015-02-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5587. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4357

[jira] [Updated] (SPARK-5605) Allow using String to specify colum name in DSL aggregate functions

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5605: --- Summary: Allow using String to specify colum name in DSL aggregate functions (was: Allow passing in

[jira] [Created] (SPARK-5605) Allow passing in String's directly into DSL aggregate functions

2015-02-04 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5605: -- Summary: Allow passing in String's directly into DSL aggregate functions Key: SPARK-5605 URL: https://issues.apache.org/jira/browse/SPARK-5605 Project: Spark

[jira] [Commented] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-02-04 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306321#comment-14306321 ] Sandy Ryza commented on SPARK-4550: --- I also just tried this out using an object that's

[jira] [Updated] (SPARK-5602) Better support for creating DataFrame from local data collection

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5602: --- Issue Type: Sub-task (was: Improvement) Parent: SPARK-5166 Better support for creating

[jira] [Commented] (SPARK-4587) Model export/import

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306147#comment-14306147 ] Joseph K. Bradley commented on SPARK-4587: -- It sounds like we're converging!

[jira] [Created] (SPARK-5603) Preinsert casting and renaming rule is needed in the Analyzer

2015-02-04 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5603: --- Summary: Preinsert casting and renaming rule is needed in the Analyzer Key: SPARK-5603 URL: https://issues.apache.org/jira/browse/SPARK-5603 Project: Spark Issue

[jira] [Resolved] (SPARK-5426) SQL Java API helper methods

2015-02-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5426. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4243

[jira] [Commented] (SPARK-5603) Preinsert casting and renaming rule is needed in the Analyzer

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306205#comment-14306205 ] Apache Spark commented on SPARK-5603: - User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-5595) In memory data cache should be invalidated after insert into/overwrite

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306204#comment-14306204 ] Apache Spark commented on SPARK-5595: - User 'yhuai' has created a pull request for

[jira] [Resolved] (SPARK-5118) Create table test stored as parquet as select ... report error

2015-02-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5118. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3921

[jira] [Resolved] (SPARK-5577) Create a convenient way for Python users to register SQL UDFs

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5577. Resolution: Fixed Fix Version/s: 1.3.0 Create a convenient way for Python users to register

[jira] [Comment Edited] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-02-04 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304864#comment-14304864 ] Sandy Ryza edited comment on SPARK-4550 at 2/5/15 12:36 AM: I

[jira] [Commented] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-02-04 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306538#comment-14306538 ] Manoj Kumar commented on SPARK-5021: Can you please explain, what do you mean by soft

[jira] [Commented] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306552#comment-14306552 ] Andrew Or commented on SPARK-5388: -- Hi [~vanzin], thank you for all of your comments. I

[jira] [Resolved] (SPARK-5411) Allow SparkListeners to be specified in SparkConf and loaded when creating SparkContext

2015-02-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5411. Resolution: Fixed Fix Version/s: 1.3.0 Allow SparkListeners to be specified in

[jira] [Resolved] (SPARK-5605) Allow using String to specify colum name in DSL aggregate functions

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5605. Resolution: Fixed Fix Version/s: 1.3.0 Allow using String to specify colum name in DSL

  1   2   >