[jira] [Updated] (SPARK-8390) Update DirectKafkaWordCount examples to show how offset ranges can be used

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8390: - Target Version/s: 1.5.0 (was: 1.4.1, 1.5.0) Update DirectKafkaWordCount examples to show how offset

[jira] [Commented] (SPARK-7050) Kafka-assembly should keep consistent behavior under sbt and maven compilation

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614694#comment-14614694 ] Sean Owen commented on SPARK-7050: -- Yeah so [~jerryshao] given the PR I'm suggesting you

[jira] [Updated] (SPARK-7402) JSON serialization of params

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7402: - Target Version/s: 1.5.0 (was: 1.4.1, 1.5.0) JSON serialization of params

[jira] [Updated] (SPARK-7410) Add option to avoid broadcasting configuration with newAPIHadoopFile

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7410: - Target Version/s: (was: 1.4.1) Add option to avoid broadcasting configuration with newAPIHadoopFile

[jira] [Commented] (SPARK-7050) Fix Python Kafka test assembly jar not found issue under Maven build

2015-07-06 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614698#comment-14614698 ] Saisai Shao commented on SPARK-7050: Thanks [~srowen], how about this title? Fix

[jira] [Commented] (SPARK-5133) Feature Importance for Decision Tree (Ensembles)

2015-07-06 Thread Peter Prettenhofer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614799#comment-14614799 ] Peter Prettenhofer commented on SPARK-5133: --- [~yalamart] For some reason i

[jira] [Commented] (SPARK-8837) support using keyword in column name

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614865#comment-14614865 ] Apache Spark commented on SPARK-8837: - User 'cloud-fan' has created a pull request for

[jira] [Assigned] (SPARK-8837) support using keyword in column name

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8837: --- Assignee: (was: Apache Spark) support using keyword in column name

[jira] [Assigned] (SPARK-7114) parse error for DataFrame.filter after aggregate

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7114: --- Assignee: Apache Spark parse error for DataFrame.filter after aggregate

[jira] [Assigned] (SPARK-7114) parse error for DataFrame.filter after aggregate

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7114: --- Assignee: (was: Apache Spark) parse error for DataFrame.filter after aggregate

[jira] [Commented] (SPARK-8518) Log-linear models for survival analysis

2015-07-06 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614795#comment-14614795 ] Yanbo Liang commented on SPARK-8518: OK, I will first finish the design documents and

[jira] [Commented] (SPARK-5133) Feature Importance for Decision Tree (Ensembles)

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614812#comment-14614812 ] Sean Owen commented on SPARK-5133: -- We don't generally Assign JIRAs while they're being

[jira] [Created] (SPARK-8836) Sorted join

2015-07-06 Thread Daniel Darabos (JIRA)
Daniel Darabos created SPARK-8836: - Summary: Sorted join Key: SPARK-8836 URL: https://issues.apache.org/jira/browse/SPARK-8836 Project: Spark Issue Type: Improvement Components:

[jira] [Commented] (SPARK-8646) PySpark does not run on YARN

2015-07-06 Thread Juliet Hougland (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614856#comment-14614856 ] Juliet Hougland commented on SPARK-8646: [~sowen] The pandas error came when I

[jira] [Created] (SPARK-8837) support using keyword in column name

2015-07-06 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-8837: -- Summary: support using keyword in column name Key: SPARK-8837 URL: https://issues.apache.org/jira/browse/SPARK-8837 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-7114) parse error for DataFrame.filter after aggregate

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614866#comment-14614866 ] Apache Spark commented on SPARK-7114: - User 'cloud-fan' has created a pull request for

[jira] [Assigned] (SPARK-8837) support using keyword in column name

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8837: --- Assignee: Apache Spark support using keyword in column name

[jira] [Commented] (SPARK-8646) PySpark does not run on YARN

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614885#comment-14614885 ] Sean Owen commented on SPARK-8646: -- Right, none of this uses pandas directly. As

[jira] [Comment Edited] (SPARK-8834) Throttle DStreams dynamically through back-pressure information

2015-07-06 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-8834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614777#comment-14614777 ] François Garillot edited comment on SPARK-8834 at 7/6/15 11:15 AM:

[jira] [Created] (SPARK-8832) insertInto() throws error in sparkR

2015-07-06 Thread Amar Gondaliya (JIRA)
Amar Gondaliya created SPARK-8832: - Summary: insertInto() throws error in sparkR Key: SPARK-8832 URL: https://issues.apache.org/jira/browse/SPARK-8832 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-8646) PySpark does not run on YARN

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614673#comment-14614673 ] Sean Owen commented on SPARK-8646: -- [~j_houg] is the resolution here just that pandas has

[jira] [Updated] (SPARK-8788) Java unit test for PCA transformer

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8788: - Affects Version/s: (was: 1.5.0) Priority: Minor (was: Major) [~yanboliang] please read

[jira] [Commented] (SPARK-8743) Deregister Codahale metrics for streaming when StreamingContext is closed

2015-07-06 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614681#comment-14614681 ] Tathagata Das commented on SPARK-8743: -- [~neelesh77] Any ETA on this? Deregister

[jira] [Commented] (SPARK-5133) Feature Importance for Decision Tree (Ensembles)

2015-07-06 Thread Peter Prettenhofer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614682#comment-14614682 ] Peter Prettenhofer commented on SPARK-5133: --- [~yalamart] I'm already working on

[jira] [Updated] (SPARK-8828) Revert the change of SPARK-5680

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8828: - Component/s: SQL Revert the change of SPARK-5680 --- Key:

[jira] [Updated] (SPARK-8593) History Server doesn't show complete application when one attempt inprogress

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8593: - Target Version/s: (was: 1.4.1) History Server doesn't show complete application when one attempt

[jira] [Updated] (SPARK-8414) Ensure ClosureCleaner actually triggers clean ups

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8414: - Target Version/s: 1.5.0 (was: 1.4.1, 1.5.0) Ensure ClosureCleaner actually triggers clean ups

[jira] [Updated] (SPARK-6266) PySpark SparseVector missing doc for size, indices, values

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6266: - Target Version/s: 1.5.0 (was: 1.4.1, 1.5.0) PySpark SparseVector missing doc for size, indices, values

[jira] [Updated] (SPARK-6129) Add a section in user guide for model evaluation

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6129: - Target Version/s: 1.5.0 (was: 1.4.1, 1.5.0) Add a section in user guide for model evaluation

[jira] [Updated] (SPARK-5905) Improve RowMatrix user guide and doc.

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5905: - Target Version/s: 1.5.0 (was: 1.4.1, 1.5.0) Improve RowMatrix user guide and doc.

[jira] [Updated] (SPARK-6174) Improve doc: Python ALS, MatrixFactorizationModel

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6174: - Target Version/s: 1.5.0 (was: 1.4.1, 1.5.0) Improve doc: Python ALS, MatrixFactorizationModel

[jira] [Updated] (SPARK-8016) YARN cluster / client modes have different app names for python

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8016: - Target Version/s: 1.5.0 (was: 1.4.1, 1.5.0) YARN cluster / client modes have different app names for

[jira] [Comment Edited] (SPARK-8834) Throttle DStreams dynamically through back-pressure information

2015-07-06 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-8834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614777#comment-14614777 ] François Garillot edited comment on SPARK-8834 at 7/6/15 9:33 AM:

[jira] [Commented] (SPARK-8833) Kafka Direct API support offset in zookeeper

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614677#comment-14614677 ] Apache Spark commented on SPARK-8833: - User 'guowei2' has created a pull request for

[jira] [Updated] (SPARK-8400) ml.ALS doesn't handle -1 block size

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8400: - Target Version/s: 1.3.2, 1.4.2, 1.5.0 (was: 1.3.2, 1.4.1, 1.5.0) ml.ALS doesn't handle -1 block size

[jira] [Commented] (SPARK-8807) Add between operator in SparkR

2015-07-06 Thread Venkata Vineel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614688#comment-14614688 ] Venkata Vineel commented on SPARK-8807: --- [~yu_ishikawa] Can you please add more

[jira] [Updated] (SPARK-8050) Make Savable and Loader Java-friendly.

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8050: - Target Version/s: 1.5.0 (was: 1.4.1, 1.5.0) Make Savable and Loader Java-friendly.

[jira] [Updated] (SPARK-7808) Package doc for spark.ml.feature

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7808: - Target Version/s: 1.5.0 (was: 1.4.1, 1.5.0) Package doc for spark.ml.feature

[jira] [Commented] (SPARK-8547) xgboost exploration

2015-07-06 Thread Venkata Vineel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614690#comment-14614690 ] Venkata Vineel commented on SPARK-8547: --- [~josephkb] Can I get started on this. Is

[jira] [Updated] (SPARK-7050) Fix Python Kafka test assembly jar not found issue under Maven build

2015-07-06 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-7050: --- Summary: Fix Python Kafka test assembly jar not found issue under Maven build (was: Kafka-assembly

[jira] [Commented] (SPARK-5133) Feature Importance for Decision Tree (Ensembles)

2015-07-06 Thread Venkata Vineel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614715#comment-14614715 ] Venkata Vineel commented on SPARK-5133: --- [~peter.prettenhofer] Please assign it to

[jira] [Assigned] (SPARK-8830) levenshtein directly on top of UTF8String

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8830: --- Assignee: Apache Spark levenshtein directly on top of UTF8String

[jira] [Assigned] (SPARK-8830) levenshtein directly on top of UTF8String

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8830: --- Assignee: (was: Apache Spark) levenshtein directly on top of UTF8String

[jira] [Commented] (SPARK-8830) levenshtein directly on top of UTF8String

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614726#comment-14614726 ] Apache Spark commented on SPARK-8830: - User 'tarekauel' has created a pull request for

[jira] [Updated] (SPARK-7398) Add back-pressure to Spark Streaming

2015-07-06 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-7398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] François Garillot updated SPARK-7398: - Description: Spark Streaming has trouble dealing with situations where batch processing

[jira] [Created] (SPARK-8835) Provide pluggable Congestion Strategies to deal with Streaming load

2015-07-06 Thread JIRA
François Garillot created SPARK-8835: Summary: Provide pluggable Congestion Strategies to deal with Streaming load Key: SPARK-8835 URL: https://issues.apache.org/jira/browse/SPARK-8835 Project:

[jira] [Created] (SPARK-8834) Throttle DStreams dynamically through back-pressure information

2015-07-06 Thread JIRA
François Garillot created SPARK-8834: Summary: Throttle DStreams dynamically through back-pressure information Key: SPARK-8834 URL: https://issues.apache.org/jira/browse/SPARK-8834 Project: Spark

[jira] [Commented] (SPARK-6981) [SQL] SparkPlanner and QueryExecution should be factored out from SQLContext

2015-07-06 Thread Santiago M. Mola (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614645#comment-14614645 ] Santiago M. Mola commented on SPARK-6981: - Any progress on this? [SQL]

[jira] [Created] (SPARK-8833) Kafka Direct API support offset in zookeeper

2015-07-06 Thread guowei (JIRA)
guowei created SPARK-8833: - Summary: Kafka Direct API support offset in zookeeper Key: SPARK-8833 URL: https://issues.apache.org/jira/browse/SPARK-8833 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-8747) fix EqualNullSafe for binary type

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8747: - Assignee: Wenchen Fan fix EqualNullSafe for binary type -

[jira] [Updated] (SPARK-7401) Dot product and squared_distances should be vectorized in Vectors

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7401: - Assignee: Manoj Kumar Dot product and squared_distances should be vectorized in Vectors

[jira] [Commented] (SPARK-6001) K-Means clusterer should return the assignments of input points to clusters

2015-07-06 Thread Venkata Vineel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614720#comment-14614720 ] Venkata Vineel commented on SPARK-6001: --- [~derrickburns] Can you please assign this

[jira] [Updated] (SPARK-7398) Add back-pressure to Spark Streaming

2015-07-06 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-7398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] François Garillot updated SPARK-7398: - Description: Spark Streaming has trouble dealing with situations where batch processing

[jira] [Updated] (SPARK-8835) Provide pluggable Congestion Strategies to deal with Streaming load

2015-07-06 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-8835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] François Garillot updated SPARK-8835: - Shepherd: Tathagata Das Provide pluggable Congestion Strategies to deal with Streaming

[jira] [Commented] (SPARK-8833) Kafka Direct API support offset in zookeeper

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614670#comment-14614670 ] Sean Owen commented on SPARK-8833: -- No, you actually pass the offsets you want to begin

[jira] [Assigned] (SPARK-8833) Kafka Direct API support offset in zookeeper

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8833: --- Assignee: Apache Spark Kafka Direct API support offset in zookeeper

[jira] [Assigned] (SPARK-8833) Kafka Direct API support offset in zookeeper

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8833: --- Assignee: (was: Apache Spark) Kafka Direct API support offset in zookeeper

[jira] [Updated] (SPARK-8834) Throttle DStreams dynamically through back-pressure information

2015-07-06 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-8834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] François Garillot updated SPARK-8834: - Description: This aims to have Spark Streaming be more resilient to high-throughput

[jira] [Commented] (SPARK-8834) Throttle DStreams dynamically through back-pressure information

2015-07-06 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-8834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614777#comment-14614777 ] François Garillot commented on SPARK-8834: -- More feature-rich, in that (see

[jira] [Updated] (SPARK-8834) Throttle DStreams dynamically through back-pressure information

2015-07-06 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-8834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] François Garillot updated SPARK-8834: - Target Version/s: 1.5.0 Throttle DStreams dynamically through back-pressure information

[jira] [Updated] (SPARK-8835) Provide pluggable Congestion Strategies to deal with Streaming load

2015-07-06 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-8835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] François Garillot updated SPARK-8835: - Description: Second part of [SPARK-7398|https://issues.apache.org/jira/browse/SPARK-7398]

[jira] [Commented] (SPARK-6724) Model import/export for FPGrowth

2015-07-06 Thread Hrishikesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614920#comment-14614920 ] Hrishikesh commented on SPARK-6724: --- I am facing some issues in my code:

[jira] [Assigned] (SPARK-8839) Thrift Sever will throw `java.util.NoSuchElementException: key not found` exception when many clients connect it

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8839: --- Assignee: Apache Spark Thrift Sever will throw `java.util.NoSuchElementException: key not

[jira] [Commented] (SPARK-8838) Add config to enable/disable merging part-files when merging parquet schema

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614893#comment-14614893 ] Apache Spark commented on SPARK-8838: - User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-8838) Add config to enable/disable merging part-files when merging parquet schema

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8838: --- Assignee: Apache Spark Add config to enable/disable merging part-files when merging parquet

[jira] [Created] (SPARK-8838) Add config to enable/disable merging part-files when merging parquet schema

2015-07-06 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-8838: -- Summary: Add config to enable/disable merging part-files when merging parquet schema Key: SPARK-8838 URL: https://issues.apache.org/jira/browse/SPARK-8838

[jira] [Assigned] (SPARK-8838) Add config to enable/disable merging part-files when merging parquet schema

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8838: --- Assignee: (was: Apache Spark) Add config to enable/disable merging part-files when

[jira] [Created] (SPARK-8839) Thrift Sever will throw `java.util.NoSuchElementException: key not found` exception when many clients connect it

2015-07-06 Thread SaintBacchus (JIRA)
SaintBacchus created SPARK-8839: --- Summary: Thrift Sever will throw `java.util.NoSuchElementException: key not found` exception when many clients connect it Key: SPARK-8839 URL:

[jira] [Assigned] (SPARK-8839) Thrift Sever will throw `java.util.NoSuchElementException: key not found` exception when many clients connect it

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8839: --- Assignee: (was: Apache Spark) Thrift Sever will throw

[jira] [Commented] (SPARK-8839) Thrift Sever will throw `java.util.NoSuchElementException: key not found` exception when many clients connect it

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614955#comment-14614955 ] Apache Spark commented on SPARK-8839: - User 'SaintBacchus' has created a pull request

[jira] [Commented] (SPARK-8596) Install and configure RStudio server on Spark EC2

2015-07-06 Thread Vincent Warmerdam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614968#comment-14614968 ] Vincent Warmerdam commented on SPARK-8596: -- done and done. this task feels like

[jira] [Commented] (SPARK-8684) Update R version in Spark EC2 AMI

2015-07-06 Thread Vincent Warmerdam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614990#comment-14614990 ] Vincent Warmerdam commented on SPARK-8684: -- just tried it all, and it just seems

[jira] [Comment Edited] (SPARK-8684) Update R version in Spark EC2 AMI

2015-07-06 Thread Vincent Warmerdam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614990#comment-14614990 ] Vincent Warmerdam edited comment on SPARK-8684 at 7/6/15 12:50 PM:

[jira] [Commented] (SPARK-8724) Need documentation on how to deploy or use SparkR in Spark 1.4.0+

2015-07-06 Thread Vincent Warmerdam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14615007#comment-14615007 ] Vincent Warmerdam commented on SPARK-8724: -- ``` spark_link - system('cat

[jira] [Updated] (SPARK-6197) handle json parse exception for eventlog file not finished writing

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6197: - Target Version/s: 1.3.2, 1.4.2 (was: 1.3.2, 1.4.1) handle json parse exception for eventlog file not

[jira] [Updated] (SPARK-4231) Add RankingMetrics to examples.MovieLensALS

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4231: - Target Version/s: (was: 1.3.0) Priority: Minor (was: Major) Add RankingMetrics to

[jira] [Updated] (SPARK-3828) Spark returns inconsistent results when building with different Hadoop version

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3828: - Target Version/s: (was: 1.2.0) Spark returns inconsistent results when building with different Hadoop

[jira] [Commented] (SPARK-8841) Fix partition pruning percentage log message

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14615013#comment-14615013 ] Apache Spark commented on SPARK-8841: - User 'srlindemann' has created a pull request

[jira] [Updated] (SPARK-2750) Add Https support for Web UI

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-2750: - Target Version/s: (was: 1.0.2) Add Https support for Web UI

[jira] [Updated] (SPARK-4123) Show dependency changes in pull requests

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4123: - Target Version/s: (was: 1.2.0) Show dependency changes in pull requests

[jira] [Assigned] (SPARK-8841) Fix partition pruning percentage log message

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8841: --- Assignee: Apache Spark Fix partition pruning percentage log message

[jira] [Created] (SPARK-8840) Float type coercion with hiveContext

2015-07-06 Thread Evgeny SInelnikov (JIRA)
Evgeny SInelnikov created SPARK-8840: Summary: Float type coercion with hiveContext Key: SPARK-8840 URL: https://issues.apache.org/jira/browse/SPARK-8840 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-8833) Kafka Direct API support offset in zookeeper

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8833: - Due Date: (was: 6/Jul/15) Priority: Minor (was: Major) Issue Type: Improvement (was:

[jira] [Created] (SPARK-8841) Fix partition pruning percentage log message

2015-07-06 Thread Steve Lindemann (JIRA)
Steve Lindemann created SPARK-8841: -- Summary: Fix partition pruning percentage log message Key: SPARK-8841 URL: https://issues.apache.org/jira/browse/SPARK-8841 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-8841) Fix partition pruning percentage log message

2015-07-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8841: --- Assignee: (was: Apache Spark) Fix partition pruning percentage log message

[jira] [Updated] (SPARK-4454) Race condition in DAGScheduler

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4454: - Target Version/s: (was: 1.2.2, 1.3.0) Race condition in DAGScheduler --

[jira] [Updated] (SPARK-2063) Creating a SchemaRDD via sql() does not correctly resolve nested types

2015-07-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-2063: - Target Version/s: (was: 1.2.0) Creating a SchemaRDD via sql() does not correctly resolve nested types

[jira] [Commented] (SPARK-4729) Add time series subsampling to MLlib

2015-07-06 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14615016#comment-14615016 ] RJ Nowling commented on SPARK-4729: --- Hi [~yalamart], I haven't looked at this in quite

[jira] [Commented] (SPARK-8833) Kafka Direct API support offset in zookeeper

2015-07-06 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14615056#comment-14615056 ] Cody Koeninger commented on SPARK-8833: --- this basic idea has been discussed before

[jira] [Updated] (SPARK-5016) GaussianMixtureEM should distribute matrix inverse for large numFeatures, k

2015-07-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5016: - Target Version/s: 1.5.0 GaussianMixtureEM should distribute matrix inverse for large

[jira] [Updated] (SPARK-8703) Add CountVectorizer as a ml transformer to convert document to words count vector

2015-07-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8703: - Target Version/s: 1.5.0 Add CountVectorizer as a ml transformer to convert document to

[jira] [Updated] (SPARK-8092) OneVsRest doesn't allow flexibility in label/ feature column renaming

2015-07-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8092: - Target Version/s: 1.5.0 OneVsRest doesn't allow flexibility in label/ feature column

[jira] [Updated] (SPARK-8178) date/time function: quarter

2015-07-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8178: --- Description: quarter(timestamp): int Returns the quarter of the year for a date, timestamp, or

[jira] [Commented] (SPARK-8818) In should not take Any not Column

2015-07-06 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14615659#comment-14615659 ] Yu Ishikawa commented on SPARK-8818: I'm happy to hear that. Thank you for closing

[jira] [Comment Edited] (SPARK-8743) Deregister Codahale metrics for streaming when StreamingContext is closed

2015-07-06 Thread Neelesh Srinivas Salian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14615553#comment-14615553 ] Neelesh Srinivas Salian edited comment on SPARK-8743 at 7/6/15 8:06 PM:

[jira] [Updated] (SPARK-8092) OneVsRest doesn't allow flexibility in label/ feature column renaming

2015-07-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8092: - Shepherd: Joseph K. Bradley OneVsRest doesn't allow flexibility in label/ feature column

[jira] [Resolved] (SPARK-7114) parse error for DataFrame.filter after aggregate

2015-07-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7114. Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 1.5.0 parse error for

[jira] [Resolved] (SPARK-8837) support using keyword in column name

2015-07-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-8837. Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 1.5.0 support using keyword

[jira] [Comment Edited] (SPARK-8743) Deregister Codahale metrics for streaming when StreamingContext is closed

2015-07-06 Thread Neelesh Srinivas Salian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14615553#comment-14615553 ] Neelesh Srinivas Salian edited comment on SPARK-8743 at 7/6/15 8:36 PM:

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-07-06 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14615616#comment-14615616 ] Pedro Rodriguez commented on SPARK-5556: I am still interested, but was unsure of

  1   2   3   4   >