[jira] [Assigned] (SPARK-5972) Cache residuals for GradientBoostedTrees during training

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5972: --- Assignee: (was: Apache Spark) > Cache residuals for GradientBoostedTrees during training

[jira] [Assigned] (SPARK-5972) Cache residuals for GradientBoostedTrees during training

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5972: --- Assignee: Apache Spark > Cache residuals for GradientBoostedTrees during training > -

[jira] [Created] (SPARK-6676) Add hadoop 2.4+ for profiles in POM.xml

2015-04-02 Thread Zhang, Liye (JIRA)
Zhang, Liye created SPARK-6676: -- Summary: Add hadoop 2.4+ for profiles in POM.xml Key: SPARK-6676 URL: https://issues.apache.org/jira/browse/SPARK-6676 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-6676) Add hadoop 2.4+ for profiles in POM.xml

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392466#comment-14392466 ] Apache Spark commented on SPARK-6676: - User 'liyezhang556520' has created a pull reque

[jira] [Assigned] (SPARK-6676) Add hadoop 2.4+ for profiles in POM.xml

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6676: --- Assignee: Apache Spark > Add hadoop 2.4+ for profiles in POM.xml > --

[jira] [Assigned] (SPARK-6676) Add hadoop 2.4+ for profiles in POM.xml

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6676: --- Assignee: (was: Apache Spark) > Add hadoop 2.4+ for profiles in POM.xml > ---

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread Sudharma Puranik (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392474#comment-14392474 ] Sudharma Puranik commented on SPARK-2243: - [~jahubba] : Running on the seperate JV

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread Sudharma Puranik (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392475#comment-14392475 ] Sudharma Puranik commented on SPARK-2243: - [~jahubba] : Running on the seperate JV

[jira] [Issue Comment Deleted] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread Sudharma Puranik (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sudharma Puranik updated SPARK-2243: Comment: was deleted (was: [~jahubba] : Running on the seperate JVMs is not a workaround but

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread Sudharma Puranik (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392519#comment-14392519 ] Sudharma Puranik commented on SPARK-2243: - [~sowen] : My reply was for Jason wher

[jira] [Resolved] (SPARK-6676) Add hadoop 2.4+ for profiles in POM.xml

2015-04-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6676. -- Resolution: Won't Fix This is already supported by the hadoop-2.4 profile, which mean "2.4+" > Add hado

[jira] [Created] (SPARK-6677) pyspark.sql nondeterministic issue with row fields

2015-04-02 Thread Stefano Parmesan (JIRA)
Stefano Parmesan created SPARK-6677: --- Summary: pyspark.sql nondeterministic issue with row fields Key: SPARK-6677 URL: https://issues.apache.org/jira/browse/SPARK-6677 Project: Spark Issue

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392499#comment-14392499 ] Sean Owen commented on SPARK-2243: -- The best thing to do is state your use case. Not a wo

[jira] [Resolved] (SPARK-6672) createDataFrame from RDD[Row] with UDTs cannot be saved

2015-04-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-6672. --- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5329 [https://github.com/

[jira] [Comment Edited] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread Sudharma Puranik (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392519#comment-14392519 ] Sudharma Puranik edited comment on SPARK-2243 at 4/2/15 10:36 AM: --

[jira] [Created] (SPARK-6678) select count(DISTINCT C_UID) from parquetdir may be can optimize

2015-04-02 Thread Littlestar (JIRA)
Littlestar created SPARK-6678: - Summary: select count(DISTINCT C_UID) from parquetdir may be can optimize Key: SPARK-6678 URL: https://issues.apache.org/jira/browse/SPARK-6678 Project: Spark Iss

[jira] [Updated] (SPARK-6679) java.lang.ClassNotFoundException on Mesos fine grained mode and input replication

2015-04-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-6679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ondřej Smola updated SPARK-6679: Description: Spark Streaming 1.3.0, Mesos 0.21.1 - Only when using fine grained mode and receiver i

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread sam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392616#comment-14392616 ] sam commented on SPARK-2243: [~srowen] The real issue here and use case, is to be able to ch

[jira] [Assigned] (SPARK-4449) specify port range in spark

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4449: --- Assignee: Apache Spark > specify port range in spark > --- > >

[jira] [Commented] (SPARK-6407) Streaming ALS for Collaborative Filtering

2015-04-02 Thread Chris Fregly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392646#comment-14392646 ] Chris Fregly commented on SPARK-6407: - from [~mengxr] "The online update should be i

[jira] [Created] (SPARK-6679) java.lang.ClassNotFoundException on Mesos fine grained mode and input replication

2015-04-02 Thread JIRA
Ondřej Smola created SPARK-6679: --- Summary: java.lang.ClassNotFoundException on Mesos fine grained mode and input replication Key: SPARK-6679 URL: https://issues.apache.org/jira/browse/SPARK-6679 Project

[jira] [Assigned] (SPARK-4449) specify port range in spark

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4449: --- Assignee: (was: Apache Spark) > specify port range in spark > ---

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392721#comment-14392721 ] Sean Owen commented on SPARK-2243: -- [~sams] in this particular case, can you simply set b

[jira] [Commented] (SPARK-6407) Streaming ALS for Collaborative Filtering

2015-04-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392742#comment-14392742 ] Sean Owen commented on SPARK-6407: -- ALS doesn't use gradient descent, at least not enough

[jira] [Created] (SPARK-6680) Be able to specifie IP for spark-shell(spark driver) blocker for Docker integration

2015-04-02 Thread Egor Pakhomov (JIRA)
Egor Pakhomov created SPARK-6680: Summary: Be able to specifie IP for spark-shell(spark driver) blocker for Docker integration Key: SPARK-6680 URL: https://issues.apache.org/jira/browse/SPARK-6680 Pro

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread sam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392780#comment-14392780 ] sam commented on SPARK-2243: // sam in this particular case, can you simply set both of these

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread sam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392798#comment-14392798 ] sam commented on SPARK-2243: What I would suggest though, is putting an `assert`/`requires` so

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread Jason Hubbard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392804#comment-14392804 ] Jason Hubbard commented on SPARK-2243: -- I don't think programmatically spinning up JV

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread sam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392848#comment-14392848 ] sam commented on SPARK-2243: Yup, a singleton would make sense, it's creation is side effectin

[jira] [Comment Edited] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread sam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392848#comment-14392848 ] sam edited comment on SPARK-2243 at 4/2/15 3:37 PM: Yup, a singleton w

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-04-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392860#comment-14392860 ] Sean Owen commented on SPARK-2243: -- [~sams] the {{SparkContext}} constructor will throw a

[jira] [Commented] (SPARK-5452) We are migrating Tera Data SQL to Spark SQL. Query is taking long time. Please have a look on this issue

2015-04-02 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392901#comment-14392901 ] Nicholas Chammas commented on SPARK-5452: - Yeah, as Sean said, this post as it sta

[jira] [Commented] (SPARK-6569) Kafka directInputStream logs what appear to be incorrect warnings

2015-04-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392934#comment-14392934 ] Sean Owen commented on SPARK-6569: -- [~minisaw] are you going to submit a PR to reduce the

[jira] [Assigned] (SPARK-6209) ExecutorClassLoader can leak connections after failing to load classes from the REPL class server

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6209: --- Assignee: Apache Spark (was: Josh Rosen) > ExecutorClassLoader can leak connections after fa

[jira] [Assigned] (SPARK-6209) ExecutorClassLoader can leak connections after failing to load classes from the REPL class server

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6209: --- Assignee: Josh Rosen (was: Apache Spark) > ExecutorClassLoader can leak connections after fa

[jira] [Commented] (SPARK-6662) Allow variable substitution in spark.yarn.historyServer.address

2015-04-02 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392963#comment-14392963 ] Cheolsoo Park commented on SPARK-6662: -- [~srowen], thank you for your comment. {quote

[jira] [Commented] (SPARK-6506) python support yarn cluster mode requires SPARK_HOME to be set

2015-04-02 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392971#comment-14392971 ] Marcelo Vanzin commented on SPARK-6506: --- Maybe you're running into SPARK-5808? > py

[jira] [Commented] (SPARK-765) Test suite should run Spark example programs

2015-04-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392975#comment-14392975 ] Josh Rosen commented on SPARK-765: -- Hi [~yuu.ishik...@gmail.com], It should be fined to a

[jira] [Commented] (SPARK-6506) python support yarn cluster mode requires SPARK_HOME to be set

2015-04-02 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392978#comment-14392978 ] Thomas Graves commented on SPARK-6506: -- No it was built with maven and the pyspark ar

[jira] [Commented] (SPARK-6569) Kafka directInputStream logs what appear to be incorrect warnings

2015-04-02 Thread Platon Potapov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392992#comment-14392992 ] Platon Potapov commented on SPARK-6569: --- If we talk about log level reduction only,

[jira] [Commented] (SPARK-6618) HiveMetastoreCatalog.lookupRelation should use fine-grained lock

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392991#comment-14392991 ] Apache Spark commented on SPARK-6618: - User 'yhuai' has created a pull request for thi

[jira] [Commented] (SPARK-5992) Locality Sensitive Hashing (LSH) for MLlib

2015-04-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393017#comment-14393017 ] Joseph K. Bradley commented on SPARK-5992: -- That sounds good; I'll try to take a

[jira] [Updated] (SPARK-5972) Cache residuals for GradientBoostedTrees during training

2015-04-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5972: - Assignee: Manoj Kumar > Cache residuals for GradientBoostedTrees during training > ---

[jira] [Commented] (SPARK-6407) Streaming ALS for Collaborative Filtering

2015-04-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393021#comment-14393021 ] Joseph K. Bradley commented on SPARK-6407: -- I'm not too familiar with the area, b

[jira] [Assigned] (SPARK-6194) collect() in PySpark will cause memory leak in JVM

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6194: --- Assignee: Apache Spark (was: Davies Liu) > collect() in PySpark will cause memory leak in JV

[jira] [Assigned] (SPARK-6194) collect() in PySpark will cause memory leak in JVM

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6194: --- Assignee: Davies Liu (was: Apache Spark) > collect() in PySpark will cause memory leak in JV

[jira] [Issue Comment Deleted] (SPARK-6667) hang while collect in PySpark

2015-04-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-6667: -- Comment: was deleted (was: This patch introduced a rare bug that can cause a hang while calling {{colle

[jira] [Assigned] (SPARK-677) PySpark should not collect results through local filesystem

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-677: -- Assignee: Davies Liu (was: Apache Spark) > PySpark should not collect results through local fil

[jira] [Commented] (SPARK-6667) hang while collect in PySpark

2015-04-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393023#comment-14393023 ] Josh Rosen commented on SPARK-6667: --- This patch introduced a rare bug that can cause a h

[jira] [Assigned] (SPARK-677) PySpark should not collect results through local filesystem

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-677: -- Assignee: Apache Spark (was: Davies Liu) > PySpark should not collect results through local fil

[jira] [Commented] (SPARK-6194) collect() in PySpark will cause memory leak in JVM

2015-04-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393026#comment-14393026 ] Josh Rosen commented on SPARK-6194: --- This patch introduced a rare bug that can cause a h

[jira] [Resolved] (SPARK-6194) collect() in PySpark will cause memory leak in JVM

2015-04-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-6194. --- Resolution: Fixed Target Version/s: 1.3.0, 1.2.2 (was: 1.0.3, 1.1.2, 1.2.2, 1.3.0) Going to

[jira] [Created] (SPARK-6681) JAVA_HOME error with upgrade to Spark 1.3.0

2015-04-02 Thread Ken Williams (JIRA)
Ken Williams created SPARK-6681: --- Summary: JAVA_HOME error with upgrade to Spark 1.3.0 Key: SPARK-6681 URL: https://issues.apache.org/jira/browse/SPARK-6681 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-6667) hang while collect in PySpark

2015-04-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-6667: -- Priority: Blocker (was: Critical) Target Version/s: 1.2.2, 1.3.1, 1.4.0 (was: 1.3.1, 1.4.

[jira] [Created] (SPARK-6682) Deprecate static train and use builder instead for Scala/Java

2015-04-02 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6682: Summary: Deprecate static train and use builder instead for Scala/Java Key: SPARK-6682 URL: https://issues.apache.org/jira/browse/SPARK-6682 Project: Spark

[jira] [Created] (SPARK-6683) GLMs with GradientDescent could scale step size instead of features

2015-04-02 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6683: Summary: GLMs with GradientDescent could scale step size instead of features Key: SPARK-6683 URL: https://issues.apache.org/jira/browse/SPARK-6683 Project: Sp

[jira] [Commented] (SPARK-2883) Spark Support for ORCFile format

2015-04-02 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393141#comment-14393141 ] Zhan Zhang commented on SPARK-2883: --- Following code demonstrate the usage of the orc sup

[jira] [Commented] (SPARK-3720) support ORC in spark sql

2015-04-02 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393146#comment-14393146 ] Zhan Zhang commented on SPARK-3720: --- [~iward] I have update the patch with new api suppo

[jira] [Created] (SPARK-6684) Add checkpointing to GradientBoostedTrees

2015-04-02 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6684: Summary: Add checkpointing to GradientBoostedTrees Key: SPARK-6684 URL: https://issues.apache.org/jira/browse/SPARK-6684 Project: Spark Issue Type: I

[jira] [Assigned] (SPARK-6671) Add status command for spark daemons

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6671: --- Assignee: (was: Apache Spark) > Add status command for spark daemons > --

[jira] [Commented] (SPARK-6671) Add status command for spark daemons

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393182#comment-14393182 ] Apache Spark commented on SPARK-6671: - User 'pchanumolu' has created a pull request fo

[jira] [Assigned] (SPARK-6671) Add status command for spark daemons

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6671: --- Assignee: Apache Spark > Add status command for spark daemons > -

[jira] [Resolved] (SPARK-6667) hang while collect in PySpark

2015-04-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-6667. --- Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 1.2.2 > hang

[jira] [Assigned] (SPARK-4194) Exceptions thrown during SparkContext or SparkEnv construction might lead to resource leaks or corrupted global state

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4194: --- Assignee: Apache Spark > Exceptions thrown during SparkContext or SparkEnv construction might

[jira] [Commented] (SPARK-4194) Exceptions thrown during SparkContext or SparkEnv construction might lead to resource leaks or corrupted global state

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393249#comment-14393249 ] Apache Spark commented on SPARK-4194: - User 'vanzin' has created a pull request for th

[jira] [Assigned] (SPARK-4194) Exceptions thrown during SparkContext or SparkEnv construction might lead to resource leaks or corrupted global state

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4194: --- Assignee: (was: Apache Spark) > Exceptions thrown during SparkContext or SparkEnv constru

[jira] [Comment Edited] (SPARK-2883) Spark Support for ORCFile format

2015-04-02 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393141#comment-14393141 ] Zhan Zhang edited comment on SPARK-2883 at 4/2/15 7:54 PM: --- Foll

[jira] [Comment Edited] (SPARK-2883) Spark Support for ORCFile format

2015-04-02 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393141#comment-14393141 ] Zhan Zhang edited comment on SPARK-2883 at 4/2/15 7:54 PM: --- Foll

[jira] [Updated] (SPARK-6479) Create off-heap block storage API (internal)

2015-04-02 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhan Zhang updated SPARK-6479: -- Attachment: SPARK-6479.pdf This is the updated version for offheap store internal api design. > Create

[jira] [Commented] (SPARK-6578) Outbound channel in network library is not thread-safe, can lead to fetch failures

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393284#comment-14393284 ] Apache Spark commented on SPARK-6578: - User 'vanzin' has created a pull request for th

[jira] [Commented] (SPARK-6112) Provide OffHeap support through HDFS RAM_DISK

2015-04-02 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393287#comment-14393287 ] Zhan Zhang commented on SPARK-6112: --- Design spec for API attached to SPARK-6479 and wait

[jira] [Updated] (SPARK-6079) Use index to speed up StatusTracker.getJobIdsForGroup()

2015-04-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-6079: -- Fix Version/s: 1.3.1 > Use index to speed up StatusTracker.getJobIdsForGroup() > ---

[jira] [Updated] (SPARK-6685) Use DSYRK to compute AtA in ALS

2015-04-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6685: - Description: Now we use DSPR to compute AtA in ALS, which is a Level 2 BLAS routine. We should swi

[jira] [Created] (SPARK-6685) Use DSYRK to compute AtA in ALS

2015-04-02 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6685: Summary: Use DSYRK to compute AtA in ALS Key: SPARK-6685 URL: https://issues.apache.org/jira/browse/SPARK-6685 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-6671) Add status command for spark daemons

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6671: --- Assignee: Apache Spark > Add status command for spark daemons > -

[jira] [Assigned] (SPARK-6671) Add status command for spark daemons

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6671: --- Assignee: (was: Apache Spark) > Add status command for spark daemons > --

[jira] [Commented] (SPARK-6479) Create off-heap block storage API (internal)

2015-04-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393331#comment-14393331 ] Reynold Xin commented on SPARK-6479: Looks good overall. I have some comments but thos

[jira] [Updated] (SPARK-6414) Spark driver failed with NPE on job cancelation

2015-04-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-6414: -- Assignee: Hung Lin > Spark driver failed with NPE on job cancelation > -

[jira] [Resolved] (SPARK-6414) Spark driver failed with NPE on job cancelation

2015-04-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-6414. --- Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 1.2.2 Fixed

[jira] [Updated] (SPARK-4346) YarnClientSchedulerBack.asyncMonitorApplication should be common with Client.monitorApplication

2015-04-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4346: - Affects Version/s: 1.0.0 > YarnClientSchedulerBack.asyncMonitorApplication should be common with > Client

[jira] [Updated] (SPARK-6578) Outbound channel in network library is not thread-safe, can lead to fetch failures

2015-04-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6578: --- Fix Version/s: 1.2.2 > Outbound channel in network library is not thread-safe, can lead to fetch > fa

[jira] [Updated] (SPARK-6578) Outbound channel in network library is not thread-safe, can lead to fetch failures

2015-04-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6578: --- Target Version/s: 1.2.2, 1.3.1, 1.4.0 (was: 1.3.1, 1.4.0) > Outbound channel in network library is no

[jira] [Updated] (SPARK-6443) Could not submit app in standalone cluster mode when HA is enabled

2015-04-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6443: - Affects Version/s: 1.0.0 > Could not submit app in standalone cluster mode when HA is enabled > --

[jira] [Updated] (SPARK-6675) HiveContext setConf is not stable

2015-04-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6675: Labels: (was: patch) > HiveContext setConf is not stable > ---

[jira] [Updated] (SPARK-6675) HiveContext setConf is not stable

2015-04-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6675: Target Version/s: 1.4.0 (was: 1.3.0) > HiveContext setConf is not stable >

[jira] [Updated] (SPARK-6443) Support HA in standalone cluster modehen HA is enabled

2015-04-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6443: - Summary: Support HA in standalone cluster modehen HA is enabled (was: Could not submit app in standalone

[jira] [Updated] (SPARK-6443) Support HA in standalone cluster mode

2015-04-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6443: - Priority: Major (was: Critical) > Support HA in standalone cluster mode > ---

[jira] [Updated] (SPARK-6443) Support HA in standalone cluster mode

2015-04-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6443: - Summary: Support HA in standalone cluster mode (was: Support HA in standalone cluster modehen HA is enabl

[jira] [Assigned] (SPARK-6669) Lock metastore client in analyzeTable

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6669: --- Assignee: Apache Spark (was: Michael Armbrust) > Lock metastore client in analyzeTable > ---

[jira] [Commented] (SPARK-6669) Lock metastore client in analyzeTable

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393582#comment-14393582 ] Apache Spark commented on SPARK-6669: - User 'yhuai' has created a pull request for thi

[jira] [Assigned] (SPARK-6669) Lock metastore client in analyzeTable

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6669: --- Assignee: Michael Armbrust (was: Apache Spark) > Lock metastore client in analyzeTable > ---

[jira] [Updated] (SPARK-6479) Create off-heap block storage API (internal)

2015-04-02 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhan Zhang updated SPARK-6479: -- Attachment: SPARK-6479OffheapAPIdesign.pdf Add failure case handling overall design and example. > Crea

[jira] [Updated] (SPARK-6443) Support HA in standalone cluster mode

2015-04-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6443: - Description: After digging some codes, I found user could not submit app in standalone cluster mode when

[jira] [Comment Edited] (SPARK-6479) Create off-heap block storage API (internal)

2015-04-02 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393590#comment-14393590 ] Zhan Zhang edited comment on SPARK-6479 at 4/2/15 10:23 PM: [~

[jira] [Commented] (SPARK-6479) Create off-heap block storage API (internal)

2015-04-02 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393590#comment-14393590 ] Zhan Zhang commented on SPARK-6479: --- [~rxin] Thanks for the feedback. I updated the docu

[jira] [Comment Edited] (SPARK-6479) Create off-heap block storage API (internal)

2015-04-02 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14393590#comment-14393590 ] Zhan Zhang edited comment on SPARK-6479 at 4/2/15 10:24 PM: [~

[jira] [Updated] (SPARK-6443) Support HA in standalone cluster mode

2015-04-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6443: - Description: == EDIT by Andrew == >From a quick survey in the code I can confirm that cli

[jira] [Updated] (SPARK-6443) Support HA in standalone cluster mode

2015-04-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6443: - Description: == EDIT by Andrew == >From a quick survey in the code I can confirm that cli

[jira] [Assigned] (SPARK-2669) Hadoop configuration is not localised when submitting job in yarn-cluster mode

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2669: --- Assignee: (was: Apache Spark) > Hadoop configuration is not localised when submitting job

[jira] [Assigned] (SPARK-2669) Hadoop configuration is not localised when submitting job in yarn-cluster mode

2015-04-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2669: --- Assignee: Apache Spark > Hadoop configuration is not localised when submitting job in yarn-cl

  1   2   3   >