[jira] [Assigned] (SPARK-7466) DAG visualization: orphaned nodes are not rendered correctly

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7466: --- Assignee: Andrew Or (was: Apache Spark) DAG visualization: orphaned nodes are not rendered

[jira] [Assigned] (SPARK-7466) DAG visualization: orphaned nodes are not rendered correctly

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7466: --- Assignee: Apache Spark (was: Andrew Or) DAG visualization: orphaned nodes are not rendered

[jira] [Commented] (SPARK-7466) DAG visualization: orphaned nodes are not rendered correctly

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534013#comment-14534013 ] Apache Spark commented on SPARK-7466: - User 'andrewor14' has created a pull request

[jira] [Commented] (SPARK-6770) DirectKafkaInputDStream has not been initialized when recovery from checkpoint

2015-05-08 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534078#comment-14534078 ] Tathagata Das commented on SPARK-6770: -- Was this problem solved? I think I discuss

[jira] [Created] (SPARK-7481) Add Hadoop 2.6+ profile to pull in object store FS accessors

2015-05-08 Thread Steve Loughran (JIRA)
Steve Loughran created SPARK-7481: - Summary: Add Hadoop 2.6+ profile to pull in object store FS accessors Key: SPARK-7481 URL: https://issues.apache.org/jira/browse/SPARK-7481 Project: Spark

[jira] [Updated] (SPARK-6091) Add MulticlassMetrics in PySpark/MLlib

2015-05-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6091: - Target Version/s: 1.4.0 Add MulticlassMetrics in PySpark/MLlib

[jira] [Updated] (SPARK-6091) Add MulticlassMetrics in PySpark/MLlib

2015-05-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6091: - Assignee: Yanbo Liang Add MulticlassMetrics in PySpark/MLlib

[jira] [Updated] (SPARK-6092) Add RankingMetrics in PySpark/MLlib

2015-05-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6092: - Assignee: Yanbo Liang Add RankingMetrics in PySpark/MLlib ---

[jira] [Updated] (SPARK-6092) Add RankingMetrics in PySpark/MLlib

2015-05-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6092: - Target Version/s: 1.4.0 Add RankingMetrics in PySpark/MLlib ---

[jira] [Updated] (SPARK-7478) Add a SQLContext.getOrCreate to maintain a singleton instance of SQLContext

2015-05-08 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7478: - Description: Having a SQLContext singleton would make it easier for applications to use a lazily

[jira] [Assigned] (SPARK-7478) Add a SQLContext.getOrCreate to maintain a singleton instance of SQLContext

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7478: --- Assignee: Tathagata Das (was: Apache Spark) Add a SQLContext.getOrCreate to maintain a

[jira] [Resolved] (SPARK-6889) Streamline contribution process with update to Contribution wiki, JIRA rules

2015-05-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6889. -- Resolution: Fixed Fix Version/s: 1.4.0 Streamline contribution process with update to

[jira] [Commented] (SPARK-7478) Add a SQLContext.getOrCreate to maintain a singleton instance of SQLContext

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534099#comment-14534099 ] Apache Spark commented on SPARK-7478: - User 'tdas' has created a pull request for this

[jira] [Created] (SPARK-7479) SparkR can not work

2015-05-08 Thread Weizhong (JIRA)
Weizhong created SPARK-7479: --- Summary: SparkR can not work Key: SPARK-7479 URL: https://issues.apache.org/jira/browse/SPARK-7479 Project: Spark Issue Type: Bug Components: SparkR

[jira] [Resolved] (SPARK-7479) SparkR can not work

2015-05-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7479. -- Resolution: Invalid This kind of thing should begin as a question at user@ as I suspect it is a basic

[jira] [Commented] (SPARK-7481) Add Hadoop 2.6+ profile to pull in object store FS accessors

2015-05-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534168#comment-14534168 ] Sean Owen commented on SPARK-7481: -- Yikes, that seems like a load of stuff to pull in.

[jira] [Updated] (SPARK-7478) Add a SQLContext.getOrCreate to maintain a singleton instance of SQLContext

2015-05-08 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7478: - Description: Having a SQLContext singleton would make it easier for applications to use a lazily

[jira] [Resolved] (SPARK-5034) Spark on Yarn launch failure on HDInsight on Windows

2015-05-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5034. -- Resolution: Cannot Reproduce I don't know what to make of this without more info. I don't think it is a

[jira] [Updated] (SPARK-7478) Add a SQLContext.getOrCreate to maintain a singleton instance of SQLContext

2015-05-08 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7478: - Description: Having a SQLContext singleton would make it easier for applications to use a lazily

[jira] [Updated] (SPARK-7478) Add a SQLContext.getOrCreate to maintain a singleton instance of SQLContext

2015-05-08 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7478: - Description: Having a SQLContext singleton would make it easier for applications to use a lazily

[jira] [Assigned] (SPARK-7478) Add a SQLContext.getOrCreate to maintain a singleton instance of SQLContext

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7478: --- Assignee: Apache Spark (was: Tathagata Das) Add a SQLContext.getOrCreate to maintain a

[jira] [Commented] (SPARK-6876) DataFrame.na.replace value support for Python

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534026#comment-14534026 ] Apache Spark commented on SPARK-6876: - User 'adrian-wang' has created a pull request

[jira] [Assigned] (SPARK-7467) DAG visualization: handle checkpoint correctly

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7467: --- Assignee: Apache Spark (was: Andrew Or) DAG visualization: handle checkpoint correctly

[jira] [Assigned] (SPARK-7467) DAG visualization: handle checkpoint correctly

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7467: --- Assignee: Andrew Or (was: Apache Spark) DAG visualization: handle checkpoint correctly

[jira] [Commented] (SPARK-7467) DAG visualization: handle checkpoint correctly

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534032#comment-14534032 ] Apache Spark commented on SPARK-7467: - User 'andrewor14' has created a pull request

[jira] [Comment Edited] (SPARK-7478) Add a SQLContext.getOrCreate to maintain a singleton instance of SQLContext

2015-05-08 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534094#comment-14534094 ] Tathagata Das edited comment on SPARK-7478 at 5/8/15 8:24 AM: --

[jira] [Comment Edited] (SPARK-7478) Add a SQLContext.getOrCreate to maintain a singleton instance of SQLContext

2015-05-08 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534094#comment-14534094 ] Tathagata Das edited comment on SPARK-7478 at 5/8/15 8:24 AM: --

[jira] [Commented] (SPARK-7478) Add a SQLContext.getOrCreate to maintain a singleton instance of SQLContext

2015-05-08 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534094#comment-14534094 ] Tathagata Das commented on SPARK-7478: -- [~rxin] Thoughts? Add a

[jira] [Assigned] (SPARK-6876) DataFrame.na.replace value support for Python

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6876: --- Assignee: Apache Spark DataFrame.na.replace value support for Python

[jira] [Assigned] (SPARK-6876) DataFrame.na.replace value support for Python

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6876: --- Assignee: (was: Apache Spark) DataFrame.na.replace value support for Python

[jira] [Assigned] (SPARK-7231) Make SparkR DataFrame API more dplyr friendly

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7231: --- Assignee: Apache Spark (was: Shivaram Venkataraman) Make SparkR DataFrame API more dplyr

[jira] [Commented] (SPARK-7231) Make SparkR DataFrame API more dplyr friendly

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534034#comment-14534034 ] Apache Spark commented on SPARK-7231: - User 'shivaram' has created a pull request for

[jira] [Resolved] (SPARK-1423) Add scripts for launching Spark on Windows Azure

2015-05-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1423. -- Resolution: Won't Fix Given the lack of activity and resolution of

[jira] [Resolved] (SPARK-7392) Kryo buffer size can not be larger than 2M

2015-05-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7392. -- Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Zhang, Liye Resolved by

[jira] [Created] (SPARK-7478) Add a SQLContext.getOrCreate to maintain a singleton instance of SQLContext

2015-05-08 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-7478: Summary: Add a SQLContext.getOrCreate to maintain a singleton instance of SQLContext Key: SPARK-7478 URL: https://issues.apache.org/jira/browse/SPARK-7478 Project:

[jira] [Updated] (SPARK-7478) Add a SQLContext.getOrCreate to maintain a singleton instance of SQLContext

2015-05-08 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-7478: - Description: Having a SQLContext singleton would make it easier for applications to use a lazily

[jira] [Created] (SPARK-7480) Get exception when DataFrame saveAsTable and run sql on the same table at the same time

2015-05-08 Thread pin_zhang (JIRA)
pin_zhang created SPARK-7480: Summary: Get exception when DataFrame saveAsTable and run sql on the same table at the same time Key: SPARK-7480 URL: https://issues.apache.org/jira/browse/SPARK-7480

[jira] [Commented] (SPARK-7481) Add Hadoop 2.6+ profile to pull in object store FS accessors

2015-05-08 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534174#comment-14534174 ] Steve Loughran commented on SPARK-7481: --- This doesn't contain any endorsement of the

[jira] [Commented] (SPARK-6770) DirectKafkaInputDStream has not been initialized when recovery from checkpoint

2015-05-08 Thread yangping wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534209#comment-14534209 ] yangping wu commented on SPARK-6770: Hi [~tdas], I use the code you mentioned, It was

[jira] [Commented] (SPARK-7481) Add Hadoop 2.6+ profile to pull in object store FS accessors

2015-05-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534210#comment-14534210 ] Sean Owen commented on SPARK-7481: -- Maybe I'd be less frightened if I knew the size of

[jira] [Commented] (SPARK-6770) DirectKafkaInputDStream has not been initialized when recovery from checkpoint

2015-05-08 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534240#comment-14534240 ] Tathagata Das commented on SPARK-6770: -- Awesome! I am closing this JIRA then!

[jira] [Commented] (SPARK-7459) Add Java example for ElementwiseProduct in programming guide

2015-05-08 Thread Octavian Geagla (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534276#comment-14534276 ] Octavian Geagla commented on SPARK-7459: Can do! Add Java example for

[jira] [Closed] (SPARK-6770) DirectKafkaInputDStream has not been initialized when recovery from checkpoint

2015-05-08 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das closed SPARK-6770. Resolution: Not A Problem DirectKafkaInputDStream has not been initialized when recovery from

[jira] [Assigned] (SPARK-7482) Rename some DataFrame API methods in SparkR to match their counterparts in Scala

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7482: --- Assignee: (was: Apache Spark) Rename some DataFrame API methods in SparkR to match

[jira] [Commented] (SPARK-7482) Rename some DataFrame API methods in SparkR to match their counterparts in Scala

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534255#comment-14534255 ] Apache Spark commented on SPARK-7482: - User 'sun-rui' has created a pull request for

[jira] [Assigned] (SPARK-7482) Rename some DataFrame API methods in SparkR to match their counterparts in Scala

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7482: --- Assignee: Apache Spark Rename some DataFrame API methods in SparkR to match their

[jira] [Comment Edited] (SPARK-7459) Add Java example for ElementwiseProduct in programming guide

2015-05-08 Thread Octavian Geagla (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534276#comment-14534276 ] Octavian Geagla edited comment on SPARK-7459 at 5/8/15 10:26 AM:

[jira] [Commented] (SPARK-7459) Add Java example for ElementwiseProduct in programming guide

2015-05-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534303#comment-14534303 ] Sean Owen commented on SPARK-7459: -- You dont need to be assigned, just go ahead. Add

[jira] [Commented] (SPARK-6154) Support Kafka, JDBC in Scala 2.11

2015-05-08 Thread Jianshi Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534344#comment-14534344 ] Jianshi Huang commented on SPARK-6154: -- Do you mean we need to upgrade the jline

[jira] [Created] (SPARK-7482) Rename some DataFrame API methods in SparkR to match their counterparts in Scala

2015-05-08 Thread Sun Rui (JIRA)
Sun Rui created SPARK-7482: -- Summary: Rename some DataFrame API methods in SparkR to match their counterparts in Scala Key: SPARK-7482 URL: https://issues.apache.org/jira/browse/SPARK-7482 Project: Spark

[jira] [Updated] (SPARK-7459) Add Java example for ElementwiseProduct in programming guide

2015-05-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7459: - Assignee: Octavian Geagla Add Java example for ElementwiseProduct in programming guide

[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH

2015-05-08 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-6869: - Priority: Blocker (was: Minor) Add pyspark archives path to PYTHONPATH

[jira] [Updated] (SPARK-7449) createPhysicalRDD should use RDD output as schema instead of relation.schema

2015-05-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7449: - Component/s: SQL createPhysicalRDD should use RDD output as schema instead of relation.schema

[jira] [Updated] (SPARK-7483) [MLLib] Using Kryo with FPGrowth fails with an exception

2015-05-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7483: - Component/s: MLlib Priority: Minor (was: Major) [MLLib] Using Kryo with FPGrowth fails with an

[jira] [Updated] (SPARK-7484) Support passing jdbc connection properties for dataframe.createJDBCTable and insertIntoJDBC

2015-05-08 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7484: --- Issue Type: Improvement (was: Bug) Support passing jdbc connection properties for

[jira] [Updated] (SPARK-7435) Make DataFrame.show() consistent with that of Scala and pySpark

2015-05-08 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7435: --- Priority: Critical (was: Blocker) Make DataFrame.show() consistent with that of Scala and

[jira] [Created] (SPARK-7486) Add the streaming implementation for estimating quantiles and median

2015-05-08 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-7486: -- Summary: Add the streaming implementation for estimating quantiles and median Key: SPARK-7486 URL: https://issues.apache.org/jira/browse/SPARK-7486 Project:

[jira] [Commented] (SPARK-7110) when use saveAsNewAPIHadoopFile, sometimes it throws Delegation Token can be issued only with kerberos or web authentication

2015-05-08 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534494#comment-14534494 ] Thomas Graves commented on SPARK-7110: -- [~gu chi] is there some of the stack trace

[jira] [Resolved] (SPARK-7393) How to improve Spark SQL performance?

2015-05-08 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7393. Resolution: Invalid Hi - thanks for giving feedback on your use of Spark SQL. This type of

[jira] [Created] (SPARK-7485) Remove python artifacts from the assembly jar

2015-05-08 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-7485: Summary: Remove python artifacts from the assembly jar Key: SPARK-7485 URL: https://issues.apache.org/jira/browse/SPARK-7485 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-3928) Support wildcard matches on Parquet files

2015-05-08 Thread Thu Kyaw (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534582#comment-14534582 ] Thu Kyaw commented on SPARK-3928: - Hello [~lian cheng] please let me know if you want me

[jira] [Commented] (SPARK-5928) Remote Shuffle Blocks cannot be more than 2 GB

2015-05-08 Thread Rangarajan Sreenivasan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534629#comment-14534629 ] Rangarajan Sreenivasan commented on SPARK-5928: --- We are hitting a very

[jira] [Resolved] (SPARK-1920) Spark JAR compiled with Java 7 leads to PySpark not working in YARN

2015-05-08 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-1920. -- Resolution: Duplicate Spark JAR compiled with Java 7 leads to PySpark not working in YARN

[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH

2015-05-08 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-6869: - Assignee: Lianhui Wang Add pyspark archives path to PYTHONPATH

[jira] [Resolved] (SPARK-6869) Add pyspark archives path to PYTHONPATH

2015-05-08 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-6869. -- Resolution: Fixed Fix Version/s: 1.4.0 Add pyspark archives path to PYTHONPATH

[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH

2015-05-08 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-6869: - Target Version/s: 1.4.0 Add pyspark archives path to PYTHONPATH

[jira] [Updated] (SPARK-6961) Cannot save data to parquet files when executing from Windows from a Maven Project

2015-05-08 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6961: --- Priority: Critical (was: Blocker) Cannot save data to parquet files when executing from

[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH

2015-05-08 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-6869: - Issue Type: Bug (was: Improvement) Add pyspark archives path to PYTHONPATH

[jira] [Commented] (SPARK-7481) Add Hadoop 2.6+ profile to pull in object store FS accessors

2015-05-08 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534622#comment-14534622 ] Steve Loughran commented on SPARK-7481: --- hadoop openstack 100K +httpclient (400K)

[jira] [Updated] (SPARK-7381) Missing Python API for o.a.s.ml

2015-05-08 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-7381: --- Summary: Missing Python API for o.a.s.ml (was: Python API for Transformers) Missing Python API for

[jira] [Assigned] (SPARK-7447) Large Job submission lag when using Parquet w/ Schema Merging

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7447: --- Assignee: (was: Apache Spark) Large Job submission lag when using Parquet w/ Schema

[jira] [Assigned] (SPARK-7447) Large Job submission lag when using Parquet w/ Schema Merging

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7447: --- Assignee: Apache Spark Large Job submission lag when using Parquet w/ Schema Merging

[jira] [Commented] (SPARK-7447) Large Job submission lag when using Parquet w/ Schema Merging

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14535018#comment-14535018 ] Apache Spark commented on SPARK-7447: - User 'viirya' has created a pull request for

[jira] [Commented] (SPARK-7477) TachyonBlockManager Store Block in TRY_CACHE mode which gives BlockNotFoundException when blocks are evicted from cache

2015-05-08 Thread Dibyendu Bhattacharya (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534805#comment-14534805 ] Dibyendu Bhattacharya commented on SPARK-7477: -- I tried Hierarchical Storage

[jira] [Resolved] (SPARK-3454) Expose JSON representation of data shown in WebUI

2015-05-08 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3454. Resolution: Fixed Expose JSON representation of data shown in WebUI

[jira] [Created] (SPARK-7488) Python API for ml.recommendation

2015-05-08 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7488: -- Summary: Python API for ml.recommendation Key: SPARK-7488 URL: https://issues.apache.org/jira/browse/SPARK-7488 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-7489) Spark shell crashes when compiled with scala 2.11 and SPARK_PREPEND_CLASSES=true

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7489: --- Assignee: (was: Apache Spark) Spark shell crashes when compiled with scala 2.11 and

[jira] [Assigned] (SPARK-7489) Spark shell crashes when compiled with scala 2.11 and SPARK_PREPEND_CLASSES=true

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7489: --- Assignee: Apache Spark Spark shell crashes when compiled with scala 2.11 and

[jira] [Commented] (SPARK-7489) Spark shell crashes when compiled with scala 2.11 and SPARK_PREPEND_CLASSES=true

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14535053#comment-14535053 ] Apache Spark commented on SPARK-7489: - User 'vinodkc' has created a pull request for

[jira] [Created] (SPARK-7489) Spark shell crashes when compiled with scala 2.11 and SPARK_PREPEND_CLASSES=true

2015-05-08 Thread Vinod KC (JIRA)
Vinod KC created SPARK-7489: --- Summary: Spark shell crashes when compiled with scala 2.11 and SPARK_PREPEND_CLASSES=true Key: SPARK-7489 URL: https://issues.apache.org/jira/browse/SPARK-7489 Project: Spark

[jira] [Assigned] (SPARK-6091) Add MulticlassMetrics in PySpark/MLlib

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6091: --- Assignee: Apache Spark (was: Yanbo Liang) Add MulticlassMetrics in PySpark/MLlib

[jira] [Created] (SPARK-7487) Python API for ml.regression

2015-05-08 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7487: -- Summary: Python API for ml.regression Key: SPARK-7487 URL: https://issues.apache.org/jira/browse/SPARK-7487 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-7448) Implement custom bye array serializer for use in PySpark shuffle

2015-05-08 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14535316#comment-14535316 ] Josh Rosen commented on SPARK-7448: --- This is a change that would be nice to performance

[jira] [Created] (SPARK-7490) MapOutputTracker: close input streams to free native memory

2015-05-08 Thread Evan Jones (JIRA)
Evan Jones created SPARK-7490: - Summary: MapOutputTracker: close input streams to free native memory Key: SPARK-7490 URL: https://issues.apache.org/jira/browse/SPARK-7490 Project: Spark Issue

[jira] [Commented] (SPARK-7487) Python API for ml.regression

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14535324#comment-14535324 ] Apache Spark commented on SPARK-7487: - User 'brkyvz' has created a pull request for

[jira] [Assigned] (SPARK-7487) Python API for ml.regression

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7487: --- Assignee: (was: Apache Spark) Python API for ml.regression

[jira] [Assigned] (SPARK-7487) Python API for ml.regression

2015-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7487: --- Assignee: Apache Spark Python API for ml.regression

[jira] [Created] (SPARK-7491) Handle drivers for Metastore JDBC

2015-05-08 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-7491: --- Summary: Handle drivers for Metastore JDBC Key: SPARK-7491 URL: https://issues.apache.org/jira/browse/SPARK-7491 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-7410) Add option to avoid broadcasting configuration with newAPIHadoopFile

2015-05-08 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14535386#comment-14535386 ] Josh Rosen commented on SPARK-7410: --- We should confirm this, but if I recall the reason

[jira] [Resolved] (SPARK-7383) Python API for ml.feature

2015-05-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7383. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5991

[jira] [Updated] (SPARK-7436) Cannot implement nor use custom StandaloneRecoveryModeFactory implementations

2015-05-08 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-7436: -- Assignee: Jacek Lewandowski Cannot implement nor use custom StandaloneRecoveryModeFactory

[jira] [Created] (SPARK-7493) ALTER TABLE statement

2015-05-08 Thread Sergey Semichev (JIRA)
Sergey Semichev created SPARK-7493: -- Summary: ALTER TABLE statement Key: SPARK-7493 URL: https://issues.apache.org/jira/browse/SPARK-7493 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-6824) Fill the docs for DataFrame API in SparkR

2015-05-08 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-6824. -- Resolution: Fixed Fix Version/s: 1.4.0 1.5.0 Issue

[jira] [Resolved] (SPARK-7298) Harmonize style of new UI visualizations

2015-05-08 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-7298. -- Resolution: Fixed Fix Version/s: 1.4.0 Harmonize style of new UI visualizations

[jira] [Resolved] (SPARK-7133) Implement struct, array, and map field accessor using apply in Scala and __getitem__ in Python

2015-05-08 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7133. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5744

[jira] [Updated] (SPARK-7448) Implement custom bye array serializer for use in PySpark shuffle

2015-05-08 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-7448: -- Priority: Minor (was: Major) Implement custom bye array serializer for use in PySpark shuffle

[jira] [Comment Edited] (SPARK-5928) Remote Shuffle Blocks cannot be more than 2 GB

2015-05-08 Thread Rangarajan Sreenivasan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534629#comment-14534629 ] Rangarajan Sreenivasan edited comment on SPARK-5928 at 5/8/15 5:51 PM:

[jira] [Resolved] (SPARK-7474) ParamGridBuilder's doctest doesn't show up correctly in the generated doc

2015-05-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7474. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6001

[jira] [Resolved] (SPARK-7436) Cannot implement nor use custom StandaloneRecoveryModeFactory implementations

2015-05-08 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-7436. --- Resolution: Fixed Fix Version/s: 1.4.0 1.3.2 Issue resolved by pull request

[jira] [Commented] (SPARK-7447) Large Job submission lag when using Parquet w/ Schema Merging

2015-05-08 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14535208#comment-14535208 ] Brad Willard commented on SPARK-7447: - Thanks, you are a hero. Large Job submission

  1   2   3   >