[jira] [Updated] (SPARK-5614) Predicate pushdown through Generate

2015-02-04 Thread Lu Yan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lu Yan updated SPARK-5614: -- Description: Now in Catalyst's rules, predicates can not be pushed through "Generate" nodes. Further more, part

[jira] [Created] (SPARK-5614) Predicate pushdown through Generate

2015-02-04 Thread Lu Yan (JIRA)
Lu Yan created SPARK-5614: - Summary: Predicate pushdown through Generate Key: SPARK-5614 URL: https://issues.apache.org/jira/browse/SPARK-5614 Project: Spark Issue Type: Improvement Compone

[jira] [Resolved] (SPARK-5612) Move DataFrame implicit functions into SQLContext.implicits

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5612. Resolution: Fixed Fix Version/s: 1.3.0 > Move DataFrame implicit functions into SQLContext.im

[jira] [Updated] (SPARK-5264) Support `drop temporary table [if exists]` DDL command

2015-02-04 Thread shengli (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shengli updated SPARK-5264: --- Issue Type: New Feature (was: Bug) > Support `drop temporary table [if exists]` DDL command > --

[jira] [Resolved] (SPARK-5606) Support plus sign in HiveContext

2015-02-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5606. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4378 [https:/

[jira] [Issue Comment Deleted] (SPARK-3365) Failure to save Lists to Parquet

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-3365: - Comment: was deleted (was: A fix for this would be helpful for ML model import/export (for

[jira] [Updated] (SPARK-3365) Failure to save Lists to Parquet

2015-02-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3365: Target Version/s: 1.2.0, 1.3.0 (was: 1.2.0) > Failure to save Lists to Parquet > --

[jira] [Updated] (SPARK-3365) Failure to save Lists to Parquet

2015-02-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3365: Assignee: Cheng Lian > Failure to save Lists to Parquet > >

[jira] [Updated] (SPARK-3365) Failure to save Lists to Parquet

2015-02-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3365: Priority: Blocker (was: Major) > Failure to save Lists to Parquet > ---

[jira] [Commented] (SPARK-3365) Failure to save Lists to Parquet

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306774#comment-14306774 ] Joseph K. Bradley commented on SPARK-3365: -- I found this as follows: {code} impor

[jira] [Commented] (SPARK-3365) Failure to save Lists to Parquet

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306773#comment-14306773 ] Joseph K. Bradley commented on SPARK-3365: -- A fix for this would be helpful for M

[jira] [Resolved] (SPARK-5599) Audit MLlib public APIs for 1.3

2015-02-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5599. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4377 [https://githu

[jira] [Updated] (SPARK-5604) Remove setCheckpointDir from LDA and tree Strategy

2015-02-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5604: - Assignee: Xiangrui Meng > Remove setCheckpointDir from LDA and tree Strategy > ---

[jira] [Resolved] (SPARK-5596) Model import/export for GLMs and Naive Bayes

2015-02-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5596. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4233 [https://githu

[jira] [Resolved] (SPARK-5607) NullPointerException in objenesis

2015-02-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5607. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Patrick Wendell

[jira] [Commented] (SPARK-5604) Remove setCheckpointDir from LDA and tree Strategy

2015-02-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306737#comment-14306737 ] Xiangrui Meng commented on SPARK-5604: -- Yes. I talked to [~tdas] about this and he al

[jira] [Updated] (SPARK-5604) Remove setCheckpointDir from LDA and tree Strategy

2015-02-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5604: - Summary: Remove setCheckpointDir from LDA and tree Strategy (was: Remove setCheckpointDir from LD

[jira] [Updated] (SPARK-5586) Automatically provide sqlContext in Spark shell

2015-02-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5586: --- Assignee: shengli > Automatically provide sqlContext in Spark shell >

[jira] [Commented] (SPARK-5557) spark-shell failed to start

2015-02-04 Thread Nathan McCarthy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306714#comment-14306714 ] Nathan McCarthy commented on SPARK-5557: Any ideas on what needs to be done to fix

[jira] [Updated] (SPARK-5586) Automatically provide sqlContext in Spark shell

2015-02-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5586: --- Priority: Blocker (was: Critical) > Automatically provide sqlContext in Spark shell > ---

[jira] [Updated] (SPARK-5586) Automatically provide sqlContext in Spark shell

2015-02-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5586: --- Assignee: (was: Patrick Wendell) > Automatically provide sqlContext in Spark shell > -

[jira] [Updated] (SPARK-5613) YarnClientSchedulerBackend fails to get application report when yarn restarts

2015-02-04 Thread Kashish Jain (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kashish Jain updated SPARK-5613: Description: Steps to Reproduce 1) Run any spark job 2) Stop yarn while the spark job is running (an

[jira] [Updated] (SPARK-5613) YarnClientSchedulerBackend fails to get application report when yarn restarts

2015-02-04 Thread Kashish Jain (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kashish Jain updated SPARK-5613: Description: Steps to Reproduce 1) Run any spark job 2) Stop yarn while the spark job is running (an

[jira] [Created] (SPARK-5613) YarnClientSchedulerBackend fails to get application report when yarn restarts

2015-02-04 Thread Kashish Jain (JIRA)
Kashish Jain created SPARK-5613: --- Summary: YarnClientSchedulerBackend fails to get application report when yarn restarts Key: SPARK-5613 URL: https://issues.apache.org/jira/browse/SPARK-5613 Project: Sp

[jira] [Commented] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-02-04 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306691#comment-14306691 ] Manoj Kumar commented on SPARK-5021: [~tgaloppo] Is there are any way we could have a

[jira] [Commented] (SPARK-5016) GaussianMixtureEM should distribute matrix inverse for large numFeatures, k

2015-02-04 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306688#comment-14306688 ] Manoj Kumar commented on SPARK-5016: Hi, I would like to fix this (since I'm familiar

[jira] [Commented] (SPARK-5586) Automatically provide sqlContext in Spark shell

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306680#comment-14306680 ] Apache Spark commented on SPARK-5586: - User 'OopsOutOfMemory' has created a pull reque

[jira] [Resolved] (SPARK-5609) PythonMLlibAPI trainGaussianMixture seed should use Java type

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-5609. -- Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Xiangrui Meng Fixed by

[jira] [Commented] (SPARK-5612) Move DataFrame implicit functions into SQLContext.implicits

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306676#comment-14306676 ] Apache Spark commented on SPARK-5612: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-5609) PythonMLlibAPI trainGaussianMixture seed should use Java type

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306668#comment-14306668 ] Joseph K. Bradley commented on SPARK-5609: -- Oh, thanks! I must have an outdated

[jira] [Created] (SPARK-5612) Move DataFrame implicit functions into SQLContext.implicits

2015-02-04 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5612: -- Summary: Move DataFrame implicit functions into SQLContext.implicits Key: SPARK-5612 URL: https://issues.apache.org/jira/browse/SPARK-5612 Project: Spark Issue

[jira] [Commented] (SPARK-5611) Allow spark-ec2 repo to be specified in CLI of spark_ec2.py

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306658#comment-14306658 ] Apache Spark commented on SPARK-5611: - User 'florianverhein' has created a pull reques

[jira] [Created] (SPARK-5611) Allow spark-ec2 repo to be specified in CLI of spark_ec2.py

2015-02-04 Thread Florian Verhein (JIRA)
Florian Verhein created SPARK-5611: -- Summary: Allow spark-ec2 repo to be specified in CLI of spark_ec2.py Key: SPARK-5611 URL: https://issues.apache.org/jira/browse/SPARK-5611 Project: Spark

[jira] [Commented] (SPARK-5609) PythonMLlibAPI trainGaussianMixture seed should use Java type

2015-02-04 Thread Meethu Mathew (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306618#comment-14306618 ] Meethu Mathew commented on SPARK-5609: -- I think this is solved with this https://git

[jira] [Commented] (SPARK-4964) Exactly-once + WAL-free Kafka Support in Spark Streaming

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306596#comment-14306596 ] Apache Spark commented on SPARK-4964: - User 'tdas' has created a pull request for this

[jira] [Comment Edited] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-02-04 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306549#comment-14306549 ] Manoj Kumar edited comment on SPARK-5021 at 2/5/15 4:05 AM: Ah

[jira] [Commented] (SPARK-5604) Remove setCheckpointDir from LDA

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306594#comment-14306594 ] Joseph K. Bradley commented on SPARK-5604: -- If we're doing this, then we should r

[jira] [Comment Edited] (SPARK-5609) PythonMLlibAPI trainGaussianMixture seed should use Java type

2015-02-04 Thread Meethu Mathew (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306593#comment-14306593 ] Meethu Mathew edited comment on SPARK-5609 at 2/5/15 4:03 AM: --

[jira] [Commented] (SPARK-5609) PythonMLlibAPI trainGaussianMixture seed should use Java type

2015-02-04 Thread Meethu Mathew (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306593#comment-14306593 ] Meethu Mathew commented on SPARK-5609: -- Please assign the ticket to me. > PythonMLl

[jira] [Resolved] (SPARK-5602) Better support for creating DataFrame from local data collection

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5602. Resolution: Fixed Fix Version/s: 1.3.0 > Better support for creating DataFrame from local dat

[jira] [Closed] (SPARK-5308) MD5 / SHA1 hash format doesn't match standard Maven output

2015-02-04 Thread Kuldeep (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kuldeep closed SPARK-5308. -- > MD5 / SHA1 hash format doesn't match standard Maven output > -

[jira] [Closed] (SPARK-5426) SQL Java API helper methods

2015-02-04 Thread Kuldeep (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kuldeep closed SPARK-5426. -- > SQL Java API helper methods > --- > > Key: SPARK-5426 > UR

[jira] [Commented] (SPARK-5538) CachedTableSuite failure due to unpersisting RDDs in a non-blocking way

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306589#comment-14306589 ] Apache Spark commented on SPARK-5538: - User 'rxin' has created a pull request for this

[jira] [Resolved] (SPARK-5538) CachedTableSuite failure due to unpersisting RDDs in a non-blocking way

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5538. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Reynold Xin > CachedTableSuite fail

[jira] [Updated] (SPARK-5538) CachedTableSuite failure due to unpersisting RDDs in a non-blocking way

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5538: --- Priority: Critical (was: Minor) > CachedTableSuite failure due to unpersisting RDDs in a non-blocking

[jira] [Commented] (SPARK-5607) NullPointerException in objenesis

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306572#comment-14306572 ] Apache Spark commented on SPARK-5607: - User 'pwendell' has created a pull request for

[jira] [Created] (SPARK-5610) Generate Java docs without package private classes and methods

2015-02-04 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5610: Summary: Generate Java docs without package private classes and methods Key: SPARK-5610 URL: https://issues.apache.org/jira/browse/SPARK-5610 Project: Spark

[jira] [Commented] (SPARK-5609) PythonMLlibAPI trainGaussianMixture seed should use Java type

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306561#comment-14306561 ] Joseph K. Bradley commented on SPARK-5609: -- Ping [~MeethuMathew] [~mengxr] > Pyt

[jira] [Commented] (SPARK-2087) Clean Multi-user semantics for thrift JDBC/ODBC server.

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306560#comment-14306560 ] Apache Spark commented on SPARK-2087: - User 'chenghao-intel' has created a pull reques

[jira] [Created] (SPARK-5609) PythonMLlibAPI trainGaussianMixture seed should use Java type

2015-02-04 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5609: Summary: PythonMLlibAPI trainGaussianMixture seed should use Java type Key: SPARK-5609 URL: https://issues.apache.org/jira/browse/SPARK-5609 Project: Spark

[jira] [Updated] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5388: - Description: The existing submission gateway in standalone mode is not compatible across Spark versions.

[jira] [Updated] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5388: - Description: The existing submission gateway in standalone mode is not compatible across Spark versions. I

[jira] [Commented] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306552#comment-14306552 ] Andrew Or commented on SPARK-5388: -- Hi [~vanzin], thank you for all of your comments. I a

[jira] [Commented] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-02-04 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306549#comment-14306549 ] Manoj Kumar commented on SPARK-5021: Ah, I see what you mean (Google helped me), I nev

[jira] [Updated] (SPARK-3185) SPARK launch on Hadoop 2 in EC2 throws Tachyon exception when Formatting JOURNAL_FOLDER

2015-02-04 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-3185: Component/s: EC2 > SPARK launch on Hadoop 2 in EC2 throws Tachyon exception when Formatting

[jira] [Commented] (SPARK-5608) Improve SEO of Spark documentation site to let Google find latest docs

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306544#comment-14306544 ] Apache Spark commented on SPARK-5608: - User 'mateiz' has created a pull request for th

[jira] [Commented] (SPARK-4131) Support "Writing data into the filesystem from queries"

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306541#comment-14306541 ] Apache Spark commented on SPARK-4131: - User 'scwf' has created a pull request for this

[jira] [Commented] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-02-04 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306538#comment-14306538 ] Manoj Kumar commented on SPARK-5021: Can you please explain, what do you mean by "soft

[jira] [Created] (SPARK-5608) Improve SEO of Spark documentation site to let Google find latest docs

2015-02-04 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-5608: Summary: Improve SEO of Spark documentation site to let Google find latest docs Key: SPARK-5608 URL: https://issues.apache.org/jira/browse/SPARK-5608 Project: Spark

[jira] [Updated] (SPARK-5607) NullPointerException in objenesis

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5607: --- Description: Tests are sometimes failing with the following exception. The problem might be that Kryo

[jira] [Commented] (SPARK-5552) Automated data science AMI creation and data science cluster deployment on EC2

2015-02-04 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306530#comment-14306530 ] Florian Verhein commented on SPARK-5552: Just updated the two READMEs to match the

[jira] [Created] (SPARK-5607) NullPointerException in objenesis

2015-02-04 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5607: -- Summary: NullPointerException in objenesis Key: SPARK-5607 URL: https://issues.apache.org/jira/browse/SPARK-5607 Project: Spark Issue Type: Bug Repor

[jira] [Resolved] (SPARK-5605) Allow using String to specify colum name in DSL aggregate functions

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5605. Resolution: Fixed Fix Version/s: 1.3.0 > Allow using String to specify colum name in DSL aggr

[jira] [Updated] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5388: - Attachment: (was: Stable Spark Standalone Submission.pdf) > Provide a stable application submission ga

[jira] [Updated] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5388: - Attachment: stable-spark-submit-in-standalone-mode-2-4-15.pdf > Provide a stable application submission ga

[jira] [Commented] (SPARK-5606) Support plus sign in HiveContext

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306418#comment-14306418 ] Apache Spark commented on SPARK-5606: - User 'watermen' has created a pull request for

[jira] [Updated] (SPARK-5606) Support plus sign in HiveContext

2015-02-04 Thread Yadong Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yadong Qi updated SPARK-5606: - Description: Now spark version is only support ```SELECT -key FROM DECIMAL_UDF;``` in HiveContext. This p

[jira] [Created] (SPARK-5606) Support plus sign in HiveContext

2015-02-04 Thread Yadong Qi (JIRA)
Yadong Qi created SPARK-5606: Summary: Support plus sign in HiveContext Key: SPARK-5606 URL: https://issues.apache.org/jira/browse/SPARK-5606 Project: Spark Issue Type: Bug Components:

[jira] [Commented] (SPARK-5529) Executor is still hold while BlockManager has been removed

2015-02-04 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306378#comment-14306378 ] Lianhui Wang commented on SPARK-5529: - OK, [~sandyr] thanks. > Executor is still hold

[jira] [Resolved] (SPARK-5411) Allow SparkListeners to be specified in SparkConf and loaded when creating SparkContext

2015-02-04 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5411. Resolution: Fixed Fix Version/s: 1.3.0 > Allow SparkListeners to be specified in Spar

[jira] [Commented] (SPARK-5605) Allow using String to specify colum name in DSL aggregate functions

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306326#comment-14306326 ] Apache Spark commented on SPARK-5605: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-5599) Audit MLlib public APIs for 1.3

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306325#comment-14306325 ] Apache Spark commented on SPARK-5599: - User 'mengxr' has created a pull request for th

[jira] [Commented] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-02-04 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306321#comment-14306321 ] Sandy Ryza commented on SPARK-4550: --- I also just tried this out using an object that's n

[jira] [Created] (SPARK-5605) Allow passing in String's directly into DSL aggregate functions

2015-02-04 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5605: -- Summary: Allow passing in String's directly into DSL aggregate functions Key: SPARK-5605 URL: https://issues.apache.org/jira/browse/SPARK-5605 Project: Spark Is

[jira] [Updated] (SPARK-5605) Allow using String to specify colum name in DSL aggregate functions

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5605: --- Summary: Allow using String to specify colum name in DSL aggregate functions (was: Allow passing in S

[jira] [Comment Edited] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-02-04 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14304864#comment-14304864 ] Sandy Ryza edited comment on SPARK-4550 at 2/5/15 12:36 AM: I

[jira] [Resolved] (SPARK-5577) Create a convenient way for Python users to register SQL UDFs

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5577. Resolution: Fixed Fix Version/s: 1.3.0 > Create a convenient way for Python users to register

[jira] [Commented] (SPARK-5595) In memory data cache should be invalidated after insert into/overwrite

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306204#comment-14306204 ] Apache Spark commented on SPARK-5595: - User 'yhuai' has created a pull request for thi

[jira] [Resolved] (SPARK-5118) "Create table test stored as parquet as select ..." report error

2015-02-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5118. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3921 [https:/

[jira] [Commented] (SPARK-5603) Preinsert casting and renaming rule is needed in the Analyzer

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306205#comment-14306205 ] Apache Spark commented on SPARK-5603: - User 'yhuai' has created a pull request for thi

[jira] [Updated] (SPARK-4587) Model export/import

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4587: - Description: This is an umbrella JIRA for one of the most requested features on the user

[jira] [Resolved] (SPARK-5426) SQL Java API helper methods

2015-02-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5426. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4243 [https:/

[jira] [Created] (SPARK-5604) Remove setCheckpointDir from LDA

2015-02-04 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5604: Summary: Remove setCheckpointDir from LDA Key: SPARK-5604 URL: https://issues.apache.org/jira/browse/SPARK-5604 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-5587) Support change database owner

2015-02-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5587. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4357 [https:/

[jira] [Created] (SPARK-5603) Preinsert casting and renaming rule is needed in the Analyzer

2015-02-04 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5603: --- Summary: Preinsert casting and renaming rule is needed in the Analyzer Key: SPARK-5603 URL: https://issues.apache.org/jira/browse/SPARK-5603 Project: Spark Issue Type

[jira] [Commented] (SPARK-4587) Model export/import

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306147#comment-14306147 ] Joseph K. Bradley commented on SPARK-4587: -- It sounds like we're converging! I'l

[jira] [Updated] (SPARK-5602) Better support for creating DataFrame from local data collection

2015-02-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5602: --- Issue Type: Sub-task (was: Improvement) Parent: SPARK-5166 > Better support for creating Data

[jira] [Commented] (SPARK-5602) Better support for creating DataFrame from local data collection

2015-02-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306128#comment-14306128 ] Apache Spark commented on SPARK-5602: - User 'rxin' has created a pull request for this

[jira] [Resolved] (SPARK-5591) NoSuchObjectException for CTAS

2015-02-04 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5591. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4365 [https:/

[jira] [Commented] (SPARK-4587) Model export/import

2015-02-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306115#comment-14306115 ] Sean Owen commented on SPARK-4587: -- OK, an internal-only format makes sense. So the idea

[jira] [Created] (SPARK-5602) Better support for creating DataFrame from local data collection

2015-02-04 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5602: -- Summary: Better support for creating DataFrame from local data collection Key: SPARK-5602 URL: https://issues.apache.org/jira/browse/SPARK-5602 Project: Spark I

[jira] [Resolved] (SPARK-4939) Python updateStateByKey example hang in local mode

2015-02-04 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-4939. --- Resolution: Fixed Fixed by https://github.com/apache/spark/commit/0a89b156850fc5ba93160987927

[jira] [Updated] (SPARK-4939) Python updateStateByKey example hang in local mode

2015-02-04 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-4939: -- Affects Version/s: (was: 1.3.0) 1.2.1 Fix Version/s: 1.2.2

[jira] [Commented] (SPARK-4877) userClassPathFirst doesn't handle user classes inheriting from parent

2015-02-04 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306104#comment-14306104 ] Stephen Haberman commented on SPARK-4877: - Hi Matt, I know about the caching/Link

[jira] [Created] (SPARK-5601) Make streaming algorithms Java-friendly

2015-02-04 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5601: Summary: Make streaming algorithms Java-friendly Key: SPARK-5601 URL: https://issues.apache.org/jira/browse/SPARK-5601 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-4707) Reliable Kafka Receiver can lose data if the block generator fails to store data

2015-02-04 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-4707. -- Resolution: Fixed Fix Version/s: 1.3.0 > Reliable Kafka Receiver can lose data if the blo

[jira] [Commented] (SPARK-4587) Model export/import

2015-02-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306088#comment-14306088 ] Xiangrui Meng commented on SPARK-4587: -- [~srowen] Parquet is just an implementation d

[jira] [Commented] (SPARK-4877) userClassPathFirst doesn't handle user classes inheriting from parent

2015-02-04 Thread Matt Whelan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306082#comment-14306082 ] Matt Whelan commented on SPARK-4877: Overriding only findClass ignores caching, which

[jira] [Commented] (SPARK-4587) Model export/import

2015-02-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306032#comment-14306032 ] Sean Owen commented on SPARK-4587: -- True; you could also store N separate PMML models! At

[jira] [Commented] (SPARK-4587) Model export/import

2015-02-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306022#comment-14306022 ] Joseph K. Bradley commented on SPARK-4587: -- You may be right about compression; I

  1   2   >