[jira] [Created] (SPARK-10607) Scheduler should include defensive measures against infinite loops due to task commit denial

2015-09-14 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-10607: -- Summary: Scheduler should include defensive measures against infinite loops due to task commit denial Key: SPARK-10607 URL: https://issues.apache.org/jira/browse/SPARK-10607

[jira] [Resolved] (SPARK-10273) Add @since annotation to pyspark.mllib.feature

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10273. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8633

[jira] [Resolved] (SPARK-10275) Add @since annotation to pyspark.mllib.random

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10275. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8666

[jira] [Comment Edited] (SPARK-10399) Off Heap Memory Access for non-JVM libraries (C++)

2015-09-14 Thread Paul Wais (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744634#comment-14744634 ] Paul Wais edited comment on SPARK-10399 at 9/15/15 1:16 AM: After

[jira] [Commented] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Kevin Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744668#comment-14744668 ] Kevin Tsai commented on SPARK-10590: Hi Reynold, The issue still there to me. I've setup 2 Linux

[jira] [Resolved] (SPARK-10542) The PySpark 1.5 closure serializer can't serialize a namedtuple instance.

2015-09-14 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-10542. Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Target

[jira] [Commented] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Kevin Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744660#comment-14744660 ] Kevin Tsai commented on SPARK-10590: Yes, I've ran ./dev/change-scala-version.sh 2.11 before build

[jira] [Commented] (SPARK-9313) Enable a "docker run" invocation in place of PYSPARK_PYTHON

2015-09-14 Thread Justin Uang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744731#comment-14744731 ] Justin Uang commented on SPARK-9313: This would be hugely helpful. I'm working on a platform that

[jira] [Commented] (SPARK-8939) YARN EC2 default setting fails with IllegalArgumentException

2015-09-14 Thread Sen Fang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744792#comment-14744792 ] Sen Fang commented on SPARK-8939: - The issue seems to lie in:

[jira] [Created] (SPARK-10605) collect_list() and collect_set() should accept struct types as argument

2015-09-14 Thread Mike Fang (JIRA)
Mike Fang created SPARK-10605: - Summary: collect_list() and collect_set() should accept struct types as argument Key: SPARK-10605 URL: https://issues.apache.org/jira/browse/SPARK-10605 Project: Spark

[jira] [Commented] (SPARK-6235) Address various 2G limits

2015-09-14 Thread Sean McKibben (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744746#comment-14744746 ] Sean McKibben commented on SPARK-6235: -- When reading from HBase into spark, the regions seem to

[jira] [Updated] (SPARK-10601) Spark SQL - Support for MINUS

2015-09-14 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-10601: Component/s: SQL > Spark SQL - Support for MINUS > - > >

[jira] [Commented] (SPARK-10486) Spark intermittently fails to recover from a worker failure (in standalone mode)

2015-09-14 Thread Madhusudanan Kandasamy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744858#comment-14744858 ] Madhusudanan Kandasamy commented on SPARK-10486: Can you share a simplified testcase

[jira] [Commented] (SPARK-9844) File appender race condition during SparkWorker shutdown

2015-09-14 Thread Gareth Lewin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744590#comment-14744590 ] Gareth Lewin commented on SPARK-9844: - I believe this still exists in 1.5.0, I am getting this on my

[jira] [Commented] (SPARK-10399) Off Heap Memory Access for non-JVM libraries (C++)

2015-09-14 Thread Paul Wais (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744634#comment-14744634 ] Paul Wais commented on SPARK-10399: --- After investigating this issue a bit further, it might be feasible

[jira] [Resolved] (SPARK-9851) Support submitting map stages individually in DAGScheduler

2015-09-14 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-9851. -- Resolution: Fixed Fix Version/s: 1.6.0 > Support submitting map stages individually in

[jira] [Commented] (SPARK-10194) SGD algorithms need convergenceTol parameter in Python

2015-09-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744782#comment-14744782 ] Yanbo Liang commented on SPARK-10194: - [~mengxr] OK. > SGD algorithms need convergenceTol parameter

[jira] [Commented] (SPARK-6235) Address various 2G limits

2015-09-14 Thread Ram Gande (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744605#comment-14744605 ] Ram Gande commented on SPARK-6235: -- Any progress on this. We are seeing this issue constantly in our

[jira] [Updated] (SPARK-9962) Decision Tree training: prevNodeIdsForInstances.unpersist() at end of training

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9962: - Shepherd: Joseph K. Bradley Target Version/s: 1.6.0 > Decision Tree training:

[jira] [Updated] (SPARK-9962) Decision Tree training: prevNodeIdsForInstances.unpersist() at end of training

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9962: - Assignee: holdenk > Decision Tree training: prevNodeIdsForInstances.unpersist() at end of

[jira] [Created] (SPARK-10606) Cube/Rollup/GrpSet doesn't create the correct plan when group by is on something other than an AttributeReference

2015-09-14 Thread Harish Butani (JIRA)
Harish Butani created SPARK-10606: - Summary: Cube/Rollup/GrpSet doesn't create the correct plan when group by is on something other than an AttributeReference Key: SPARK-10606 URL:

[jira] [Comment Edited] (SPARK-8939) YARN EC2 default setting fails with IllegalArgumentException

2015-09-14 Thread Heji Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14739578#comment-14739578 ] Heji Kim edited comment on SPARK-8939 at 9/15/15 3:14 AM: -- I was trying to

[jira] [Resolved] (SPARK-9793) PySpark DenseVector, SparseVector should override __eq__ and __hash__

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-9793. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8166

[jira] [Updated] (SPARK-10600) SparkSQL - Support for Not Exists in a Correlated Subquery

2015-09-14 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-10600: Component/s: SQL > SparkSQL - Support for Not Exists in a Correlated Subquery >

[jira] [Updated] (SPARK-10094) Mark ML PySpark feature transformers as Experimental to match Scala

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10094: -- Target Version/s: 1.6.0 (was: 1.5.0) > Mark ML PySpark feature transformers as Experimental

[jira] [Commented] (SPARK-7169) Allow to specify metrics configuration more flexibly

2015-09-14 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744200#comment-14744200 ] Ryan Williams commented on SPARK-7169: -- [~jlewandowski] I assume [~jerryshao] is referring to the

[jira] [Resolved] (SPARK-10576) Move .java files out of src/main/scala

2015-09-14 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-10576. --- Resolution: Fixed Assignee: Sean Owen Fix Version/s: 1.6.0 Target

[jira] [Commented] (SPARK-10577) [PySpark, SQL] DataFrame hint for broadcast join

2015-09-14 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743022#comment-14743022 ] Maciej Bryński commented on SPARK-10577: And what about pyspark ? I did my own "broadcast join",

[jira] [Commented] (SPARK-10578) pyspark.ml.classification.RandomForestClassifer does not return `rawPrediction` column

2015-09-14 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743029#comment-14743029 ] Liang-Chi Hsieh commented on SPARK-10578: - Hi Karen, I think these columns are added to

[jira] [Commented] (SPARK-10577) [PySpark] DataFrame hint for broadcast join

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743033#comment-14743033 ] Reynold Xin commented on SPARK-10577: - We already have the Java API in functions.scala - just need to

[jira] [Updated] (SPARK-10577) [PySpark] DataFrame hint for broadcast join

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10577: Summary: [PySpark] DataFrame hint for broadcast join (was: [PySpark, SQL] DataFrame hint for

[jira] [Updated] (SPARK-10577) [PySpark] DataFrame hint for broadcast join

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10577: Target Version/s: 1.6.0 > [PySpark] DataFrame hint for broadcast join >

[jira] [Created] (SPARK-10586) BlockManager ca't be removed when it is re-registered, then disassociats

2015-09-14 Thread meiyoula (JIRA)
meiyoula created SPARK-10586: Summary: BlockManager ca't be removed when it is re-registered, then disassociats Key: SPARK-10586 URL: https://issues.apache.org/jira/browse/SPARK-10586 Project: Spark

[jira] [Commented] (SPARK-10577) [PySpark, SQL] DataFrame hint for broadcast join

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743027#comment-14743027 ] Reynold Xin commented on SPARK-10577: - We can easily add it to pyspark by adding it to

[jira] [Commented] (SPARK-10577) [PySpark, SQL] DataFrame hint for broadcast join

2015-09-14 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743030#comment-14743030 ] Maciej Bryński commented on SPARK-10577: Unfortunatelly I'm rather poweruser than programmer. And

[jira] [Updated] (SPARK-10577) [PySpark, SQL] DataFrame hint for broadcast join

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10577: Labels: starter (was: ) > [PySpark, SQL] DataFrame hint for broadcast join >

[jira] [Updated] (SPARK-10577) [PySpark] DataFrame hint for broadcast join

2015-09-14 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-10577: --- Description: As in https://issues.apache.org/jira/browse/SPARK-8300 there should by

[jira] [Commented] (SPARK-1103) Garbage collect RDD information inside of Spark

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743060#comment-14743060 ] Apache Spark commented on SPARK-1103: - User 'tdas' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10573: Assignee: (was: Apache Spark) > IndexToString transformSchema adds output field as

[jira] [Commented] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743958#comment-14743958 ] Apache Spark commented on SPARK-10573: -- User 'pnpritchard' has created a pull request for this

[jira] [Commented] (SPARK-10574) HashingTF should use MurmurHash3

2015-09-14 Thread Simeon Simeonov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743922#comment-14743922 ] Simeon Simeonov commented on SPARK-10574: - [~josephkb] this makes sense. There are a few

[jira] [Updated] (SPARK-6548) stddev_pop and stddev_samp aggregate functions

2015-09-14 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6548: Assignee: Jihong MA > stddev_pop and stddev_samp aggregate functions >

[jira] [Updated] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10573: -- Assignee: Nick Pritchard > IndexToString transformSchema adds output field as DoubleType >

[jira] [Updated] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10573: -- Shepherd: Xiangrui Meng > IndexToString transformSchema adds output field as DoubleType >

[jira] [Updated] (SPARK-10077) Java package doc for spark.ml.feature

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10077: -- Assignee: holdenk > Java package doc for spark.ml.feature >

[jira] [Updated] (SPARK-10077) Java package doc for spark.ml.feature

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10077: -- Shepherd: Xiangrui Meng Target Version/s: 1.6.0 > Java package doc for

[jira] [Commented] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744000#comment-14744000 ] Sean Owen commented on SPARK-10590: --- No, I'm talking about {{./dev/change-scala-version.sh 2.11}}. If

[jira] [Comment Edited] (SPARK-10574) HashingTF should use MurmurHash3

2015-09-14 Thread Simeon Simeonov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743922#comment-14743922 ] Simeon Simeonov edited comment on SPARK-10574 at 9/14/15 5:55 PM: --

[jira] [Updated] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10573: -- Target Version/s: 1.6.0, 1.5.1 > IndexToString transformSchema adds output field as DoubleType

[jira] [Updated] (SPARK-9769) Add Python API for ml.feature.CountVectorizer

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9769: - Shepherd: Xiangrui Meng > Add Python API for ml.feature.CountVectorizer >

[jira] [Created] (SPARK-10592) deprecate weights and use coefficients instead in ML models

2015-09-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10592: - Summary: deprecate weights and use coefficients instead in ML models Key: SPARK-10592 URL: https://issues.apache.org/jira/browse/SPARK-10592 Project: Spark

[jira] [Comment Edited] (SPARK-10594) ApplicationMaster "--help" references the removed "--num-executors" option

2015-09-14 Thread Erick Tryzelaar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744117#comment-14744117 ] Erick Tryzelaar edited comment on SPARK-10594 at 9/14/15 7:49 PM: --

[jira] [Updated] (SPARK-10594) ApplicationMaster "--help" references the removed "--num-executors" option

2015-09-14 Thread Erick Tryzelaar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erick Tryzelaar updated SPARK-10594: Attachment: 0002-SPARK-10594-YARN-Document-ApplicationMaster-properti.patch

[jira] [Commented] (SPARK-10594) ApplicationMaster "--help" references the removed "--num-executors" option

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744127#comment-14744127 ] Sean Owen commented on SPARK-10594: --- Can you make a PR instead? we use github rather than patches.

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744136#comment-14744136 ] Reynold Xin commented on SPARK-9325: Are you only going to add collect, or are you going to add

[jira] [Closed] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-10590. --- Resolution: Cannot Reproduce Closing this one for now. [~kevintsai] please continue to comment if

[jira] [Updated] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10590: Target Version/s: (was: 1.5.0) > Spark with YARN build is broken >

[jira] [Commented] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744149#comment-14744149 ] Reynold Xin commented on SPARK-10588: - I think a more proper fix is to write the schema out as a

[jira] [Commented] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744157#comment-14744157 ] Yin Huai commented on SPARK-10588: -- Right, that is better. > Saving a DataFrame containing only nulls

[jira] [Updated] (SPARK-10539) Intersection Optimization is Wrong

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10539: Assignee: Yijie Shen > Intersection Optimization is Wrong > -- > >

[jira] [Updated] (SPARK-9774) Add Python API for ml.regression.IsotonicRegression

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9774: - Shepherd: Yanbo Liang Target Version/s: 1.6.0 Priority: Major (was:

[jira] [Updated] (SPARK-10266) Add @Since annotation to ml.tuning

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10266: -- Assignee: Ehsan Mohyedin Kermani > Add @Since annotation to ml.tuning >

[jira] [Updated] (SPARK-10266) Add @Since annotation to ml.tuning

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10266: -- Shepherd: Yu Ishikawa Target Version/s: 1.6.0 > Add @Since annotation to ml.tuning

[jira] [Updated] (SPARK-10271) Add @since annotation to pyspark.mllib.clustering

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10271: -- Shepherd: Yu Ishikawa > Add @since annotation to pyspark.mllib.clustering >

[jira] [Created] (SPARK-10594) ApplicationMaster "--help" references the removed "--num-executors" option

2015-09-14 Thread Erick Tryzelaar (JIRA)
Erick Tryzelaar created SPARK-10594: --- Summary: ApplicationMaster "--help" references the removed "--num-executors" option Key: SPARK-10594 URL: https://issues.apache.org/jira/browse/SPARK-10594

[jira] [Updated] (SPARK-10272) Add @since annotation to pyspark.mllib.evaluation

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10272: -- Assignee: Noel Smith > Add @since annotation to pyspark.mllib.evaluation >

[jira] [Updated] (SPARK-10273) Add @since annotation to pyspark.mllib.feature

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10273: -- Assignee: Noel Smith > Add @since annotation to pyspark.mllib.feature >

[jira] [Updated] (SPARK-10269) Add @since annotation to pyspark.mllib.classification

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10269: -- Assignee: Noel Smith > Add @since annotation to pyspark.mllib.classification >

[jira] [Updated] (SPARK-10271) Add @since annotation to pyspark.mllib.clustering

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10271: -- Assignee: Noel Smith > Add @since annotation to pyspark.mllib.clustering >

[jira] [Updated] (SPARK-10275) Add @since annotation to pyspark.mllib.random

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10275: -- Shepherd: Noel Smith > Add @since annotation to pyspark.mllib.random >

[jira] [Updated] (SPARK-10276) Add @since annotation to pyspark.mllib.recommendation

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10276: -- Assignee: Yu Ishikawa > Add @since annotation to pyspark.mllib.recommendation >

[jira] [Updated] (SPARK-10274) Add @since annotation to pyspark.mllib.fpm

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10274: -- Shepherd: Noel Smith > Add @since annotation to pyspark.mllib.fpm >

[jira] [Updated] (SPARK-10273) Add @since annotation to pyspark.mllib.feature

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10273: -- Shepherd: Yu Ishikawa > Add @since annotation to pyspark.mllib.feature >

[jira] [Created] (SPARK-10595) Various ML programming guide cleanups post 1.5

2015-09-14 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10595: - Summary: Various ML programming guide cleanups post 1.5 Key: SPARK-10595 URL: https://issues.apache.org/jira/browse/SPARK-10595 Project: Spark

[jira] [Updated] (SPARK-10269) Add @since annotation to pyspark.mllib.classification

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10269: -- Shepherd: Yu Ishikawa > Add @since annotation to pyspark.mllib.classification >

[jira] [Updated] (SPARK-10272) Add @since annotation to pyspark.mllib.evaluation

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10272: -- Shepherd: Yu Ishikawa > Add @since annotation to pyspark.mllib.evaluation >

[jira] [Updated] (SPARK-10283) Add @since annotation to pyspark.ml.regression

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10283: -- Assignee: Yu Ishikawa > Add @since annotation to pyspark.ml.regression >

[jira] [Updated] (SPARK-10282) Add @since annotation to pyspark.ml.recommendation

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10282: -- Shepherd: Noel Smith Assignee: Yu Ishikawa Target Version/s: 1.6.0 >

[jira] [Updated] (SPARK-10281) Add @since annotation to pyspark.ml.clustering

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10281: -- Target Version/s: 1.6.0 > Add @since annotation to pyspark.ml.clustering >

[jira] [Updated] (SPARK-10284) Add @since annotation to pyspark.ml.tuning

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10284: -- Target Version/s: 1.6.0 > Add @since annotation to pyspark.ml.tuning >

[jira] [Updated] (SPARK-10284) Add @since annotation to pyspark.ml.tuning

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10284: -- Shepherd: Noel Smith Assignee: Yu Ishikawa > Add @since annotation to pyspark.ml.tuning >

[jira] [Updated] (SPARK-10283) Add @since annotation to pyspark.ml.regression

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10283: -- Target Version/s: 1.6.0 > Add @since annotation to pyspark.ml.regression >

[jira] [Updated] (SPARK-10285) Add @since annotation to pyspark.ml.util

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10285: -- Shepherd: Noel Smith Assignee: Yu Ishikawa Target Version/s: 1.6.0 >

[jira] [Updated] (SPARK-10283) Add @since annotation to pyspark.ml.regression

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10283: -- Shepherd: Noel Smith > Add @since annotation to pyspark.ml.regression >

[jira] [Updated] (SPARK-10594) ApplicationMaster "--help" references the removed "--num-executors" option

2015-09-14 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-10594: --- Priority: Trivial (was: Major) > ApplicationMaster "--help" references the removed

[jira] [Commented] (SPARK-10269) Add @since annotation to pyspark.mllib.classification

2015-09-14 Thread Noel Smith (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744045#comment-14744045 ] Noel Smith commented on SPARK-10269: OK - no problem. I'll add one next time. > Add @since

[jira] [Updated] (SPARK-10595) Various ML programming guide cleanups post 1.5

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10595: -- Shepherd: Feynman Liang > Various ML programming guide cleanups post 1.5 >

[jira] [Updated] (SPARK-10394) Make GBTParams use shared "stepSize"

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10394: -- Shepherd: Joseph K. Bradley > Make GBTParams use shared "stepSize" >

[jira] [Updated] (SPARK-10516) Add values as a property to DenseVector in PySpark

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10516: -- Shepherd: Xiangrui Meng > Add values as a property to DenseVector in PySpark >

[jira] [Resolved] (SPARK-10194) SGD algorithms need convergenceTol parameter in Python

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10194. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8457

[jira] [Updated] (SPARK-10194) SGD algorithms need convergenceTol parameter in Python

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10194: -- Assignee: Yanbo Liang > SGD algorithms need convergenceTol parameter in Python >

[jira] [Commented] (SPARK-10577) [PySpark] DataFrame hint for broadcast join

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744080#comment-14744080 ] Reynold Xin commented on SPARK-10577: - Please go ahead. Thanks. > [PySpark] DataFrame hint for

[jira] [Updated] (SPARK-9793) PySpark DenseVector, SparseVector should override __eq__ and __hash__

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9793: - Shepherd: Xiangrui Meng Assignee: Yanbo Liang > PySpark DenseVector, SparseVector should

[jira] [Created] (SPARK-10593) sql lateral view same name gives wrong value

2015-09-14 Thread Davies Liu (JIRA)
Davies Liu created SPARK-10593: -- Summary: sql lateral view same name gives wrong value Key: SPARK-10593 URL: https://issues.apache.org/jira/browse/SPARK-10593 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-9774) Add Python API for ml.regression.IsotonicRegression

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9774: - Assignee: holdenk > Add Python API for ml.regression.IsotonicRegression >

[jira] [Updated] (SPARK-10265) Add @Since annotation to ml.regression

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10265: -- Target Version/s: 1.6.0 > Add @Since annotation to ml.regression >

[jira] [Updated] (SPARK-10265) Add @Since annotation to ml.regression

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10265: -- Assignee: Ehsan Mohyedin Kermani > Add @Since annotation to ml.regression >

[jira] [Updated] (SPARK-10593) sql lateral view same name gives wrong value

2015-09-14 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-10593: --- Description: This query will return wrong result: {code} select insideLayer1.json as

[jira] [Commented] (SPARK-10266) Add @Since annotation to ml.tuning

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744024#comment-14744024 ] Xiangrui Meng commented on SPARK-10266: --- [~Ehsan Mohyedin Kermani] Next time please leave a message

[jira] [Assigned] (SPARK-10595) Various ML programming guide cleanups post 1.5

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10595: Assignee: Apache Spark (was: Joseph K. Bradley) > Various ML programming guide cleanups

<    1   2   3   >