[jira] [Comment Edited] (SPARK-10141) Number of tasks on executors still become negative after failures

2015-09-20 Thread Ohad Zadok (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14877454#comment-14877454 ] Ohad Zadok edited comment on SPARK-10141 at 9/20/15 7:56 AM: - happens to me

[jira] [Comment Edited] (SPARK-10141) Number of tasks on executors still become negative after failures

2015-09-20 Thread Ohad Zadok (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14877454#comment-14877454 ] Ohad Zadok edited comment on SPARK-10141 at 9/20/15 7:57 AM: - happens to me

[jira] [Resolved] (SPARK-4503) The history server is not compatible with HDFS HA

2015-09-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4503. -- Resolution: Cannot Reproduce > The history server is not compatible with HDFS HA >

[jira] [Commented] (SPARK-3246) Support weighted SVMWithSGD for classification of unbalanced dataset

2015-09-20 Thread Masaki Rikitoku (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14877444#comment-14877444 ] Masaki Rikitoku commented on SPARK-3246: In the libsvm package, we can set the different C

[jira] [Commented] (SPARK-10141) Number of tasks on executors still become negative after failures

2015-09-20 Thread Ohad Zadok (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14877454#comment-14877454 ] Ohad Zadok commented on SPARK-10141: happens to me as well on 1.5.0 When running LDA on ~1.5 Million

[jira] [Commented] (SPARK-10718) Check License should not verify conf files for license

2015-09-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1490#comment-1490 ] Sean Owen commented on SPARK-10718: --- There's no such file in the repo or release though:

[jira] [Commented] (SPARK-6484) Ganglia metrics xml reporter doesn't escape correctly

2015-09-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14900031#comment-14900031 ] Reynold Xin commented on SPARK-6484: Is this still a problem? > Ganglia metrics xml reporter doesn't

[jira] [Updated] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10588: Target Version/s: 1.6.0 (was: 1.6.0, 1.5.1) > Saving a DataFrame containing only nulls to JSON

[jira] [Updated] (SPARK-10544) Serialization of Python namedtuple subclasses in functions / closures is broken

2015-09-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10544: Fix Version/s: 1.5.1 1.6.0 > Serialization of Python namedtuple subclasses in

[jira] [Updated] (SPARK-10337) Views are broken

2015-09-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10337: Target Version/s: 1.6.0 (was: 1.5.1) > Views are broken > > >

[jira] [Created] (SPARK-10721) Log warning when file deletion fails

2015-09-20 Thread Ted Yu (JIRA)
Ted Yu created SPARK-10721: -- Summary: Log warning when file deletion fails Key: SPARK-10721 URL: https://issues.apache.org/jira/browse/SPARK-10721 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-10721) Log warning when file deletion fails

2015-09-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10721: Assignee: Apache Spark > Log warning when file deletion fails >

[jira] [Assigned] (SPARK-10721) Log warning when file deletion fails

2015-09-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10721: Assignee: (was: Apache Spark) > Log warning when file deletion fails >

[jira] [Commented] (SPARK-10721) Log warning when file deletion fails

2015-09-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14900098#comment-14900098 ] Apache Spark commented on SPARK-10721: -- User 'tedyu' has created a pull request for this issue:

[jira] [Resolved] (SPARK-5905) Note requirements for certain RowMatrix methods in docs

2015-09-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5905. -- Resolution: Fixed > Note requirements for certain RowMatrix methods in docs >

[jira] [Updated] (SPARK-5905) Note requirements for certain RowMatrix methods in docs

2015-09-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5905: - Assignee: Sean Owen > Note requirements for certain RowMatrix methods in docs >

[jira] [Updated] (SPARK-5905) Note requirements for certain RowMatrix methods in docs

2015-09-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5905: - Target Version/s: 1.6.0 Fix Version/s: 1.6.0 > Note requirements for certain RowMatrix

[jira] [Updated] (SPARK-10715) Duplicate initialzation flag in WeightedLeastSquare

2015-09-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10715: -- Assignee: Kai Sasaki > Duplicate initialzation flag in WeightedLeastSquare >

[jira] [Updated] (SPARK-10715) Duplicate initialzation flag in WeightedLeastSquare

2015-09-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10715: -- Target Version/s: 1.6.0 > Duplicate initialzation flag in WeightedLeastSquare >

[jira] [Updated] (SPARK-10686) Add quantileCol to AFTSurvivalRegression

2015-09-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10686: -- Shepherd: Xiangrui Meng > Add quantileCol to AFTSurvivalRegression >

[jira] [Updated] (SPARK-9681) Support R feature interactions in RFormula

2015-09-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9681: - Shepherd: Xiangrui Meng > Support R feature interactions in RFormula >

[jira] [Resolved] (SPARK-10715) Duplicate initialzation flag in WeightedLeastSquare

2015-09-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10715. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8837

[jira] [Updated] (SPARK-9681) Support R feature interactions in RFormula

2015-09-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9681: - Assignee: Eric Liang > Support R feature interactions in RFormula >

[jira] [Updated] (SPARK-10631) Add missing API doc in pyspark.mllib.linalg.Vector

2015-09-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10631: -- Shepherd: Xiangrui Meng > Add missing API doc in pyspark.mllib.linalg.Vector >

[jira] [Commented] (SPARK-3255) Faster algorithms for logistic regression

2015-09-20 Thread Henry Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14900114#comment-14900114 ] Henry Lin commented on SPARK-3255: -- This paper

[jira] [Comment Edited] (SPARK-3255) Faster algorithms for logistic regression

2015-09-20 Thread Henry Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14900114#comment-14900114 ] Henry Lin edited comment on SPARK-3255 at 9/20/15 11:41 PM: This paper here...

[jira] [Commented] (SPARK-10559) DataFrame schema ArrayType should accept ResultIterable

2015-09-20 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14899952#comment-14899952 ] Maciej BryƄski commented on SPARK-10559: I did some performance tests with both solutions and I

[jira] [Commented] (SPARK-10719) SQLImplicits.rddToDataFrameHolder is not thread safe when using Scala 2.10

2015-09-20 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1481#comment-1481 ] Shixiong Zhu commented on SPARK-10719: -- Actually, other places that use `TypeTag` as context bound

[jira] [Created] (SPARK-10719) SQLImplicits.rddToDataFrameHolder is not thread safe when using Scala 2.10

2015-09-20 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-10719: Summary: SQLImplicits.rddToDataFrameHolder is not thread safe when using Scala 2.10 Key: SPARK-10719 URL: https://issues.apache.org/jira/browse/SPARK-10719 Project:

[jira] [Created] (SPARK-10723) Add RDD.reduceOption method

2015-09-20 Thread Tatsuya Atsumi (JIRA)
Tatsuya Atsumi created SPARK-10723: -- Summary: Add RDD.reduceOption method Key: SPARK-10723 URL: https://issues.apache.org/jira/browse/SPARK-10723 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-10694) Prevent Data Loss in Spark Streaming when used with OFF_HEAP ExternalBlockStore (Tachyon)

2015-09-20 Thread Dibyendu Bhattacharya (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dibyendu Bhattacharya updated SPARK-10694: -- Component/s: Block Manager > Prevent Data Loss in Spark Streaming when used

[jira] [Created] (SPARK-10724) SQL's floor() returns DOUBLE

2015-09-20 Thread Simeon Simeonov (JIRA)
Simeon Simeonov created SPARK-10724: --- Summary: SQL's floor() returns DOUBLE Key: SPARK-10724 URL: https://issues.apache.org/jira/browse/SPARK-10724 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-10722) Uncaught exception: RDDBlockId not found in driver-heartbeater

2015-09-20 Thread Simeon Simeonov (JIRA)
Simeon Simeonov created SPARK-10722: --- Summary: Uncaught exception: RDDBlockId not found in driver-heartbeater Key: SPARK-10722 URL: https://issues.apache.org/jira/browse/SPARK-10722 Project: Spark

[jira] [Commented] (SPARK-8000) SQLContext.read.load() should be able to auto-detect input data

2015-09-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14900204#comment-14900204 ] Reynold Xin commented on SPARK-8000: Yes - sounds good. > SQLContext.read.load() should be able to

[jira] [Assigned] (SPARK-10724) SQL's floor() returns DOUBLE

2015-09-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-10724: - Assignee: Xiangrui Meng > SQL's floor() returns DOUBLE > >

[jira] [Updated] (SPARK-10724) SQL's floor() returns DOUBLE

2015-09-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10724: -- Assignee: (was: Xiangrui Meng) > SQL's floor() returns DOUBLE >

[jira] [Resolved] (SPARK-10631) Add missing API doc in pyspark.mllib.linalg.Vector

2015-09-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10631. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8834

[jira] [Commented] (SPARK-10602) Univariate statistics as UDAFs: single-pass continuous stats

2015-09-20 Thread Sabyasachi Nayak (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14900166#comment-14900166 ] Sabyasachi Nayak commented on SPARK-10602: -- Hey Seth,Can you pls share the working versions of

[jira] [Commented] (SPARK-8000) SQLContext.read.load() should be able to auto-detect input data

2015-09-20 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14900172#comment-14900172 ] Yanbo Liang commented on SPARK-8000: [~rxin] I will work on it. I agree to make Spark SQL write an

[jira] [Commented] (SPARK-10723) Add RDD.reduceOption method

2015-09-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14900184#comment-14900184 ] Apache Spark commented on SPARK-10723: -- User 'Attsun1031' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10723) Add RDD.reduceOption method

2015-09-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10723: Assignee: Apache Spark > Add RDD.reduceOption method > --- > >

[jira] [Assigned] (SPARK-10723) Add RDD.reduceOption method

2015-09-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10723: Assignee: (was: Apache Spark) > Add RDD.reduceOption method >

[jira] [Updated] (SPARK-9852) Let reduce tasks fetch multiple map output partitions

2015-09-20 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-9852: - Summary: Let reduce tasks fetch multiple map output partitions (was: Let HashShuffleFetcher

[jira] [Assigned] (SPARK-10630) createDataFrame from a Java List

2015-09-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10630: Assignee: Apache Spark > createDataFrame from a Java List >

[jira] [Assigned] (SPARK-10630) createDataFrame from a Java List

2015-09-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10630: Assignee: (was: Apache Spark) > createDataFrame from a Java List >

[jira] [Commented] (SPARK-10630) createDataFrame from a Java List

2015-09-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14900017#comment-14900017 ] Apache Spark commented on SPARK-10630: -- User 'holdenk' has created a pull request for this issue:

[jira] [Created] (SPARK-10720) Add a java wrapper to create dataframe from a local list of Java Beans.

2015-09-20 Thread holdenk (JIRA)
holdenk created SPARK-10720: --- Summary: Add a java wrapper to create dataframe from a local list of Java Beans. Key: SPARK-10720 URL: https://issues.apache.org/jira/browse/SPARK-10720 Project: Spark

[jira] [Updated] (SPARK-10685) Misaligned data with RDD.zip and DataFrame.withColumn after repartition

2015-09-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10685: Target Version/s: 1.6.0, 1.5.1 (was: 1.5.1) > Misaligned data with RDD.zip and

[jira] [Updated] (SPARK-10640) Spark history server fails to parse taskEndReasonFromJson TaskCommitDenied

2015-09-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10640: Description: I'm seeing an exception from the spark history server trying to read a history file:

[jira] [Commented] (SPARK-10517) Console "Output" field is empty when using DataFrameWriter.json

2015-09-20 Thread Hans van den Bogert (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14900038#comment-14900038 ] Hans van den Bogert commented on SPARK-10517: - I never see output size. I've tried local disk

[jira] [Updated] (SPARK-10297) When save data to a data source table, we should bound the size of a saved file

2015-09-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10297: - Target Version/s: 1.6.0 (was: 1.6.0, 1.5.1) > When save data to a data source table, we should bound

[jira] [Updated] (SPARK-10297) When save data to a data source table, we should bound the size of a saved file

2015-09-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10297: - Target Version/s: 1.6.0, 1.5.1 (was: 1.6.0) > When save data to a data source table, we should bound

[jira] [Commented] (SPARK-10720) Add a java wrapper to create dataframe from a local list of Java Beans.

2015-09-20 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14900019#comment-14900019 ] holdenk commented on SPARK-10720: - While its not blocked on this I'm going to wait for SPARK-10630 to go

[jira] [Updated] (SPARK-10718) Check License should not verify conf files for license

2015-09-20 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rekha Joshi updated SPARK-10718: Priority: Minor (was: Major) > Check License should not verify conf files for license >