[jira] [Updated] (SPARK-10329) Cost RDD in k-means|| initialization is not storage-efficient

2015-08-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10329: -- Assignee: hujiayin Cost RDD in k-means|| initialization is not storage-efficient

[jira] [Assigned] (SPARK-10290) Spark can register temp table and hive table with the same table name

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10290: Assignee: Apache Spark Spark can register temp table and hive table with the same table

[jira] [Commented] (SPARK-10290) Spark can register temp table and hive table with the same table name

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721493#comment-14721493 ] Apache Spark commented on SPARK-10290: -- User 'mzorro' has created a pull request for

[jira] [Assigned] (SPARK-10290) Spark can register temp table and hive table with the same table name

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10290: Assignee: (was: Apache Spark) Spark can register temp table and hive table with the

[jira] [Comment Edited] (SPARK-10329) Cost RDD in k-means|| initialization is not storage-efficient

2015-08-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721419#comment-14721419 ] Xiangrui Meng edited comment on SPARK-10329 at 8/30/15 6:53 AM:

[jira] [Updated] (SPARK-10184) Optimization for bounds determination in RangePartitioner

2015-08-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10184: -- Assignee: Jigao Fu Optimization for bounds determination in RangePartitioner

[jira] [Commented] (SPARK-10199) Avoid using reflections for parquet model save

2015-08-30 Thread Vinod KC (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721491#comment-14721491 ] Vinod KC commented on SPARK-10199: -- [~fliang] , As you suggested, 1) I've made

[jira] [Created] (SPARK-10356) MLlib: Normalization should use absolute values

2015-08-30 Thread Carsten Schnober (JIRA)
Carsten Schnober created SPARK-10356: Summary: MLlib: Normalization should use absolute values Key: SPARK-10356 URL: https://issues.apache.org/jira/browse/SPARK-10356 Project: Spark

[jira] [Updated] (SPARK-10356) MLlib: Normalization should use absolute values

2015-08-30 Thread Carsten Schnober (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carsten Schnober updated SPARK-10356: - Description: The normalizer does not handle vectors with negative values properly. It

[jira] [Updated] (SPARK-10356) MLlib: Normalization should use absolute values

2015-08-30 Thread Carsten Schnober (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carsten Schnober updated SPARK-10356: - Description: The normalizer does not handle vectors with negative values properly. It

[jira] [Resolved] (SPARK-10184) Optimization for bounds determination in RangePartitioner

2015-08-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10184. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8397

[jira] [Updated] (SPARK-10226) Error occured in SparkSQL when using !=

2015-08-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10226: -- Assignee: wangwei Error occured in SparkSQL when using !=

[jira] [Updated] (SPARK-10350) Fix SQL Programming Guide

2015-08-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10350: -- Assignee: Guoqiang Li Fix SQL Programming Guide - Key:

[jira] [Created] (SPARK-10355) Add Python API for SQLTransformer

2015-08-30 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-10355: --- Summary: Add Python API for SQLTransformer Key: SPARK-10355 URL: https://issues.apache.org/jira/browse/SPARK-10355 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-10301) For struct type, if parquet's global schema has less fields than a file's schema, data reading will fail

2015-08-30 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-10301: --- Description: We hit this issue when reading a complex Parquet dateset without turning on schema

[jira] [Updated] (SPARK-10023) Unified DecisionTreeParams checkpointInterval between Scala and Python API.

2015-08-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10023: Description: checkpointInterval is member of DecisionTreeParams in Scala API which is

[jira] [Commented] (SPARK-10023) Unified DecisionTreeParams checkpointInterval between Scala and Python API.

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721477#comment-14721477 ] Apache Spark commented on SPARK-10023: -- User 'yanboliang' has created a pull request

[jira] [Assigned] (SPARK-10023) Unified DecisionTreeParams checkpointInterval between Scala and Python API.

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10023: Assignee: (was: Apache Spark) Unified DecisionTreeParams checkpointInterval between

[jira] [Commented] (SPARK-9926) Parallelize file listing for partitioned Hive table

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721483#comment-14721483 ] Apache Spark commented on SPARK-9926: - User 'piaozhexiu' has created a pull request

[jira] [Comment Edited] (SPARK-10356) MLlib: Normalization should use absolute values

2015-08-30 Thread Carsten Schnober (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721502#comment-14721502 ] Carsten Schnober edited comment on SPARK-10356 at 8/30/15 12:00 PM:

[jira] [Resolved] (SPARK-10331) Update user guide to address minor comments during code review

2015-08-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10331. --- Resolution: Fixed Fix Version/s: 1.5.1 Update user guide to address minor comments

[jira] [Updated] (SPARK-10346) SparkR mutate and transform should replace column with same name to match R data.frame behavior

2015-08-30 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-10346: - Component/s: SparkR SparkR mutate and transform should replace column with same name to match R

[jira] [Updated] (SPARK-10301) For struct type, if parquet's global schema has less fields than a file's schema, data reading will fail

2015-08-30 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-10301: --- Description: We hit this issue when reading a complex Parquet dateset without turning on schema

[jira] [Created] (SPARK-10354) First cost RDD shouldn't be cached in k-means|| and the following cost RDD should use MEMORY_AND_DISK

2015-08-30 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10354: - Summary: First cost RDD shouldn't be cached in k-means|| and the following cost RDD should use MEMORY_AND_DISK Key: SPARK-10354 URL:

[jira] [Commented] (SPARK-10329) Cost RDD in k-means|| initialization is not storage-efficient

2015-08-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721419#comment-14721419 ] Xiangrui Meng commented on SPARK-10329: --- Assigned. I will send a small PR to fix

[jira] [Comment Edited] (SPARK-10329) Cost RDD in k-means|| initialization is not storage-efficient

2015-08-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721419#comment-14721419 ] Xiangrui Meng edited comment on SPARK-10329 at 8/30/15 6:36 AM:

[jira] [Commented] (SPARK-10301) For struct type, if parquet's global schema has less fields than a file's schema, data reading will fail

2015-08-30 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721447#comment-14721447 ] Cheng Lian commented on SPARK-10301: Updated ticket description to provide a more

[jira] [Resolved] (SPARK-10348) Improve Spark ML user guide

2015-08-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10348. --- Resolution: Fixed Fix Version/s: 1.5.1 Issue resolved by pull request 8517

[jira] [Updated] (SPARK-10331) Update user guide to address minor comments during code review

2015-08-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10331: -- Description: Clean-up user guides to address some minor comments in:

[jira] [Updated] (SPARK-10023) Unified DecisionTreeParams checkpointInterval between Scala and Python API.

2015-08-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10023: Description: checkpointInterval is member of DecisionTreeParams in Scala API which is

[jira] [Commented] (SPARK-10356) MLlib: Normalization should use absolute values

2015-08-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721494#comment-14721494 ] Sean Owen commented on SPARK-10356: --- It's not true that the sum of the elements will be

[jira] [Commented] (SPARK-10264) Add @Since annotation to ml.recoomendation

2015-08-30 Thread Tijo Thomas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721682#comment-14721682 ] Tijo Thomas commented on SPARK-10264: - I am working on this. Thanks Add @Since

[jira] [Updated] (SPARK-10353) MLlib BLAS gemm outputs wrong result when beta = 0.0 for transpose transpose matrix multiplication

2015-08-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10353: -- Affects Version/s: 1.3.1 1.4.1 MLlib BLAS gemm outputs wrong result

[jira] [Updated] (SPARK-10353) MLlib BLAS gemm outputs wrong result when beta = 0.0 for transpose transpose matrix multiplication

2015-08-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10353: -- Fix Version/s: 1.5.1 1.4.2 MLlib BLAS gemm outputs wrong result when beta

[jira] [Updated] (SPARK-10353) MLlib BLAS gemm outputs wrong result when beta = 0.0 for transpose transpose matrix multiplication

2015-08-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10353: -- Assignee: Burak Yavuz MLlib BLAS gemm outputs wrong result when beta = 0.0 for transpose

[jira] [Commented] (SPARK-10353) MLlib BLAS gemm outputs wrong result when beta = 0.0 for transpose transpose matrix multiplication

2015-08-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721689#comment-14721689 ] Xiangrui Meng commented on SPARK-10353: --- Leave the JIRA open for 1.3. MLlib BLAS

[jira] [Updated] (SPARK-8684) Update R version in Spark EC2 AMI

2015-08-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8684: - Fix Version/s: (was: 1.5.0) Update R version in Spark EC2 AMI -

[jira] [Updated] (SPARK-10189) python rdd socket connection problem

2015-08-30 Thread ABHISHEK CHOUDHARY (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ABHISHEK CHOUDHARY updated SPARK-10189: --- Description: I am trying to use wholeTextFiles with pyspark , and now I am getting

[jira] [Updated] (SPARK-9663) ML Python API coverage issues found during 1.5 QA

2015-08-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-9663: --- Description: This umbrella is for a list of Python API coverage issues which we should fix for the

[jira] [Assigned] (SPARK-10355) Add Python API for SQLTransformer

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10355: Assignee: (was: Apache Spark) Add Python API for SQLTransformer

[jira] [Commented] (SPARK-10355) Add Python API for SQLTransformer

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721467#comment-14721467 ] Apache Spark commented on SPARK-10355: -- User 'yanboliang' has created a pull request

[jira] [Assigned] (SPARK-10355) Add Python API for SQLTransformer

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10355: Assignee: Apache Spark Add Python API for SQLTransformer

[jira] [Updated] (SPARK-10023) Unified DecisionTreeParams checkpointInterval between Scala and Python API.

2015-08-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10023: Description: checkpointInterval is member of DecisionTreeParams in Scala API which is

[jira] [Updated] (SPARK-10023) Unified DecisionTreeParams checkpointInterval between Scala and Python API.

2015-08-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10023: Description: checkpointInterval is member of DecisionTreeParams in Scala API which is

[jira] [Assigned] (SPARK-10023) Unified DecisionTreeParams checkpointInterval between Scala and Python API.

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10023: Assignee: Apache Spark Unified DecisionTreeParams checkpointInterval between Scala and

[jira] [Commented] (SPARK-10356) MLlib: Normalization should use absolute values

2015-08-30 Thread Carsten Schnober (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721502#comment-14721502 ] Carsten Schnober commented on SPARK-10356: -- According to

[jira] [Updated] (SPARK-10189) python rdd socket connection problem

2015-08-30 Thread ABHISHEK CHOUDHARY (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ABHISHEK CHOUDHARY updated SPARK-10189: --- Description: I am trying to use wholeTextFiles with pyspark , and now I am getting

[jira] [Updated] (SPARK-10189) python rdd socket connection problem

2015-08-30 Thread ABHISHEK CHOUDHARY (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ABHISHEK CHOUDHARY updated SPARK-10189: --- Description: I am trying to use wholeTextFiles with pyspark , and now I am getting

[jira] [Comment Edited] (SPARK-8292) ShortestPaths run with error result

2015-08-30 Thread Anita Tailor (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721554#comment-14721554 ] Anita Tailor edited comment on SPARK-8292 at 8/30/15 3:13 PM: --

[jira] [Commented] (SPARK-8684) Update R version in Spark EC2 AMI

2015-08-30 Thread Vincent Warmerdam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721566#comment-14721566 ] Vincent Warmerdam commented on SPARK-8684: -- upgrading the spark ec2 ami would

[jira] [Resolved] (SPARK-8292) ShortestPaths run with error result

2015-08-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-8292. -- Resolution: Not A Problem ShortestPaths run with error result ---

[jira] [Resolved] (SPARK-10356) MLlib: Normalization should use absolute values

2015-08-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10356. --- Resolution: Not A Problem MLlib: Normalization should use absolute values

[jira] [Commented] (SPARK-10356) MLlib: Normalization should use absolute values

2015-08-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721511#comment-14721511 ] Sean Owen commented on SPARK-10356: --- Exactly. Your code does not compute a 1 norm

[jira] [Issue Comment Deleted] (SPARK-8684) Update R version in Spark EC2 AMI

2015-08-30 Thread Vincent Warmerdam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vincent Warmerdam updated SPARK-8684: - Comment: was deleted (was: closed due to github confusion. reopened due to sanity.)

[jira] [Commented] (SPARK-8292) ShortestPaths run with error result

2015-08-30 Thread Anita Tailor (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721554#comment-14721554 ] Anita Tailor commented on SPARK-8292: -

[jira] [Commented] (SPARK-8684) Update R version in Spark EC2 AMI

2015-08-30 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721585#comment-14721585 ] Yin Huai commented on SPARK-8684: - ah, I see. I thought the jira of this task had been all

[jira] [Commented] (SPARK-9642) LinearRegression should supported weighted data

2015-08-30 Thread Meihua Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721801#comment-14721801 ] Meihua Wu commented on SPARK-9642: -- [~sethah] Thank you for your help. I worked on this

[jira] [Issue Comment Deleted] (SPARK-10329) Cost RDD in k-means|| initialization is not storage-efficient

2015-08-30 Thread hujiayin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hujiayin updated SPARK-10329: - Comment: was deleted (was: ok, I will try to fix it today) Cost RDD in k-means|| initialization is not

[jira] [Commented] (SPARK-10329) Cost RDD in k-means|| initialization is not storage-efficient

2015-08-30 Thread hujiayin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721799#comment-14721799 ] hujiayin commented on SPARK-10329: -- ok, I will try to fix it today Cost RDD in

[jira] [Created] (SPARK-10357) DataFrames unable to drop unwanted columns

2015-08-30 Thread Randy Gelhausen (JIRA)
Randy Gelhausen created SPARK-10357: --- Summary: DataFrames unable to drop unwanted columns Key: SPARK-10357 URL: https://issues.apache.org/jira/browse/SPARK-10357 Project: Spark Issue Type:

[jira] [Created] (SPARK-10358) Spark-sql throws IOException on exit when using HDFS to store event log.

2015-08-30 Thread Sioa Song (JIRA)
Sioa Song created SPARK-10358: - Summary: Spark-sql throws IOException on exit when using HDFS to store event log. Key: SPARK-10358 URL: https://issues.apache.org/jira/browse/SPARK-10358 Project: Spark

[jira] [Commented] (SPARK-9666) ML 1.5 QA: model save/load audit

2015-08-30 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721906#comment-14721906 ] yuhao yang commented on SPARK-9666: --- models have no change in 1.5:

[jira] [Commented] (SPARK-10199) Avoid using reflections for parquet model save

2015-08-30 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721907#comment-14721907 ] Feynman Liang commented on SPARK-10199: --- [~vinodkc] Thanks! I think these results

[jira] [Commented] (SPARK-9642) LinearRegression should supported weighted data

2015-08-30 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721791#comment-14721791 ] Seth Hendrickson commented on SPARK-9642: - I'd like to take this one if no one

[jira] [Commented] (SPARK-10264) Add @Since annotation to ml.recoomendation

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14722969#comment-14722969 ] Apache Spark commented on SPARK-10264: -- User 'tijoparacka' has created a pull request for this

[jira] [Assigned] (SPARK-10264) Add @Since annotation to ml.recoomendation

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10264: Assignee: Apache Spark > Add @Since annotation to ml.recoomendation >

[jira] [Assigned] (SPARK-10264) Add @Since annotation to ml.recoomendation

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10264: Assignee: (was: Apache Spark) > Add @Since annotation to ml.recoomendation >

[jira] [Commented] (SPARK-6724) Model import/export for FPGrowth

2015-08-30 Thread Meethu Mathew (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14722999#comment-14722999 ] Meethu Mathew commented on SPARK-6724: -- [~josephkb] Could you plz give your opinion on this ? >

[jira] [Updated] (SPARK-9545) Run Maven tests in pull request builder if title has [test-maven] in it

2015-08-30 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-9545: --- Summary: Run Maven tests in pull request builder if title has [test-maven] in it (was: Run

[jira] [Resolved] (SPARK-9547) Allow testing pull requests with different Hadoop versions

2015-08-30 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-9547. Resolution: Fixed Fix Version/s: 1.6.0 Allow testing pull requests with different

[jira] [Resolved] (SPARK-9545) Run Maven tests in pull request builder if title has [maven-test] in it

2015-08-30 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-9545. Resolution: Fixed Fix Version/s: 1.6.0 Run Maven tests in pull request builder if

[jira] [Assigned] (SPARK-10359) Enumerate Spark's dependencies in a file and diff against it for new pull requests

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10359: Assignee: Apache Spark (was: Patrick Wendell) Enumerate Spark's dependencies in a file

[jira] [Assigned] (SPARK-10359) Enumerate Spark's dependencies in a file and diff against it for new pull requests

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10359: Assignee: Patrick Wendell (was: Apache Spark) Enumerate Spark's dependencies in a file

[jira] [Commented] (SPARK-10359) Enumerate Spark's dependencies in a file and diff against it for new pull requests

2015-08-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14722716#comment-14722716 ] Apache Spark commented on SPARK-10359: -- User 'pwendell' has created a pull request

[jira] [Created] (SPARK-10359) Enumerate Spark's dependencies in a file and diff against it for new pull requests

2015-08-30 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-10359: --- Summary: Enumerate Spark's dependencies in a file and diff against it for new pull requests Key: SPARK-10359 URL: https://issues.apache.org/jira/browse/SPARK-10359