[jira] [Created] (SPARK-10483) spark-submit can not support symbol link

2015-09-08 Thread xuqing (JIRA)
xuqing created SPARK-10483: -- Summary: spark-submit can not support symbol link Key: SPARK-10483 URL: https://issues.apache.org/jira/browse/SPARK-10483 Project: Spark Issue Type: Bug Compon

[jira] [Updated] (SPARK-10483) spark-submit can not support symbol link

2015-09-08 Thread xuqing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xuqing updated SPARK-10483: --- Description: Create a symbol link for spark-submit {quote} [root@xqwin03 bin]# ll spark-submit lrwxrwxrwx 1 r

[jira] [Updated] (SPARK-10483) spark-submit can not support symbol link

2015-09-08 Thread xuqing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xuqing updated SPARK-10483: --- Environment: Red Hat Enterprise Linux Server release 6.4 (Santiago) (was: [root@xqwin03 bin]# cat /etc/redha

[jira] [Created] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when cross join happen

2015-09-08 Thread Yi Zhou (JIRA)
Yi Zhou created SPARK-10484: --- Summary: [Spark SQL] Come across lost task(timeout) or GC OOM error when cross join happen Key: SPARK-10484 URL: https://issues.apache.org/jira/browse/SPARK-10484 Project: Spa

[jira] [Updated] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when cross join happen

2015-09-08 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Zhou updated SPARK-10484: Description: Found that it lost task or GC OOM when below cross join happen. The left big table is ~1.2G i

[jira] [Updated] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when cross join happen

2015-09-08 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Zhou updated SPARK-10484: Description: Found that it lost task or GC OOM when below cross join happen. The left big table is ~1.2G i

[jira] [Updated] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when cross join happen

2015-09-08 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Zhou updated SPARK-10484: Description: Found that it lost task or GC OOM when below cross join happen. The left big table is ~1.2G i

[jira] [Updated] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when cross join happen

2015-09-08 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Zhou updated SPARK-10484: Issue Type: Improvement (was: Bug) > [Spark SQL] Come across lost task(timeout) or GC OOM error when cros

[jira] [Updated] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when cross join happen

2015-09-08 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Zhou updated SPARK-10484: Issue Type: Bug (was: Improvement) > [Spark SQL] Come across lost task(timeout) or GC OOM error when cros

[jira] [Commented] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when two tables do cross join

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734389#comment-14734389 ] Apache Spark commented on SPARK-10484: -- User 'chenghao-intel' has created a pull req

[jira] [Updated] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when two tables do cross join

2015-09-08 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Zhou updated SPARK-10484: Summary: [Spark SQL] Come across lost task(timeout) or GC OOM error when two tables do cross join (was: [

[jira] [Updated] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when two table do cross join

2015-09-08 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Zhou updated SPARK-10484: Summary: [Spark SQL] Come across lost task(timeout) or GC OOM error when two table do cross join (was: [S

[jira] [Assigned] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when two tables do cross join

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10484: Assignee: Apache Spark > [Spark SQL] Come across lost task(timeout) or GC OOM error when

[jira] [Assigned] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when two tables do cross join

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10484: Assignee: (was: Apache Spark) > [Spark SQL] Come across lost task(timeout) or GC OOM

[jira] [Commented] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when two tables do cross join

2015-09-08 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734395#comment-14734395 ] Cheng Hao commented on SPARK-10484: --- In cartesian produce implementation, there is 2 le

[jira] [Created] (SPARK-10485) IF expression is not correctly resolved when one of the options have NullType

2015-09-08 Thread Antonio Jesus Navarro (JIRA)
Antonio Jesus Navarro created SPARK-10485: - Summary: IF expression is not correctly resolved when one of the options have NullType Key: SPARK-10485 URL: https://issues.apache.org/jira/browse/SPARK-10485

[jira] [Resolved] (SPARK-10483) spark-submit can not support symbol link

2015-09-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10483. --- Resolution: Duplicate Please have a look at https://cwiki.apache.org/confluence/display/SPARK/Contri

[jira] [Updated] (SPARK-10481) SPARK_PREPEND_CLASSES make spark-yarn related jar could not be found

2015-09-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10481: -- Priority: Minor (was: Major) > SPARK_PREPEND_CLASSES make spark-yarn related jar could not be found >

[jira] [Commented] (SPARK-3369) Java mapPartitions Iterator->Iterable is inconsistent with Scala's Iterator->Iterator

2015-09-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734495#comment-14734495 ] Sean Owen commented on SPARK-3369: -- I don't think there's a "why" -- just hasn't been don

[jira] [Commented] (SPARK-10479) LogisticRegression copy should copy model summary if available

2015-09-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734504#comment-14734504 ] Sean Owen commented on SPARK-10479: --- Seems OK, but this seems so logically related to S

[jira] [Commented] (SPARK-9610) Class and instance weighting for ML

2015-09-08 Thread Nickolay Yakushev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734556#comment-14734556 ] Nickolay Yakushev commented on SPARK-9610: -- 1. Is basic statistics a good candida

[jira] [Commented] (SPARK-5421) SparkSql throw OOM at shuffle

2015-09-08 Thread Romi Kuntsman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734561#comment-14734561 ] Romi Kuntsman commented on SPARK-5421: -- does this still happen on the latest version?

[jira] [Reopened] (SPARK-6350) Make mesosExecutorCores configurable in mesos "fine-grained" mode

2015-09-08 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Iulian Dragos reopened SPARK-6350: -- I'm re-opening this, since in the meantime this regressed. See changes in d86bbb, which regressed i

[jira] [Comment Edited] (SPARK-6350) Make mesosExecutorCores configurable in mesos "fine-grained" mode

2015-09-08 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734575#comment-14734575 ] Iulian Dragos edited comment on SPARK-6350 at 9/8/15 10:26 AM: -

[jira] [Commented] (SPARK-10479) LogisticRegression copy should copy model summary if available

2015-09-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734606#comment-14734606 ] Sean Owen commented on SPARK-10479: --- This is already being fixed in https://github.com/

[jira] [Updated] (SPARK-10480) ML.LinearRegressionModel.copy() can not use argument "extra"

2015-09-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10480: -- Assignee: Yanbo Liang > ML.LinearRegressionModel.copy() can not use argument "extra" >

[jira] [Updated] (SPARK-10479) LogisticRegression copy should copy model summary if available

2015-09-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10479: -- Assignee: Yanbo Liang > LogisticRegression copy should copy model summary if available > --

[jira] [Commented] (SPARK-10288) Add a rest client for Spark on Yarn

2015-09-08 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734637#comment-14734637 ] Steve Loughran commented on SPARK-10288: The long-haul filesystem communications

[jira] [Commented] (SPARK-6350) Make mesosExecutorCores configurable in mesos "fine-grained" mode

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734706#comment-14734706 ] Apache Spark commented on SPARK-6350: - User 'dragos' has created a pull request for th

[jira] [Updated] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when two tables do cross join

2015-09-08 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Zhou updated SPARK-10484: Description: Found that it lost task or GC OOM when below cross join happen. The left big table is ~1.2G i

[jira] [Updated] (SPARK-10484) [Spark SQL] Come across lost task(timeout) or GC OOM error when two tables do cross join

2015-09-08 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Zhou updated SPARK-10484: Description: Found that it lost task or GC OOM when below cross join happen. The left big table is ~1.2G i

[jira] [Commented] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-09-08 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734721#comment-14734721 ] Yi Zhou commented on SPARK-5791: [~yhuai], Yes. Thank you ! > [Spark SQL] show poor perfo

[jira] [Created] (SPARK-10486) Spark intermittently fails to recover from a worker failure (in standalone mode)

2015-09-08 Thread Cheuk Lam (JIRA)
Cheuk Lam created SPARK-10486: - Summary: Spark intermittently fails to recover from a worker failure (in standalone mode) Key: SPARK-10486 URL: https://issues.apache.org/jira/browse/SPARK-10486 Project: S

[jira] [Updated] (SPARK-10486) Spark intermittently fails to recover from a worker failure (in standalone mode)

2015-09-08 Thread Cheuk Lam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheuk Lam updated SPARK-10486: -- Description: We have run into a problem where some Spark job is aborted after one worker is killed in

[jira] [Updated] (SPARK-10486) Spark intermittently fails to recover from a worker failure (in standalone mode)

2015-09-08 Thread Cheuk Lam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheuk Lam updated SPARK-10486: -- Description: We have run into a problem where some Spark job is aborted after one worker is killed in

[jira] [Commented] (SPARK-10467) Vector is converted to tuple when extracted from Row using __getitem__

2015-09-08 Thread Alexey Grishchenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734780#comment-14734780 ] Alexey Grishchenko commented on SPARK-10467: Issue is not reproduced on maste

[jira] [Updated] (SPARK-10486) Spark intermittently fails to recover from a worker failure (in standalone mode)

2015-09-08 Thread Cheuk Lam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheuk Lam updated SPARK-10486: -- Description: We have run into a problem where some Spark job is aborted after one worker is killed in

[jira] [Commented] (SPARK-4940) Support more evenly distributing cores for Mesos mode

2015-09-08 Thread Martin Tapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734792#comment-14734792 ] Martin Tapp commented on SPARK-4940: I see your point and thinking about it, round-rob

[jira] [Commented] (SPARK-6101) Create a SparkSQL DataSource API implementation for DynamoDB

2015-09-08 Thread Rustam Aliyev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734830#comment-14734830 ] Rustam Aliyev commented on SPARK-6101: -- What's the status of this? GH repo has not be

[jira] [Comment Edited] (SPARK-6101) Create a SparkSQL DataSource API implementation for DynamoDB

2015-09-08 Thread Rustam Aliyev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734830#comment-14734830 ] Rustam Aliyev edited comment on SPARK-6101 at 9/8/15 1:58 PM: --

[jira] [Resolved] (SPARK-9170) ORC data source creates a schema with lowercase table names

2015-09-08 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-9170. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 7520 [https://github.com/

[jira] [Comment Edited] (SPARK-9834) Normal equation solver for ordinary least squares

2015-09-08 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734170#comment-14734170 ] Debasish Das edited comment on SPARK-9834 at 9/8/15 3:18 PM: -

[jira] [Created] (SPARK-10487) MLlib model fitting causes DataFrame write to break with OutOfMemory exception

2015-09-08 Thread Zoltan Toth (JIRA)
Zoltan Toth created SPARK-10487: --- Summary: MLlib model fitting causes DataFrame write to break with OutOfMemory exception Key: SPARK-10487 URL: https://issues.apache.org/jira/browse/SPARK-10487 Project:

[jira] [Commented] (SPARK-9708) Spark should create local temporary directories in Mesos sandbox when launched with Mesos

2015-09-08 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735011#comment-14735011 ] Iulian Dragos commented on SPARK-9708: -- This won't work when the external shuffle ser

[jira] [Commented] (SPARK-10479) LogisticRegression copy should copy model summary if available

2015-09-08 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735053#comment-14735053 ] Feynman Liang commented on SPARK-10479: --- [~lravindr] Sorry, I didn't know that some

[jira] [Commented] (SPARK-9715) Store numFeatures in all ML PredictionModel types

2015-09-08 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735065#comment-14735065 ] Seth Hendrickson commented on SPARK-9715: - I can take this one if no one else has

[jira] [Created] (SPARK-10488) No longer possible to create SparkConf in pyspark application

2015-09-08 Thread Brad Willard (JIRA)
Brad Willard created SPARK-10488: Summary: No longer possible to create SparkConf in pyspark application Key: SPARK-10488 URL: https://issues.apache.org/jira/browse/SPARK-10488 Project: Spark

[jira] [Commented] (SPARK-10488) No longer possible to create SparkConf in pyspark application

2015-09-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735131#comment-14735131 ] Sean Owen commented on SPARK-10488: --- FWIW I have been using ipython + pyspark successfu

[jira] [Commented] (SPARK-10488) No longer possible to create SparkConf in pyspark application

2015-09-08 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735141#comment-14735141 ] Brad Willard commented on SPARK-10488: -- [~srowen] I have it working via that method

[jira] [Comment Edited] (SPARK-10488) No longer possible to create SparkConf in pyspark application

2015-09-08 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735141#comment-14735141 ] Brad Willard edited comment on SPARK-10488 at 9/8/15 4:54 PM: -

[jira] [Commented] (SPARK-10488) No longer possible to create SparkConf in pyspark application

2015-09-08 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735159#comment-14735159 ] Brad Willard commented on SPARK-10488: -- So I have a comical workaround now. I can le

[jira] [Comment Edited] (SPARK-10488) No longer possible to create SparkConf in pyspark application

2015-09-08 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735159#comment-14735159 ] Brad Willard edited comment on SPARK-10488 at 9/8/15 5:03 PM: -

[jira] [Commented] (SPARK-10488) No longer possible to create SparkConf in pyspark application

2015-09-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735197#comment-14735197 ] Sean Owen commented on SPARK-10488: --- How about using YARN with dynamic allocation? at l

[jira] [Created] (SPARK-10489) GraphX dataframe wrapper

2015-09-08 Thread Feynman Liang (JIRA)
Feynman Liang created SPARK-10489: - Summary: GraphX dataframe wrapper Key: SPARK-10489 URL: https://issues.apache.org/jira/browse/SPARK-10489 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-10489) GraphX dataframe wrapper

2015-09-08 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735202#comment-14735202 ] Feynman Liang commented on SPARK-10489: --- Working on this > GraphX dataframe wrappe

[jira] [Closed] (SPARK-10489) GraphX dataframe wrapper

2015-09-08 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang closed SPARK-10489. - Resolution: Won't Fix Doing this in a separate spark package (https://github.com/databricks/spar

[jira] [Created] (SPARK-10490) Consolidate the Cholesky solvers in WeightedLeastSquares and ALS

2015-09-08 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10490: - Summary: Consolidate the Cholesky solvers in WeightedLeastSquares and ALS Key: SPARK-10490 URL: https://issues.apache.org/jira/browse/SPARK-10490 Project: Spark

[jira] [Created] (SPARK-10491) move RowMatrix.dspr to BLAS

2015-09-08 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10491: - Summary: move RowMatrix.dspr to BLAS Key: SPARK-10491 URL: https://issues.apache.org/jira/browse/SPARK-10491 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-9690) Add random seed Param to PySpark CrossValidator

2015-09-08 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9690: - Shepherd: Joseph K. Bradley > Add random seed Param to PySpark CrossValidator > --

[jira] [Resolved] (SPARK-10479) LogisticRegression copy should copy model summary if available

2015-09-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10479. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Target Ver

[jira] [Resolved] (SPARK-10480) ML.LinearRegressionModel.copy() can not use argument "extra"

2015-09-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10480. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Target Ver

[jira] [Updated] (SPARK-9690) Add random seed Param to CrossValidator

2015-09-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9690: - Summary: Add random seed Param to CrossValidator (was: Add random seed Param to PySpark CrossVali

[jira] [Commented] (SPARK-3369) Java mapPartitions Iterator->Iterable is inconsistent with Scala's Iterator->Iterator

2015-09-08 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735323#comment-14735323 ] Nicholas Chammas commented on SPARK-3369: - Sean said: {quote} I don't think there

[jira] [Updated] (SPARK-9694) Add random seed Param to Scala CrossValidator

2015-09-08 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9694: - Summary: Add random seed Param to Scala CrossValidator (was: Add random seed Param to Cro

[jira] [Updated] (SPARK-9690) Add random seed Param to PySpark CrossValidator

2015-09-08 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9690: - Summary: Add random seed Param to PySpark CrossValidator (was: Add random seed Param to C

[jira] [Updated] (SPARK-9690) Add random seed Param to CrossValidator

2015-09-08 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9690: - Component/s: PySpark > Add random seed Param to CrossValidator > -

[jira] [Closed] (SPARK-4752) Classifier based on artificial neural network

2015-09-08 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Ulanov closed SPARK-4752. --- Resolution: Fixed Fix Version/s: 1.5.0 > Classifier based on artificial neural network

[jira] [Commented] (SPARK-8632) Poor Python UDF performance because of RDD caching

2015-09-08 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735392#comment-14735392 ] Davies Liu commented on SPARK-8632: --- [~rxin] As [~justin.uang] suggested before, the bat

[jira] [Commented] (SPARK-10309) Some tasks failed with Unable to acquire memory

2015-09-08 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735401#comment-14735401 ] Davies Liu commented on SPARK-10309: [~nadenf] In my case, the job finally finished (

[jira] [Commented] (SPARK-8632) Poor Python UDF performance because of RDD caching

2015-09-08 Thread Justin Uang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735406#comment-14735406 ] Justin Uang commented on SPARK-8632: Davies, what do you mean by upstream? I didn't qu

[jira] [Commented] (SPARK-9435) Java UDFs don't work with GROUP BY expressions

2015-09-08 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735416#comment-14735416 ] Michael Armbrust commented on SPARK-9435: - >From a quick glance, the problem is li

[jira] [Resolved] (SPARK-10316) respect non-deterministic expressions in PhysicalOperation

2015-09-08 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-10316. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8486 [http

[jira] [Commented] (SPARK-10441) Cannot write timestamp to JSON

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735454#comment-14735454 ] Apache Spark commented on SPARK-10441: -- User 'yhuai' has created a pull request for

[jira] [Resolved] (SPARK-10470) ml.IsotonicRegressionModel.copy did not set parent

2015-09-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10470. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8637 [https://gi

[jira] [Updated] (SPARK-10470) ml.IsotonicRegressionModel.copy did not set parent

2015-09-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10470: -- Target Version/s: 1.6.0, 1.5.1 > ml.IsotonicRegressionModel.copy did not set parent > -

[jira] [Updated] (SPARK-10470) ml.IsotonicRegressionModel.copy did not set parent

2015-09-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10470: -- Assignee: Yanbo Liang > ml.IsotonicRegressionModel.copy did not set parent > --

[jira] [Created] (SPARK-10492) Update Streaming documentation about rate limiting and backpressure

2015-09-08 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-10492: - Summary: Update Streaming documentation about rate limiting and backpressure Key: SPARK-10492 URL: https://issues.apache.org/jira/browse/SPARK-10492 Project: Spark

[jira] [Assigned] (SPARK-10492) Update Streaming documentation about rate limiting and backpressure

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10492: Assignee: Apache Spark (was: Tathagata Das) > Update Streaming documentation about rate l

[jira] [Assigned] (SPARK-10492) Update Streaming documentation about rate limiting and backpressure

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10492: Assignee: Tathagata Das (was: Apache Spark) > Update Streaming documentation about rate l

[jira] [Commented] (SPARK-10492) Update Streaming documentation about rate limiting and backpressure

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735510#comment-14735510 ] Apache Spark commented on SPARK-10492: -- User 'tdas' has created a pull request for t

[jira] [Updated] (SPARK-10470) ml.IsotonicRegressionModel.copy did not set parent

2015-09-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10470: -- Fix Version/s: 1.5.1 > ml.IsotonicRegressionModel.copy did not set parent > ---

[jira] [Commented] (SPARK-10373) Move @since annotator to pyspark to be shared by all components

2015-09-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735520#comment-14735520 ] Xiangrui Meng commented on SPARK-10373: --- No, this is for 1.6. > Move @since annota

[jira] [Created] (SPARK-10493) reduceByKey not returning distinct results

2015-09-08 Thread Glenn Strycker (JIRA)
Glenn Strycker created SPARK-10493: -- Summary: reduceByKey not returning distinct results Key: SPARK-10493 URL: https://issues.apache.org/jira/browse/SPARK-10493 Project: Spark Issue Type: Bu

[jira] [Commented] (SPARK-2620) case class cannot be used as key for reduce

2015-09-08 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735557#comment-14735557 ] Glenn Strycker commented on SPARK-2620: --- I am finding similar behavior for a non-cas

[jira] [Closed] (SPARK-6101) Create a SparkSQL DataSource API implementation for DynamoDB

2015-09-08 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-6101. -- Resolution: Won't Fix Assignee: (was: Chris Fregly) Fix Version/s: (was: 1.6.0)

[jira] [Updated] (SPARK-6101) Create a SparkSQL DataSource API implementation for DynamoDB

2015-09-08 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6101: --- Affects Version/s: (was: 1.2.0) > Create a SparkSQL DataSource API implementation for DynamoDB > -

[jira] [Assigned] (SPARK-10373) Move @since annotator to pyspark to be shared by all components

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10373: Assignee: Davies Liu (was: Apache Spark) > Move @since annotator to pyspark to be shared

[jira] [Commented] (SPARK-10373) Move @since annotator to pyspark to be shared by all components

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735581#comment-14735581 ] Apache Spark commented on SPARK-10373: -- User 'davies' has created a pull request for

[jira] [Assigned] (SPARK-10373) Move @since annotator to pyspark to be shared by all components

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10373: Assignee: Apache Spark (was: Davies Liu) > Move @since annotator to pyspark to be shared

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-08 Thread Frank Rosner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735598#comment-14735598 ] Frank Rosner commented on SPARK-10493: -- Thanks for submitting the issue, [~glenn.str

[jira] [Resolved] (SPARK-10441) Cannot write timestamp to JSON

2015-09-08 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-10441. -- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved b

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-08 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735626#comment-14735626 ] Glenn Strycker commented on SPARK-10493: Thanks for the speedy follow-up, [~frosn

[jira] [Commented] (SPARK-10482) Add Python interface for CountVectorizer

2015-09-08 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735639#comment-14735639 ] holdenk commented on SPARK-10482: - This seems to duplicate https://issues.apache.org/jira

[jira] [Commented] (SPARK-8632) Poor Python UDF performance because of RDD caching

2015-09-08 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735644#comment-14735644 ] Davies Liu commented on SPARK-8632: --- The upstream means child of current SparkPlan, coul

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-08 Thread Glenn Strycker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735653#comment-14735653 ] Glenn Strycker commented on SPARK-10493: Note: this only seems to be occurring "a

[jira] [Assigned] (SPARK-9014) Allow Python spark API to use built-in exponential operator

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9014: --- Assignee: (was: Apache Spark) > Allow Python spark API to use built-in exponential operat

[jira] [Assigned] (SPARK-9014) Allow Python spark API to use built-in exponential operator

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9014: --- Assignee: Apache Spark > Allow Python spark API to use built-in exponential operator > --

[jira] [Commented] (SPARK-9014) Allow Python spark API to use built-in exponential operator

2015-09-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735664#comment-14735664 ] Apache Spark commented on SPARK-9014: - User '0x0FFF' has created a pull request for th

[jira] [Commented] (SPARK-10442) select cast('false' as boolean) returns true

2015-09-08 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14735668#comment-14735668 ] Yin Huai commented on SPARK-10442: -- [~lian cheng] Looks like postgresql support more str

[jira] [Updated] (SPARK-10468) Verify schema before Dataframe select API call

2015-09-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10468: -- Assignee: Vinod KC > Verify schema before Dataframe select API call > -

  1   2   3   >