[jira] [Commented] (SPARK-15747) Support SPARK_CONF_DIR/spark-defaults.d/*.conf drop-in style config files

2016-06-02 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313758#comment-15313758 ] Jeff Zhang commented on SPARK-15747: [~tmoschou] What problem do you meet for spark d

[jira] [Commented] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Antonio Murgia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313754#comment-15313754 ] Antonio Murgia commented on SPARK-15740: Looking into it right now. > Word2VecSu

[jira] [Commented] (SPARK-15751) Add generateAssociationRules in fpm in pyspark

2016-06-02 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313739#comment-15313739 ] Jeff Zhang commented on SPARK-15751: working on it . > Add generateAssociationRules

[jira] [Created] (SPARK-15751) Add generateAssociationRules in fpm in pyspark

2016-06-02 Thread Jeff Zhang (JIRA)
Jeff Zhang created SPARK-15751: -- Summary: Add generateAssociationRules in fpm in pyspark Key: SPARK-15751 URL: https://issues.apache.org/jira/browse/SPARK-15751 Project: Spark Issue Type: Improv

[jira] [Comment Edited] (SPARK-14811) ML, Graph 2.0 QA: API: New Scala APIs, docs

2016-06-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313736#comment-15313736 ] Yanbo Liang edited comment on SPARK-14811 at 6/3/16 6:39 AM: -

[jira] [Commented] (SPARK-14811) ML, Graph 2.0 QA: API: New Scala APIs, docs

2016-06-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313736#comment-15313736 ] Yanbo Liang commented on SPARK-14811: - I think we should have {@Since} for all prams

[jira] [Assigned] (SPARK-15750) Constructing FPGrowth fails when no numPartitions specified in pyspark

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15750: Assignee: Apache Spark > Constructing FPGrowth fails when no numPartitions specified in py

[jira] [Assigned] (SPARK-15750) Constructing FPGrowth fails when no numPartitions specified in pyspark

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15750: Assignee: (was: Apache Spark) > Constructing FPGrowth fails when no numPartitions spec

[jira] [Commented] (SPARK-15750) Constructing FPGrowth fails when no numPartitions specified in pyspark

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313732#comment-15313732 ] Apache Spark commented on SPARK-15750: -- User 'zjffdu' has created a pull request for

[jira] [Updated] (SPARK-15750) Constructing FPGrowth fails when no numPartitions specified in pyspark

2016-06-02 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated SPARK-15750: --- Summary: Constructing FPGrowth fails when no numPartitions specified in pyspark (was: Constructing F

[jira] [Updated] (SPARK-15750) Constructing FPGrowth fails when no numPartitions specified in pyspark

2016-06-02 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated SPARK-15750: --- Component/s: PySpark > Constructing FPGrowth fails when no numPartitions specified in pyspark > -

[jira] [Created] (SPARK-15750) Constructing FPGrowth fails when no numPartitions specified

2016-06-02 Thread Jeff Zhang (JIRA)
Jeff Zhang created SPARK-15750: -- Summary: Constructing FPGrowth fails when no numPartitions specified Key: SPARK-15750 URL: https://issues.apache.org/jira/browse/SPARK-15750 Project: Spark Issu

[jira] [Assigned] (SPARK-15749) Make the error message more meaningful

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15749: Assignee: (was: Apache Spark) > Make the error message more meaningful > -

[jira] [Commented] (SPARK-15749) Make the error message more meaningful

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313717#comment-15313717 ] Apache Spark commented on SPARK-15749: -- User 'huaxingao' has created a pull request

[jira] [Assigned] (SPARK-15749) Make the error message more meaningful

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15749: Assignee: Apache Spark > Make the error message more meaningful >

[jira] [Updated] (SPARK-15749) Make the error message more meaningful

2016-06-02 Thread Huaxin Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Huaxin Gao updated SPARK-15749: --- Description: For table test1 (C1 varchar (10), C2 varchar (10)), when I insert a row using sqlConte

[jira] [Created] (SPARK-15749) Make the error message more meaningful

2016-06-02 Thread Huaxin Gao (JIRA)
Huaxin Gao created SPARK-15749: -- Summary: Make the error message more meaningful Key: SPARK-15749 URL: https://issues.apache.org/jira/browse/SPARK-15749 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-15748) Replace inefficient foldLeft() call in PartitionStatistics

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313619#comment-15313619 ] Apache Spark commented on SPARK-15748: -- User 'JoshRosen' has created a pull request

[jira] [Resolved] (SPARK-14959) ​Problem Reading partitioned ORC or Parquet files

2016-06-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-14959. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13463 [https://github.

[jira] [Updated] (SPARK-14959) ​Problem Reading partitioned ORC or Parquet files

2016-06-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-14959: --- Assignee: Xin Wu > ​Problem Reading partitioned ORC or Parquet files > --

[jira] [Assigned] (SPARK-15748) Replace inefficient foldLeft() call in PartitionStatistics

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15748: Assignee: Josh Rosen (was: Apache Spark) > Replace inefficient foldLeft() call in Partiti

[jira] [Assigned] (SPARK-15748) Replace inefficient foldLeft() call in PartitionStatistics

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15748: Assignee: Apache Spark (was: Josh Rosen) > Replace inefficient foldLeft() call in Partiti

[jira] [Commented] (SPARK-14146) Imported implicits can't be found in Spark REPL in some cases

2016-06-02 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313618#comment-15313618 ] Prashant Sharma commented on SPARK-14146: - Thanks for the reproducers, I am looki

[jira] [Resolved] (SPARK-15733) Makes the explain output less verbose by hiding some verbose output like None, null, empty List, and etc..

2016-06-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-15733. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13470 [https://github.

[jira] [Updated] (SPARK-15733) Makes the explain output less verbose by hiding some verbose output like None, null, empty List, and etc..

2016-06-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15733: --- Assignee: Sean Zhong > Makes the explain output less verbose by hiding some verbose output like > No

[jira] [Created] (SPARK-15748) Replace inefficient foldLeft() call in PartitionStatistics

2016-06-02 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-15748: -- Summary: Replace inefficient foldLeft() call in PartitionStatistics Key: SPARK-15748 URL: https://issues.apache.org/jira/browse/SPARK-15748 Project: Spark Issue

[jira] [Updated] (SPARK-15747) Support SPARK_CONF_DIR/spark-defaults.d/*.conf drop-in style config files

2016-06-02 Thread Terry Moschou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Terry Moschou updated SPARK-15747: -- Description: Feature request to automatically source all files in {{SPARK_CONF_DIR/spark-defaul

[jira] [Created] (SPARK-15747) Support SPARK_CONF_DIR/spark-defaults.d/*.conf drop-in style config files

2016-06-02 Thread Terry Moschou (JIRA)
Terry Moschou created SPARK-15747: - Summary: Support SPARK_CONF_DIR/spark-defaults.d/*.conf drop-in style config files Key: SPARK-15747 URL: https://issues.apache.org/jira/browse/SPARK-15747 Project:

[jira] [Updated] (SPARK-15724) Add benchmarks for performance over wide schemas

2016-06-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-15724: --- Assignee: Eric Liang > Add benchmarks for performance over wide schemas > ---

[jira] [Resolved] (SPARK-15724) Add benchmarks for performance over wide schemas

2016-06-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-15724. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13456 [https://github.

[jira] [Commented] (SPARK-15723) SimpleDateParamSuite test is locale-fragile and relies on deprecated short TZ name

2016-06-02 Thread Brett Randall (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313479#comment-15313479 ] Brett Randall commented on SPARK-15723: --- Yes the fix is to the test only. I looked

[jira] [Updated] (SPARK-15746) SchemaUtils.checkColumnType with VectorUDT prints instance details in error message

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-15746: --- Summary: SchemaUtils.checkColumnType with VectorUDT prints instance details in error message

[jira] [Created] (SPARK-15746) SchemaUtils.checkColumnType with VectorUDT prints instance details

2016-06-02 Thread Nick Pentreath (JIRA)
Nick Pentreath created SPARK-15746: -- Summary: SchemaUtils.checkColumnType with VectorUDT prints instance details Key: SPARK-15746 URL: https://issues.apache.org/jira/browse/SPARK-15746 Project: Spark

[jira] [Commented] (SPARK-15722) Wrong data when CTAS specifies schema

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313438#comment-15313438 ] Apache Spark commented on SPARK-15722: -- User 'andrewor14' has created a pull request

[jira] [Resolved] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-15732. Resolution: Fixed Fix Version/s: 2.0.0 Resolved by https://github.com/apache/spark/pull/1348

[jira] [Updated] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15732: --- Assignee: Wenchen Fan > Dataset generated code "generated.java" Fails with Certain Case Classes > ---

[jira] [Assigned] (SPARK-15723) SimpleDateParamSuite test is locale-fragile and relies on deprecated short TZ name

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15723: Assignee: Apache Spark > SimpleDateParamSuite test is locale-fragile and relies on depreca

[jira] [Assigned] (SPARK-15723) SimpleDateParamSuite test is locale-fragile and relies on deprecated short TZ name

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15723: Assignee: (was: Apache Spark) > SimpleDateParamSuite test is locale-fragile and relies

[jira] [Commented] (SPARK-15723) SimpleDateParamSuite test is locale-fragile and relies on deprecated short TZ name

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313427#comment-15313427 ] Apache Spark commented on SPARK-15723: -- User 'javabrett' has created a pull request

[jira] [Assigned] (SPARK-15745) Use classloader's getResource() for reading resource files in HiveTests

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15745: Assignee: Apache Spark > Use classloader's getResource() for reading resource files in Hiv

[jira] [Commented] (SPARK-15745) Use classloader's getResource() for reading resource files in HiveTests

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313421#comment-15313421 ] Apache Spark commented on SPARK-15745: -- User 'sameeragarwal' has created a pull requ

[jira] [Assigned] (SPARK-15745) Use classloader's getResource() for reading resource files in HiveTests

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15745: Assignee: (was: Apache Spark) > Use classloader's getResource() for reading resource f

[jira] [Created] (SPARK-15745) Use classloader's getResource() for reading resource files in HiveTests

2016-06-02 Thread Sameer Agarwal (JIRA)
Sameer Agarwal created SPARK-15745: -- Summary: Use classloader's getResource() for reading resource files in HiveTests Key: SPARK-15745 URL: https://issues.apache.org/jira/browse/SPARK-15745 Project:

[jira] [Resolved] (SPARK-15736) Gracefully handle loss of DiskStore files

2016-06-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15736. --- Resolution: Fixed Fix Version/s: 2.0.0 1.6.2 > Gracefully handle loss of Di

[jira] [Assigned] (SPARK-15744) Rename two TungstenAggregation*Suites and update error messages/comments

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15744: Assignee: (was: Apache Spark) > Rename two TungstenAggregation*Suites and update error

[jira] [Commented] (SPARK-15744) Rename two TungstenAggregation*Suites and update error messages/comments

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313405#comment-15313405 ] Apache Spark commented on SPARK-15744: -- User 'dongjoon-hyun' has created a pull requ

[jira] [Assigned] (SPARK-15744) Rename two TungstenAggregation*Suites and update error messages/comments

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15744: Assignee: Apache Spark > Rename two TungstenAggregation*Suites and update error messages/c

[jira] [Comment Edited] (SPARK-13868) Random forest accuracy exploration

2016-06-02 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313400#comment-15313400 ] Xusen Yin edited comment on SPARK-13868 at 6/3/16 12:40 AM: [

[jira] [Resolved] (SPARK-15718) better error message for writing bucketing data

2016-06-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15718. --- Resolution: Fixed Fix Version/s: 2.0.0 > better error message for writing bucketing data > ---

[jira] [Updated] (SPARK-15744) Rename two TungstenAggregation*Suites and update error messages/comments

2016-06-02 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-15744: -- Description: For consistency, this issue updates some remaining `TungstenAggregation/SortBased

[jira] [Commented] (SPARK-13868) Random forest accuracy exploration

2016-06-02 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313400#comment-15313400 ] Xusen Yin commented on SPARK-13868: --- [~josephkb] [~tanwanirahul] Here is what I found:

[jira] [Created] (SPARK-15744) Rename two TungstenAggregation*Suites and update error messages/comments

2016-06-02 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-15744: - Summary: Rename two TungstenAggregation*Suites and update error messages/comments Key: SPARK-15744 URL: https://issues.apache.org/jira/browse/SPARK-15744 Project: S

[jira] [Assigned] (SPARK-15743) Prevent saving with all-column partitioning

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15743: Assignee: (was: Apache Spark) > Prevent saving with all-column partitioning >

[jira] [Commented] (SPARK-15743) Prevent saving with all-column partitioning

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313381#comment-15313381 ] Apache Spark commented on SPARK-15743: -- User 'dongjoon-hyun' has created a pull requ

[jira] [Assigned] (SPARK-15743) Prevent saving with all-column partitioning

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15743: Assignee: Apache Spark > Prevent saving with all-column partitioning > ---

[jira] [Commented] (SPARK-15741) PySpark Cleanup of _setDefault with seed=None

2016-06-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313376#comment-15313376 ] Bryan Cutler commented on SPARK-15741: -- >From what I gathered, explicitly setting a

[jira] [Created] (SPARK-15743) Prevent saving with all-column partitioning

2016-06-02 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-15743: - Summary: Prevent saving with all-column partitioning Key: SPARK-15743 URL: https://issues.apache.org/jira/browse/SPARK-15743 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15732: Assignee: Apache Spark > Dataset generated code "generated.java" Fails with Certain Case C

[jira] [Assigned] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15732: Assignee: (was: Apache Spark) > Dataset generated code "generated.java" Fails with Cer

[jira] [Commented] (SPARK-15742) Reduce collections allocations in Catalyst tree transformation methods

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313355#comment-15313355 ] Apache Spark commented on SPARK-15742: -- User 'JoshRosen' has created a pull request

[jira] [Assigned] (SPARK-15742) Reduce collections allocations in Catalyst tree transformation methods

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15742: Assignee: Josh Rosen (was: Apache Spark) > Reduce collections allocations in Catalyst tre

[jira] [Assigned] (SPARK-15742) Reduce collections allocations in Catalyst tree transformation methods

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15742: Assignee: Apache Spark (was: Josh Rosen) > Reduce collections allocations in Catalyst tre

[jira] [Commented] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313354#comment-15313354 ] Apache Spark commented on SPARK-15732: -- User 'cloud-fan' has created a pull request

[jira] [Created] (SPARK-15742) Reduce collections allocations in Catalyst tree transformation methods

2016-06-02 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-15742: -- Summary: Reduce collections allocations in Catalyst tree transformation methods Key: SPARK-15742 URL: https://issues.apache.org/jira/browse/SPARK-15742 Project: Spark

[jira] [Commented] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313330#comment-15313330 ] Reynold Xin commented on SPARK-15732: - Yea we definitely should put a better error me

[jira] [Closed] (SPARK-15741) PySpark Cleanup of _setDefault with seed=None

2016-06-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler closed SPARK-15741. Resolution: Invalid Looks like I jumped the gun here, None values are not ignored and seems like s

[jira] [Commented] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313321#comment-15313321 ] Wenchen Fan commented on SPARK-15732: - It's really hard to support it, and I don't th

[jira] [Resolved] (SPARK-15668) ml.feature: update check schema to avoid confusion when user use MLlib.vector as input type

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-15668. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13411 [https:/

[jira] [Assigned] (SPARK-15688) RelationalGroupedDataset.toDF should not add group by expressions that are already added in the aggregate expressions.

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15688: Assignee: Apache Spark > RelationalGroupedDataset.toDF should not add group by expressions

[jira] [Assigned] (SPARK-15688) RelationalGroupedDataset.toDF should not add group by expressions that are already added in the aggregate expressions.

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15688: Assignee: (was: Apache Spark) > RelationalGroupedDataset.toDF should not add group by

[jira] [Commented] (SPARK-15688) RelationalGroupedDataset.toDF should not add group by expressions that are already added in the aggregate expressions.

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313306#comment-15313306 ] Apache Spark commented on SPARK-15688: -- User 'dilipbiswal' has created a pull reques

[jira] [Updated] (SPARK-15741) PySpark Cleanup of _setDefault with seed=None

2016-06-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15741: - Issue Type: Improvement (was: Sub-task) Parent: (was: SPARK-14771) > PySpark Cleanup

[jira] [Created] (SPARK-15741) PySpark Cleanup of _setDefault with seed=None

2016-06-02 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-15741: Summary: PySpark Cleanup of _setDefault with seed=None Key: SPARK-15741 URL: https://issues.apache.org/jira/browse/SPARK-15741 Project: Spark Issue Type: Sub

[jira] [Assigned] (SPARK-15725) Dynamic allocation hangs YARN app when executors time out

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15725: Assignee: (was: Apache Spark) > Dynamic allocation hangs YARN app when executors time

[jira] [Commented] (SPARK-15725) Dynamic allocation hangs YARN app when executors time out

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313285#comment-15313285 ] Apache Spark commented on SPARK-15725: -- User 'rdblue' has created a pull request for

[jira] [Assigned] (SPARK-15725) Dynamic allocation hangs YARN app when executors time out

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15725: Assignee: Apache Spark > Dynamic allocation hangs YARN app when executors time out > -

[jira] [Resolved] (SPARK-15734) Avoids printing internal row in explain output

2016-06-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-15734. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13471 [https://github.

[jira] [Updated] (SPARK-15139) PySpark TreeEnsemble missing methods

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-15139: --- Assignee: holdenk > PySpark TreeEnsemble missing methods > --

[jira] [Resolved] (SPARK-15719) Disable writing Parquet summary files by default

2016-06-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-15719. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13455 [https://github.

[jira] [Comment Edited] (SPARK-15705) Spark won't read ORC schema from metastore for partitioned tables

2016-06-02 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313257#comment-15313257 ] Xin Wu edited comment on SPARK-15705 at 6/2/16 11:15 PM: - I can r

[jira] [Commented] (SPARK-15705) Spark won't read ORC schema from metastore for partitioned tables

2016-06-02 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313257#comment-15313257 ] Xin Wu commented on SPARK-15705: I can recreate it now. and will look into it. > Spark

[jira] [Resolved] (SPARK-15139) PySpark TreeEnsemble missing methods

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-15139. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12919 [https:/

[jira] [Resolved] (SPARK-15092) toDebugString missing from ML DecisionTreeClassifier

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-15092. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12919 [https:/

[jira] [Commented] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Bo Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313230#comment-15313230 ] Bo Meng commented on SPARK-15732: - There is no easy way to work around this issue since "

[jira] [Commented] (SPARK-15623) 2.0 python coverage ml.feature

2016-06-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313226#comment-15313226 ] Bryan Cutler commented on SPARK-15623: -- I took another spin through this and linked

[jira] [Assigned] (SPARK-15738) PySpark ml.feature RFormula missing string representation displaying formula

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15738: Assignee: (was: Apache Spark) > PySpark ml.feature RFormula missing string representat

[jira] [Assigned] (SPARK-15738) PySpark ml.feature RFormula missing string representation displaying formula

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15738: Assignee: Apache Spark > PySpark ml.feature RFormula missing string representation display

[jira] [Commented] (SPARK-15738) PySpark ml.feature RFormula missing string representation displaying formula

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313222#comment-15313222 ] Apache Spark commented on SPARK-15738: -- User 'BryanCutler' has created a pull reques

[jira] [Comment Edited] (SPARK-14811) ML, Graph 2.0 QA: API: New Scala APIs, docs

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313208#comment-15313208 ] Nick Pentreath edited comment on SPARK-14811 at 6/2/16 10:31 PM: --

[jira] [Comment Edited] (SPARK-14811) ML, Graph 2.0 QA: API: New Scala APIs, docs

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313208#comment-15313208 ] Nick Pentreath edited comment on SPARK-14811 at 6/2/16 10:31 PM: --

[jira] [Commented] (SPARK-14811) ML, Graph 2.0 QA: API: New Scala APIs, docs

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313208#comment-15313208 ] Nick Pentreath commented on SPARK-14811: Question on this - we seem to be inconsi

[jira] [Commented] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313198#comment-15313198 ] Xiangrui Meng commented on SPARK-15740: --- [~tmnd91] Could you run the test and estim

[jira] [Comment Edited] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313198#comment-15313198 ] Xiangrui Meng edited comment on SPARK-15740 at 6/2/16 10:24 PM: ---

[jira] [Commented] (SPARK-15736) Gracefully handle loss of DiskStore files

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313196#comment-15313196 ] Apache Spark commented on SPARK-15736: -- User 'JoshRosen' has created a pull request

[jira] [Updated] (SPARK-15736) Gracefully handle loss of DiskStore files

2016-06-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-15736: --- Target Version/s: 1.6.2, 2.0.0 (was: 2.0.0) > Gracefully handle loss of DiskStore files > --

[jira] [Updated] (SPARK-15736) Gracefully handle loss of DiskStore files

2016-06-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-15736: --- Affects Version/s: 1.6.0 > Gracefully handle loss of DiskStore files > --

[jira] [Assigned] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15740: Assignee: (was: Apache Spark) > Word2VecSuite "big model load / save" caused OOM in ma

[jira] [Commented] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313195#comment-15313195 ] Apache Spark commented on SPARK-15740: -- User 'mengxr' has created a pull request for

[jira] [Assigned] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15740: Assignee: Apache Spark > Word2VecSuite "big model load / save" caused OOM in maven jenkins

[jira] [Commented] (SPARK-15710) Exception with WHERE clause in SQL for non-default Hive database

2016-06-02 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15313193#comment-15313193 ] Xin Wu commented on SPARK-15710: hmm.. after another rebase of the master. it seems that

  1   2   3   >