[jira] [Commented] (SPARK-14525) DataFrameWriter's save method should delegate to jdbc for jdbc datasource

2016-04-12 Thread Justin Pihony (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238621#comment-15238621 ] Justin Pihony commented on SPARK-14525: --- I don't mind putting together a PR for this, however I am

[jira] [Commented] (SPARK-14540) Support Scala 2.12 closures and Java 8 lambdas in ClosureCleaner

2016-04-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238602#comment-15238602 ] Josh Rosen commented on SPARK-14540: I found a problem which seems to prevent the cleaning /

[jira] [Created] (SPARK-14592) Create table like

2016-04-12 Thread Yin Huai (JIRA)
Yin Huai created SPARK-14592: Summary: Create table like Key: SPARK-14592 URL: https://issues.apache.org/jira/browse/SPARK-14592 Project: Spark Issue Type: Sub-task Components: SQL

[jira] [Created] (SPARK-14591) DDLParser should accept decimal(precision)

2016-04-12 Thread Yin Huai (JIRA)
Yin Huai created SPARK-14591: Summary: DDLParser should accept decimal(precision) Key: SPARK-14591 URL: https://issues.apache.org/jira/browse/SPARK-14591 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-14127) [Table related commands] Describe table

2016-04-12 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238567#comment-15238567 ] Xiao Li commented on SPARK-14127: - Most of work are duplicate with `show table extended`. Thus,

[jira] [Updated] (SPARK-14586) SparkSQL doesn't parse decimal like Hive

2016-04-12 Thread Stephane Maarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephane Maarek updated SPARK-14586: Description: create a test_data.csv with the following {code:none} a, 2.0 ,3.0 {code}

[jira] [Commented] (SPARK-14499) Add tests to make sure drop partitions of an external table will not delete data

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238552#comment-15238552 ] Apache Spark commented on SPARK-14499: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14499) Add tests to make sure drop partitions of an external table will not delete data

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14499: Assignee: Apache Spark > Add tests to make sure drop partitions of an external table will

[jira] [Assigned] (SPARK-14499) Add tests to make sure drop partitions of an external table will not delete data

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14499: Assignee: (was: Apache Spark) > Add tests to make sure drop partitions of an external

[jira] [Assigned] (SPARK-14590) Update pull request template with link to jira

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14590: Assignee: (was: Apache Spark) > Update pull request template with link to jira >

[jira] [Commented] (SPARK-14590) Update pull request template with link to jira

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238532#comment-15238532 ] Apache Spark commented on SPARK-14590: -- User 'lresende' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14590) Update pull request template with link to jira

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14590: Assignee: Apache Spark > Update pull request template with link to jira >

[jira] [Created] (SPARK-14590) Update pull request template with link to jira

2016-04-12 Thread Luciano Resende (JIRA)
Luciano Resende created SPARK-14590: --- Summary: Update pull request template with link to jira Key: SPARK-14590 URL: https://issues.apache.org/jira/browse/SPARK-14590 Project: Spark Issue

[jira] [Assigned] (SPARK-14589) Enhance DB2 JDBC Dialect docker tests

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14589: Assignee: (was: Apache Spark) > Enhance DB2 JDBC Dialect docker tests >

[jira] [Commented] (SPARK-14589) Enhance DB2 JDBC Dialect docker tests

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238521#comment-15238521 ] Apache Spark commented on SPARK-14589: -- User 'lresende' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14589) Enhance DB2 JDBC Dialect docker tests

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14589: Assignee: Apache Spark > Enhance DB2 JDBC Dialect docker tests >

[jira] [Created] (SPARK-14589) Enhance DB2 JDBC Dialect docker tests

2016-04-12 Thread Luciano Resende (JIRA)
Luciano Resende created SPARK-14589: --- Summary: Enhance DB2 JDBC Dialect docker tests Key: SPARK-14589 URL: https://issues.apache.org/jira/browse/SPARK-14589 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-14311) Model persistence in SparkR

2016-04-12 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238500#comment-15238500 ] Yanbo Liang commented on SPARK-14311: - Sure, I can have a try. Another issue is R `Object` has

[jira] [Created] (SPARK-14588) Consider getting column stats from files (wherever feasible) to get better stats for joins

2016-04-12 Thread Rajesh Balamohan (JIRA)
Rajesh Balamohan created SPARK-14588: Summary: Consider getting column stats from files (wherever feasible) to get better stats for joins Key: SPARK-14588 URL:

[jira] [Created] (SPARK-14587) abstract class Receiver should be explicit about the return type of its methods

2016-04-12 Thread Jacek Laskowski (JIRA)
Jacek Laskowski created SPARK-14587: --- Summary: abstract class Receiver should be explicit about the return type of its methods Key: SPARK-14587 URL: https://issues.apache.org/jira/browse/SPARK-14587

[jira] [Assigned] (SPARK-14441) Consolidate DDL tests

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14441: Assignee: (was: Apache Spark) > Consolidate DDL tests > - > >

[jira] [Commented] (SPARK-14441) Consolidate DDL tests

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238480#comment-15238480 ] Apache Spark commented on SPARK-14441: -- User 'bomeng' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14441) Consolidate DDL tests

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14441: Assignee: Apache Spark > Consolidate DDL tests > - > >

[jira] [Commented] (SPARK-14554) disable whole stage codegen if there are too many input columns

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238469#comment-15238469 ] Apache Spark commented on SPARK-14554: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-14409) Investigate adding a RankingEvaluator to ML

2016-04-12 Thread Yong Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238462#comment-15238462 ] Yong Tang commented on SPARK-14409: --- Thanks [~mlnick] for the review. I was planning to add MRR to

[jira] [Updated] (SPARK-14586) SparkSQL doesn't parse decimal like Hive

2016-04-12 Thread Stephane Maarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephane Maarek updated SPARK-14586: Description: create a test_data.csv with the following {code:none} a, 2.0 ,3.0 {code}

[jira] [Updated] (SPARK-14586) SparkSQL doesn't parse decimal like Hive

2016-04-12 Thread Stephane Maarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephane Maarek updated SPARK-14586: Description: create a test_data.csv with the following {code:none} a, 2.0 ,3.0 {code}

[jira] [Updated] (SPARK-14586) SparkSQL doesn't parse decimal like Hive

2016-04-12 Thread Stephane Maarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephane Maarek updated SPARK-14586: Description: create a test_data.csv with the following {code:none} a, 2.0 ,3.0 {code}

[jira] [Updated] (SPARK-14447) Speed up TungstenAggregate w/ keys using AggregateHashMap

2016-04-12 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-14447: --- Summary: Speed up TungstenAggregate w/ keys using AggregateHashMap (was: Integrate

[jira] [Commented] (SPARK-14447) Integrate AggregateHashMap in Aggregates with Keys

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238401#comment-15238401 ] Apache Spark commented on SPARK-14447: -- User 'sameeragarwal' has created a pull request for this

[jira] [Updated] (SPARK-14583) SparkSQL doesn't read hive table properly after MSCK REPAIR

2016-04-12 Thread Stephane Maarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephane Maarek updated SPARK-14583: Summary: SparkSQL doesn't read hive table properly after MSCK REPAIR (was: Spark doesn't

[jira] [Updated] (SPARK-14586) SparkSQL doesn't parse decimal like Hive

2016-04-12 Thread Stephane Maarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephane Maarek updated SPARK-14586: Description: create a test_data.csv with the following {code:none} a, 2.0 ,3.0 {code}

[jira] [Created] (SPARK-14586) SparkSQL doesn't parse decimal like Hive

2016-04-12 Thread Stephane Maarek (JIRA)
Stephane Maarek created SPARK-14586: --- Summary: SparkSQL doesn't parse decimal like Hive Key: SPARK-14586 URL: https://issues.apache.org/jira/browse/SPARK-14586 Project: Spark Issue Type:

[jira] [Updated] (SPARK-14375) Unit test for spark.ml KMeansSummary

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14375: -- Shepherd: Joseph K. Bradley Assignee: Yanbo Liang Target

[jira] [Created] (SPARK-14585) Provide accessor methods for Pipeline stages

2016-04-12 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-14585: - Summary: Provide accessor methods for Pipeline stages Key: SPARK-14585 URL: https://issues.apache.org/jira/browse/SPARK-14585 Project: Spark Issue

[jira] [Updated] (SPARK-14583) Spark doesn't read hive table properly after MSCK REPAIR

2016-04-12 Thread Stephane Maarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephane Maarek updated SPARK-14583: Description: it seems that Spark forgets or fails to read the metadata tblproperties after

[jira] [Updated] (SPARK-14583) Spark doesn't read hive table properly after MSCK REPAIR

2016-04-12 Thread Stephane Maarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephane Maarek updated SPARK-14583: Description: it seems that Spark forgets or fails to read the metadata tblproperties after

[jira] [Updated] (SPARK-14084) Parallel training jobs in model selection

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14084: -- Target Version/s: 2.1.0 (was: 2.0.0) > Parallel training jobs in model selection >

[jira] [Created] (SPARK-14584) Improve recognition of non-nullability in Dataset transformations

2016-04-12 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-14584: -- Summary: Improve recognition of non-nullability in Dataset transformations Key: SPARK-14584 URL: https://issues.apache.org/jira/browse/SPARK-14584 Project: Spark

[jira] [Resolved] (SPARK-13982) SparkR - KMeans predict: Output column name of features is an unclear, automatic genetared text

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-13982. --- Resolution: Fixed Assignee: Yanbo Liang Fix Version/s: 2.0.0

[jira] [Updated] (SPARK-13982) SparkR - KMeans predict: Output column name of features is an unclear, automatic generated text

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-13982: -- Summary: SparkR - KMeans predict: Output column name of features is an unclear,

[jira] [Commented] (SPARK-14059) Define R wrappers under org.apache.spark.ml.r

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238358#comment-15238358 ] Joseph K. Bradley commented on SPARK-14059: --- This task looks complete. Can I resolve it? >

[jira] [Commented] (SPARK-14583) Spark doesn't read hive table properly after MSCK REPAIR

2016-04-12 Thread Stephane Maarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238352#comment-15238352 ] Stephane Maarek commented on SPARK-14583: - pretty much the same behavior if instead of MSCK

[jira] [Updated] (SPARK-13982) SparkR - KMeans predict: Output column name of features is an unclear, automatic genetared text

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-13982: -- Target Version/s: 2.0.0 > SparkR - KMeans predict: Output column name of features is

[jira] [Commented] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-04-12 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238346#comment-15238346 ] yuhao yang commented on SPARK-14154: Got your concern. I'll run some benchmark. > Simplify the

[jira] [Updated] (SPARK-14509) Add python CountVectorizerExample

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14509: -- Shepherd: Joseph K. Bradley Assignee: zhengruifeng Target

[jira] [Commented] (SPARK-14577) spark.sql.codegen.maxCaseBranches config option

2016-04-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238344#comment-15238344 ] Reynold Xin commented on SPARK-14577: - Yea we shouldn't change the architecture. >

[jira] [Created] (SPARK-14583) Spark doesn't read hive table properly after MSCK REPAIR

2016-04-12 Thread Stephane Maarek (JIRA)
Stephane Maarek created SPARK-14583: --- Summary: Spark doesn't read hive table properly after MSCK REPAIR Key: SPARK-14583 URL: https://issues.apache.org/jira/browse/SPARK-14583 Project: Spark

[jira] [Commented] (SPARK-14577) spark.sql.codegen.maxCaseBranches config option

2016-04-12 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238323#comment-15238323 ] Dongjoon Hyun commented on SPARK-14577: --- In the current Spark architecture, `sql/core` module is

[jira] [Commented] (SPARK-14582) Increase the parallelism for small tables

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238321#comment-15238321 ] Apache Spark commented on SPARK-14582: -- User 'davies' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14582) Increase the parallelism for small tables

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14582: Assignee: Apache Spark (was: Davies Liu) > Increase the parallelism for small tables >

[jira] [Assigned] (SPARK-14582) Increase the parallelism for small tables

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14582: Assignee: Davies Liu (was: Apache Spark) > Increase the parallelism for small tables >

[jira] [Resolved] (SPARK-14579) Fix a race condition in StreamExecution.processAllAvailable

2016-04-12 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-14579. -- Resolution: Fixed Fix Version/s: 2.0.0 > Fix a race condition in

[jira] [Created] (SPARK-14582) Increase the parallelism for small tables

2016-04-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14582: -- Summary: Increase the parallelism for small tables Key: SPARK-14582 URL: https://issues.apache.org/jira/browse/SPARK-14582 Project: Spark Issue Type:

[jira] [Updated] (SPARK-10386) Model import/export for PrefixSpan

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10386: -- Shepherd: Joseph K. Bradley (was: Xiangrui Meng) > Model import/export for PrefixSpan

[jira] [Resolved] (SPARK-14578) Can't load a json dataset with nested wide schema

2016-04-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14578. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12338

[jira] [Commented] (SPARK-8514) LU factorization on BlockMatrix

2016-04-12 Thread Jerome (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238300#comment-15238300 ] Jerome commented on SPARK-8514: --- Hello Joseph: Is this JIRA still under consideration? Best, Jerome On

[jira] [Commented] (SPARK-14529) Consolidate mllib and mllib-local into one mllib folder

2016-04-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238291#comment-15238291 ] DB Tsai commented on SPARK-14529: - We still can make graphx depend on mllib-local, and I plan to do so

[jira] [Updated] (SPARK-5992) Locality Sensitive Hashing (LSH) for MLlib

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5992: - Target Version/s: 2.1.0 (was: 2.0.0) > Locality Sensitive Hashing (LSH) for MLlib >

[jira] [Updated] (SPARK-12942) Provide option to allow control the precision of numerical type for DataFrameWriter

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12942: -- Target Version/s: 2.0.0 Component/s: (was: ML) > Provide option to allow

[jira] [Updated] (SPARK-12942) Provide option to allow control the precision of numerical type for DataFrameWriter

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12942: -- Target Version/s: (was: 2.0.0) > Provide option to allow control the precision of

[jira] [Updated] (SPARK-12942) Provide option to allow control the precision of numerical type for DataFrameWriter

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12942: -- Target Version/s: (was: 2.0.0) > Provide option to allow control the precision of

[jira] [Updated] (SPARK-9478) Add class weights to Random Forest

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9478: - Target Version/s: 2.1.0 (was: 2.0.0) > Add class weights to Random Forest >

[jira] [Updated] (SPARK-8514) LU factorization on BlockMatrix

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8514: - Target Version/s: (was: 2.0.0) > LU factorization on BlockMatrix >

[jira] [Updated] (SPARK-10078) Vector-free L-BFGS

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10078: -- Target Version/s: (was: 2.0.0) > Vector-free L-BFGS > -- > >

[jira] [Comment Edited] (SPARK-13116) TungstenAggregate though it is supposedly capable of all processing unsafe & safe rows, fails if the input is safe rows

2016-04-12 Thread Martin Brandt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238264#comment-15238264 ] Martin Brandt edited comment on SPARK-13116 at 4/12/16 11:46 PM: - I am

[jira] [Commented] (SPARK-13116) TungstenAggregate though it is supposedly capable of all processing unsafe & safe rows, fails if the input is safe rows

2016-04-12 Thread Martin Brandt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238264#comment-15238264 ] Martin Brandt commented on SPARK-13116: --- I am seeing what looks like the issue described here, in

[jira] [Commented] (SPARK-14581) Improve filter push down

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238220#comment-15238220 ] Apache Spark commented on SPARK-14581: -- User 'davies' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14581) Improve filter push down

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14581: Assignee: Davies Liu (was: Apache Spark) > Improve filter push down >

[jira] [Assigned] (SPARK-14581) Improve filter push down

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14581: Assignee: Apache Spark (was: Davies Liu) > Improve filter push down >

[jira] [Created] (SPARK-14581) Improve filter push down

2016-04-12 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14581: -- Summary: Improve filter push down Key: SPARK-14581 URL: https://issues.apache.org/jira/browse/SPARK-14581 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-14363) Executor OOM due to a memory leak in Sorter

2016-04-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14363. Resolution: Fixed Fix Version/s: 1.6.2 2.0.0 Issue resolved by pull

[jira] [Updated] (SPARK-14497) Use top instead of sortBy() to get top N frequent words as dict in CountVectorizer

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14497: -- Summary: Use top instead of sortBy() to get top N frequent words as dict in

[jira] [Commented] (SPARK-12414) Remove closure serializer

2016-04-12 Thread Dubkov Mikhail (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238171#comment-15238171 ] Dubkov Mikhail commented on SPARK-12414: [~srowen], [~andrewor14], As I see, you just hard coded

[jira] [Commented] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238172#comment-15238172 ] Xiangrui Meng commented on SPARK-14154: --- Changed the priority to critical since we should decide

[jira] [Updated] (SPARK-14568) Log instrumentation in logistic regression as a first task

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14568: -- Component/s: ML > Log instrumentation in logistic regression as a first task >

[jira] [Updated] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14154: -- Priority: Critical (was: Minor) > Simplify the implementation for Kolmogorov–Smirnov test >

[jira] [Commented] (SPARK-11157) Allow Spark to be built without assemblies

2016-04-12 Thread Sebastian Kochman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238151#comment-15238151 ] Sebastian Kochman commented on SPARK-11157: --- After this change, when I try to submit a Spark

[jira] [Updated] (SPARK-14568) Log instrumentation in logistic regression as a first task

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14568: -- Shepherd: Joseph K. Bradley > Log instrumentation in logistic regression as a first task >

[jira] [Updated] (SPARK-14568) Log instrumentation in logistic regression as a first task

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14568: -- Target Version/s: 2.0.0 > Log instrumentation in logistic regression as a first task >

[jira] [Updated] (SPARK-14568) Log instrumentation in logistic regression as a first task

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14568: -- Assignee: Timothy Hunter > Log instrumentation in logistic regression as a first task >

[jira] [Assigned] (SPARK-14576) Spark console should display Web UI url

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14576: Assignee: Apache Spark > Spark console should display Web UI url >

[jira] [Commented] (SPARK-14576) Spark console should display Web UI url

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238137#comment-15238137 ] Apache Spark commented on SPARK-14576: -- User 'seyfe' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14576) Spark console should display Web UI url

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14576: Assignee: (was: Apache Spark) > Spark console should display Web UI url >

[jira] [Updated] (SPARK-14564) Python Word2Vec missing setWindowSize method

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14564: -- Component/s: (was: MLlib) > Python Word2Vec missing setWindowSize method >

[jira] [Updated] (SPARK-14564) Python Word2Vec missing setWindowSize method

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14564: -- Labels: ml pyspark python word2vec (was: ml mllib pyspark python word2vec) > Python

[jira] [Updated] (SPARK-14580) HiveTypeCoercion.IfCoercion should preserve original predicates.

2016-04-12 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-14580: -- Description: Currently, `HiveTypeCoercion.IfCoercion` removes all predicates whose

[jira] [Updated] (SPARK-14580) HiveTypeCoercion.IfCoercion should preserve original predicates.

2016-04-12 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-14580: -- Description: Currently, `HiveTypeCoercion.IfCoercion` removes all predicates whose

[jira] [Commented] (SPARK-14577) spark.sql.codegen.maxCaseBranches config option

2016-04-12 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238129#comment-15238129 ] Dongjoon Hyun commented on SPARK-14577: --- Oh, sure! Thank you for guide. >

[jira] [Commented] (SPARK-14580) HiveTypeCoercion.IfCoercion should preserve original predicates.

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238127#comment-15238127 ] Apache Spark commented on SPARK-14580: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-14580) HiveTypeCoercion.IfCoercion should preserve original predicates.

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14580: Assignee: (was: Apache Spark) > HiveTypeCoercion.IfCoercion should preserve original

[jira] [Assigned] (SPARK-14580) HiveTypeCoercion.IfCoercion should preserve original predicates.

2016-04-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14580: Assignee: Apache Spark > HiveTypeCoercion.IfCoercion should preserve original predicates.

[jira] [Resolved] (SPARK-14547) Avoid DNS resolution for reusing connections

2016-04-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14547. - Resolution: Fixed Fix Version/s: 2.0.0 > Avoid DNS resolution for reusing connections >

[jira] [Commented] (SPARK-14550) OneHotEncoding wrapper in SparkR

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238120#comment-15238120 ] Joseph K. Bradley commented on SPARK-14550: --- Please see comment on [SPARK-14546] >

[jira] [Closed] (SPARK-14553) PCA wrapper for SparkR

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-14553. - Resolution: Later Please see comment on [SPARK-14546] > PCA wrapper for SparkR >

[jira] [Closed] (SPARK-14552) ReValue wrapper for SparkR

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-14552. - Resolution: Later Please see comment on [SPARK-14546] > ReValue wrapper for SparkR >

[jira] [Commented] (SPARK-14546) Scale Wrapper in SparkR

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238117#comment-15238117 ] Joseph K. Bradley commented on SPARK-14546: --- [~aloknsingh] Thanks for reporting these issues.

[jira] [Closed] (SPARK-14546) Scale Wrapper in SparkR

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-14546. - Resolution: Later > Scale Wrapper in SparkR > --- > >

[jira] [Closed] (SPARK-14550) OneHotEncoding wrapper in SparkR

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-14550. - Resolution: Later > OneHotEncoding wrapper in SparkR >

[jira] [Commented] (SPARK-14529) Consolidate mllib and mllib-local into one mllib folder

2016-04-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238105#comment-15238105 ] Joseph K. Bradley commented on SPARK-14529: --- Will this be confusing if, at some point, we

  1   2   3   >