[jira] [Assigned] (SPARK-14669) Some SQL metrics is broken when whole-stage codegen enabled

2016-04-15 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-14669: -- Assignee: Davies Liu > Some SQL metrics is broken when whole-stage codegen enabled >

[jira] [Resolved] (SPARK-14677) Make the max number of iterations configurable for Catalyst

2016-04-15 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-14677. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12434

[jira] [Assigned] (SPARK-14632) randomSplit method fails on dataframes with maps in schema

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14632: Assignee: Apache Spark > randomSplit method fails on dataframes with maps in schema >

[jira] [Commented] (SPARK-14632) randomSplit method fails on dataframes with maps in schema

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243967#comment-15243967 ] Apache Spark commented on SPARK-14632: -- User 'sbcd90' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14632) randomSplit method fails on dataframes with maps in schema

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14632: Assignee: (was: Apache Spark) > randomSplit method fails on dataframes with maps in

[jira] [Commented] (SPARK-14679) UI DAG visualization causes OOM generating data

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243926#comment-15243926 ] Apache Spark commented on SPARK-14679: -- User 'rdblue' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14679) UI DAG visualization causes OOM generating data

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14679: Assignee: Apache Spark > UI DAG visualization causes OOM generating data >

[jira] [Assigned] (SPARK-14679) UI DAG visualization causes OOM generating data

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14679: Assignee: (was: Apache Spark) > UI DAG visualization causes OOM generating data >

[jira] [Created] (SPARK-14679) UI DAG visualization causes OOM generating data

2016-04-15 Thread Ryan Blue (JIRA)
Ryan Blue created SPARK-14679: - Summary: UI DAG visualization causes OOM generating data Key: SPARK-14679 URL: https://issues.apache.org/jira/browse/SPARK-14679 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-14668) Move current_database to Catalyst

2016-04-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14668. - Resolution: Fixed Fix Version/s: 2.0.0 > Move current_database to Catalyst >

[jira] [Assigned] (SPARK-14649) DagScheduler runs duplicate tasks on fetch failure

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14649: Assignee: (was: Apache Spark) > DagScheduler runs duplicate tasks on fetch failure >

[jira] [Commented] (SPARK-14649) DagScheduler runs duplicate tasks on fetch failure

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243869#comment-15243869 ] Apache Spark commented on SPARK-14649: -- User 'sitalkedia' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14649) DagScheduler runs duplicate tasks on fetch failure

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14649: Assignee: Apache Spark > DagScheduler runs duplicate tasks on fetch failure >

[jira] [Assigned] (SPARK-14678) Add a file sink log to support versioning and compaction

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14678: Assignee: Apache Spark (was: Shixiong Zhu) > Add a file sink log to support versioning

[jira] [Assigned] (SPARK-14678) Add a file sink log to support versioning and compaction

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14678: Assignee: Shixiong Zhu (was: Apache Spark) > Add a file sink log to support versioning

[jira] [Commented] (SPARK-14678) Add a file sink log to support versioning and compaction

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243838#comment-15243838 ] Apache Spark commented on SPARK-14678: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Created] (SPARK-14678) Add a file sink log to support versioning and compaction

2016-04-15 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-14678: Summary: Add a file sink log to support versioning and compaction Key: SPARK-14678 URL: https://issues.apache.org/jira/browse/SPARK-14678 Project: Spark

[jira] [Commented] (SPARK-14676) Catch, wrap, and re-throw exceptions from Await.result in order to capture full stacktrace

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243790#comment-15243790 ] Apache Spark commented on SPARK-14676: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14677) Make the max number of iterations configurable for Catalyst

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14677: Assignee: Apache Spark (was: Reynold Xin) > Make the max number of iterations

[jira] [Assigned] (SPARK-14677) Make the max number of iterations configurable for Catalyst

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14677: Assignee: Reynold Xin (was: Apache Spark) > Make the max number of iterations

[jira] [Commented] (SPARK-14677) Make the max number of iterations configurable for Catalyst

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243794#comment-15243794 ] Apache Spark commented on SPARK-14677: -- User 'rxin' has created a pull request for this issue:

[jira] [Created] (SPARK-14677) Make the max number of iterations configurable for Catalyst

2016-04-15 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-14677: --- Summary: Make the max number of iterations configurable for Catalyst Key: SPARK-14677 URL: https://issues.apache.org/jira/browse/SPARK-14677 Project: Spark

[jira] [Commented] (SPARK-13944) Separate out local linear algebra as a standalone module without Spark dependency

2016-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243793#comment-15243793 ] Xiangrui Meng commented on SPARK-13944: --- `mllib-local` by the name is not scoped just for local

[jira] [Assigned] (SPARK-14676) Catch, wrap, and re-throw exceptions from Await.result in order to capture full stacktrace

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14676: Assignee: Apache Spark (was: Josh Rosen) > Catch, wrap, and re-throw exceptions from

[jira] [Assigned] (SPARK-14676) Catch, wrap, and re-throw exceptions from Await.result in order to capture full stacktrace

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14676: Assignee: Josh Rosen (was: Apache Spark) > Catch, wrap, and re-throw exceptions from

[jira] [Updated] (SPARK-14620) Use/benchmark a better hash in AggregateHashMap

2016-04-15 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-14620: - Assignee: Sameer Agarwal > Use/benchmark a better hash in AggregateHashMap >

[jira] [Commented] (SPARK-14675) ClassFormatError in codegen when using Aggregator

2016-04-15 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243788#comment-15243788 ] koert kuipers commented on SPARK-14675: --- for example it works fine if i use this aggregator

[jira] [Resolved] (SPARK-14620) Use/benchmark a better hash in AggregateHashMap

2016-04-15 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-14620. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12379

[jira] [Created] (SPARK-14676) Catch, wrap, and re-throw exceptions from Await.result in order to capture full stacktrace

2016-04-15 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-14676: -- Summary: Catch, wrap, and re-throw exceptions from Await.result in order to capture full stacktrace Key: SPARK-14676 URL: https://issues.apache.org/jira/browse/SPARK-14676

[jira] [Commented] (SPARK-14675) ClassFormatError in codegen when using Aggregator

2016-04-15 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243780#comment-15243780 ] koert kuipers commented on SPARK-14675: --- it works fine for other similar Aggregators, i think the

[jira] [Resolved] (SPARK-14628) Remove all the Options in TaskMetrics

2016-04-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14628. - Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 2.0.0 > Remove all the

[jira] [Commented] (SPARK-14675) ClassFormatError in codegen when using Aggregator

2016-04-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243770#comment-15243770 ] Reynold Xin commented on SPARK-14675: - cc [~cloud_fan] > ClassFormatError in codegen when using

[jira] [Created] (SPARK-14675) ClassFormatError in codegen when using Aggregator

2016-04-15 Thread koert kuipers (JIRA)
koert kuipers created SPARK-14675: - Summary: ClassFormatError in codegen when using Aggregator Key: SPARK-14675 URL: https://issues.apache.org/jira/browse/SPARK-14675 Project: Spark Issue

[jira] [Updated] (SPARK-14668) Move current_database to Catalyst

2016-04-15 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-14668: - Summary: Move current_database to Catalyst (was: Move current_database to sql/core) > Move

[jira] [Commented] (SPARK-14569) Log instrumentation in KMeans

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243749#comment-15243749 ] Apache Spark commented on SPARK-14569: -- User 'keypointt' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14569) Log instrumentation in KMeans

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14569: Assignee: (was: Apache Spark) > Log instrumentation in KMeans >

[jira] [Assigned] (SPARK-14569) Log instrumentation in KMeans

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14569: Assignee: Apache Spark > Log instrumentation in KMeans > - >

[jira] [Assigned] (SPARK-14674) Move HiveContext.hiveconf to HiveSessionState

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14674: Assignee: Apache Spark (was: Andrew Or) > Move HiveContext.hiveconf to HiveSessionState

[jira] [Assigned] (SPARK-14674) Move HiveContext.hiveconf to HiveSessionState

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14674: Assignee: Andrew Or (was: Apache Spark) > Move HiveContext.hiveconf to HiveSessionState

[jira] [Commented] (SPARK-14674) Move HiveContext.hiveconf to HiveSessionState

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243747#comment-15243747 ] Apache Spark commented on SPARK-14674: -- User 'andrewor14' has created a pull request for this issue:

[jira] [Created] (SPARK-14674) Move HiveContext.hiveconf to HiveSessionState

2016-04-15 Thread Andrew Or (JIRA)
Andrew Or created SPARK-14674: - Summary: Move HiveContext.hiveconf to HiveSessionState Key: SPARK-14674 URL: https://issues.apache.org/jira/browse/SPARK-14674 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-14647) Group SQLContext/HiveContext state into PersistentState

2016-04-15 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14647: -- Issue Type: Sub-task (was: Bug) Parent: SPARK-14673 > Group SQLContext/HiveContext state into

[jira] [Updated] (SPARK-14672) Move HiveContext analyze logic to AnalyzeTable

2016-04-15 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14672: -- Issue Type: Sub-task (was: Bug) Parent: SPARK-14673 > Move HiveContext analyze logic to

[jira] [Updated] (SPARK-14668) Move current_database to sql/core

2016-04-15 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14668: -- Parent Issue: SPARK-14673 (was: SPARK-14118) > Move current_database to sql/core >

[jira] [Created] (SPARK-14673) Remove HiveContext

2016-04-15 Thread Andrew Or (JIRA)
Andrew Or created SPARK-14673: - Summary: Remove HiveContext Key: SPARK-14673 URL: https://issues.apache.org/jira/browse/SPARK-14673 Project: Spark Issue Type: Bug Components: SQL

[jira] [Assigned] (SPARK-14377) Review spark.ml parity for classification, except trees

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-14377: - Assignee: Joseph K. Bradley > Review spark.ml parity for classification, except

[jira] [Updated] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4591: - Summary: Algorithm/model parity for spark.ml (Scala) (was: Algorithm/model parity audit

[jira] [Updated] (SPARK-4591) Algorithm/model parity audit for spark.ml (Scala)

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4591: - Summary: Algorithm/model parity audit for spark.ml (Scala) (was: Algorithm/model parity

[jira] [Issue Comment Deleted] (SPARK-11939) PySpark support model export/import for Pipeline API

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11939: -- Comment: was deleted (was: Noting the items which are still WIP in Scala. We can

[jira] [Assigned] (SPARK-14671) Pipeline.setStages needs to handle Array non-covariance

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14671: Assignee: Apache Spark (was: Joseph K. Bradley) > Pipeline.setStages needs to handle

[jira] [Commented] (SPARK-14671) Pipeline.setStages needs to handle Array non-covariance

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243721#comment-15243721 ] Apache Spark commented on SPARK-14671: -- User 'jkbradley' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14671) Pipeline.setStages needs to handle Array non-covariance

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14671: Assignee: Joseph K. Bradley (was: Apache Spark) > Pipeline.setStages needs to handle

[jira] [Assigned] (SPARK-14672) Move HiveContext analyze logic to AnalyzeTable

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14672: Assignee: Andrew Or (was: Apache Spark) > Move HiveContext analyze logic to AnalyzeTable

[jira] [Commented] (SPARK-14672) Move HiveContext analyze logic to AnalyzeTable

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243713#comment-15243713 ] Apache Spark commented on SPARK-14672: -- User 'andrewor14' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14672) Move HiveContext analyze logic to AnalyzeTable

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14672: Assignee: Apache Spark (was: Andrew Or) > Move HiveContext analyze logic to AnalyzeTable

[jira] [Commented] (SPARK-14564) Python Word2Vec missing setWindowSize method

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243708#comment-15243708 ] Apache Spark commented on SPARK-14564: -- User 'jasoncl' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14564) Python Word2Vec missing setWindowSize method

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14564: Assignee: (was: Apache Spark) > Python Word2Vec missing setWindowSize method >

[jira] [Assigned] (SPARK-14564) Python Word2Vec missing setWindowSize method

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14564: Assignee: Apache Spark > Python Word2Vec missing setWindowSize method >

[jira] [Created] (SPARK-14672) Move HiveContext analyze logic to AnalyzeTable

2016-04-15 Thread Andrew Or (JIRA)
Andrew Or created SPARK-14672: - Summary: Move HiveContext analyze logic to AnalyzeTable Key: SPARK-14672 URL: https://issues.apache.org/jira/browse/SPARK-14672 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-14586) SparkSQL doesn't parse decimal like Hive

2016-04-15 Thread Suresh Thalamati (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243702#comment-15243702 ] Suresh Thalamati commented on SPARK-14586: -- Thanks for reporting this issue , Stephane. which

[jira] [Updated] (SPARK-14567) Add instrumentation logs to MLlib training algorithms

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14567: -- Target Version/s: 2.0.0 > Add instrumentation logs to MLlib training algorithms >

[jira] [Updated] (SPARK-14567) Add instrumentation logs to MLlib training algorithms

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14567: -- Component/s: ML > Add instrumentation logs to MLlib training algorithms >

[jira] [Created] (SPARK-14671) Pipeline.setStages needs to handle Array non-covariance

2016-04-15 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-14671: - Summary: Pipeline.setStages needs to handle Array non-covariance Key: SPARK-14671 URL: https://issues.apache.org/jira/browse/SPARK-14671 Project: Spark

[jira] [Assigned] (SPARK-14670) Allow updating SQLMetrics on driver

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14670: Assignee: Apache Spark (was: Andrew Or) > Allow updating SQLMetrics on driver >

[jira] [Assigned] (SPARK-14670) Allow updating SQLMetrics on driver

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14670: Assignee: Andrew Or (was: Apache Spark) > Allow updating SQLMetrics on driver >

[jira] [Commented] (SPARK-14670) Allow updating SQLMetrics on driver

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243688#comment-15243688 ] Apache Spark commented on SPARK-14670: -- User 'andrewor14' has created a pull request for this issue:

[jira] [Created] (SPARK-14670) Allow updating SQLMetrics on driver

2016-04-15 Thread Andrew Or (JIRA)
Andrew Or created SPARK-14670: - Summary: Allow updating SQLMetrics on driver Key: SPARK-14670 URL: https://issues.apache.org/jira/browse/SPARK-14670 Project: Spark Issue Type: Bug

[jira] [Closed] (SPARK-8817) DataFrame should not allow duplicate colum names

2016-04-15 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] koert kuipers closed SPARK-8817. Resolution: Not A Problem I believe community disagrees with me and thinks its ok to have duplicate

[jira] [Assigned] (SPARK-7264) SparkR API for parallel functions

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7264: --- Assignee: Apache Spark > SparkR API for parallel functions >

[jira] [Commented] (SPARK-7264) SparkR API for parallel functions

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243655#comment-15243655 ] Apache Spark commented on SPARK-7264: - User 'thunterdb' has created a pull request for this issue:

[jira] [Assigned] (SPARK-7264) SparkR API for parallel functions

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7264: --- Assignee: (was: Apache Spark) > SparkR API for parallel functions >

[jira] [Updated] (SPARK-14661) Trim PCAModel by required explained variance

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14661: -- Issue Type: New Feature (was: Improvement) > Trim PCAModel by required explained

[jira] [Updated] (SPARK-14661) Trim PCAModel by required explained variance

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14661: -- Affects Version/s: (was: 2.0.0) > Trim PCAModel by required explained variance >

[jira] [Updated] (SPARK-14661) Trim PCAModel by required explained variance

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14661: -- Component/s: ML > Trim PCAModel by required explained variance >

[jira] [Commented] (SPARK-14659) OneHotEncoder support drop first category alphabetically in the encoded vector

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243642#comment-15243642 ] Joseph K. Bradley commented on SPARK-14659: --- This change SGTM. I'll target it for 2.0 >

[jira] [Updated] (SPARK-14659) OneHotEncoder support drop first category alphabetically in the encoded vector

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14659: -- Target Version/s: 2.0.0 > OneHotEncoder support drop first category alphabetically in

[jira] [Updated] (SPARK-14389) OOM during BroadcastNestedLoopJoin

2016-04-15 Thread Steve Johnston (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Johnston updated SPARK-14389: --- Attachment: stderr_2.txt OOM stderr from run on EMR 4.5 with Spark 1.6.1 > OOM during

[jira] [Updated] (SPARK-14389) OOM during BroadcastNestedLoopJoin

2016-04-15 Thread Steve Johnston (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Johnston updated SPARK-14389: --- Attachment: controller_2.txt OOM controller from run on EMR 4.5 with Spark 1.6.1 > OOM

[jira] [Commented] (SPARK-7445) StringIndexer should handle binary labels properly

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243624#comment-15243624 ] Joseph K. Bradley commented on SPARK-7445: -- Targeting at 2.0 since this will be a change of

[jira] [Updated] (SPARK-14389) OOM during BroadcastNestedLoopJoin

2016-04-15 Thread Steve Johnston (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Johnston updated SPARK-14389: --- Attachment: stdout_2.txt OOM stdout from run on EMR 4.5 with Spark 1.6.1 > OOM during

[jira] [Updated] (SPARK-14634) Add BisectingKMeansSummary

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14634: -- Issue Type: New Feature (was: Bug) > Add BisectingKMeansSummary >

[jira] [Updated] (SPARK-14634) Add BisectingKMeansSummary

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14634: -- Priority: Minor (was: Major) > Add BisectingKMeansSummary >

[jira] [Updated] (SPARK-7445) StringIndexer should handle binary labels properly

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7445: - Target Version/s: 2.0.0 > StringIndexer should handle binary labels properly >

[jira] [Updated] (SPARK-14623) add label binarizer

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14623: -- Issue Type: New Feature (was: Improvement) > add label binarizer >

[jira] [Updated] (SPARK-14623) add label binarizer

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14623: -- Affects Version/s: (was: 1.6.1) > add label binarizer > > >

[jira] [Updated] (SPARK-14623) add label binarizer

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14623: -- Shepherd: (was: Xiangrui Meng) > add label binarizer > > >

[jira] [Commented] (SPARK-14623) add label binarizer

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243621#comment-15243621 ] Joseph K. Bradley commented on SPARK-14623: --- [~hujiayin] Thanks for this. However, this looks

[jira] [Commented] (SPARK-7264) SparkR API for parallel functions

2016-04-15 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243584#comment-15243584 ] Timothy Hunter commented on SPARK-7264: --- I will have a PR for this soon. > SparkR API for parallel

[jira] [Updated] (SPARK-14610) Remove superfluous split from random forest findSplitsForContinousFeature

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14610: -- Priority: Minor (was: Major) > Remove superfluous split from random forest

[jira] [Updated] (SPARK-14389) OOM during BroadcastNestedLoopJoin

2016-04-15 Thread Steve Johnston (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Johnston updated SPARK-14389: --- Environment: OS: Amazon Linux AMI 2015.09 EMR: 4.3.0 & 4.5.0 Hadoop: Amazon 2.7.1 & 2.7.2

[jira] [Commented] (SPARK-14389) OOM during BroadcastNestedLoopJoin

2016-04-15 Thread Steve Johnston (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243549#comment-15243549 ] Steve Johnston commented on SPARK-14389: This is reproducible on Spark 1.6.1. > OOM during

[jira] [Assigned] (SPARK-14669) Some SQL metrics is broken when whole-stage codegen enabled

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14669: Assignee: Apache Spark > Some SQL metrics is broken when whole-stage codegen enabled >

[jira] [Commented] (SPARK-14669) Some SQL metrics is broken when whole-stage codegen enabled

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243532#comment-15243532 ] Apache Spark commented on SPARK-14669: -- User 'davies' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14669) Some SQL metrics is broken when whole-stage codegen enabled

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14669: Assignee: (was: Apache Spark) > Some SQL metrics is broken when whole-stage codegen

[jira] [Assigned] (SPARK-14668) Move current_database to sql/core

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14668: Assignee: Apache Spark (was: Yin Huai) > Move current_database to sql/core >

[jira] [Commented] (SPARK-14668) Move current_database to sql/core

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243529#comment-15243529 ] Apache Spark commented on SPARK-14668: -- User 'yhuai' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14668) Move current_database to sql/core

2016-04-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14668: Assignee: Yin Huai (was: Apache Spark) > Move current_database to sql/core >

[jira] [Created] (SPARK-14669) Some SQL metrics is broken when whole-stage codegen enabled

2016-04-15 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14669: -- Summary: Some SQL metrics is broken when whole-stage codegen enabled Key: SPARK-14669 URL: https://issues.apache.org/jira/browse/SPARK-14669 Project: Spark

[jira] [Created] (SPARK-14668) Move current_database to sql/core

2016-04-15 Thread Yin Huai (JIRA)
Yin Huai created SPARK-14668: Summary: Move current_database to sql/core Key: SPARK-14668 URL: https://issues.apache.org/jira/browse/SPARK-14668 Project: Spark Issue Type: Sub-task

[jira] [Resolved] (SPARK-7861) Python wrapper for OneVsRest

2016-04-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7861. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12124

  1   2   3   >