[jira] [Created] (SPARK-19487) Low latency execution for Spark

2017-02-06 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-19487: - Summary: Low latency execution for Spark Key: SPARK-19487 URL: https://issues.apache.org/jira/browse/SPARK-19487 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-18967) Locality preferences should be used when scheduling even when delay scheduling is turned off

2017-02-06 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-18967. Resolution: Fixed Fix Version/s: 2.2 > Locality preferences should be used when sche

[jira] [Commented] (SPARK-16441) Spark application hang when dynamic allocation is enabled

2017-02-06 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15855414#comment-15855414 ] Yuming Wang commented on SPARK-16441: - [~cenyuhai], [2.1.0|https://github.com/apache/

[jira] [Commented] (SPARK-19484) continue work to create a table with an empty schema

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15855402#comment-15855402 ] Apache Spark commented on SPARK-19484: -- User 'windpiger' has created a pull request

[jira] [Assigned] (SPARK-19484) continue work to create a table with an empty schema

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19484: Assignee: Apache Spark > continue work to create a table with an empty schema > --

[jira] [Assigned] (SPARK-19484) continue work to create a table with an empty schema

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19484: Assignee: (was: Apache Spark) > continue work to create a table with an empty schema >

[jira] [Updated] (SPARK-16441) Spark application hang when dynamic allocation is enabled

2017-02-06 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-16441: Attachment: SPARK-16441-compare-apply-PR-16819.zip > Spark application hang when dynamic allocation

[jira] [Created] (SPARK-19486) Investigate using multiple threads for task serialization

2017-02-06 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-19486: - Summary: Investigate using multiple threads for task serialization Key: SPARK-19486 URL: https://issues.apache.org/jira/browse/SPARK-19486 Project: S

[jira] [Created] (SPARK-19485) Launch tasks async i.e. dont wait for the network

2017-02-06 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-19485: - Summary: Launch tasks async i.e. dont wait for the network Key: SPARK-19485 URL: https://issues.apache.org/jira/browse/SPARK-19485 Project: Spark

[jira] [Created] (SPARK-19484) continue work to create a table with an empty schema

2017-02-06 Thread Song Jun (JIRA)
Song Jun created SPARK-19484: Summary: continue work to create a table with an empty schema Key: SPARK-19484 URL: https://issues.apache.org/jira/browse/SPARK-19484 Project: Spark Issue Type: Impr

[jira] [Resolved] (SPARK-19407) defaultFS is used FileSystem.get instead of getting it from uri scheme

2017-02-06 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-19407. -- Resolution: Fixed Assignee: Genmao Yu Fix Version/s: 2.2.0 2.

[jira] [Updated] (SPARK-19483) Add one RocketMQ plugin for the Apache Spark

2017-02-06 Thread Longda Feng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Longda Feng updated SPARK-19483: External issue URL: https://issues.apache.org/jira/browse/ROCKETMQ-81 External issue ID: ROCKET

[jira] [Created] (SPARK-19483) Add one RocketMQ plugin for the Apache Spark

2017-02-06 Thread Longda Feng (JIRA)
Longda Feng created SPARK-19483: --- Summary: Add one RocketMQ plugin for the Apache Spark Key: SPARK-19483 URL: https://issues.apache.org/jira/browse/SPARK-19483 Project: Spark Issue Type: Task

[jira] [Assigned] (SPARK-19482) Fail it if 'spark.master' is set with different value

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19482: Assignee: (was: Apache Spark) > Fail it if 'spark.master' is set with different value

[jira] [Assigned] (SPARK-19482) Fail it if 'spark.master' is set with different value

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19482: Assignee: Apache Spark > Fail it if 'spark.master' is set with different value > -

[jira] [Commented] (SPARK-19482) Fail it if 'spark.master' is set with different value

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15855240#comment-15855240 ] Apache Spark commented on SPARK-19482: -- User 'uncleGen' has created a pull request f

[jira] [Created] (SPARK-19482) Fail it if 'spark.master' is set with different value

2017-02-06 Thread Genmao Yu (JIRA)
Genmao Yu created SPARK-19482: - Summary: Fail it if 'spark.master' is set with different value Key: SPARK-19482 URL: https://issues.apache.org/jira/browse/SPARK-19482 Project: Spark Issue Type: I

[jira] [Assigned] (SPARK-19467) PySpark ML shouldn't use circular imports

2017-02-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19467: - Assignee: Maciej Szymkiewicz > PySpark ML shouldn't use circular imports > -

[jira] [Resolved] (SPARK-19467) PySpark ML shouldn't use circular imports

2017-02-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-19467. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16814 [h

[jira] [Resolved] (SPARK-19441) Remove IN type coercion from PromoteStrings

2017-02-06 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-19441. - Resolution: Fixed Issue resolved by pull request 16783 [https://github.com/apache/spark/pull/1678

[jira] [Commented] (SPARK-19479) Spark Mesos artifact split causes spark-core dependency to not pull in mesos impl

2017-02-06 Thread Charles Allen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15855102#comment-15855102 ] Charles Allen commented on SPARK-19479: --- [~mgummelt] that's actually a really good

[jira] [Commented] (SPARK-15573) Backwards-compatible persistence for spark.ml

2017-02-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15855097#comment-15855097 ] Joseph K. Bradley commented on SPARK-15573: --- It's a good point that we can't ma

[jira] [Commented] (SPARK-19208) MultivariateOnlineSummarizer performance optimization

2017-02-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15855090#comment-15855090 ] Joseph K. Bradley commented on SPARK-19208: --- You're right that sharing intermed

[jira] [Commented] (SPARK-19479) Spark Mesos artifact split causes spark-core dependency to not pull in mesos impl

2017-02-06 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15855091#comment-15855091 ] Michael Gummelt commented on SPARK-19479: - Yea, sorry for the inconvenience, but

[jira] [Commented] (SPARK-12157) Support numpy types as return values of Python UDFs

2017-02-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15855060#comment-15855060 ] Joseph K. Bradley commented on SPARK-12157: --- I don't know of any Python UDF per

[jira] [Commented] (SPARK-16824) Add API docs for VectorUDT

2017-02-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15855057#comment-15855057 ] Joseph K. Bradley commented on SPARK-16824: --- I think we didn't document it sinc

[jira] [Updated] (SPARK-16824) Add API docs for VectorUDT

2017-02-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-16824: -- Issue Type: Documentation (was: Improvement) > Add API docs for VectorUDT > --

[jira] [Updated] (SPARK-16824) Add API docs for VectorUDT

2017-02-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-16824: -- Component/s: MLlib > Add API docs for VectorUDT > -- > >

[jira] [Commented] (SPARK-18891) Support for specific collection types

2017-02-06 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-18891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15855011#comment-15855011 ] Michal Šenkýř commented on SPARK-18891: --- Thanks. Yes, I know about the Maps issue a

[jira] [Assigned] (SPARK-19481) Fix flaky test: o.a.s.repl.ReplSuite should clone and clean line object in ClosureCleaner

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19481: Assignee: Shixiong Zhu (was: Apache Spark) > Fix flaky test: o.a.s.repl.ReplSuite should

[jira] [Assigned] (SPARK-19481) Fix flaky test: o.a.s.repl.ReplSuite should clone and clean line object in ClosureCleaner

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19481: Assignee: Apache Spark (was: Shixiong Zhu) > Fix flaky test: o.a.s.repl.ReplSuite should

[jira] [Commented] (SPARK-19481) Fix flaky test: o.a.s.repl.ReplSuite should clone and clean line object in ClosureCleaner

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15854889#comment-15854889 ] Apache Spark commented on SPARK-19481: -- User 'zsxwing' has created a pull request fo

[jira] [Created] (SPARK-19481) Fix flaky test: o.a.s.repl.ReplSuite should clone and clean line object in ClosureCleaner

2017-02-06 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-19481: Summary: Fix flaky test: o.a.s.repl.ReplSuite should clone and clean line object in ClosureCleaner Key: SPARK-19481 URL: https://issues.apache.org/jira/browse/SPARK-19481

[jira] [Created] (SPARK-19480) Higher order functions in SQL

2017-02-06 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-19480: --- Summary: Higher order functions in SQL Key: SPARK-19480 URL: https://issues.apache.org/jira/browse/SPARK-19480 Project: Spark Issue Type: New Feature

[jira] [Resolved] (SPARK-19472) [SQL]SQLParser fails to resolve nested CASE WHEN statement with parentheses

2017-02-06 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-19472. - Resolution: Fixed Assignee: Herman van Hovell Fix Version/s: 2.2.0 2.1.

[jira] [Resolved] (SPARK-19479) Spark Mesos artifact split causes spark-core dependency to not pull in mesos impl

2017-02-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19479. --- Resolution: Invalid As I said on the PR, I don't think this is for release notes because release not

[jira] [Created] (SPARK-19479) Spark Mesos artifact split causes spark-core dependency to not pull in mesos impl

2017-02-06 Thread Charles Allen (JIRA)
Charles Allen created SPARK-19479: - Summary: Spark Mesos artifact split causes spark-core dependency to not pull in mesos impl Key: SPARK-19479 URL: https://issues.apache.org/jira/browse/SPARK-19479 P

[jira] [Updated] (SPARK-19478) JDBC Sink

2017-02-06 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-19478: - Issue Type: New Feature (was: Bug) > JDBC Sink > - > > Key: SPAR

[jira] [Created] (SPARK-19478) JDBC Sink

2017-02-06 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-19478: Summary: JDBC Sink Key: SPARK-19478 URL: https://issues.apache.org/jira/browse/SPARK-19478 Project: Spark Issue Type: Bug Components: Struc

[jira] [Updated] (SPARK-19398) Log in TaskSetManager is not correct

2017-02-06 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-19398: --- Fix Version/s: 2.2 > Log in TaskSetManager is not correct > -

[jira] [Resolved] (SPARK-19398) Log in TaskSetManager is not correct

2017-02-06 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-19398. Resolution: Fixed Assignee: jin xing > Log in TaskSetManager is not correct > ---

[jira] [Commented] (SPARK-19462) when spark.sql.adaptive.enabled is enabled, DF is not resilient to node/container failure

2017-02-06 Thread Ian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15854534#comment-15854534 ] Ian commented on SPARK-19462: - I appears that the state mutating of newPartitioning of org.a

[jira] [Created] (SPARK-19477) [SQL] Datasets created from a Dataframe with extra columns retain the extra columns

2017-02-06 Thread Don Drake (JIRA)
Don Drake created SPARK-19477: - Summary: [SQL] Datasets created from a Dataframe with extra columns retain the extra columns Key: SPARK-19477 URL: https://issues.apache.org/jira/browse/SPARK-19477 Project

[jira] [Commented] (SPARK-18069) Many examples in Python docstrings are incomplete

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15854468#comment-15854468 ] Apache Spark commented on SPARK-18069: -- User 'HyukjinKwon' has created a pull reques

[jira] [Commented] (SPARK-19439) PySpark's registerJavaFunction Should Support UDAFs

2017-02-06 Thread Keith Bourgoin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15854330#comment-15854330 ] Keith Bourgoin commented on SPARK-19439: SPARK-10915 refers to making it possible

[jira] [Created] (SPARK-19476) Running threads in Spark DataFrame foreachPartition() causes NullPointerException

2017-02-06 Thread Gal Topper (JIRA)
Gal Topper created SPARK-19476: -- Summary: Running threads in Spark DataFrame foreachPartition() causes NullPointerException Key: SPARK-19476 URL: https://issues.apache.org/jira/browse/SPARK-19476 Project

[jira] [Resolved] (SPARK-19080) simplify data source analysis

2017-02-06 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-19080. - Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16269 [https://githu

[jira] [Assigned] (SPARK-19475) (ML|MLlib).linalg.DenseVector method delegation fails for __neg__

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19475: Assignee: (was: Apache Spark) > (ML|MLlib).linalg.DenseVector method delegation fails

[jira] [Commented] (SPARK-19475) (ML|MLlib).linalg.DenseVector method delegation fails for __neg__

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15854270#comment-15854270 ] Apache Spark commented on SPARK-19475: -- User 'zero323' has created a pull request fo

[jira] [Assigned] (SPARK-19475) (ML|MLlib).linalg.DenseVector method delegation fails for __neg__

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19475: Assignee: Apache Spark > (ML|MLlib).linalg.DenseVector method delegation fails for __neg__

[jira] [Created] (SPARK-19475) (ML|MLlib).linalg.DenseVector method delegation fails for __neg__

2017-02-06 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19475: -- Summary: (ML|MLlib).linalg.DenseVector method delegation fails for __neg__ Key: SPARK-19475 URL: https://issues.apache.org/jira/browse/SPARK-19475 Project

[jira] [Commented] (SPARK-19468) Dataset slow because of unnecessary shuffles

2017-02-06 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15854242#comment-15854242 ] koert kuipers commented on SPARK-19468: --- so to summarize: RDD does what we would ex

[jira] [Assigned] (SPARK-19472) [SQL]SQLParser fails to resolve nested CASE WHEN statement with parentheses

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19472: Assignee: Apache Spark > [SQL]SQLParser fails to resolve nested CASE WHEN statement with p

[jira] [Assigned] (SPARK-19472) [SQL]SQLParser fails to resolve nested CASE WHEN statement with parentheses

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19472: Assignee: (was: Apache Spark) > [SQL]SQLParser fails to resolve nested CASE WHEN state

[jira] [Commented] (SPARK-19472) [SQL]SQLParser fails to resolve nested CASE WHEN statement with parentheses

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15854187#comment-15854187 ] Apache Spark commented on SPARK-19472: -- User 'hvanhovell' has created a pull request

[jira] [Commented] (SPARK-19468) Dataset slow because of unnecessary shuffles

2017-02-06 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15854152#comment-15854152 ] koert kuipers commented on SPARK-19468: --- inserting unnecessary shuffles makes thing

[jira] [Assigned] (SPARK-17663) SchedulableBuilder should handle invalid data access via scheduler.allocation.file

2017-02-06 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid reassigned SPARK-17663: Assignee: Eren Avsarogullari (was: Imran Rashid) > SchedulableBuilder should handle inval

[jira] [Assigned] (SPARK-17663) SchedulableBuilder should handle invalid data access via scheduler.allocation.file

2017-02-06 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid reassigned SPARK-17663: Assignee: Imran Rashid > SchedulableBuilder should handle invalid data access via > sched

[jira] [Resolved] (SPARK-17663) SchedulableBuilder should handle invalid data access via scheduler.allocation.file

2017-02-06 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid resolved SPARK-17663. -- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 15237 [https://git

[jira] [Commented] (SPARK-19439) PySpark's registerJavaFunction Should Support UDAFs

2017-02-06 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15854034#comment-15854034 ] Hyukjin Kwon commented on SPARK-19439: -- So, as you said, is this a duplicate of SPAR

[jira] [Resolved] (SPARK-19440) Window in pyspark doesn't have attributes unboundedPreceding, unboundedFollowing and currentRow

2017-02-06 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19440. -- Resolution: Invalid It seems there are as below: {code} >>> from pyspark.sql import Window >>>

[jira] [Resolved] (SPARK-19442) Unable to add column to the dataset using Dataset.WithColumn() api

2017-02-06 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19442. -- Resolution: Cannot Reproduce I am resolving this as I can't reproduce in the current master as

[jira] [Commented] (SPARK-19451) Long values in Window function

2017-02-06 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853963#comment-15853963 ] Herman van Hovell commented on SPARK-19451: --- At the end of the day I would like

[jira] [Commented] (SPARK-19451) Long values in Window function

2017-02-06 Thread Julien Champ (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853936#comment-15853936 ] Julien Champ commented on SPARK-19451: -- Glad to see that I'm not the only one convin

[jira] [Assigned] (SPARK-19471) [SQL]A confusing NullPointerException when creating table

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19471: Assignee: (was: Apache Spark) > [SQL]A confusing NullPointerException when creating ta

[jira] [Assigned] (SPARK-19471) [SQL]A confusing NullPointerException when creating table

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19471: Assignee: Apache Spark > [SQL]A confusing NullPointerException when creating table > -

[jira] [Commented] (SPARK-19471) [SQL]A confusing NullPointerException when creating table

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853906#comment-15853906 ] Apache Spark commented on SPARK-19471: -- User 'yangw1234' has created a pull request

[jira] [Resolved] (SPARK-19469) PySpark should allow driver process on different machine

2017-02-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19469. --- Resolution: Not A Problem I don't think this makes sense. The JVM and Python process must be colocate

[jira] [Resolved] (SPARK-18341) Eliminate use of SingularMatrixException in WeightedLeastSquares logic

2017-02-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18341. --- Resolution: Won't Fix > Eliminate use of SingularMatrixException in WeightedLeastSquares logic >

[jira] [Commented] (SPARK-19449) Inconsistent results between ml package RandomForestClassificationModel and mllib package RandomForestModel

2017-02-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853831#comment-15853831 ] Sean Owen commented on SPARK-19449: --- This isn't a bug. It's not expected that, even if

[jira] [Comment Edited] (SPARK-10643) Support remote application download in client mode spark submit

2017-02-06 Thread wangqiaoshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853817#comment-15853817 ] wangqiaoshi edited comment on SPARK-10643 at 2/6/17 10:50 AM: -

[jira] [Resolved] (SPARK-19470) Spark 1.6.4 in Intellij can't use jetty 8

2017-02-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19470. --- Resolution: Invalid There is no Spark 1.6.4, and you can't expect to change a dependency with no oth

[jira] [Created] (SPARK-19474) SparkSQL unsupports to change hive table's name\dataType

2017-02-06 Thread Xiaochen Ouyang (JIRA)
Xiaochen Ouyang created SPARK-19474: --- Summary: SparkSQL unsupports to change hive table's name\dataType Key: SPARK-19474 URL: https://issues.apache.org/jira/browse/SPARK-19474 Project: Spark

[jira] [Commented] (SPARK-10643) Support remote application download in client mode spark submit

2017-02-06 Thread wangqiaoshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853817#comment-15853817 ] wangqiaoshi commented on SPARK-10643: - +1. i think it would be useful when use azkab

[jira] [Commented] (SPARK-19468) Dataset slow because of unnecessary shuffles

2017-02-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853809#comment-15853809 ] Sean Owen commented on SPARK-19468: --- I am unclear whether this is a bug report. You're

[jira] [Commented] (SPARK-19451) Long values in Window function

2017-02-06 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853800#comment-15853800 ] Herman van Hovell commented on SPARK-19451: --- Yeah, you are right about that. We

[jira] [Commented] (SPARK-19451) Long values in Window function

2017-02-06 Thread Julien Champ (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853793#comment-15853793 ] Julien Champ commented on SPARK-19451: -- Let's imagine that this window is used on ti

[jira] [Updated] (SPARK-16441) Spark application hang when dynamic allocation is enabled

2017-02-06 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-16441: Attachment: SPARK-16441-yarn-metrics.jpg SPARK-16441-threadDump.jpg

[jira] [Updated] (SPARK-16441) Spark application hang when dynamic allocation is enabled

2017-02-06 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-16441: Affects Version/s: 2.1.0 > Spark application hang when dynamic allocation is enabled >

[jira] [Commented] (SPARK-16441) Spark application hang when dynamic allocation is enabled

2017-02-06 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853784#comment-15853784 ] Yuming Wang commented on SPARK-16441: - set {{spark.dynamicAllocation.maxExecutors}} t

[jira] [Commented] (SPARK-19451) Long values in Window function

2017-02-06 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853782#comment-15853782 ] Herman van Hovell commented on SPARK-19451: --- [~jchamp] how may rows are in your

[jira] [Assigned] (SPARK-16441) Spark application hang when dynamic allocation is enabled

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16441: Assignee: (was: Apache Spark) > Spark application hang when dynamic allocation is enab

[jira] [Assigned] (SPARK-16441) Spark application hang when dynamic allocation is enabled

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16441: Assignee: Apache Spark > Spark application hang when dynamic allocation is enabled > -

[jira] [Commented] (SPARK-16441) Spark application hang when dynamic allocation is enabled

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853775#comment-15853775 ] Apache Spark commented on SPARK-16441: -- User 'wangyum' has created a pull request fo

[jira] [Commented] (SPARK-19451) Long values in Window function

2017-02-06 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853773#comment-15853773 ] Genmao Yu commented on SPARK-19451: --- [~jchamp] I have taken a fast look through the cod

[jira] [Comment Edited] (SPARK-19451) Long values in Window function

2017-02-06 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853773#comment-15853773 ] Genmao Yu edited comment on SPARK-19451 at 2/6/17 9:58 AM: --- [~j

[jira] [Assigned] (SPARK-19451) Long values in Window function

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19451: Assignee: (was: Apache Spark) > Long values in Window function > -

[jira] [Commented] (SPARK-19451) Long values in Window function

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853763#comment-15853763 ] Apache Spark commented on SPARK-19451: -- User 'uncleGen' has created a pull request f

[jira] [Assigned] (SPARK-19451) Long values in Window function

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19451: Assignee: Apache Spark > Long values in Window function > -- >

[jira] [Commented] (SPARK-17910) Allow users to update the comment of a column

2017-02-06 Thread Xiaochen Ouyang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853748#comment-15853748 ] Xiaochen Ouyang commented on SPARK-17910: - Hey,I wonder that do we have a plan to

[jira] [Commented] (SPARK-19451) Long values in Window function

2017-02-06 Thread Julien Champ (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853685#comment-15853685 ] Julien Champ commented on SPARK-19451: -- Thanks [~uncleGen] for your answer. I was t

[jira] [Commented] (SPARK-17213) Parquet String Pushdown for Non-Eq Comparisons Broken

2017-02-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15853673#comment-15853673 ] Apache Spark commented on SPARK-17213: -- User 'HyukjinKwon' has created a pull reques