[jira] [Created] (SPARK-20793) cache table will not refresh after insert data to some broadcast table

2017-05-17 Thread du (JIRA)
du created SPARK-20793: -- Summary: cache table will not refresh after insert data to some broadcast table Key: SPARK-20793 URL: https://issues.apache.org/jira/browse/SPARK-20793 Project: Spark Issue Typ

[jira] [Updated] (SPARK-20700) InferFiltersFromConstraints stackoverflows for query (v2)

2017-05-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-20700: Fix Version/s: 2.2.0 > InferFiltersFromConstraints stackoverflows for query (v2) >

[jira] [Resolved] (SPARK-20700) InferFiltersFromConstraints stackoverflows for query (v2)

2017-05-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-20700. - Resolution: Fixed > InferFiltersFromConstraints stackoverflows for query (v2) > -

[jira] [Assigned] (SPARK-20792) Support same timeout operations in mapGroupsWithState function in batch queries as in streaming queries

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20792: Assignee: Apache Spark (was: Tathagata Das) > Support same timeout operations in mapGroup

[jira] [Assigned] (SPARK-20792) Support same timeout operations in mapGroupsWithState function in batch queries as in streaming queries

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20792: Assignee: Tathagata Das (was: Apache Spark) > Support same timeout operations in mapGroup

[jira] [Commented] (SPARK-20792) Support same timeout operations in mapGroupsWithState function in batch queries as in streaming queries

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16015226#comment-16015226 ] Apache Spark commented on SPARK-20792: -- User 'tdas' has created a pull request for t

[jira] [Created] (SPARK-20792) Support same timeout operations in mapGroupsWithState function in batch queries as in streaming queries

2017-05-17 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-20792: - Summary: Support same timeout operations in mapGroupsWithState function in batch queries as in streaming queries Key: SPARK-20792 URL: https://issues.apache.org/jira/browse/SPAR

[jira] [Commented] (SPARK-20506) ML, Graph 2.2 QA: Programming guide update and migration guide

2017-05-17 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16015215#comment-16015215 ] Yanbo Liang commented on SPARK-20506: - +1 [~mlnick] Adding section "Highlights in thi

[jira] [Updated] (SPARK-3181) Add Robust Regression Algorithm with Huber Estimator

2017-05-17 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-3181: --- Affects Version/s: 2.2.0 Target Version/s: 2.3.0 > Add Robust Regression Algorithm with Huber Est

[jira] [Commented] (SPARK-3181) Add Robust Regression Algorithm with Huber Estimator

2017-05-17 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16015198#comment-16015198 ] Yanbo Liang commented on SPARK-3181: [~mlnick] Yeah, the Breeze bug has been fixed. I

[jira] [Comment Edited] (SPARK-3181) Add Robust Regression Algorithm with Huber Estimator

2017-05-17 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16015198#comment-16015198 ] Yanbo Liang edited comment on SPARK-3181 at 5/18/17 4:49 AM: -

[jira] [Commented] (SPARK-20780) Spark Kafka10 Consumer Hangs

2017-05-17 Thread David Walsh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16015186#comment-16015186 ] David Walsh commented on SPARK-20780: - FYI [~srowen] it appears this is the underlyin

[jira] [Commented] (SPARK-10925) Exception when joining DataFrames

2017-05-17 Thread Haifeng Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16015176#comment-16015176 ] Haifeng Li commented on SPARK-10925: I tried the code with spark 2.1.1, I got same er

[jira] [Updated] (SPARK-10925) Exception when joining DataFrames

2017-05-17 Thread Haifeng Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haifeng Li updated SPARK-10925: --- Attachment: TestCase.scala > Exception when joining DataFrames > - >

[jira] [Closed] (SPARK-20777) Spark Streaming NullPointerException when restoring from hdfs checkpoint

2017-05-17 Thread Richard Moorhead (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Moorhead closed SPARK-20777. Resolution: Won't Fix Issue was multiple rdd transformations werent checkpointed from where

[jira] [Resolved] (SPARK-20505) ML, Graph 2.2 QA: Update user guide for new features & APIs

2017-05-17 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-20505. - Resolution: Fixed Fix Version/s: 2.2.0 > ML, Graph 2.2 QA: Update user guide for new featu

[jira] [Updated] (SPARK-20786) Improve ceil and floor handle the value which is not expected

2017-05-17 Thread caoxuewen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] caoxuewen updated SPARK-20786: -- Summary: Improve ceil and floor handle the value which is not expected (was: Improve ceil handle the v

[jira] [Updated] (SPARK-20784) Spark hangs forever after a joinWith() and cache() in YARN client mode

2017-05-17 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-20784: - Summary: Spark hangs forever after a joinWith() and cache() in YARN client mode (was: Spark hang

[jira] [Commented] (SPARK-11597) improve performance of array and map encoder

2017-05-17 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16015065#comment-16015065 ] Wenchen Fan commented on SPARK-11597: - Actually this is already fixed by https://iss

[jira] [Updated] (SPARK-20504) ML 2.2 QA: API: Java compatibility, docs

2017-05-17 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weichen Xu updated SPARK-20504: --- Attachment: (updated)signature.diff (updated)process_script2.sh (updat

[jira] [Resolved] (SPARK-11597) improve performance of array and map encoder

2017-05-17 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-11597. - Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 2.1.0 > improve performanc

[jira] [Commented] (SPARK-11597) improve performance of array and map encoder

2017-05-17 Thread Min Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014993#comment-16014993 ] Min Shen commented on SPARK-11597: -- Is there any further update on this ticket? We have

[jira] [Commented] (SPARK-12139) REGEX Column Specification for Hive Queries

2017-05-17 Thread jane (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014984#comment-16014984 ] jane commented on SPARK-12139: -- This feature is very important. Please review. > REGEX Colu

[jira] [Commented] (SPARK-12139) REGEX Column Specification for Hive Queries

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014982#comment-16014982 ] Apache Spark commented on SPARK-12139: -- User 'janewangfb' has created a pull request

[jira] [Resolved] (SPARK-13747) Concurrent execution in SQL doesn't work with Scala ForkJoinPool

2017-05-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-13747. -- Resolution: Fixed Fix Version/s: 2.2.0 Target Version/s: 2.1.0, 2.0.2 (was: 2.

[jira] [Commented] (SPARK-20791) Use Apache Arrow to Improve Spark createDataFrame from Pandas.DataFrame

2017-05-17 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014976#comment-16014976 ] Bryan Cutler commented on SPARK-20791: -- I will work on this pending SPARK-13534 bein

[jira] [Created] (SPARK-20791) Use Apache Arrow to Improve Spark createDataFrame from Pandas.DataFrame

2017-05-17 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-20791: Summary: Use Apache Arrow to Improve Spark createDataFrame from Pandas.DataFrame Key: SPARK-20791 URL: https://issues.apache.org/jira/browse/SPARK-20791 Project: Spar

[jira] [Commented] (SPARK-14141) Let user specify datatypes of pandas dataframe in toPandas()

2017-05-17 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014944#comment-16014944 ] Bryan Cutler commented on SPARK-14141: -- Take a look at SPARK-13534 which will make a

[jira] [Commented] (SPARK-17463) Serialization of accumulators in heartbeats is not thread-safe

2017-05-17 Thread Allen George (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014930#comment-16014930 ] Allen George commented on SPARK-17463: -- [~zsxwing] You're absolutely right: my apolo

[jira] [Commented] (SPARK-17463) Serialization of accumulators in heartbeats is not thread-safe

2017-05-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014799#comment-16014799 ] Shixiong Zhu commented on SPARK-17463: -- [~allengeorge] it's protected by `Collection

[jira] [Resolved] (SPARK-20788) Fix the Executor task reaper's false alarm warning logs

2017-05-17 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-20788. -- Resolution: Fixed Fix Version/s: 2.2.0 > Fix the Executor task reaper's false alarm warn

[jira] [Assigned] (SPARK-20790) ALS with implicit feedback ignores negative values

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20790: Assignee: (was: Apache Spark) > ALS with implicit feedback ignores negative values > -

[jira] [Assigned] (SPARK-20790) ALS with implicit feedback ignores negative values

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20790: Assignee: Apache Spark > ALS with implicit feedback ignores negative values >

[jira] [Commented] (SPARK-20790) ALS with implicit feedback ignores negative values

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014771#comment-16014771 ] Apache Spark commented on SPARK-20790: -- User 'davideis' has created a pull request f

[jira] [Resolved] (SPARK-14584) Improve recognition of non-nullability in Dataset transformations

2017-05-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-14584. Resolution: Fixed Fix Version/s: 2.2.0 > Improve recognition of non-nullability in Dataset t

[jira] [Commented] (SPARK-14584) Improve recognition of non-nullability in Dataset transformations

2017-05-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014728#comment-16014728 ] Josh Rosen commented on SPARK-14584: [~maropu], yep, I think we can close it. I'll ma

[jira] [Comment Edited] (SPARK-20790) ALS with implicit feedback ignores negative values

2017-05-17 Thread David Eis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014723#comment-16014723 ] David Eis edited comment on SPARK-20790 at 5/17/17 8:41 PM: S

[jira] [Commented] (SPARK-20790) ALS with implicit feedback ignores negative values

2017-05-17 Thread David Eis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014723#comment-16014723 ] David Eis commented on SPARK-20790: --- See https://github.com/apache/spark/pull/5314/fil

[jira] [Commented] (SPARK-20790) ALS with implicit feedback ignores negative values

2017-05-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014704#comment-16014704 ] Sean Owen commented on SPARK-20790: --- This needs more detail. Where is the logic problem

[jira] [Resolved] (SPARK-20789) Spark monitoring UI - http://spark-master:8080/api/v1/applications is not responding with json msg

2017-05-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20789. --- Resolution: Invalid Questions should go to the mailing list. > Spark monitoring UI - http://spark-ma

[jira] [Resolved] (SPARK-19555) Improve inefficient StringUtils.escapeLikeRegex() method

2017-05-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-19555. Resolution: Fixed Fix Version/s: 2.1.1 2.2.0 Fixed by SPARK-17647 in Spar

[jira] [Created] (SPARK-20790) ALS with implicit feedback ignores negative values

2017-05-17 Thread David Eis (JIRA)
David Eis created SPARK-20790: - Summary: ALS with implicit feedback ignores negative values Key: SPARK-20790 URL: https://issues.apache.org/jira/browse/SPARK-20790 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-20789) Spark monitoring UI - http://spark-master:8080/api/v1/applications is not responding with json msg

2017-05-17 Thread Ram Mettu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ram Mettu updated SPARK-20789: -- Summary: Spark monitoring UI - http://spark-master:8080/api/v1/applications is not responding with json

[jira] [Created] (SPARK-20789) Spark monitoring UI - http://spark-master:8080/api/v1/applications is not responding json message

2017-05-17 Thread Ram Mettu (JIRA)
Ram Mettu created SPARK-20789: - Summary: Spark monitoring UI - http://spark-master:8080/api/v1/applications is not responding json message Key: SPARK-20789 URL: https://issues.apache.org/jira/browse/SPARK-20789

[jira] [Comment Edited] (SPARK-17463) Serialization of accumulators in heartbeats is not thread-safe

2017-05-17 Thread Allen George (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014564#comment-16014564 ] Allen George edited comment on SPARK-17463 at 5/17/17 6:31 PM:

[jira] [Commented] (SPARK-17463) Serialization of accumulators in heartbeats is not thread-safe

2017-05-17 Thread Allen George (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014564#comment-16014564 ] Allen George commented on SPARK-17463: -- [~zsxwing] So, we're hitting this problem on

[jira] [Commented] (SPARK-19089) Support nested arrays/seqs in Datasets

2017-05-17 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-19089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014536#comment-16014536 ] Michal Šenkýř commented on SPARK-19089: --- I would like to point out that SPARK-18891

[jira] [Assigned] (SPARK-20788) Fix the Executor task reaper's false alarm warning logs

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20788: Assignee: Shixiong Zhu (was: Apache Spark) > Fix the Executor task reaper's false alarm w

[jira] [Commented] (SPARK-20788) Fix the Executor task reaper's false alarm warning logs

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014531#comment-16014531 ] Apache Spark commented on SPARK-20788: -- User 'zsxwing' has created a pull request fo

[jira] [Assigned] (SPARK-20788) Fix the Executor task reaper's false alarm warning logs

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20788: Assignee: Apache Spark (was: Shixiong Zhu) > Fix the Executor task reaper's false alarm w

[jira] [Commented] (SPARK-18891) Support for specific collection types

2017-05-17 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-18891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014529#comment-16014529 ] Michal Šenkýř commented on SPARK-18891: --- I would also like to point out that the ch

[jira] [Created] (SPARK-20788) Fix the Executor task reaper's false alarm warning logs

2017-05-17 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-20788: Summary: Fix the Executor task reaper's false alarm warning logs Key: SPARK-20788 URL: https://issues.apache.org/jira/browse/SPARK-20788 Project: Spark Issue

[jira] [Assigned] (SPARK-20700) InferFiltersFromConstraints stackoverflows for query (v2)

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20700: Assignee: Apache Spark (was: Jiang Xingbo) > InferFiltersFromConstraints stackoverflows f

[jira] [Assigned] (SPARK-20700) InferFiltersFromConstraints stackoverflows for query (v2)

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20700: Assignee: Jiang Xingbo (was: Apache Spark) > InferFiltersFromConstraints stackoverflows f

[jira] [Commented] (SPARK-20700) InferFiltersFromConstraints stackoverflows for query (v2)

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014458#comment-16014458 ] Apache Spark commented on SPARK-20700: -- User 'jiangxb1987' has created a pull reques

[jira] [Commented] (SPARK-1503) Implement Nesterov's accelerated first-order method

2017-05-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014394#comment-16014394 ] Joseph K. Bradley commented on SPARK-1503: -- I'm fine with us closing it since the

[jira] [Updated] (SPARK-20139) Spark UI reports partial success for completed stage while log shows all tasks are finished

2017-05-17 Thread Ivan Gozali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Gozali updated SPARK-20139: Attachment: screenshot-2.png Spark shows success but UI doesn't show all tasks completed. > Spark

[jira] [Issue Comment Deleted] (SPARK-20139) Spark UI reports partial success for completed stage while log shows all tasks are finished

2017-05-17 Thread Ivan Gozali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Gozali updated SPARK-20139: Comment: was deleted (was: Spark shows success but UI doesn't show all tasks completed.) > Spark U

[jira] [Commented] (SPARK-20139) Spark UI reports partial success for completed stage while log shows all tasks are finished

2017-05-17 Thread Ivan Gozali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014391#comment-16014391 ] Ivan Gozali commented on SPARK-20139: - Facing a similar issue with 8 [AWS m2.4xlarge

[jira] [Created] (SPARK-20787) PySpark can't handle datetimes before 1900

2017-05-17 Thread Keith Bourgoin (JIRA)
Keith Bourgoin created SPARK-20787: -- Summary: PySpark can't handle datetimes before 1900 Key: SPARK-20787 URL: https://issues.apache.org/jira/browse/SPARK-20787 Project: Spark Issue Type: Bu

[jira] [Assigned] (SPARK-20748) Built-in SQL Function Support - CH[A]R

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20748: Assignee: (was: Apache Spark) > Built-in SQL Function Support - CH[A]R > -

[jira] [Assigned] (SPARK-20748) Built-in SQL Function Support - CH[A]R

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20748: Assignee: Apache Spark > Built-in SQL Function Support - CH[A]R >

[jira] [Commented] (SPARK-20748) Built-in SQL Function Support - CH[A]R

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014311#comment-16014311 ] Apache Spark commented on SPARK-20748: -- User 'wangyum' has created a pull request fo

[jira] [Commented] (SPARK-20712) [SQL] Spark can't read Hive table when column type has length greater than 4000 bytes

2017-05-17 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014105#comment-16014105 ] Wenchen Fan commented on SPARK-20712: - I can't reproduce this in master branch, can y

[jira] [Resolved] (SPARK-6349) Add probability estimates in SVMModel predict result

2017-05-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6349. -- Resolution: Won't Fix > Add probability estimates in SVMModel predict result > -

[jira] [Resolved] (SPARK-19303) Add evaluate method in clustering models

2017-05-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19303. --- Resolution: Won't Fix > Add evaluate method in clustering models > --

[jira] [Resolved] (SPARK-19379) SparkAppHandle.getState not registering FAILED state upon Spark app failure in Local mode

2017-05-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19379. --- Resolution: Won't Fix > SparkAppHandle.getState not registering FAILED state upon Spark app failure

[jira] [Resolved] (SPARK-18981) The last job hung when speculation is on

2017-05-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18981. --- Resolution: Won't Fix > The last job hung when speculation is on > --

[jira] [Commented] (SPARK-14174) Accelerate KMeans via Mini-Batch EM

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014047#comment-16014047 ] Nick Pentreath commented on SPARK-14174: [~podongfeng] did you manage to look int

[jira] [Resolved] (SPARK-18867) Throw cause if IsolatedClientLoad can't create client

2017-05-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18867. --- Resolution: Won't Fix > Throw cause if IsolatedClientLoad can't create client > -

[jira] [Resolved] (SPARK-14974) spark sql job create too many files in HDFS when doing insert overwrite hive table

2017-05-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14974. --- Resolution: Not A Problem > spark sql job create too many files in HDFS when doing insert overwrite h

[jira] [Resolved] (SPARK-16987) Add spark-default.conf property to define https port for spark history server

2017-05-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16987. --- Resolution: Duplicate > Add spark-default.conf property to define https port for spark history server

[jira] [Commented] (SPARK-6000) Batch K-Means clusters should support "mini-batch" updates

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014046#comment-16014046 ] Nick Pentreath commented on SPARK-6000: --- Even though SPARK-14174 is later - it seems

[jira] [Resolved] (SPARK-16253) make spark sql compatible with hive sql that using python script transform like using 'xxx.py'

2017-05-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16253. --- Resolution: Not A Problem > make spark sql compatible with hive sql that using python script transfor

[jira] [Commented] (SPARK-6349) Add probability estimates in SVMModel predict result

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014043#comment-16014043 ] Nick Pentreath commented on SPARK-6349: --- This is now covered by {{ml}}'s {{LinearSVC

[jira] [Commented] (SPARK-6417) Add Linear Programming algorithm

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014041#comment-16014041 ] Nick Pentreath commented on SPARK-6417: --- I think it's fairly safe to say there is no

[jira] [Closed] (SPARK-6417) Add Linear Programming algorithm

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath closed SPARK-6417. - Resolution: Won't Fix > Add Linear Programming algorithm > - > >

[jira] [Commented] (SPARK-7290) Add StringVectorizer

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014039#comment-16014039 ] Nick Pentreath commented on SPARK-7290: --- Is this still desired? Seems it perhaps doe

[jira] [Resolved] (SPARK-14661) Trim PCAModel by required explained variance

2017-05-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14661. --- Resolution: Won't Fix > Trim PCAModel by required explained variance > --

[jira] [Resolved] (SPARK-14289) Support multiple eviction strategies for cached RDD partitions

2017-05-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14289. --- Resolution: Won't Fix > Support multiple eviction strategies for cached RDD partitions >

[jira] [Resolved] (SPARK-14293) Improve shuffle load balancing and minimize network data transmission

2017-05-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14293. --- Resolution: Won't Fix > Improve shuffle load balancing and minimize network data transmission > -

[jira] [Commented] (SPARK-3181) Add Robust Regression Algorithm with Huber Estimator

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014026#comment-16014026 ] Nick Pentreath commented on SPARK-3181: --- So the Breeze bug is fixed now right? Will

[jira] [Closed] (SPARK-5328) Update PySpark MLlib NaiveBayes API to take model type parameter for Bernoulli fit

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath closed SPARK-5328. - Resolution: Won't Fix > Update PySpark MLlib NaiveBayes API to take model type parameter for > Be

[jira] [Commented] (SPARK-5328) Update PySpark MLlib NaiveBayes API to take model type parameter for Bernoulli fit

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014005#comment-16014005 ] Nick Pentreath commented on SPARK-5328: --- This is pretty stale so I'll close it off,

[jira] [Commented] (SPARK-1503) Implement Nesterov's accelerated first-order method

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16013999#comment-16013999 ] Nick Pentreath commented on SPARK-1503: --- I think it's safe to say this won't go into

[jira] [Comment Edited] (SPARK-1503) Implement Nesterov's accelerated first-order method

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16013999#comment-16013999 ] Nick Pentreath edited comment on SPARK-1503 at 5/17/17 1:13 PM:

[jira] [Commented] (SPARK-1359) SGD implementation is not efficient

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16013998#comment-16013998 ] Nick Pentreath commented on SPARK-1359: --- Do we care much about this now, since {{mll

[jira] [Closed] (SPARK-12015) Auto convert int to Double when required in pyspark.ml

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath closed SPARK-12015. -- Resolution: Duplicate > Auto convert int to Double when required in pyspark.ml > --

[jira] [Commented] (SPARK-12015) Auto convert int to Double when required in pyspark.ml

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16013996#comment-16013996 ] Nick Pentreath commented on SPARK-12015: This was fixed in SPARK-7425 - closing a

[jira] [Commented] (SPARK-12686) Support group-by push down into data sources

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16013983#comment-16013983 ] Apache Spark commented on SPARK-12686: -- User 'kisimple' has created a pull request f

[jira] [Updated] (SPARK-20723) Random Forest Classifier should expose intermediateRDDStorageLevel similar to ALS

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-20723: --- Target Version/s: (was: 2.3.0) > Random Forest Classifier should expose intermediateRDDStor

[jira] [Updated] (SPARK-20723) Random Forest Classifier should expose intermediateRDDStorageLevel similar to ALS

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-20723: --- Affects Version/s: (was: 2.3.0) 2.2.0 > Random Forest Classifier s

[jira] [Commented] (SPARK-20723) Random Forest Classifier should expose intermediateRDDStorageLevel similar to ALS

2017-05-17 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16013972#comment-16013972 ] Nick Pentreath commented on SPARK-20723: Please don't set Target Version by the w

[jira] [Commented] (SPARK-20784) Spark hangs forever after a joinWith() and cache()

2017-05-17 Thread Mathieu D (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16013911#comment-16013911 ] Mathieu D commented on SPARK-20784: --- Changed the title. It's noted for the self reprodu

[jira] [Updated] (SPARK-20784) Spark hangs forever after a joinWith() and cache()

2017-05-17 Thread Mathieu D (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mathieu D updated SPARK-20784: -- Summary: Spark hangs forever after a joinWith() and cache() (was: Spark hangs forever) > Spark hangs

[jira] [Comment Edited] (SPARK-20784) Spark hangs forever

2017-05-17 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16013901#comment-16013901 ] Hyukjin Kwon edited comment on SPARK-20784 at 5/17/17 12:05 PM: ---

[jira] [Commented] (SPARK-20784) Spark hangs forever

2017-05-17 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16013901#comment-16013901 ] Hyukjin Kwon commented on SPARK-20784: -- Could you update the JIRA title? it sounds S

[jira] [Assigned] (SPARK-20786) Improve ceil handle the value which is not expected

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20786: Assignee: (was: Apache Spark) > Improve ceil handle the value which is not expected >

[jira] [Assigned] (SPARK-20786) Improve ceil handle the value which is not expected

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20786: Assignee: Apache Spark > Improve ceil handle the value which is not expected > ---

[jira] [Commented] (SPARK-20786) Improve ceil handle the value which is not expected

2017-05-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16013836#comment-16013836 ] Apache Spark commented on SPARK-20786: -- User 'heary-cao' has created a pull request

  1   2   >