[jira] [Resolved] (SPARK-13503) Support to specify the (writing) option for compression codec for TEXT

2016-02-25 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13503. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.0.0 > Support to specif

[jira] [Commented] (SPARK-13289) Word2Vec generate infinite distances when numIterations>5

2016-02-25 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168619#comment-15168619 ] Nick Pentreath commented on SPARK-13289: Master branch should be building now. Ca

[jira] [Commented] (SPARK-13445) Selecting "data" with window function does not work unless aliased (using PARTITION BY)

2016-02-25 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168608#comment-15168608 ] Xiao Li commented on SPARK-13445: - Tried it in the latest 1.6 upstream. The query can be

[jira] [Assigned] (SPARK-13450) SortMergeJoin will OOM when join rows have lot of same keys

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13450: Assignee: Apache Spark > SortMergeJoin will OOM when join rows have lot of same keys > ---

[jira] [Commented] (SPARK-13450) SortMergeJoin will OOM when join rows have lot of same keys

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168595#comment-15168595 ] Apache Spark commented on SPARK-13450: -- User 'shenh062326' has created a pull reques

[jira] [Assigned] (SPARK-13450) SortMergeJoin will OOM when join rows have lot of same keys

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13450: Assignee: (was: Apache Spark) > SortMergeJoin will OOM when join rows have lot of same

[jira] [Updated] (SPARK-13445) Selecting "data" with window function does not work unless aliased (using PARTITION BY)

2016-02-25 Thread Devaraj K (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Devaraj K updated SPARK-13445: -- Summary: Selecting "data" with window function does not work unless aliased (using PARTITION BY) (was:

[jira] [Resolved] (SPARK-13487) User-facing RuntimeConfig interface

2016-02-25 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-13487. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11378 [https://github.com/

[jira] [Commented] (SPARK-13445) Seleting "data" with window function does not work unless aliased (using PARTITION BY)

2016-02-25 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168568#comment-15168568 ] Xiao Li commented on SPARK-13445: - Yeah. In 1.6.0, I can reproduce it. Will try to see if

[jira] [Updated] (SPARK-13445) Seleting "data" with window function does not work unless aliased (using PARTITION BY)

2016-02-25 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-13445: Affects Version/s: 1.6.0 > Seleting "data" with window function does not work unless aliased (using > PART

[jira] [Updated] (SPARK-12941) Spark-SQL JDBC Oracle dialect fails to map string datatypes to Oracle VARCHAR datatype

2016-02-25 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-12941: - Assignee: Thomas Sebastian > Spark-SQL JDBC Oracle dialect fails to map string datatypes to Oracle VARCHA

[jira] [Resolved] (SPARK-12941) Spark-SQL JDBC Oracle dialect fails to map string datatypes to Oracle VARCHAR datatype

2016-02-25 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-12941. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11306 [https://github.com/

[jira] [Commented] (SPARK-13445) Seleting "data" with window function does not work unless aliased (using PARTITION BY)

2016-02-25 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168532#comment-15168532 ] Xiao Li commented on SPARK-13445: - Let me try it in 1.6. > Seleting "data" with window

[jira] [Commented] (SPARK-13505) Python API for MaxAbsScaler

2016-02-25 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168529#comment-15168529 ] Nick Pentreath commented on SPARK-13505: [~holdenk] [~bryanc] [~sethah] any inter

[jira] [Commented] (SPARK-13445) Seleting "data" with window function does not work unless aliased (using PARTITION BY)

2016-02-25 Thread Simeon Simeonov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168527#comment-15168527 ] Simeon Simeonov commented on SPARK-13445: - I can reproduce it consistently in our

[jira] [Commented] (SPARK-13463) Support Column pruning for Dataset logical plan

2016-02-25 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168504#comment-15168504 ] Xiao Li commented on SPARK-13463: - Sure, will do it this weekend. Thanks! > Support Colu

[jira] [Commented] (SPARK-13445) Seleting "data" with window function does not work unless aliased (using PARTITION BY)

2016-02-25 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168501#comment-15168501 ] Xiao Li commented on SPARK-13445: - I am unable to reproduce the error that is described i

[jira] [Updated] (SPARK-13504) Add approxQuantile for SparkR

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13504: -- Target Version/s: 2.0.0 Issue Type: New Feature (was: Improvement) > Add approxQuant

[jira] [Updated] (SPARK-13504) Add approxQuantile for SparkR

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13504: -- Assignee: Yanbo Liang > Add approxQuantile for SparkR > - > >

[jira] [Resolved] (SPARK-13504) Add approxQuantile for SparkR

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13504. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11383 [https://g

[jira] [Assigned] (SPARK-13503) Support to specify the (writing) option for compression codec for TEXT

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13503: Assignee: Apache Spark > Support to specify the (writing) option for compression codec for

[jira] [Commented] (SPARK-13503) Support to specify the (writing) option for compression codec for TEXT

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168457#comment-15168457 ] Apache Spark commented on SPARK-13503: -- User 'HyukjinKwon' has created a pull reques

[jira] [Assigned] (SPARK-13503) Support to specify the (writing) option for compression codec for TEXT

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13503: Assignee: (was: Apache Spark) > Support to specify the (writing) option for compressio

[jira] [Updated] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12363: -- Fix Version/s: 1.3.2 > PowerIterationClustering test case failed if we deprecated KMeans.setRun

[jira] [Updated] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12363: -- Target Version/s: 1.3.2, 1.4.2, 1.5.3, 1.6.1, 2.0.0 (was: 2.0.0) > PowerIterationClustering te

[jira] [Updated] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12363: -- Shepherd: Xiangrui Meng (was: Joseph K. Bradley) > PowerIterationClustering test case failed i

[jira] [Resolved] (SPARK-13033) PySpark ml.regression support export/import

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13033. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11000 [https://g

[jira] [Created] (SPARK-13505) Python API for MaxAbsScaler

2016-02-25 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-13505: - Summary: Python API for MaxAbsScaler Key: SPARK-13505 URL: https://issues.apache.org/jira/browse/SPARK-13505 Project: Spark Issue Type: New Feature

[jira] [Resolved] (SPARK-13028) Add MaxAbsScaler to ML.feature as a transformer

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13028. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10939 [https://g

[jira] [Updated] (SPARK-13036) PySpark ml.feature support export/import

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13036: -- Assignee: Xusen Yin > PySpark ml.feature support export/import > --

[jira] [Updated] (SPARK-13036) PySpark ml.feature support export/import

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13036: -- Target Version/s: 2.0.0 > PySpark ml.feature support export/import > --

[jira] [Commented] (SPARK-13503) Support to specify the (writing) option for compression codec for TEXT

2016-02-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168439#comment-15168439 ] Hyukjin Kwon commented on SPARK-13503: -- I completely forgot I actually did this for

[jira] [Updated] (SPARK-13503) Support to specify the (writing) option for compression codec for TEXT

2016-02-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13503: - Description: CSV and JSON support to specify compression option for writing (this was done by [t

[jira] [Commented] (SPARK-13503) Support to specify the (writing) option for compression codec for JSON and Text

2016-02-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168431#comment-15168431 ] Hyukjin Kwon commented on SPARK-13503: -- [~rxin] Sure. > Support to specify the (wri

[jira] [Commented] (SPARK-13503) Support to specify the (writing) option for compression codec for JSON and Text

2016-02-25 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168412#comment-15168412 ] Reynold Xin commented on SPARK-13503: - Sounds good. Let's make sure we have the same

[jira] [Resolved] (SPARK-13361) Add benchmark codes for Encoder#compress() in CompressionSchemeBenchmark

2016-02-25 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13361. - Resolution: Fixed Assignee: Takeshi Yamamuro Fix Version/s: 2.0.0 > Add benchmark

[jira] [Assigned] (SPARK-13504) Add approxQuantile for SparkR

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13504: Assignee: (was: Apache Spark) > Add approxQuantile for SparkR > --

[jira] [Commented] (SPARK-13504) Add approxQuantile for SparkR

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168403#comment-15168403 ] Apache Spark commented on SPARK-13504: -- User 'yanboliang' has created a pull request

[jira] [Assigned] (SPARK-13504) Add approxQuantile for SparkR

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13504: Assignee: Apache Spark > Add approxQuantile for SparkR > - > >

[jira] [Created] (SPARK-13504) Add approxQuantile for SparkR

2016-02-25 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-13504: --- Summary: Add approxQuantile for SparkR Key: SPARK-13504 URL: https://issues.apache.org/jira/browse/SPARK-13504 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-13503) Support to specify the (writing) option for compression codec for JSON and Text

2016-02-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-13503: - Description: CSV supports to specify compression option for writing (this was done [this PR|http

[jira] [Commented] (SPARK-13503) Support to specify the (writing) option for compression codec for JSON and Text

2016-02-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168319#comment-15168319 ] Hyukjin Kwon commented on SPARK-13503: -- [~rxin] I can work on this but just want to

[jira] [Created] (SPARK-13503) Support to specify the (writing) option for compression codec for JSON and Text

2016-02-25 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-13503: Summary: Support to specify the (writing) option for compression codec for JSON and Text Key: SPARK-13503 URL: https://issues.apache.org/jira/browse/SPARK-13503 Proje

[jira] [Commented] (SPARK-13484) Filter outer joined result using a non-nullable column from the right table

2016-02-25 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168246#comment-15168246 ] Takeshi Yamamuro commented on SPARK-13484: -- Thanks! > Filter outer joined resul

[jira] [Resolved] (SPARK-12757) Use reference counting to prevent blocks from being evicted during reads

2016-02-25 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-12757. --- Resolution: Fixed Fix Version/s: 2.0.0 > Use reference counting to prevent blocks from being e

[jira] [Resolved] (SPARK-13387) Add support for SPARK_DAEMON_JAVA_OPTS with MesosClusterDispatcher.

2016-02-25 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-13387. --- Resolution: Fixed Assignee: Timothy Chen Fix Version/s: 2.0.0 Target Vers

[jira] [Resolved] (SPARK-13501) Remove use of Guava Stopwatch class

2016-02-25 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-13501. --- Resolution: Fixed Fix Version/s: 2.0.0 > Remove use of Guava Stopwatch class > ---

[jira] [Updated] (SPARK-12009) Avoid re-allocate yarn container while driver want to stop all Executors

2016-02-25 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-12009: -- Fix Version/s: (was: 1.6.1) > Avoid re-allocate yarn container while driver want to stop all Execut

[jira] [Updated] (SPARK-12009) Avoid re-allocate yarn container while driver want to stop all Executors

2016-02-25 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-12009: -- Assignee: SuYan > Avoid re-allocate yarn container while driver want to stop all Executors > --

[jira] [Resolved] (SPARK-12009) Avoid re-allocate yarn container while driver want to stop all Executors

2016-02-25 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-12009. --- Resolution: Fixed Fix Version/s: 2.0.0 1.6.1 > Avoid re-allocate yarn conta

[jira] [Resolved] (SPARK-13483) URL address error in Spark web ui in YARN mode

2016-02-25 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-13483. -- Resolution: Duplicate > URL address error in Spark web ui in YARN mode > --

[jira] [Commented] (SPARK-13502) Missing ml.NaiveBayes in MLlib guide

2016-02-25 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168192#comment-15168192 ] Xusen Yin commented on SPARK-13502: --- FYI [~mengxr] > Missing ml.NaiveBayes in MLlib gu

[jira] [Created] (SPARK-13502) Missing ml.NaiveBayes in MLlib guide

2016-02-25 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-13502: - Summary: Missing ml.NaiveBayes in MLlib guide Key: SPARK-13502 URL: https://issues.apache.org/jira/browse/SPARK-13502 Project: Spark Issue Type: Documentation

[jira] [Closed] (SPARK-7768) Make user-defined type (UDT) API public

2016-02-25 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jakob Odersky closed SPARK-7768. Resolution: Fixed > Make user-defined type (UDT) API public > --

[jira] [Comment Edited] (SPARK-13288) [1.6.0] Memory leak in Spark streaming

2016-02-25 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168174#comment-15168174 ] Shixiong Zhu edited comment on SPARK-13288 at 2/26/16 12:31 AM: ---

[jira] [Commented] (SPARK-13459) Separate Alive and Dead Executors in Executor Totals Table

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168175#comment-15168175 ] Apache Spark commented on SPARK-13459: -- User 'ajbozarth' has created a pull request

[jira] [Commented] (SPARK-13288) [1.6.0] Memory leak in Spark streaming

2016-02-25 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168174#comment-15168174 ] Shixiong Zhu commented on SPARK-13288: -- A head dump file would be very helpful. Howe

[jira] [Commented] (SPARK-11293) Spillable collections leak shuffle memory

2016-02-25 Thread Russell Alexander Spitzer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168168#comment-15168168 ] Russell Alexander Spitzer commented on SPARK-11293: --- Saw a similar issu

[jira] [Commented] (SPARK-7768) Make user-defined type (UDT) API public

2016-02-25 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168155#comment-15168155 ] Jakob Odersky commented on SPARK-7768: -- [~marmbrus] UDTs are public now (in Scala at

[jira] [Assigned] (SPARK-11011) UserDefinedType serialization should be strongly typed

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11011: Assignee: Apache Spark > UserDefinedType serialization should be strongly typed >

[jira] [Commented] (SPARK-11011) UserDefinedType serialization should be strongly typed

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168135#comment-15168135 ] Apache Spark commented on SPARK-11011: -- User 'jodersky' has created a pull request f

[jira] [Assigned] (SPARK-11011) UserDefinedType serialization should be strongly typed

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11011: Assignee: (was: Apache Spark) > UserDefinedType serialization should be strongly typed

[jira] [Resolved] (SPARK-13468) Fix a corner case where the page UI should show DAG but it doesn't show

2016-02-25 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-13468. -- Resolution: Fixed Assignee: Liwei Lin Fix Version/s: 2.0.0 > Fix a corner case

[jira] [Commented] (SPARK-13487) User-facing RuntimeConfig interface

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168043#comment-15168043 ] Apache Spark commented on SPARK-13487: -- User 'rxin' has created a pull request for t

[jira] [Commented] (SPARK-13444) QuantileDiscretizer chooses bad splits on large DataFrames

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168037#comment-15168037 ] Apache Spark commented on SPARK-13444: -- User 'oliverpierson' has created a pull requ

[jira] [Commented] (SPARK-13501) Remove use of Guava Stopwatch class

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168025#comment-15168025 ] Apache Spark commented on SPARK-13501: -- User 'JoshRosen' has created a pull request

[jira] [Assigned] (SPARK-13501) Remove use of Guava Stopwatch class

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13501: Assignee: Apache Spark (was: Josh Rosen) > Remove use of Guava Stopwatch class >

[jira] [Assigned] (SPARK-13501) Remove use of Guava Stopwatch class

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13501: Assignee: Josh Rosen (was: Apache Spark) > Remove use of Guava Stopwatch class >

[jira] [Created] (SPARK-13501) Remove use of Guava Stopwatch class

2016-02-25 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-13501: -- Summary: Remove use of Guava Stopwatch class Key: SPARK-13501 URL: https://issues.apache.org/jira/browse/SPARK-13501 Project: Spark Issue Type: Bug Affects V

[jira] [Commented] (SPARK-13500) Add an example for LDA in PySpark

2016-02-25 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168021#comment-15168021 ] Bryan Cutler commented on SPARK-13500: -- I'm working on it :D > Add an example for L

[jira] [Created] (SPARK-13500) Add an example for LDA in PySpark

2016-02-25 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-13500: Summary: Add an example for LDA in PySpark Key: SPARK-13500 URL: https://issues.apache.org/jira/browse/SPARK-13500 Project: Spark Issue Type: Improvement

[jira] [Comment Edited] (SPARK-12878) Dataframe fails with nested User Defined Types

2016-02-25 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160119#comment-15160119 ] Jakob Odersky edited comment on SPARK-12878 at 2/25/16 10:22 PM: --

[jira] [Commented] (SPARK-10712) JVM crashes with spark.sql.tungsten.enabled = true

2016-02-25 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15167911#comment-15167911 ] Jakob Odersky commented on SPARK-10712: --- Any news on this? Is it still an issue? >

[jira] [Resolved] (SPARK-13292) QuantileDiscretizer should take random seed in PySpark

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13292. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11362 [https://g

[jira] [Updated] (SPARK-12874) ML StringIndexer does not protect itself from column name duplication

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12874: -- Fix Version/s: 1.6.2 > ML StringIndexer does not protect itself from column name duplication >

[jira] [Resolved] (SPARK-12874) ML StringIndexer does not protect itself from column name duplication

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-12874. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11370 [https://g

[jira] [Updated] (SPARK-12874) ML StringIndexer does not protect itself from column name duplication

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12874: -- Assignee: Yu Ishikawa > ML StringIndexer does not protect itself from column name duplication >

[jira] [Updated] (SPARK-13385) Enable AssociationRules to generate consequents with user-defined lengths

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13385: -- Assignee: zhengruifeng > Enable AssociationRules to generate consequents with user-defined leng

[jira] [Updated] (SPARK-13385) Enable AssociationRules to generate consequents with user-defined lengths

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13385: -- Shepherd: (was: Xiangrui Meng) > Enable AssociationRules to generate consequents with user-de

[jira] [Updated] (SPARK-11219) Make Parameter Description Format Consistent in PySpark.MLlib

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11219: -- Assignee: Bryan Cutler > Make Parameter Description Format Consistent in PySpark.MLlib > --

[jira] [Assigned] (SPARK-13499) Optimize vectorized parquet reader for dictionary encoded data and RLE decoding

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13499: Assignee: Apache Spark > Optimize vectorized parquet reader for dictionary encoded data an

[jira] [Commented] (SPARK-13489) GSoC 2016 project ideas for MLlib

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15167844#comment-15167844 ] Xiangrui Meng commented on SPARK-13489: --- We can roughly discuss the theme before wo

[jira] [Commented] (SPARK-13499) Optimize vectorized parquet reader for dictionary encoded data and RLE decoding

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15167843#comment-15167843 ] Apache Spark commented on SPARK-13499: -- User 'nongli' has created a pull request for

[jira] [Assigned] (SPARK-13499) Optimize vectorized parquet reader for dictionary encoded data and RLE decoding

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13499: Assignee: (was: Apache Spark) > Optimize vectorized parquet reader for dictionary enco

[jira] [Assigned] (SPARK-12042) Python API for mllib.stat.test.StreamingTest

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12042: Assignee: (was: Apache Spark) > Python API for mllib.stat.test.StreamingTest > ---

[jira] [Assigned] (SPARK-12042) Python API for mllib.stat.test.StreamingTest

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12042: Assignee: Apache Spark > Python API for mllib.stat.test.StreamingTest > --

[jira] [Commented] (SPARK-12042) Python API for mllib.stat.test.StreamingTest

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15167829#comment-15167829 ] Apache Spark commented on SPARK-12042: -- User 'yinxusen' has created a pull request f

[jira] [Commented] (SPARK-13489) GSoC 2016 project ideas for MLlib

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15167826#comment-15167826 ] Xiangrui Meng commented on SPARK-13489: --- Yes, the features should be delivered to S

[jira] [Created] (SPARK-13499) Optimize vectorized parquet reader for dictionary encoded data and RLE decoding

2016-02-25 Thread Nong Li (JIRA)
Nong Li created SPARK-13499: --- Summary: Optimize vectorized parquet reader for dictionary encoded data and RLE decoding Key: SPARK-13499 URL: https://issues.apache.org/jira/browse/SPARK-13499 Project: Spark

[jira] [Updated] (SPARK-13464) Fix failed test test_reduce_by_key_and_window_with_none_invFunc in pyspark/streaming

2016-02-25 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-13464: - Affects Version/s: 1.3.1 > Fix failed test test_reduce_by_key_and_window_with_none_invFunc in >

[jira] [Resolved] (SPARK-13464) Fix failed test test_reduce_by_key_and_window_with_none_invFunc in pyspark/streaming

2016-02-25 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-13464. -- Resolution: Fixed Assignee: Liang-Chi Hsieh Fix Version/s: 1.3.2 > Fix failed t

[jira] [Resolved] (SPARK-13069) ActorHelper is not throttled by rate limiter

2016-02-25 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-13069. -- Resolution: Fixed Assignee: Lin Zhao Fix Version/s: 2.0.0 > ActorHelper is not

[jira] [Updated] (SPARK-13444) QuantileDiscretizer chooses bad splits on large DataFrames

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13444: -- Fix Version/s: (was: 1.6.2) > QuantileDiscretizer chooses bad splits on large DataFrames >

[jira] [Updated] (SPARK-13444) QuantileDiscretizer chooses bad splits on large DataFrames

2016-02-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13444: -- Affects Version/s: 2.0.0 > QuantileDiscretizer chooses bad splits on large DataFrames > ---

[jira] [Assigned] (SPARK-13498) JDBCRDD should update some input metrics

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13498: Assignee: (was: Apache Spark) > JDBCRDD should update some input metrics > ---

[jira] [Created] (SPARK-13498) JDBCRDD should update some input metrics

2016-02-25 Thread Wayne Song (JIRA)
Wayne Song created SPARK-13498: -- Summary: JDBCRDD should update some input metrics Key: SPARK-13498 URL: https://issues.apache.org/jira/browse/SPARK-13498 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-13498) JDBCRDD should update some input metrics

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13498: Assignee: Apache Spark > JDBCRDD should update some input metrics > --

[jira] [Commented] (SPARK-13498) JDBCRDD should update some input metrics

2016-02-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15167793#comment-15167793 ] Apache Spark commented on SPARK-13498: -- User 'wsong' has created a pull request for

[jira] [Commented] (SPARK-13445) Seleting "data" with window function does not work unless aliased (using PARTITION BY)

2016-02-25 Thread Simeon Simeonov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15167761#comment-15167761 ] Simeon Simeonov commented on SPARK-13445: - [~smilegator] So, the real issue you s

[jira] [Resolved] (SPARK-12486) Executors are not always terminated successfully by the worker.

2016-02-25 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-12486. -- Resolution: Fixed Fix Version/s: 2.0.0 1.6.1 It has been resolved by https://

  1   2   >