[jira] [Updated] (SPARK-10022) Scala-Python method/parameter inconsistency check for ML during 1.5 QA

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10022: Summary: Scala-Python method/parameter inconsistency check for ML during 1.5 QA (was:

[jira] [Updated] (SPARK-10024) Implement RandomForestParams and TreeEnsembleParams for Python API

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10024: Description: Implement RandomForestParams, GBTParams and TreeEnsembleParams for Python API, and

[jira] [Updated] (SPARK-10024) Python API Tree related params clear up

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10024: Summary: Python API Tree related params clear up (was: Implement RandomForestParams and

[jira] [Updated] (SPARK-10024) Python API Tree related params clear up

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10024: Description: Implement RandomForestParams, GBTParams and TreeEnsembleParams for Python API, and

[jira] [Updated] (SPARK-9663) ML Python API coverage issues found during 1.5 QA

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-9663: --- Description: This umbrella is for a list of Python API coverage issues which we should fix for the

[jira] [Created] (SPARK-10028) Add Python API for PrefixSpan

2015-08-16 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-10028: --- Summary: Add Python API for PrefixSpan Key: SPARK-10028 URL: https://issues.apache.org/jira/browse/SPARK-10028 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-9431) TimeIntervalType for for time intervals

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698589#comment-14698589 ] Apache Spark commented on SPARK-9431: - User 'yjshen' has created a pull request for

[jira] [Assigned] (SPARK-9431) TimeIntervalType for for time intervals

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9431: --- Assignee: (was: Apache Spark) TimeIntervalType for for time intervals

[jira] [Updated] (SPARK-9973) Wrong initial size of in-memory columnar buffers

2015-08-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-9973: -- Summary: Wrong initial size of in-memory columnar buffers (was: wrong buffle size) Wrong initial

[jira] [Updated] (SPARK-9973) wrong buffle size

2015-08-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-9973: -- Assignee: xukun wrong buffle size - Key: SPARK-9973

[jira] [Created] (SPARK-10024) Implement RandomForestParams and TreeEnsembleParams for Python API

2015-08-16 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-10024: --- Summary: Implement RandomForestParams and TreeEnsembleParams for Python API Key: SPARK-10024 URL: https://issues.apache.org/jira/browse/SPARK-10024 Project: Spark

[jira] [Updated] (SPARK-9663) ML Python API coverage issues found during 1.5 QA

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-9663: --- Description: This umbrella is for a list of Python API coverage issues which we should fix for the

[jira] [Created] (SPARK-10025) Add Python API for ml.attribute

2015-08-16 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-10025: --- Summary: Add Python API for ml.attribute Key: SPARK-10025 URL: https://issues.apache.org/jira/browse/SPARK-10025 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-9793) PySpark DenseVector, SparseVector should override __eq__ and __hash__

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-9793: --- Summary: PySpark DenseVector, SparseVector should override __eq__ and __hash__ (was: PySpark

[jira] [Created] (SPARK-10027) Add Python API missing methods for ml.feature

2015-08-16 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-10027: --- Summary: Add Python API missing methods for ml.feature Key: SPARK-10027 URL: https://issues.apache.org/jira/browse/SPARK-10027 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-9431) TimeIntervalType for for time intervals

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9431: --- Assignee: Apache Spark TimeIntervalType for for time intervals

[jira] [Updated] (SPARK-10029) Add Python examples for mllib IsotonicRegression user guide

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10029: Issue Type: Sub-task (was: Documentation) Parent: SPARK-8757 Add Python examples for

[jira] [Created] (SPARK-10029) Add Python examples for mllib IsotonicRegression user guide

2015-08-16 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-10029: --- Summary: Add Python examples for mllib IsotonicRegression user guide Key: SPARK-10029 URL: https://issues.apache.org/jira/browse/SPARK-10029 Project: Spark

[jira] [Updated] (SPARK-10022) Scala-Python method/parameter inconsistency check for ML during 1.5 QA

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10022: Description: The missing classes for PySpark were listed at SPARK-9663. Here we check and list the

[jira] [Updated] (SPARK-10022) Scala-Python method/parameter inconsistency check for ML MLlib during 1.5 QA

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10022: Description: The missing classes for PySpark were listed at SPARK-9663. Here we check and list the

[jira] [Resolved] (SPARK-10008) Shuffle locality can take precedence over narrow dependencies for RDDs with both

2015-08-16 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-10008. --- Resolution: Fixed Fix Version/s: 1.5.0 Shuffle locality can take precedence over

[jira] [Updated] (SPARK-9663) ML Python API coverage issues found during 1.5 QA

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-9663: --- Description: This umbrella is for a list of Python API coverage issues which we should fix for the

[jira] [Updated] (SPARK-10022) Scala-Python method/parameter inconsistency check for ML MLlib during 1.5 QA

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10022: Summary: Scala-Python method/parameter inconsistency check for ML MLlib during 1.5 QA (was:

[jira] [Updated] (SPARK-10022) Scala-Python inconsistency check for ML MLlib during 1.5 QA

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10022: Description: Check the Scala-Python inconsistency of ML MLlib method/parameter (was: Check the

[jira] [Updated] (SPARK-9973) Wrong initial size of in-memory columnar buffers

2015-08-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-9973: -- Shepherd: Cheng Lian Sprint: Spark 1.5 doc/QA sprint Affects Version/s:

[jira] [Commented] (SPARK-9973) Wrong initial size of in-memory columnar buffers

2015-08-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698553#comment-14698553 ] Cheng Lian commented on SPARK-9973: --- I've updated the title and description. Wrong

[jira] [Updated] (SPARK-10024) Python API RF and GBT related params clear up

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10024: Description: Implement RandomForestParams, GBTParams and TreeEnsembleParams for Python API, and

[jira] [Updated] (SPARK-9663) ML Python API coverage issues found during 1.5 QA

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-9663: --- Description: This umbrella is for a list of Python API coverage issues which we should fix for the

[jira] [Comment Edited] (SPARK-9662) ML 1.5 QA: API: Python API coverage

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698565#comment-14698565 ] Yanbo Liang edited comment on SPARK-9662 at 8/16/15 7:37 AM: -

[jira] [Commented] (SPARK-9662) ML 1.5 QA: API: Python API coverage

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698565#comment-14698565 ] Yanbo Liang commented on SPARK-9662: [~josephkb] I have finished checking for

[jira] [Created] (SPARK-10023) Unified DecisionTreeParams checkpointInterval between Scala and Python API.

2015-08-16 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-10023: --- Summary: Unified DecisionTreeParams checkpointInterval between Scala and Python API. Key: SPARK-10023 URL: https://issues.apache.org/jira/browse/SPARK-10023 Project:

[jira] [Updated] (SPARK-10024) Python API RF and GBT related params clear up

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10024: Summary: Python API RF and GBT related params clear up (was: Python API Tree related params clear

[jira] [Updated] (SPARK-10024) Python API RF and GBT related params clear up

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10024: Description: Implement RandomForestParams, GBTParams and TreeEnsembleParams for Python API, and

[jira] [Resolved] (SPARK-8844) head/collect is broken in SparkR

2015-08-16 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-8844. -- Resolution: Fixed Fix Version/s: 1.5.0 head/collect is broken in SparkR

[jira] [Commented] (SPARK-8844) head/collect is broken in SparkR

2015-08-16 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698564#comment-14698564 ] Shivaram Venkataraman commented on SPARK-8844: -- Resolved by

[jira] [Updated] (SPARK-8844) head/collect is broken in SparkR

2015-08-16 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-8844: - Assignee: Sun Rui head/collect is broken in SparkR

[jira] [Assigned] (SPARK-10029) Add Python examples for mllib IsotonicRegression user guide

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10029: Assignee: (was: Apache Spark) Add Python examples for mllib IsotonicRegression user

[jira] [Commented] (SPARK-10029) Add Python examples for mllib IsotonicRegression user guide

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698597#comment-14698597 ] Apache Spark commented on SPARK-10029: -- User 'yanboliang' has created a pull request

[jira] [Assigned] (SPARK-10029) Add Python examples for mllib IsotonicRegression user guide

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10029: Assignee: Apache Spark Add Python examples for mllib IsotonicRegression user guide

[jira] [Created] (SPARK-10022) Scala-Python inconsistency check for ML MLlib during 1.5 QA

2015-08-16 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-10022: --- Summary: Scala-Python inconsistency check for ML MLlib during 1.5 QA Key: SPARK-10022 URL: https://issues.apache.org/jira/browse/SPARK-10022 Project: Spark

[jira] [Updated] (SPARK-10030) Managed memory leak detected when cache table

2015-08-16 Thread wangwei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangwei updated SPARK-10030: Description: I test lastest spark-1.5.0 in standalone mode and follow the steps bellow, then issues

[jira] [Updated] (SPARK-10030) Managed memory leak detected when cache table

2015-08-16 Thread wangwei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangwei updated SPARK-10030: Description: I test lastest spark-1.5.0 in local, standalone, yarn mode and follow the steps bellow, then

[jira] [Assigned] (SPARK-7707) User guide and example code for KernelDensity

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7707: --- Assignee: (was: Apache Spark) User guide and example code for KernelDensity

[jira] [Commented] (SPARK-7707) User guide and example code for Statistics.kernelDensity

2015-08-16 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698634#comment-14698634 ] Sandy Ryza commented on SPARK-7707: --- [~mengxr] thoughts on which page this should land

[jira] [Updated] (SPARK-7707) User guide and example code for KernelDensity

2015-08-16 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-7707: -- Summary: User guide and example code for KernelDensity (was: User guide and example code for

[jira] [Created] (SPARK-10030) Managed memory leak detected when cache table

2015-08-16 Thread wangwei (JIRA)
wangwei created SPARK-10030: --- Summary: Managed memory leak detected when cache table Key: SPARK-10030 URL: https://issues.apache.org/jira/browse/SPARK-10030 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-10030) Managed memory leak detected when cache table

2015-08-16 Thread wangwei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangwei updated SPARK-10030: Description: 1. create table cache_test(id int, name string) stored as textfile ; 2. load data local

[jira] [Updated] (SPARK-10030) Managed memory leak detected when cache table

2015-08-16 Thread wangwei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangwei updated SPARK-10030: Description: 1. create table cache_test(id int, name string) stored as textfile ; 2. load data local

[jira] [Updated] (SPARK-10030) Managed memory leak detected when cache table

2015-08-16 Thread wangwei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangwei updated SPARK-10030: Description: 1. create table cache_test(id int, name string) stored as textfile ; 2. load data local

[jira] [Updated] (SPARK-10030) Managed memory leak detected when cache table

2015-08-16 Thread wangwei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangwei updated SPARK-10030: Description: 1. create table cache_test(id int, name string) stored as textfile ; 2. load data local

[jira] [Updated] (SPARK-10030) Managed memory leak detected when cache table

2015-08-16 Thread wangwei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangwei updated SPARK-10030: Description: 1. create table cache_test(id int, name string) stored as textfile ; 2. load data local

[jira] [Updated] (SPARK-10030) Managed memory leak detected when cache table

2015-08-16 Thread wangwei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangwei updated SPARK-10030: Description: I test lastest spark-1.5.0 in local, standalone, yarn mode and follow the steps bellow, then

[jira] [Updated] (SPARK-10032) Add Python example for mllib LDAModel user guide

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10032: Affects Version/s: (was: 1.5.0) Add Python example for mllib LDAModel user guide

[jira] [Created] (SPARK-10032) Add Python example for mllib LDAModel user guide

2015-08-16 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-10032: --- Summary: Add Python example for mllib LDAModel user guide Key: SPARK-10032 URL: https://issues.apache.org/jira/browse/SPARK-10032 Project: Spark Issue Type:

[jira] [Updated] (SPARK-10032) Add Python example for mllib LDAModel user guide

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10032: Labels: 1.5.0 (was: ) Add Python example for mllib LDAModel user guide

[jira] [Assigned] (SPARK-10005) Parquet reader doesn't handle schema merging properly for nested structs

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10005: Assignee: Apache Spark (was: Cheng Lian) Parquet reader doesn't handle schema merging

[jira] [Updated] (SPARK-10005) Parquet reader doesn't handle schema merging properly for nested structs

2015-08-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-10005: --- Description: Spark shell snippet to reproduce this issue (note that both {{DataFrame}} written

[jira] [Commented] (SPARK-10005) Parquet reader doesn't handle schema merging properly for nested structs

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698631#comment-14698631 ] Apache Spark commented on SPARK-10005: -- User 'liancheng' has created a pull request

[jira] [Assigned] (SPARK-10005) Parquet reader doesn't handle schema merging properly for nested structs

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10005: Assignee: Cheng Lian (was: Apache Spark) Parquet reader doesn't handle schema merging

[jira] [Updated] (SPARK-10030) Managed memory leak detected when cache table

2015-08-16 Thread wangwei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangwei updated SPARK-10030: Description: I test lastest spark-1.5.0 in local, standalone, yarn mode and follow the steps bellow, then

[jira] [Commented] (SPARK-8918) Add @since tags to mllib.clustering

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698635#comment-14698635 ] Apache Spark commented on SPARK-8918: - User 'XiaoqingWang' has created a pull request

[jira] [Assigned] (SPARK-7707) User guide and example code for KernelDensity

2015-08-16 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza reassigned SPARK-7707: - Assignee: Sandy Ryza User guide and example code for KernelDensity

[jira] [Assigned] (SPARK-7707) User guide and example code for KernelDensity

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7707: --- Assignee: Apache Spark User guide and example code for KernelDensity

[jira] [Commented] (SPARK-7707) User guide and example code for KernelDensity

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698647#comment-14698647 ] Apache Spark commented on SPARK-7707: - User 'sryza' has created a pull request for

[jira] [Updated] (SPARK-10030) Managed memory leak detected when cache table

2015-08-16 Thread wangwei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangwei updated SPARK-10030: Description: 1. create table cache_test(id int, name string) stored as textfile ; 2. load data local

[jira] [Updated] (SPARK-10030) Managed memory leak detected when cache table

2015-08-16 Thread wangwei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangwei updated SPARK-10030: Description: 1. create table cache_test(id int, name string) stored as textfile ; 2. load data local

[jira] [Commented] (SPARK-10031) Join two UnsafeRows in SortMergeJoin if possible

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698609#comment-14698609 ] Apache Spark commented on SPARK-10031: -- User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-10031) Join two UnsafeRows in SortMergeJoin if possible

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10031: Assignee: (was: Apache Spark) Join two UnsafeRows in SortMergeJoin if possible

[jira] [Assigned] (SPARK-10031) Join two UnsafeRows in SortMergeJoin if possible

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10031: Assignee: Apache Spark Join two UnsafeRows in SortMergeJoin if possible

[jira] [Created] (SPARK-10031) Join two UnsafeRows in SortMergeJoin if possible

2015-08-16 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-10031: --- Summary: Join two UnsafeRows in SortMergeJoin if possible Key: SPARK-10031 URL: https://issues.apache.org/jira/browse/SPARK-10031 Project: Spark Issue

[jira] [Updated] (SPARK-10029) Add Python examples for mllib IsotonicRegression user guide

2015-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10029: Labels: 1.5.0 (was: ) Add Python examples for mllib IsotonicRegression user guide

[jira] [Assigned] (SPARK-10032) Add Python example for mllib LDAModel user guide

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10032: Assignee: Apache Spark Add Python example for mllib LDAModel user guide

[jira] [Assigned] (SPARK-10032) Add Python example for mllib LDAModel user guide

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10032: Assignee: (was: Apache Spark) Add Python example for mllib LDAModel user guide

[jira] [Commented] (SPARK-10032) Add Python example for mllib LDAModel user guide

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698627#comment-14698627 ] Apache Spark commented on SPARK-10032: -- User 'yanboliang' has created a pull request

[jira] [Resolved] (SPARK-9973) Wrong initial size of in-memory columnar buffers

2015-08-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-9973. --- Resolution: Fixed Resolved by https://github.com/apache/spark/pull/8189 Wrong initial size of

[jira] [Updated] (SPARK-9973) Wrong initial size of in-memory columnar buffers

2015-08-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-9973: -- Fix Version/s: 1.5.0 Wrong initial size of in-memory columnar buffers

[jira] [Commented] (SPARK-10016) ML model broadcasts should be stored in private vars: spark.ml Word2Vec

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698808#comment-14698808 ] Apache Spark commented on SPARK-10016: -- User 'vinodkc' has created a pull request

[jira] [Assigned] (SPARK-10016) ML model broadcasts should be stored in private vars: spark.ml Word2Vec

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10016: Assignee: (was: Apache Spark) ML model broadcasts should be stored in private vars:

[jira] [Assigned] (SPARK-10016) ML model broadcasts should be stored in private vars: spark.ml Word2Vec

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10016: Assignee: Apache Spark ML model broadcasts should be stored in private vars: spark.ml

[jira] [Commented] (SPARK-9985) DataFrameWriter jdbc method ignore options that have been set

2015-08-16 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698729#comment-14698729 ] Shixiong Zhu commented on SPARK-9985: - I just realized SPARK-8463 didn't fix all

[jira] [Updated] (SPARK-10034) Can't analyze Sort on Aggregate with aggregation expression named _aggOrdering

2015-08-16 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-10034: Description: {code=scala} val df = Seq(1 - 2).toDF(i, j) val query = df.groupBy('i)

[jira] [Updated] (SPARK-10034) Can't analyze Sort on Aggregate with aggregation expression named _aggOrdering

2015-08-16 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-10034: Description: {code} val df = Seq(1 - 2).toDF(i, j) val query = df.groupBy('i)

[jira] [Commented] (SPARK-10036) DataFrameReader.json and DataFrameWriter.json don't load the JDBC driver class before creating JDBC connection

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698727#comment-14698727 ] Apache Spark commented on SPARK-10036: -- User 'zsxwing' has created a pull request

[jira] [Assigned] (SPARK-10036) DataFrameReader.json and DataFrameWriter.json don't load the JDBC driver class before creating JDBC connection

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10036: Assignee: Apache Spark DataFrameReader.json and DataFrameWriter.json don't load the JDBC

[jira] [Assigned] (SPARK-10036) DataFrameReader.json and DataFrameWriter.json don't load the JDBC driver class before creating JDBC connection

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10036: Assignee: (was: Apache Spark) DataFrameReader.json and DataFrameWriter.json don't

[jira] [Resolved] (SPARK-10005) Parquet reader doesn't handle schema merging properly for nested structs

2015-08-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-10005. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8228

[jira] [Created] (SPARK-10033) Sort on

2015-08-16 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-10033: --- Summary: Sort on Key: SPARK-10033 URL: https://issues.apache.org/jira/browse/SPARK-10033 Project: Spark Issue Type: Bug Reporter: Wenchen Fan

[jira] [Assigned] (SPARK-10034) Can't analyze Sort on Aggregate with aggregation expression named _aggOrdering

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10034: Assignee: (was: Apache Spark) Can't analyze Sort on Aggregate with aggregation

[jira] [Commented] (SPARK-10034) Can't analyze Sort on Aggregate with aggregation expression named _aggOrdering

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698701#comment-14698701 ] Apache Spark commented on SPARK-10034: -- User 'cloud-fan' has created a pull request

[jira] [Assigned] (SPARK-10034) Can't analyze Sort on Aggregate with aggregation expression named _aggOrdering

2015-08-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10034: Assignee: Apache Spark Can't analyze Sort on Aggregate with aggregation expression named

[jira] [Commented] (SPARK-9985) DataFrameWriter jdbc method ignore options that have been set

2015-08-16 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698706#comment-14698706 ] Shixiong Zhu commented on SPARK-9985: - BTW, `sqlContext.load` will load the driver

[jira] [Created] (SPARK-10036) DataFrameReader.json and DataFrameWriter.json don't load the JDBC driver class before creating JDBC connection

2015-08-16 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-10036: Summary: DataFrameReader.json and DataFrameWriter.json don't load the JDBC driver class before creating JDBC connection Key: SPARK-10036 URL:

[jira] [Created] (SPARK-10034) Can't analyze Sort on Aggregate with aggregation expression named _aggOrdering

2015-08-16 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-10034: --- Summary: Can't analyze Sort on Aggregate with aggregation expression named _aggOrdering Key: SPARK-10034 URL: https://issues.apache.org/jira/browse/SPARK-10034

[jira] [Closed] (SPARK-10033) Sort on

2015-08-16 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan closed SPARK-10033. --- Resolution: Invalid Sort on Key: SPARK-10033 URL:

[jira] [Created] (SPARK-10035) Parquet filters does not process EqualNullSafe filter.

2015-08-16 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-10035: Summary: Parquet filters does not process EqualNullSafe filter. Key: SPARK-10035 URL: https://issues.apache.org/jira/browse/SPARK-10035 Project: Spark Issue

[jira] [Resolved] (SPARK-9985) DataFrameWriter jdbc method ignore options that have been set

2015-08-16 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-9985. - Resolution: Fixed Target Version/s: (was: 1.5.0) DataFrameWriter jdbc method ignore

[jira] [Commented] (SPARK-9985) DataFrameWriter jdbc method ignore options that have been set

2015-08-16 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698702#comment-14698702 ] Shixiong Zhu commented on SPARK-9985: - [~rlgarris_databricks] I think this has been

[jira] [Assigned] (SPARK-9760) SparkSubmit doesn't work with --packages when --repositories is not specified

2015-08-16 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman reassigned SPARK-9760: Assignee: Shivaram Venkataraman SparkSubmit doesn't work with --packages

[jira] [Resolved] (SPARK-9760) SparkSubmit doesn't work with --packages when --repositories is not specified

2015-08-16 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-9760. -- Resolution: Fixed Fix Version/s: 1.5.0 SparkSubmit doesn't work with

[jira] [Commented] (SPARK-7837) NPE when save as parquet in speculative tasks

2015-08-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698767#comment-14698767 ] Cheng Lian commented on SPARK-7837: --- Just a note to people who want to reproduce this

  1   2   >