[jira] [Resolved] (SPARK-18790) Keep a general offset history of stream batches

2016-12-11 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-18790. -- Resolution: Fixed Assignee: Tyson Condie Fix Version/s: 2.1.1

[jira] [Assigned] (SPARK-18828) Refactor SparkR build and test scripts

2016-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18828: Assignee: (was: Apache Spark) > Refactor SparkR build and test scripts >

[jira] [Assigned] (SPARK-18828) Refactor SparkR build and test scripts

2016-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18828: Assignee: Apache Spark > Refactor SparkR build and test scripts >

[jira] [Commented] (SPARK-18828) Refactor SparkR build and test scripts

2016-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15741200#comment-15741200 ] Apache Spark commented on SPARK-18828: -- User 'felixcheung' has created a pull request for this

[jira] [Created] (SPARK-18828) Refactor SparkR build and test scripts

2016-12-11 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18828: Summary: Refactor SparkR build and test scripts Key: SPARK-18828 URL: https://issues.apache.org/jira/browse/SPARK-18828 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-18570) Consider supporting other R formula operators

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18570: - Priority: Minor (was: Major) > Consider supporting other R formula operators >

[jira] [Updated] (SPARK-18569) Support R formula arithmetic

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18569: - Affects Version/s: (was: 2.2.0) Target Version/s: 2.2.0 > Support R formula arithmetic

[jira] [Updated] (SPARK-18569) Support R formula arithmetic

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18569: - Affects Version/s: 2.2.0 > Support R formula arithmetic > - > >

[jira] [Updated] (SPARK-18570) Consider supporting other R formula operators

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18570: - Target Version/s: 2.2.0 > Consider supporting other R formula operators >

[jira] [Updated] (SPARK-18348) Improve tree ensemble model summary

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18348: - Target Version/s: 2.2.0 > Improve tree ensemble model summary >

[jira] [Commented] (SPARK-10413) Model should support prediction on single instance

2016-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15741169#comment-15741169 ] Yanbo Liang commented on SPARK-10413: - [~anshbansal] Yeah, we will put this feature at a high

[jira] [Updated] (SPARK-10413) Model should support prediction on single instance

2016-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10413: Labels: (was: 2.2.0) > Model should support prediction on single instance >

[jira] [Updated] (SPARK-10413) Model should support prediction on single instance

2016-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10413: Labels: 2.2.0 (was: ) > Model should support prediction on single instance >

[jira] [Updated] (SPARK-10884) Support prediction on single instance for regression and classification related models

2016-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-10884: Labels: 2.2.0 (was: ) > Support prediction on single instance for regression and classification

[jira] [Assigned] (SPARK-10884) Support prediction on single instance for regression and classification related models

2016-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-10884: --- Assignee: Yanbo Liang > Support prediction on single instance for regression and

[jira] [Assigned] (SPARK-18827) Cann't cache broadcast to disk

2016-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18827: Assignee: Apache Spark > Cann't cache broadcast to disk > --

[jira] [Commented] (SPARK-18827) Cann't cache broadcast to disk

2016-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15741061#comment-15741061 ] Apache Spark commented on SPARK-18827: -- User 'wangyum' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18827) Cann't cache broadcast to disk

2016-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18827: Assignee: (was: Apache Spark) > Cann't cache broadcast to disk >

[jira] [Assigned] (SPARK-18826) Make FileStream be able to start with most recent files

2016-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18826: Assignee: Shixiong Zhu (was: Apache Spark) > Make FileStream be able to start with most

[jira] [Assigned] (SPARK-18826) Make FileStream be able to start with most recent files

2016-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18826: Assignee: Apache Spark (was: Shixiong Zhu) > Make FileStream be able to start with most

[jira] [Commented] (SPARK-18826) Make FileStream be able to start with most recent files

2016-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15741011#comment-15741011 ] Apache Spark commented on SPARK-18826: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Commented] (SPARK-18827) Cann't cache broadcast to disk

2016-12-11 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15741009#comment-15741009 ] Yuming Wang commented on SPARK-18827: - I will create a PR later. > Cann't cache broadcast to disk >

[jira] [Created] (SPARK-18827) Cann't cache broadcast to disk

2016-12-11 Thread Yuming Wang (JIRA)
Yuming Wang created SPARK-18827: --- Summary: Cann't cache broadcast to disk Key: SPARK-18827 URL: https://issues.apache.org/jira/browse/SPARK-18827 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-18826) Make FileStream be able to start with most recent files

2016-12-11 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-18826: Summary: Make FileStream be able to start with most recent files Key: SPARK-18826 URL: https://issues.apache.org/jira/browse/SPARK-18826 Project: Spark

[jira] [Updated] (SPARK-15572) MLlib in R format: compatibility with other languages

2016-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15572: Shepherd: Yanbo Liang > MLlib in R format: compatibility with other languages >

[jira] [Comment Edited] (SPARK-15572) MLlib in R format: compatibility with other languages

2016-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740967#comment-15740967 ] Yanbo Liang edited comment on SPARK-15572 at 12/12/16 4:50 AM: --- Sure,

[jira] [Commented] (SPARK-15572) MLlib in R format: compatibility with other languages

2016-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740967#comment-15740967 ] Yanbo Liang commented on SPARK-15572: - Sure, that great. I updated me as the shepherd. > MLlib in R

[jira] [Resolved] (SPARK-18325) SparkR 2.1 QA: Check for new R APIs requiring example code

2016-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-18325. - Resolution: Fixed Fix Version/s: 2.1.1 > SparkR 2.1 QA: Check for new R APIs requiring

[jira] [Commented] (SPARK-18325) SparkR 2.1 QA: Check for new R APIs requiring example code

2016-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740943#comment-15740943 ] Yanbo Liang commented on SPARK-18325: - Since PR 16148 has been merged, I think we can resolve this

[jira] [Commented] (SPARK-17147) Spark Streaming Kafka 0.10 Consumer Can't Handle Non-consecutive Offsets (i.e. Log Compaction)

2016-12-11 Thread caolan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740906#comment-15740906 ] caolan commented on SPARK-17147: I am using spark 2.0.0 + kafka 0.10 + compact mode topics even in some

[jira] [Assigned] (SPARK-18824) Add optimizer rule to reorder expensive Filter predicates like ScalaUDF

2016-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18824: Assignee: (was: Apache Spark) > Add optimizer rule to reorder expensive Filter

[jira] [Commented] (SPARK-18824) Add optimizer rule to reorder expensive Filter predicates like ScalaUDF

2016-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740903#comment-15740903 ] Apache Spark commented on SPARK-18824: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18824) Add optimizer rule to reorder expensive Filter predicates like ScalaUDF

2016-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18824: Assignee: Apache Spark > Add optimizer rule to reorder expensive Filter predicates like

[jira] [Comment Edited] (SPARK-18332) SparkR 2.1 QA: Programming guide, migration guide, vignettes updates

2016-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740557#comment-15740557 ] Joseph K. Bradley edited comment on SPARK-18332 at 12/12/16 4:05 AM: -

[jira] [Created] (SPARK-18825) Eliminate duplicate links in SparkR API doc index

2016-12-11 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-18825: - Summary: Eliminate duplicate links in SparkR API doc index Key: SPARK-18825 URL: https://issues.apache.org/jira/browse/SPARK-18825 Project: Spark

[jira] [Created] (SPARK-18824) Add optimizer rule to reorder expensive Filter predicates like ScalaUDF

2016-12-11 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-18824: --- Summary: Add optimizer rule to reorder expensive Filter predicates like ScalaUDF Key: SPARK-18824 URL: https://issues.apache.org/jira/browse/SPARK-18824

[jira] [Commented] (SPARK-16073) Performance of Parquet encodings on saving primitive arrays

2016-12-11 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740807#comment-15740807 ] Kazuaki Ishizaki commented on SPARK-16073: -- It is an interesting topic. In the current

[jira] [Commented] (SPARK-18806) driverwrapper and executor doesn't exit when worker killed

2016-12-11 Thread liujianhui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740792#comment-15740792 ] liujianhui commented on SPARK-18806: no, it's a problem, sometimes there are exist two same driver!

[jira] [Commented] (SPARK-18820) Driver may send "LaunchTask" before executor receive "RegisteredExecutor"

2016-12-11 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740754#comment-15740754 ] jin xing commented on SPARK-18820: -- [~lins05] Thanks a lot for your comment : ) In our company's

[jira] [Created] (SPARK-18823) Assignation by column name variable not available or bug?

2016-12-11 Thread Vicente Masip (JIRA)
Vicente Masip created SPARK-18823: - Summary: Assignation by column name variable not available or bug? Key: SPARK-18823 URL: https://issues.apache.org/jira/browse/SPARK-18823 Project: Spark

[jira] [Commented] (SPARK-18332) SparkR 2.1 QA: Programming guide, migration guide, vignettes updates

2016-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740557#comment-15740557 ] Joseph K. Bradley commented on SPARK-18332: --- Let's do it after the 2.1 release. We can always

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-12-11 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740419#comment-15740419 ] Nicholas Chammas commented on SPARK-13587: -- Thanks to a lot of help from [~quasi...@gmail.com]

[jira] [Commented] (SPARK-18813) MLlib 2.2 Roadmap

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740216#comment-15740216 ] Felix Cheung commented on SPARK-18813: -- This is great, Joseph. Thanks for putting down the framework

[jira] [Updated] (SPARK-18821) Bisecting k-means wrapper in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18821: - Shepherd: Felix Cheung > Bisecting k-means wrapper in SparkR >

[jira] [Updated] (SPARK-18822) Support ML Pipeline in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18822: - Shepherd: Felix Cheung > Support ML Pipeline in SparkR > - > >

[jira] [Updated] (SPARK-15767) Decision Tree Regression wrapper in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-15767: - Shepherd: Felix Cheung > Decision Tree Regression wrapper in SparkR >

[jira] [Commented] (SPARK-18819) Failure to read single-row Parquet files

2016-12-11 Thread Michael Kamprath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740208#comment-15740208 ] Michael Kamprath commented on SPARK-18819: -- One more note, this issue only arises when doubles

[jira] [Updated] (SPARK-18822) Support ML Pipeline in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18822: - Description: >From Joseph Bradley: " Supporting Pipelines and advanced use cases: There really

[jira] [Updated] (SPARK-18822) Support ML Pipeline in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18822: - Description: >From Joseph Bradley: " Supporting Pipelines and advanced use cases: There really

[jira] [Comment Edited] (SPARK-18813) MLlib 2.2 Roadmap

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740184#comment-15740184 ] Felix Cheung edited comment on SPARK-18813 at 12/11/16 7:11 PM: I added a

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740185#comment-15740185 ] Felix Cheung commented on SPARK-15581: -- re: Pipeline in R - certainly. opened

[jira] [Commented] (SPARK-18813) MLlib 2.2 Roadmap

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740184#comment-15740184 ] Felix Cheung commented on SPARK-18813: -- I added a couple of JIRAs for R that can be found with [this

[jira] [Commented] (SPARK-18822) Support ML Pipeline in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740181#comment-15740181 ] Felix Cheung commented on SPARK-18822: -- I'll take a shot at this. > Support ML Pipeline in SparkR >

[jira] [Created] (SPARK-18822) Support ML Pipeline in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18822: Summary: Support ML Pipeline in SparkR Key: SPARK-18822 URL: https://issues.apache.org/jira/browse/SPARK-18822 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-18821) Bisecting k-means wrapper in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18821: Summary: Bisecting k-means wrapper in SparkR Key: SPARK-18821 URL: https://issues.apache.org/jira/browse/SPARK-18821 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-18332) SparkR 2.1 QA: Programming guide, migration guide, vignettes updates

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740172#comment-15740172 ] Felix Cheung commented on SPARK-18332: -- [~josephkb] they are because of the {code}@aliases{code}

[jira] [Updated] (SPARK-18226) SparkR displaying vector columns in incorrect way

2016-12-11 Thread Krishna Kalyan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kalyan updated SPARK-18226: --- Component/s: SparkR > SparkR displaying vector columns in incorrect way >

[jira] [Updated] (SPARK-18226) SparkR displaying vector columns in incorrect way

2016-12-11 Thread Krishna Kalyan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna Kalyan updated SPARK-18226: --- Component/s: (was: SparkR) > SparkR displaying vector columns in incorrect way >

[jira] [Commented] (SPARK-18820) Driver may send "LaunchTask" before executor receive "RegisteredExecutor"

2016-12-11 Thread Shuai Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15739970#comment-15739970 ] Shuai Lin commented on SPARK-18820: --- The driver first sends {{RegisteredExecutor}} message and then, if

[jira] [Updated] (SPARK-18820) Driver may send "LaunchTask" before executor receive "RegisteredExecutor"

2016-12-11 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jin xing updated SPARK-18820: - Description: CoarseGrainedSchedulerBackend will update executorDataMap after receiving

[jira] [Created] (SPARK-18820) Driver may send "LaunchTask" before executor receive "RegisteredExecutor"

2016-12-11 Thread jin xing (JIRA)
jin xing created SPARK-18820: Summary: Driver may send "LaunchTask" before executor receive "RegisteredExecutor" Key: SPARK-18820 URL: https://issues.apache.org/jira/browse/SPARK-18820 Project: Spark

[jira] [Comment Edited] (SPARK-18642) Spark SQL: Catalyst is scanning undesired columns

2016-12-11 Thread Mohit (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15739526#comment-15739526 ] Mohit edited comment on SPARK-18642 at 12/11/16 10:42 AM: -- [~dongjoon] We will

[jira] [Commented] (SPARK-18642) Spark SQL: Catalyst is scanning undesired columns

2016-12-11 Thread Mohit (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15739526#comment-15739526 ] Mohit commented on SPARK-18642: --- [~dongjoon] Please share your findings in form of 'touch-points' from the

[jira] [Resolved] (SPARK-18196) Optimise CompactBuffer implementation

2016-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18196. --- Resolution: Won't Fix For now looks like a "wontfix" as it doesn't result in a speedup. > Optimise

[jira] [Issue Comment Deleted] (SPARK-18819) Failure to read single-row Parquet files

2016-12-11 Thread Michael Kamprath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Kamprath updated SPARK-18819: - Comment: was deleted (was: Possibly. I can dump the file created using

[jira] [Commented] (SPARK-18819) Failure to read single-row Parquet files

2016-12-11 Thread Michael Kamprath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15739488#comment-15739488 ] Michael Kamprath commented on SPARK-18819: -- Possibly. I can dump the file created using

[jira] [Commented] (SPARK-18819) Failure to read single-row Parquet files

2016-12-11 Thread Michael Kamprath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15739487#comment-15739487 ] Michael Kamprath commented on SPARK-18819: -- Possibly. I can dump the file created using

[jira] [Commented] (SPARK-18750) spark should be able to control the number of executor and should not throw stack overslow

2016-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15739460#comment-15739460 ] Sean Owen commented on SPARK-18750: --- I'm going to close this as a duplicate of SPARK-18769 unless

[jira] [Commented] (SPARK-18819) Failure to read single-row Parquet files

2016-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15739458#comment-15739458 ] Sean Owen commented on SPARK-18819: --- Surely, this is specific to ARM if it doesn't occur on x86? I

[jira] [Updated] (SPARK-18819) Failure to read single-row Parquet files

2016-12-11 Thread Michael Kamprath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Kamprath updated SPARK-18819: - Description: When I create a data frame in PySpark with a small row count (less than

[jira] [Comment Edited] (SPARK-18819) Failure to read single-row Parquet files

2016-12-11 Thread Michael Kamprath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15739452#comment-15739452 ] Michael Kamprath edited comment on SPARK-18819 at 12/11/16 9:42 AM:

[jira] [Commented] (SPARK-18819) Failure to read single-row Parquet files

2016-12-11 Thread Michael Kamprath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15739452#comment-15739452 ] Michael Kamprath commented on SPARK-18819: -- Sure. The complete error message is: {{code}}

[jira] [Resolved] (SPARK-18653) Dataset.show() generates incorrect padding for Unicode Character

2016-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18653. --- Resolution: Won't Fix > Dataset.show() generates incorrect padding for Unicode Character >

[jira] [Updated] (SPARK-18819) Failure to read single-row Parquet files

2016-12-11 Thread Michael Kamprath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Kamprath updated SPARK-18819: - Description: When I create a data frame in PySpark with a small row count (less than

[jira] [Updated] (SPARK-18628) Update handle invalid documentation string

2016-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18628: -- Assignee: Krishna Kalyan > Update handle invalid documentation string >

[jira] [Resolved] (SPARK-18628) Update handle invalid documentation string

2016-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18628. --- Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 Issue resolved by pull

[jira] [Commented] (SPARK-9487) Use the same num. worker threads in Scala/Python unit tests

2016-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15739432#comment-15739432 ] Sean Owen commented on SPARK-9487: -- I think this is going around in circles. You already have an open

[jira] [Updated] (SPARK-18809) Kinesis deaggregation issue on master

2016-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18809: -- Assignee: Brian ONeill > Kinesis deaggregation issue on master > -

[jira] [Updated] (SPARK-18809) Kinesis deaggregation issue on master

2016-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18809: -- Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) > Kinesis deaggregation issue

[jira] [Resolved] (SPARK-18809) Kinesis deaggregation issue on master

2016-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18809. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16236

[jira] [Commented] (SPARK-18819) Failure to read single-row Parquet files

2016-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15739422#comment-15739422 ] Sean Owen commented on SPARK-18819: --- This doesn't say anything about the underlying error though.

[jira] [Resolved] (SPARK-18799) Spark SQL expose interface for plug-gable parser extension

2016-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18799. --- Resolution: Duplicate > Spark SQL expose interface for plug-gable parser extension >

[jira] [Commented] (SPARK-18786) pySpark SQLContext.getOrCreate(sc) take stopped sparkContext

2016-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15739415#comment-15739415 ] Sean Owen commented on SPARK-18786: --- I agree it's surprising and maybe fixable, but this may be in the

[jira] [Updated] (SPARK-18786) pySpark SQLContext.getOrCreate(sc) take stopped sparkContext

2016-12-11 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated SPARK-18786: --- Component/s: PySpark > pySpark SQLContext.getOrCreate(sc) take stopped sparkContext >

[jira] [Updated] (SPARK-18710) Add offset to GeneralizedLinearRegression models

2016-12-11 Thread Wayne Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wayne Zhang updated SPARK-18710: Shepherd: Yanbo Liang (was: Sean Owen) Remaining Estimate: 10h (was: 336h)