[jira] [Commented] (SPARK-16304) LinkageError should not crash Spark executor

2016-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356639#comment-15356639 ] Sean Owen commented on SPARK-16304: --- The executor isn't shared though, it's specific to

[jira] [Commented] (SPARK-9999) Dataset API on top of Catalyst/DataFrame

2016-06-29 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356633#comment-15356633 ] Maciej BryƄski commented on SPARK-: --- [~rxin] What about Python API ? What's the

[jira] [Updated] (SPARK-16257) spark-ec2 script not updated for 1.6.2 release

2016-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16257: -- Assignee: Brian Uri > spark-ec2 script not updated for 1.6.2 release >

[jira] [Resolved] (SPARK-16257) spark-ec2 script not updated for 1.6.2 release

2016-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16257. --- Resolution: Fixed Fix Version/s: 1.6.3 Issue resolved by pull request 13947 [https://github.co

[jira] [Assigned] (SPARK-16318) xpath_int, xpath_short, xpath_long, xpath_float, xpath_double, xpath_string

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16318: Assignee: Apache Spark > xpath_int, xpath_short, xpath_long, xpath_float, xpath_double, xp

[jira] [Commented] (SPARK-16318) xpath_int, xpath_short, xpath_long, xpath_float, xpath_double, xpath_string

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356629#comment-15356629 ] Apache Spark commented on SPARK-16318: -- User 'petermaxlee' has created a pull reques

[jira] [Assigned] (SPARK-16318) xpath_int, xpath_short, xpath_long, xpath_float, xpath_double, xpath_string

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16318: Assignee: (was: Apache Spark) > xpath_int, xpath_short, xpath_long, xpath_float, xpath

[jira] [Created] (SPARK-16318) xpath_int, xpath_short, xpath_long, xpath_float, xpath_double, xpath_string

2016-06-29 Thread Peter Lee (JIRA)
Peter Lee created SPARK-16318: - Summary: xpath_int, xpath_short, xpath_long, xpath_float, xpath_double, xpath_string Key: SPARK-16318 URL: https://issues.apache.org/jira/browse/SPARK-16318 Project: Spark

[jira] [Commented] (SPARK-16203) regexp_extract to return an ArrayType(StringType())

2016-06-29 Thread Max Moroz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356624#comment-15356624 ] Max Moroz commented on SPARK-16203: --- [~hvanhovell] UDF: yes, that's what I did - but I

[jira] [Commented] (SPARK-16304) LinkageError should not crash Spark executor

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356623#comment-15356623 ] Reynold Xin commented on SPARK-16304: - I don't think the error for a single class sho

[jira] [Commented] (SPARK-16304) LinkageError should not crash Spark executor

2016-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356621#comment-15356621 ] Sean Owen commented on SPARK-16304: --- In this case it does seem fatal. It can't be recov

[jira] [Resolved] (SPARK-16256) Add Structured Streaming Programming Guide

2016-06-29 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-16256. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13978 [https://g

[jira] [Commented] (SPARK-16317) Add file filtering interface for FileFormat

2016-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356605#comment-15356605 ] Sean Owen commented on SPARK-16317: --- The JDK already provides FilenameFilter; probably

[jira] [Updated] (SPARK-16317) Add file filtering interface for FileFormat

2016-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16317: -- Priority: Minor (was: Major) > Add file filtering interface for FileFormat > -

[jira] [Created] (SPARK-16317) Add file filtering interface for FileFormat

2016-06-29 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-16317: -- Summary: Add file filtering interface for FileFormat Key: SPARK-16317 URL: https://issues.apache.org/jira/browse/SPARK-16317 Project: Spark Issue Type: Improveme

[jira] [Resolved] (SPARK-12177) Update KafkaDStreams to new Kafka 0.10 Consumer API

2016-06-29 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-12177. --- Resolution: Fixed Assignee: Cody Koeninger Fix Version/s: 2.0.0 > Update Kafk

[jira] [Resolved] (SPARK-16294) Labelling support for the include_example Jekyll plugin

2016-06-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-16294. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13972 [https://g

[jira] [Commented] (SPARK-16290) text type features column for classification

2016-06-29 Thread mahendra singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356548#comment-15356548 ] mahendra singh commented on SPARK-16290: [~srowen] Hi srowen , have one issue w

[jira] [Updated] (SPARK-16316) dataframe except API returning wrong result in spark 1.5.0

2016-06-29 Thread Jacky Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacky Li updated SPARK-16316: - Description: Version: spark 1.5.0 Use case: use except API to do subtract between two dataframe scala>

[jira] [Updated] (SPARK-16316) dataframe except API returning wrong result in spark 1.5.0

2016-06-29 Thread Jacky Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacky Li updated SPARK-16316: - Description: Version: spark 1.5.0 Use case: use except API to do subtract between two dataframe scala>

[jira] [Created] (SPARK-16316) dataframe except API returning wrong result in spark 1.5.0

2016-06-29 Thread Jacky Li (JIRA)
Jacky Li created SPARK-16316: Summary: dataframe except API returning wrong result in spark 1.5.0 Key: SPARK-16316 URL: https://issues.apache.org/jira/browse/SPARK-16316 Project: Spark Issue Type

[jira] [Created] (SPARK-16315) Implement code generation for elt function

2016-06-29 Thread Peter Lee (JIRA)
Peter Lee created SPARK-16315: - Summary: Implement code generation for elt function Key: SPARK-16315 URL: https://issues.apache.org/jira/browse/SPARK-16315 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-16287) Implement str_to_map SQL function

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356524#comment-15356524 ] Apache Spark commented on SPARK-16287: -- User 'techaddict' has created a pull request

[jira] [Assigned] (SPARK-16287) Implement str_to_map SQL function

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16287: Assignee: (was: Apache Spark) > Implement str_to_map SQL function > --

[jira] [Assigned] (SPARK-16287) Implement str_to_map SQL function

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16287: Assignee: Apache Spark > Implement str_to_map SQL function > -

[jira] [Assigned] (SPARK-16311) Improve metadata refresh

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16311: Assignee: Apache Spark > Improve metadata refresh > > >

[jira] [Assigned] (SPARK-16311) Improve metadata refresh

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16311: Assignee: (was: Apache Spark) > Improve metadata refresh > >

[jira] [Commented] (SPARK-16311) Improve metadata refresh

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356516#comment-15356516 ] Apache Spark commented on SPARK-16311: -- User 'petermaxlee' has created a pull reques

[jira] [Commented] (SPARK-16314) Spark application got stuck when NM running executor is restarted

2016-06-29 Thread Yesha Vora (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356401#comment-15356401 ] Yesha Vora commented on SPARK-16314: Thanks [~jerryshao] for analysis. {code} Lookin

[jira] [Created] (SPARK-16314) Spark application got stuck when NM running executor is restarted

2016-06-29 Thread Yesha Vora (JIRA)
Yesha Vora created SPARK-16314: -- Summary: Spark application got stuck when NM running executor is restarted Key: SPARK-16314 URL: https://issues.apache.org/jira/browse/SPARK-16314 Project: Spark

[jira] [Commented] (SPARK-16101) Refactoring CSV data source to be consistent with JSON data source

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356393#comment-15356393 ] Apache Spark commented on SPARK-16101: -- User 'HyukjinKwon' has created a pull reques

[jira] [Assigned] (SPARK-16101) Refactoring CSV data source to be consistent with JSON data source

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16101: Assignee: Apache Spark > Refactoring CSV data source to be consistent with JSON data sourc

[jira] [Assigned] (SPARK-16101) Refactoring CSV data source to be consistent with JSON data source

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16101: Assignee: (was: Apache Spark) > Refactoring CSV data source to be consistent with JSON

[jira] [Assigned] (SPARK-16313) Spark should not silently drop exceptions in file listing

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16313: Assignee: Apache Spark (was: Reynold Xin) > Spark should not silently drop exceptions in

[jira] [Assigned] (SPARK-16313) Spark should not silently drop exceptions in file listing

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16313: Assignee: Reynold Xin (was: Apache Spark) > Spark should not silently drop exceptions in

[jira] [Commented] (SPARK-16313) Spark should not silently drop exceptions in file listing

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356364#comment-15356364 ] Apache Spark commented on SPARK-16313: -- User 'rxin' has created a pull request for t

[jira] [Created] (SPARK-16313) Spark should not silently drop exceptions in file listing

2016-06-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16313: --- Summary: Spark should not silently drop exceptions in file listing Key: SPARK-16313 URL: https://issues.apache.org/jira/browse/SPARK-16313 Project: Spark Issue

[jira] [Commented] (SPARK-16292) Failed to create spark client

2016-06-29 Thread Arcflash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356301#comment-15356301 ] Arcflash commented on SPARK-16292: -- Thanks ,I check my settings and it works fine > Fai

[jira] [Updated] (SPARK-16274) Implement xpath_boolean

2016-06-29 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-16274: Assignee: Peter Lee > Implement xpath_boolean > --- > > Key: SP

[jira] [Resolved] (SPARK-16274) Implement xpath_boolean

2016-06-29 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-16274. - Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 13964 [https://githu

[jira] [Created] (SPARK-16312) Docs for Kafka 0.10 consumer integration

2016-06-29 Thread Cody Koeninger (JIRA)
Cody Koeninger created SPARK-16312: -- Summary: Docs for Kafka 0.10 consumer integration Key: SPARK-16312 URL: https://issues.apache.org/jira/browse/SPARK-16312 Project: Spark Issue Type: Sub-

[jira] [Commented] (SPARK-16144) Add a separate Rd for ML generic methods: read.ml, write.ml, summary, predict

2016-06-29 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356277#comment-15356277 ] Yanbo Liang commented on SPARK-16144: - Sure. > Add a separate Rd for ML generic met

[jira] [Commented] (SPARK-16311) Improve metadata refresh

2016-06-29 Thread Peter Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356275#comment-15356275 ] Peter Lee commented on SPARK-16311: --- I can work on this one ... > Improve metadata re

[jira] [Updated] (SPARK-16311) Improve metadata refresh

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16311: Target Version/s: 2.0.0 > Improve metadata refresh > > > K

[jira] [Updated] (SPARK-16311) Improve metadata refresh

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16311: Description: When the underlying file changes, it can be very confusing to users when they see a F

[jira] [Updated] (SPARK-16311) Improve metadata refresh

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16311: Description: When the underlying file changes, it can be very confusing to users when they see a F

[jira] [Updated] (SPARK-16311) "refresh" should work on temporary tables or views or Dataset/DataFrame

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16311: Description: When the underlying file changes, it can be very confusing to users The refresh com

[jira] [Updated] (SPARK-16311) Improve metadata refresh

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16311: Summary: Improve metadata refresh (was: "refresh" should work on temporary tables or views or Data

[jira] [Updated] (SPARK-16311) "refresh" should work on temporary tables or views or Dataset/DataFrame

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16311: Summary: "refresh" should work on temporary tables or views or Dataset/DataFrame (was: "refresh" s

[jira] [Commented] (SPARK-16114) Add network word count example

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15356255#comment-15356255 ] Apache Spark commented on SPARK-16114: -- User 'jjthomas' has created a pull request f

[jira] [Resolved] (SPARK-16267) Replace deprecated `CREATE TEMPORARY TABLE` from testsuites

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16267. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.0.0 > Replace deprecat

[jira] [Resolved] (SPARK-16134) optimizer rules for typed filter

2016-06-29 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-16134. Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 13846 [https://github.

[jira] [Commented] (SPARK-16308) SparkR csv source should have the same default na.string as R

2016-06-29 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355992#comment-15355992 ] Felix Cheung commented on SPARK-16308: -- could someone please close this bug (I can't

[jira] [Commented] (SPARK-16309) SparkR csv source should have the same default na.string as R

2016-06-29 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355991#comment-15355991 ] Felix Cheung commented on SPARK-16309: -- could someone please close this bug (I can't

[jira] [Closed] (SPARK-16308) SparkR csv source should have the same default na.string as R

2016-06-29 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung closed SPARK-16308. > SparkR csv source should have the same default na.string as R > -

[jira] [Resolved] (SPARK-16308) SparkR csv source should have the same default na.string as R

2016-06-29 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-16308. -- Resolution: Duplicate > SparkR csv source should have the same default na.string as R > ---

[jira] [Commented] (SPARK-16310) SparkR csv source should have the same default na.string as R

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355985#comment-15355985 ] Apache Spark commented on SPARK-16310: -- User 'felixcheung' has created a pull reques

[jira] [Commented] (SPARK-16309) SparkR csv source should have the same default na.string as R

2016-06-29 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355988#comment-15355988 ] Felix Cheung commented on SPARK-16309: -- dup of https://issues.apache.org/jira/browse

[jira] [Commented] (SPARK-16308) SparkR csv source should have the same default na.string as R

2016-06-29 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355989#comment-15355989 ] Felix Cheung commented on SPARK-16308: -- dup of https://issues.apache.org/jira/browse

[jira] [Assigned] (SPARK-16310) SparkR csv source should have the same default na.string as R

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16310: Assignee: Apache Spark > SparkR csv source should have the same default na.string as R > -

[jira] [Assigned] (SPARK-16310) SparkR csv source should have the same default na.string as R

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16310: Assignee: (was: Apache Spark) > SparkR csv source should have the same default na.stri

[jira] [Assigned] (SPARK-16021) Zero out freed memory in test to help catch correctness bugs

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16021: Assignee: Apache Spark > Zero out freed memory in test to help catch correctness bugs > --

[jira] [Created] (SPARK-16310) SparkR csv source should have the same default na.string as R

2016-06-29 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-16310: Summary: SparkR csv source should have the same default na.string as R Key: SPARK-16310 URL: https://issues.apache.org/jira/browse/SPARK-16310 Project: Spark

[jira] [Commented] (SPARK-16304) LinkageError should not crash Spark executor

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355949#comment-15355949 ] Apache Spark commented on SPARK-16304: -- User 'petermaxlee' has created a pull reques

[jira] [Resolved] (SPARK-16228) "Percentile" needs explicit cast to double

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16228. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.0.0 > "Percentile" nee

[jira] [Assigned] (SPARK-16021) Zero out freed memory in test to help catch correctness bugs

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16021: Assignee: Apache Spark > Zero out freed memory in test to help catch correctness bugs > --

[jira] [Assigned] (SPARK-16304) LinkageError should not crash Spark executor

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16304: Assignee: Apache Spark > LinkageError should not crash Spark executor > --

[jira] [Commented] (SPARK-16021) Zero out freed memory in test to help catch correctness bugs

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355951#comment-15355951 ] Apache Spark commented on SPARK-16021: -- User 'ericl' has created a pull request for

[jira] [Resolved] (SPARK-16306) Improve testing for DecisionTree variances

2016-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16306. --- Resolution: Duplicate > Improve testing for DecisionTree variances >

[jira] [Resolved] (SPARK-16305) LinkageError should not crash Spark executor

2016-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16305. --- Resolution: Duplicate > LinkageError should not crash Spark executor > --

[jira] [Assigned] (SPARK-16307) Improve testing for DecisionTree variances

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16307: Assignee: (was: Apache Spark) > Improve testing for DecisionTree variances > -

[jira] [Commented] (SPARK-16307) Improve testing for DecisionTree variances

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355896#comment-15355896 ] Apache Spark commented on SPARK-16307: -- User 'MechCoder' has created a pull request

[jira] [Assigned] (SPARK-16307) Improve testing for DecisionTree variances

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16307: Assignee: Apache Spark > Improve testing for DecisionTree variances >

[jira] [Created] (SPARK-16307) Improve testing for DecisionTree variances

2016-06-29 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-16307: --- Summary: Improve testing for DecisionTree variances Key: SPARK-16307 URL: https://issues.apache.org/jira/browse/SPARK-16307 Project: Spark Issue Type: Test

[jira] [Resolved] (SPARK-16006) Attemping to write empty DataFrame with no fields throw non-intuitive exception

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16006. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.0.0 > Attemping to wri

[jira] [Created] (SPARK-16306) Improve testing for DecisionTree variances

2016-06-29 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-16306: --- Summary: Improve testing for DecisionTree variances Key: SPARK-16306 URL: https://issues.apache.org/jira/browse/SPARK-16306 Project: Spark Issue Type: Test

[jira] [Created] (SPARK-16305) LinkageError should not crash Spark executor

2016-06-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16305: --- Summary: LinkageError should not crash Spark executor Key: SPARK-16305 URL: https://issues.apache.org/jira/browse/SPARK-16305 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-16304) LinkageError should not crash Spark executor

2016-06-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-16304: --- Summary: LinkageError should not crash Spark executor Key: SPARK-16304 URL: https://issues.apache.org/jira/browse/SPARK-16304 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-16260) PySpark ML Example Improvements and Cleanup

2016-06-29 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355810#comment-15355810 ] Miao Wang commented on SPARK-16260: --- [~bryanc]I can help on the QA. Will you create sub

[jira] [Updated] (SPARK-16044) input_file_name() returns empty strings in data sources based on NewHadoopRDD.

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16044: Fix Version/s: 1.6.3 > input_file_name() returns empty strings in data sources based on NewHadoopRD

[jira] [Created] (SPARK-16303) Update SQL examples and programming guide

2016-06-29 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-16303: -- Summary: Update SQL examples and programming guide Key: SPARK-16303 URL: https://issues.apache.org/jira/browse/SPARK-16303 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-16302) Set the right number of partitions for reading data from a local collection.

2016-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16302: -- Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) > Set the right number of parti

[jira] [Assigned] (SPARK-16198) Change the access level of the predict method in spark.ml.Predictor to public

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16198: Assignee: (was: Apache Spark) > Change the access level of the predict method in spark

[jira] [Commented] (SPARK-16198) Change the access level of the predict method in spark.ml.Predictor to public

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355687#comment-15355687 ] Apache Spark commented on SPARK-16198: -- User 'husseinhazimeh' has created a pull req

[jira] [Assigned] (SPARK-16198) Change the access level of the predict method in spark.ml.Predictor to public

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16198: Assignee: Apache Spark > Change the access level of the predict method in spark.ml.Predict

[jira] [Assigned] (SPARK-16302) Set the right number of partitions for reading data from a local collection.

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16302: Assignee: (was: Apache Spark) > Set the right number of partitions for reading data fr

[jira] [Assigned] (SPARK-16302) Set the right number of partitions for reading data from a local collection.

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16302: Assignee: Apache Spark > Set the right number of partitions for reading data from a local

[jira] [Commented] (SPARK-16302) Set the right number of partitions for reading data from a local collection.

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355662#comment-15355662 ] Apache Spark commented on SPARK-16302: -- User 'lianhuiwang' has created a pull reques

[jira] [Comment Edited] (SPARK-16144) Add a separate Rd for ML generic methods: read.ml, write.ml, summary, predict

2016-06-29 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355648#comment-15355648 ] Xin Ren edited comment on SPARK-16144 at 6/29/16 6:57 PM: -- Sure,

[jira] [Commented] (SPARK-16144) Add a separate Rd for ML generic methods: read.ml, write.ml, summary, predict

2016-06-29 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355648#comment-15355648 ] Xin Ren commented on SPARK-16144: - Sure, thank Xiangrui :) > Add a separate Rd for ML ge

[jira] [Updated] (SPARK-16302) Set the right number of partitions for reading data from a local collection.

2016-06-29 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lianhui Wang updated SPARK-16302: - Summary: Set the right number of partitions for reading data from a local collection. (was: Set

[jira] [Updated] (SPARK-16302) Set the default number of partitions for reading data from a local collection.

2016-06-29 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lianhui Wang updated SPARK-16302: - Summary: Set the default number of partitions for reading data from a local collection. (was: Lo

[jira] [Commented] (SPARK-16256) Add Structured Streaming Programming Guide

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355629#comment-15355629 ] Apache Spark commented on SPARK-16256: -- User 'tdas' has created a pull request for t

[jira] [Created] (SPARK-16302) LocalTableScanExec always use defaultParallelism tasks even though it is very small seq.

2016-06-29 Thread Lianhui Wang (JIRA)
Lianhui Wang created SPARK-16302: Summary: LocalTableScanExec always use defaultParallelism tasks even though it is very small seq. Key: SPARK-16302 URL: https://issues.apache.org/jira/browse/SPARK-16302

[jira] [Resolved] (SPARK-14480) Remove meaningless StringIteratorReader for CSV data source for better performance

2016-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14480. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.1.0 > Remove meaningles

[jira] [Commented] (SPARK-16144) Add a separate Rd for ML generic methods: read.ml, write.ml, summary, predict

2016-06-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355601#comment-15355601 ] Xiangrui Meng commented on SPARK-16144: --- [~iamshrek] Thanks for helping! I asked Ya

[jira] [Commented] (SPARK-16144) Add a separate Rd for ML generic methods: read.ml, write.ml, summary, predict

2016-06-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355599#comment-15355599 ] Xiangrui Meng commented on SPARK-16144: --- [~yanboliang] Do you have time to make a f

[jira] [Updated] (SPARK-16144) Add a separate Rd for ML generic methods: read.ml, write.ml, summary, predict

2016-06-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-16144: -- Assignee: Yanbo Liang (was: Xin Ren) > Add a separate Rd for ML generic methods: read.ml, writ

[jira] [Resolved] (SPARK-16140) Group k-means method in generated doc

2016-06-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-16140. --- Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 > Group k-means method

[jira] [Commented] (SPARK-16301) Analyzer rule for resolving using joins should respect case sensitivity setting

2016-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15355596#comment-15355596 ] Apache Spark commented on SPARK-16301: -- User 'yhuai' has created a pull request for

  1   2   >