[jira] [Commented] (SPARK-20313) Possible lack of join optimization when partitions are in the join condition

2017-04-17 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15972142#comment-15972142 ] Takeshi Yamamuro commented on SPARK-20313: -- What's the issue that you'd like to point out? I

[jira] [Commented] (SPARK-20320) AnalysisException: Columns of grouping_id (count(value#17L)) does not match grouping columns (count(value#17L))

2017-04-17 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15972126#comment-15972126 ] Takeshi Yamamuro commented on SPARK-20320: -- Is this query (putting `AggregateFunction` like

[jira] [Commented] (SPARK-20169) Groupby Bug with Sparksql

2017-04-17 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15972123#comment-15972123 ] Takeshi Yamamuro commented on SPARK-20169: -- I tried this query in v2.1 and master though, I

[jira] [Commented] (SPARK-20312) query optimizer calls udf with null values when it doesn't expect them

2017-04-17 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971997#comment-15971997 ] Takeshi Yamamuro commented on SPARK-20312: -- I made the query a bit simpler and tried though, I

[jira] [Commented] (SPARK-20299) NullPointerException when null and string are in a tuple while encoding Dataset

2017-04-17 Thread Umesh Chaudhary (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971977#comment-15971977 ] Umesh Chaudhary commented on SPARK-20299: - [~marmbrus], the issue seems to caused by changes in

[jira] [Created] (SPARK-20363) sessionstate.get is get the same object in hive project, when I use spark-beeline

2017-04-17 Thread QQShu1 (JIRA)
QQShu1 created SPARK-20363: -- Summary: sessionstate.get is get the same object in hive project, when I use spark-beeline Key: SPARK-20363 URL: https://issues.apache.org/jira/browse/SPARK-20363 Project:

[jira] [Assigned] (SPARK-20311) SQL "range(N) as alias" or "range(N) alias" doesn't work

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20311: Assignee: Apache Spark > SQL "range(N) as alias" or "range(N) alias" doesn't work >

[jira] [Commented] (SPARK-20311) SQL "range(N) as alias" or "range(N) alias" doesn't work

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971932#comment-15971932 ] Apache Spark commented on SPARK-20311: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20311) SQL "range(N) as alias" or "range(N) alias" doesn't work

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20311: Assignee: (was: Apache Spark) > SQL "range(N) as alias" or "range(N) alias" doesn't

[jira] [Updated] (SPARK-20349) ListFunctions returns duplicate functions after using persistent functions

2017-04-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-20349: Fix Version/s: (was: 2.1.2) > ListFunctions returns duplicate functions after using persistent

[jira] [Commented] (SPARK-16742) Kerberos support for Spark on Mesos

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971897#comment-15971897 ] Apache Spark commented on SPARK-16742: -- User 'mgummelt' has created a pull request for this issue:

[jira] [Commented] (SPARK-20361) JVM locale affects SQL type names

2017-04-17 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971896#comment-15971896 ] Hyukjin Kwon commented on SPARK-20361: -- Seems now fixed as below just for sure: {code} >>> locale =

[jira] [Commented] (SPARK-20287) Kafka Consumer should be able to subscribe to more than one topic partition

2017-04-17 Thread Stephane Maarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971893#comment-15971893 ] Stephane Maarek commented on SPARK-20287: - [~c...@koeninger.org] It makes sense. I didn't

[jira] [Closed] (SPARK-20287) Kafka Consumer should be able to subscribe to more than one topic partition

2017-04-17 Thread Stephane Maarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephane Maarek closed SPARK-20287. --- Resolution: Not A Problem > Kafka Consumer should be able to subscribe to more than one

[jira] [Commented] (SPARK-19986) Make pyspark.streaming.tests.CheckpointTests more stable

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971863#comment-15971863 ] Apache Spark commented on SPARK-19986: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Resolved] (SPARK-20362) spark submit not considering user defined Configs (Pyspark)

2017-04-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-20362. Resolution: Duplicate > spark submit not considering user defined Configs (Pyspark) >

[jira] [Created] (SPARK-20362) spark submit not considering user defined Configs (Pyspark)

2017-04-17 Thread Harish (JIRA)
Harish created SPARK-20362: -- Summary: spark submit not considering user defined Configs (Pyspark) Key: SPARK-20362 URL: https://issues.apache.org/jira/browse/SPARK-20362 Project: Spark Issue Type:

[jira] [Commented] (SPARK-20361) JVM locale affects SQL type names

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971734#comment-15971734 ] Maciej Szymkiewicz commented on SPARK-20361: Indeed. > JVM locale affects SQL type names >

[jira] [Closed] (SPARK-20361) JVM locale affects SQL type names

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz closed SPARK-20361. -- Resolution: Fixed > JVM locale affects SQL type names >

[jira] [Commented] (SPARK-20361) JVM locale affects SQL type names

2017-04-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971723#comment-15971723 ] Sean Owen commented on SPARK-20361: --- This is the same as

[jira] [Updated] (SPARK-20361) JVM locale affects SQL type names

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-20361: --- Description: Steps to reproduce: {code} from pyspark.sql.types import IntegerType

[jira] [Created] (SPARK-20361) JVM locale affects SQL type names

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-20361: -- Summary: JVM locale affects SQL type names Key: SPARK-20361 URL: https://issues.apache.org/jira/browse/SPARK-20361 Project: Spark Issue Type:

[jira] [Commented] (SPARK-17647) SQL LIKE does not handle backslashes correctly

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971675#comment-15971675 ] Apache Spark commented on SPARK-17647: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-20360) Create repr functions for interpreters to use

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20360: Assignee: (was: Apache Spark) > Create repr functions for interpreters to use >

[jira] [Commented] (SPARK-20360) Create repr functions for interpreters to use

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971651#comment-15971651 ] Apache Spark commented on SPARK-20360: -- User 'rgbkrk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20360) Create repr functions for interpreters to use

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20360: Assignee: Apache Spark > Create repr functions for interpreters to use >

[jira] [Comment Edited] (SPARK-18085) Better History Server scalability for many / large applications

2017-04-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971594#comment-15971594 ] Marcelo Vanzin edited comment on SPARK-18085 at 4/17/17 8:57 PM: - I'm

[jira] [Commented] (SPARK-14245) webUI should display the user

2017-04-17 Thread Alex Bozarth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971633#comment-15971633 ] Alex Bozarth commented on SPARK-14245: -- Given it's been a year since I fixed that PR I honestly

[jira] [Commented] (SPARK-20349) ListFunctions returns duplicate functions after using persistent functions

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971604#comment-15971604 ] Apache Spark commented on SPARK-20349: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Commented] (SPARK-18085) Better History Server scalability for many / large applications

2017-04-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971594#comment-15971594 ] Marcelo Vanzin commented on SPARK-18085: I'm getting close to a point where I think the code can

[jira] [Created] (SPARK-20360) Create repr functions for interpreters to use

2017-04-17 Thread Kyle Kelley (JIRA)
Kyle Kelley created SPARK-20360: --- Summary: Create repr functions for interpreters to use Key: SPARK-20360 URL: https://issues.apache.org/jira/browse/SPARK-20360 Project: Spark Issue Type:

[jira] [Commented] (SPARK-20359) Catalyst EliminateOuterJoin optimization can cause NPE

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971569#comment-15971569 ] Apache Spark commented on SPARK-20359: -- User 'koertkuipers' has created a pull request for this

[jira] [Assigned] (SPARK-20359) Catalyst EliminateOuterJoin optimization can cause NPE

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20359: Assignee: (was: Apache Spark) > Catalyst EliminateOuterJoin optimization can cause

[jira] [Assigned] (SPARK-20359) Catalyst EliminateOuterJoin optimization can cause NPE

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20359: Assignee: Apache Spark > Catalyst EliminateOuterJoin optimization can cause NPE >

[jira] [Created] (SPARK-20359) Catalyst EliminateOuterJoin optimization can cause NPE

2017-04-17 Thread koert kuipers (JIRA)
koert kuipers created SPARK-20359: - Summary: Catalyst EliminateOuterJoin optimization can cause NPE Key: SPARK-20359 URL: https://issues.apache.org/jira/browse/SPARK-20359 Project: Spark

[jira] [Commented] (SPARK-14245) webUI should display the user

2017-04-17 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971536#comment-15971536 ] Imran Rashid commented on SPARK-14245: -- thanks, I should have looked in the PR first, sorry. But I

[jira] [Assigned] (SPARK-20358) Executors failing stage on interrupted exception thrown by cancelled tasks

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20358: Assignee: (was: Apache Spark) > Executors failing stage on interrupted exception

[jira] [Commented] (SPARK-20358) Executors failing stage on interrupted exception thrown by cancelled tasks

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971516#comment-15971516 ] Apache Spark commented on SPARK-20358: -- User 'ericl' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20358) Executors failing stage on interrupted exception thrown by cancelled tasks

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20358: Assignee: Apache Spark > Executors failing stage on interrupted exception thrown by

[jira] [Created] (SPARK-20358) Executors failing stage on interrupted exception thrown by cancelled tasks

2017-04-17 Thread Eric Liang (JIRA)
Eric Liang created SPARK-20358: -- Summary: Executors failing stage on interrupted exception thrown by cancelled tasks Key: SPARK-20358 URL: https://issues.apache.org/jira/browse/SPARK-20358 Project:

[jira] [Updated] (SPARK-20357) Expose Calendar.getWeekYear() as Spark SQL date function to be consistent with weekofyear()

2017-04-17 Thread Jeeyoung Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeeyoung Kim updated SPARK-20357: - Description: Since weeks and years are extracted using different boundaries (weeks happen every

[jira] [Updated] (SPARK-20357) Expose Calendar.getWeekYear() as Spark SQL date function to be consistent with weekofyear()

2017-04-17 Thread Jeeyoung Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeeyoung Kim updated SPARK-20357: - Affects Version/s: 2.1.0 > Expose Calendar.getWeekYear() as Spark SQL date function to be

[jira] [Updated] (SPARK-20357) Expose Calendar.getWeekYear() as Spark SQL date function to be consistent with weekofyear()

2017-04-17 Thread Jeeyoung Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeeyoung Kim updated SPARK-20357: - Description: Since weeks and years are extracted using different boundaries (weeks happen every

[jira] [Created] (SPARK-20357) Expose Calendar.getWeekYear() as Spark SQL date function to be consistent with weekofyear()

2017-04-17 Thread Jeeyoung Kim (JIRA)
Jeeyoung Kim created SPARK-20357: Summary: Expose Calendar.getWeekYear() as Spark SQL date function to be consistent with weekofyear() Key: SPARK-20357 URL: https://issues.apache.org/jira/browse/SPARK-20357

[jira] [Commented] (SPARK-20299) NullPointerException when null and string are in a tuple while encoding Dataset

2017-04-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971491#comment-15971491 ] Michael Armbrust commented on SPARK-20299: -- What input are you looking for? >

[jira] [Resolved] (SPARK-17647) SQL LIKE does not handle backslashes correctly

2017-04-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17647. - Resolution: Fixed Assignee: Xiangrui Meng Fix Version/s: 2.2.0

[jira] [Commented] (SPARK-14245) webUI should display the user

2017-04-17 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971365#comment-15971365 ] Thomas Graves commented on SPARK-14245: --- see the commend in the PR, I think there was a race with

[jira] [Commented] (SPARK-14245) webUI should display the user

2017-04-17 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971337#comment-15971337 ] Imran Rashid commented on SPARK-14245: -- Hi [~ajbozarth] [~tgraves] -- I was just taking a look at

[jira] [Resolved] (SPARK-20349) ListFunctions returns duplicate functions after using persistent functions

2017-04-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-20349. - Resolution: Fixed Fix Version/s: 2.2.0 2.1.2 > ListFunctions returns duplicate

[jira] [Created] (SPARK-20356) Spark sql group by returns incorrect results after join + distinct transformations

2017-04-17 Thread Chris Kipers (JIRA)
Chris Kipers created SPARK-20356: Summary: Spark sql group by returns incorrect results after join + distinct transformations Key: SPARK-20356 URL: https://issues.apache.org/jira/browse/SPARK-20356

[jira] [Commented] (SPARK-20355) Display Spark version on history page

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971120#comment-15971120 ] Apache Spark commented on SPARK-20355: -- User 'redsanket' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20355) Display Spark version on history page

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20355: Assignee: Apache Spark > Display Spark version on history page >

[jira] [Assigned] (SPARK-20355) Display Spark version on history page

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20355: Assignee: (was: Apache Spark) > Display Spark version on history page >

[jira] [Created] (SPARK-20355) Display Spark version on history page

2017-04-17 Thread Sanket Reddy (JIRA)
Sanket Reddy created SPARK-20355: Summary: Display Spark version on history page Key: SPARK-20355 URL: https://issues.apache.org/jira/browse/SPARK-20355 Project: Spark Issue Type:

[jira] [Commented] (SPARK-16599) java.util.NoSuchElementException: None.get at at org.apache.spark.storage.BlockInfoManager.releaseAllLocksForTask(BlockInfoManager.scala:343)

2017-04-17 Thread cen yuhai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971081#comment-15971081 ] cen yuhai commented on SPARK-16599: --- [~srowen] I also encounter this problem >

[jira] [Commented] (SPARK-20340) Size estimate very wrong in ExternalAppendOnlyMap from CoGroupedRDD, cause OOM

2017-04-17 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971071#comment-15971071 ] Thomas Graves commented on SPARK-20340: --- Right, I figured it was probably for performance, the

[jira] [Commented] (SPARK-20339) Issue in regex_replace in Apache Spark Java

2017-04-17 Thread Nischay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971054#comment-15971054 ] Nischay commented on SPARK-20339: - Sure I'll not add redundant code in future, also I'll use

[jira] [Commented] (SPARK-19659) Fetch big blocks to disk when shuffle-read

2017-04-17 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971025#comment-15971025 ] jin xing commented on SPARK-19659: -- [~cloud_fan] I refined the the pr. In current change, I'd propose:

[jira] [Commented] (SPARK-19951) Add string concatenate operator || to Spark SQL

2017-04-17 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971001#comment-15971001 ] Takeshi Yamamuro commented on SPARK-19951: -- Since this operation is supported in PostgreSQL and

[jira] [Comment Edited] (SPARK-20336) spark.read.csv() with wholeFile=True option fails to read non ASCII unicode characters

2017-04-17 Thread HanCheol Cho (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970944#comment-15970944 ] HanCheol Cho edited comment on SPARK-20336 at 4/17/17 10:32 AM: This

[jira] [Comment Edited] (SPARK-20336) spark.read.csv() with wholeFile=True option fails to read non ASCII unicode characters

2017-04-17 Thread HanCheol Cho (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970944#comment-15970944 ] HanCheol Cho edited comment on SPARK-20336 at 4/17/17 10:31 AM: This

[jira] [Resolved] (SPARK-20310) Dependency convergence error for scala-xml

2017-04-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20310. --- Resolution: Not A Problem Reopen if that suggestion doesn't work, and, if there is a change you are

[jira] [Commented] (SPARK-20336) spark.read.csv() with wholeFile=True option fails to read non ASCII unicode characters

2017-04-17 Thread HanCheol Cho (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970944#comment-15970944 ] HanCheol Cho commented on SPARK-20336: -- This time, I checked whether PySpark uses the same version

[jira] [Comment Edited] (SPARK-20347) Provide AsyncRDDActions in Python

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970933#comment-15970933 ] Maciej Szymkiewicz edited comment on SPARK-20347 at 4/17/17 10:22 AM:

[jira] [Comment Edited] (SPARK-20347) Provide AsyncRDDActions in Python

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970933#comment-15970933 ] Maciej Szymkiewicz edited comment on SPARK-20347 at 4/17/17 10:17 AM:

[jira] [Comment Edited] (SPARK-20347) Provide AsyncRDDActions in Python

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970933#comment-15970933 ] Maciej Szymkiewicz edited comment on SPARK-20347 at 4/17/17 10:16 AM:

[jira] [Commented] (SPARK-20347) Provide AsyncRDDActions in Python

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970933#comment-15970933 ] Maciej Szymkiewicz commented on SPARK-20347: This is a nice idea but I wonder what would be

[jira] [Resolved] (SPARK-16892) flatten function to get flat array (or map) column from array of array (or array of map) column

2017-04-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16892. --- Resolution: Not A Problem > flatten function to get flat array (or map) column from array of array

[jira] [Closed] (SPARK-20352) PySpark SparkSession initialization take longer every iteration in a single application

2017-04-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-20352. - > PySpark SparkSession initialization take longer every iteration in a single > application >

[jira] [Resolved] (SPARK-20352) PySpark SparkSession initialization take longer every iteration in a single application

2017-04-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20352. --- Resolution: Not A Problem "My code takes too long to run" is not a JIRA. You haven't addressed the

[jira] [Reopened] (SPARK-20352) PySpark SparkSession initialization take longer every iteration in a single application

2017-04-17 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hosein reopened SPARK-20352: > PySpark SparkSession initialization take longer every iteration in a single > application >

[jira] [Commented] (SPARK-20352) PySpark SparkSession initialization take longer every iteration in a single application

2017-04-17 Thread hosein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970906#comment-15970906 ] hosein commented on SPARK-20352: I monitor execution time of every line in my code and this line: spark

[jira] [Resolved] (SPARK-20352) PySpark SparkSession initialization take longer every iteration in a single application

2017-04-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20352. --- Resolution: Not A Problem Fix Version/s: (was: 2.1.0) At the least, it's not supported to

[jira] [Resolved] (SPARK-19976) DirectStream API throws OffsetOutOfRange Exception

2017-04-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19976. --- Resolution: Not A Problem > DirectStream API throws OffsetOutOfRange Exception >

[jira] [Updated] (SPARK-20353) Implement Tensorflow TFRecords file format

2017-04-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-20353: -- Priority: Minor (was: Major) I think this is too app-specific to live in Spark, and should just be in

[jira] [Commented] (SPARK-20299) NullPointerException when null and string are in a tuple while encoding Dataset

2017-04-17 Thread Umesh Chaudhary (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970857#comment-15970857 ] Umesh Chaudhary commented on SPARK-20299: - [~lwlin], I want to work on this but waiting on inputs

[jira] [Commented] (SPARK-19368) Very bad performance in BlockMatrix.toIndexedRowMatrix()

2017-04-17 Thread Angelos Kaltsikis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970853#comment-15970853 ] Angelos Kaltsikis commented on SPARK-19368: --- By any chance this will get fixed soon? > Very

[jira] [Updated] (SPARK-20335) Children expressions of Hive UDF impacts the determinism of Hive UDF

2017-04-17 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-20335: Fix Version/s: 2.1.1 > Children expressions of Hive UDF impacts the determinism of Hive UDF >

[jira] [Commented] (SPARK-20299) NullPointerException when null and string are in a tuple while encoding Dataset

2017-04-17 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970811#comment-15970811 ] Liwei Lin commented on SPARK-20299: --- hi [~umesh9...@gmail.com], are you planning to work on this? In

[jira] [Commented] (SPARK-20354) /api/v1/applications’ return sparkUser is null in REST API.

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970772#comment-15970772 ] Apache Spark commented on SPARK-20354: -- User 'guoxiaolongzte' has created a pull request for this

[jira] [Assigned] (SPARK-20354) /api/v1/applications’ return sparkUser is null in REST API.

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20354: Assignee: Apache Spark > /api/v1/applications’ return sparkUser is null in REST API. >

[jira] [Assigned] (SPARK-20354) /api/v1/applications’ return sparkUser is null in REST API.

2017-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20354: Assignee: (was: Apache Spark) > /api/v1/applications’ return sparkUser is null in

[jira] [Updated] (SPARK-20354) When I request access to the 'http: //ip:port/api/v1/applications' link, return 'sparkUser' is empty in REST API.

2017-04-17 Thread guoxiaolongzte (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guoxiaolongzte updated SPARK-20354: --- Description: When I request access to the 'http: //ip:port/api/v1/applications' link, get

[jira] [Created] (SPARK-20354) /api/v1/applications’ return sparkUser is null in REST API.

2017-04-17 Thread guoxiaolongzte (JIRA)
guoxiaolongzte created SPARK-20354: -- Summary: /api/v1/applications’ return sparkUser is null in REST API. Key: SPARK-20354 URL: https://issues.apache.org/jira/browse/SPARK-20354 Project: Spark

[jira] [Created] (SPARK-20353) Implement Tensorflow TFRecords file format

2017-04-17 Thread Mathew Wicks (JIRA)
Mathew Wicks created SPARK-20353: Summary: Implement Tensorflow TFRecords file format Key: SPARK-20353 URL: https://issues.apache.org/jira/browse/SPARK-20353 Project: Spark Issue Type: