[jira] [Resolved] (SPARK-23778) SparkContext.emptyRDD confuses SparkContext.union

2018-06-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-23778. - Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21333

[jira] [Assigned] (SPARK-23778) SparkContext.emptyRDD confuses SparkContext.union

2018-06-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-23778: --- Assignee: Marco Gaido > SparkContext.emptyRDD confuses SparkContext.union >

[jira] [Created] (SPARK-24598) SPARK SQL:Datatype overflow conditions gives incorrect result

2018-06-19 Thread navya (JIRA)
navya created SPARK-24598: - Summary: SPARK SQL:Datatype overflow conditions gives incorrect result Key: SPARK-24598 URL: https://issues.apache.org/jira/browse/SPARK-24598 Project: Spark Issue Type:

[jira] [Commented] (SPARK-24493) Kerberos Ticket Renewal is failing in Hadoop 2.8+ and Hadoop 3

2018-06-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517720#comment-16517720 ] Saisai Shao commented on SPARK-24493: - This BUG is fixed in HDFS-12670, Spark should bump its

[jira] [Commented] (SPARK-10781) Allow certain number of failed tasks and allow job to succeed

2018-06-19 Thread Hieu Tri Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517716#comment-16517716 ] Hieu Tri Huynh commented on SPARK-10781: I attached a proposed solution for this Jira. Hope to

[jira] [Updated] (SPARK-10781) Allow certain number of failed tasks and allow job to succeed

2018-06-19 Thread Hieu Tri Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hieu Tri Huynh updated SPARK-10781: --- Attachment: SPARK_10781_Proposed_Solution.pdf > Allow certain number of failed tasks and

[jira] [Resolved] (SPARK-24593) can not find hive table after spark streaming started

2018-06-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24593. -- Resolution: Invalid Sounds more like a question for now. Let's redirect questions to dev/user

[jira] [Resolved] (SPARK-24592) can not find hive table after spark streaming start

2018-06-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24592. -- Resolution: Duplicate Looks a duplicate of SPARK-24593. > can not find hive table after

[jira] [Resolved] (SPARK-24595) What about additional support on deeply nested column?

2018-06-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24595. -- Resolution: Invalid Let's redirect the question to dev/user mailing list before filing here

[jira] [Resolved] (SPARK-24336) Support 'pass through' transformation in BasicOperators

2018-06-19 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim resolved SPARK-24336. -- Resolution: Invalid > Support 'pass through' transformation in BasicOperators >

[jira] [Created] (SPARK-24597) Spark ML Pipeline Should support non-linear models => DAGPipeline

2018-06-19 Thread Michael Dreibelbis (JIRA)
Michael Dreibelbis created SPARK-24597: -- Summary: Spark ML Pipeline Should support non-linear models => DAGPipeline Key: SPARK-24597 URL: https://issues.apache.org/jira/browse/SPARK-24597

[jira] [Assigned] (SPARK-24596) Non-cascading Cache Invalidation

2018-06-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24596: Assignee: (was: Apache Spark) > Non-cascading Cache Invalidation >

[jira] [Assigned] (SPARK-24596) Non-cascading Cache Invalidation

2018-06-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24596: Assignee: Apache Spark > Non-cascading Cache Invalidation >

[jira] [Commented] (SPARK-24596) Non-cascading Cache Invalidation

2018-06-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517599#comment-16517599 ] Apache Spark commented on SPARK-24596: -- User 'maryannxue' has created a pull request for this

[jira] [Resolved] (SPARK-24583) Wrong schema type in InsertIntoDataSourceCommand

2018-06-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-24583. - Resolution: Fixed Assignee: Maryann Xue Fix Version/s: 2.3.2 > Wrong schema type in

[jira] [Created] (SPARK-24596) Non-cascading Cache Invalidation

2018-06-19 Thread Maryann Xue (JIRA)
Maryann Xue created SPARK-24596: --- Summary: Non-cascading Cache Invalidation Key: SPARK-24596 URL: https://issues.apache.org/jira/browse/SPARK-24596 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-24565) Add API for in Structured Streaming for exposing output rows of each microbatch as a DataFrame

2018-06-19 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-24565. -- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21571

[jira] [Comment Edited] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Wenbo Zhao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517532#comment-16517532 ] Wenbo Zhao edited comment on SPARK-24578 at 6/19/18 8:55 PM: - woop,

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Wenbo Zhao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517532#comment-16517532 ] Wenbo Zhao commented on SPARK-24578: woop, [~attilapiros], sorry, I didn't you have created a PR. 

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517529#comment-16517529 ] Apache Spark commented on SPARK-24578: -- User 'WenboZhao' has created a pull request for this issue:

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517515#comment-16517515 ] Attila Zsolt Piros commented on SPARK-24578: [~wbzhao] oh sorry I read your comment late,

[jira] [Resolved] (SPARK-24534) Add a way to bypass entrypoint.sh script if no spark cmd is passed

2018-06-19 Thread Erik Erlandson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erik Erlandson resolved SPARK-24534. Resolution: Fixed Fix Version/s: 2.4.0 > Add a way to bypass entrypoint.sh script

[jira] [Assigned] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24578: Assignee: Apache Spark > Reading remote cache block behavior changes and causes timeout

[jira] [Assigned] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24578: Assignee: (was: Apache Spark) > Reading remote cache block behavior changes and

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517505#comment-16517505 ] Apache Spark commented on SPARK-24578: -- User 'attilapiros' has created a pull request for this

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Wenbo Zhao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517465#comment-16517465 ] Wenbo Zhao commented on SPARK-24578: [~attilapiros] if don't mind, I could create a PR for it :) >

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517461#comment-16517461 ] Marcelo Vanzin commented on SPARK-24578: Ah, I see. That makes sense. (I actually took at look

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Wenbo Zhao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517458#comment-16517458 ] Wenbo Zhao commented on SPARK-24578: Hi [~vanzin].  the commit 

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517452#comment-16517452 ] Attila Zsolt Piros commented on SPARK-24578: [~wbzhao] Yes, you are right, readerIndex is

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517449#comment-16517449 ] Marcelo Vanzin commented on SPARK-24578: Attila's suggestion looks good, but I wonder what

[jira] [Resolved] (SPARK-12436) If all values of a JSON field is null, JSON's inferSchema should return NullType instead of StringType

2018-06-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-12436. --- Resolution: Resolved Resolved by SPARK-23772 (an alternative solution to this JIRA) > If

[jira] [Updated] (SPARK-24587) RDD.takeOrdered uses reduce, pulling all partition data to the driver

2018-06-19 Thread Ryan Deak (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Deak updated SPARK-24587: -- Description: *NOTE*: _This is likely a *very* impactful change, and likely only matters when {{num}}

[jira] [Updated] (SPARK-24295) Purge Structured streaming FileStreamSinkLog metadata compact file data.

2018-06-19 Thread Iqbal Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Iqbal Singh updated SPARK-24295: Priority: Major (was: Minor) > Purge Structured streaming FileStreamSinkLog metadata compact

[jira] [Updated] (SPARK-24295) Purge Structured streaming FileStreamSinkLog metadata compact file data.

2018-06-19 Thread Iqbal Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Iqbal Singh updated SPARK-24295: Issue Type: Bug (was: Wish) > Purge Structured streaming FileStreamSinkLog metadata compact file

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Wenbo Zhao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517441#comment-16517441 ] Wenbo Zhao commented on SPARK-24578: [~irashid] Yes, that is exactly what I saw in our side. >

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Wenbo Zhao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517440#comment-16517440 ] Wenbo Zhao commented on SPARK-24578: Hi [~attilapiros], I guess what you suggest is  {code:java}

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517439#comment-16517439 ] Imran Rashid commented on SPARK-24578: -- I think [~attilapiros] may be right -- can you send a PR to

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517416#comment-16517416 ] Attila Zsolt Piros commented on SPARK-24578: I have written a small test I know it is a bit 

[jira] [Commented] (SPARK-650) Add a "setup hook" API for running initialization code on each executor

2018-06-19 Thread Avi minsky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517398#comment-16517398 ] Avi minsky commented on SPARK-650: -- We encountered an issue with the combination of lazy static loading

[jira] [Resolved] (SPARK-24556) ReusedExchange should rewrite output partitioning also when child's partitioning is RangePartitioning

2018-06-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-24556. - Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21564

[jira] [Assigned] (SPARK-24556) ReusedExchange should rewrite output partitioning also when child's partitioning is RangePartitioning

2018-06-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-24556: --- Assignee: yucai > ReusedExchange should rewrite output partitioning also when child's >

[jira] [Resolved] (SPARK-24521) Fix ineffective test in CachedTableSuite

2018-06-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-24521. - Resolution: Fixed Fix Version/s: 2.4.0 2.3.2 > Fix ineffective test in

[jira] [Assigned] (SPARK-24521) Fix ineffective test in CachedTableSuite

2018-06-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-24521: --- Assignee: Li Jin > Fix ineffective test in CachedTableSuite >

[jira] [Commented] (SPARK-23427) spark.sql.autoBroadcastJoinThreshold causing OOM exception in the driver

2018-06-19 Thread Dean Wampler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517326#comment-16517326 ] Dean Wampler commented on SPARK-23427: -- Hi, Kazuaki. Any update on this issue? Any pointers on what

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517287#comment-16517287 ] Attila Zsolt Piros commented on SPARK-24578: The copyByteBuf() along with transferTo() is

[jira] [Comment Edited] (SPARK-24498) Add JDK compiler for runtime codegen

2018-06-19 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516917#comment-16516917 ] Takeshi Yamamuro edited comment on SPARK-24498 at 6/19/18 3:12 PM: ---

[jira] [Created] (SPARK-24595) What about additional support on deeply nested column?

2018-06-19 Thread Zejun Li (JIRA)
Zejun Li created SPARK-24595: Summary: What about additional support on deeply nested column? Key: SPARK-24595 URL: https://issues.apache.org/jira/browse/SPARK-24595 Project: Spark Issue Type:

[jira] [Commented] (SPARK-24578) Reading remote cache block behavior changes and causes timeout issue

2018-06-19 Thread Wenbo Zhao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517180#comment-16517180 ] Wenbo Zhao commented on SPARK-24578: After digging more details, this commit 

[jira] [Updated] (SPARK-24519) MapStatus has 2000 hardcoded

2018-06-19 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-24519: -- Description: MapStatus uses hardcoded value of 2000 partitions to determine if it should use

[jira] [Commented] (SPARK-24498) Add JDK compiler for runtime codegen

2018-06-19 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517097#comment-16517097 ] Kazuaki Ishizaki commented on SPARK-24498: -- [~maropu] thank you, let us use this as a start

[jira] [Commented] (SPARK-24594) Introduce metrics for YARN executor allocation problems

2018-06-19 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517028#comment-16517028 ] Attila Zsolt Piros commented on SPARK-24594: I am working on this. > Introduce metrics for

[jira] [Updated] (SPARK-24594) Introduce metrics for YARN executor allocation problems

2018-06-19 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Attila Zsolt Piros updated SPARK-24594: --- Description: Within SPARK-16630 it come up to introduce metrics for YARN executor

[jira] [Created] (SPARK-24594) Introduce metrics for YARN executor allocation problems

2018-06-19 Thread Attila Zsolt Piros (JIRA)
Attila Zsolt Piros created SPARK-24594: -- Summary: Introduce metrics for YARN executor allocation problems Key: SPARK-24594 URL: https://issues.apache.org/jira/browse/SPARK-24594 Project: Spark

[jira] [Comment Edited] (SPARK-21063) Spark return an empty result from remote hadoop cluster

2018-06-19 Thread Paul Staab (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516990#comment-16516990 ] Paul Staab edited comment on SPARK-21063 at 6/19/18 12:01 PM: -- I was able

[jira] [Comment Edited] (SPARK-21063) Spark return an empty result from remote hadoop cluster

2018-06-19 Thread Paul Staab (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516990#comment-16516990 ] Paul Staab edited comment on SPARK-21063 at 6/19/18 11:59 AM: -- I was able

[jira] [Comment Edited] (SPARK-21063) Spark return an empty result from remote hadoop cluster

2018-06-19 Thread Paul Staab (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516990#comment-16516990 ] Paul Staab edited comment on SPARK-21063 at 6/19/18 11:59 AM: -- I was able

[jira] [Comment Edited] (SPARK-21063) Spark return an empty result from remote hadoop cluster

2018-06-19 Thread Paul Staab (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516990#comment-16516990 ] Paul Staab edited comment on SPARK-21063 at 6/19/18 11:58 AM: -- I was able

[jira] [Commented] (SPARK-21063) Spark return an empty result from remote hadoop cluster

2018-06-19 Thread Paul Staab (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516990#comment-16516990 ] Paul Staab commented on SPARK-21063: I was able to find a workaround for this problem on Spark

[jira] [Commented] (SPARK-24498) Add JDK compiler for runtime codegen

2018-06-19 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516917#comment-16516917 ] Takeshi Yamamuro commented on SPARK-24498: -- Probably, kiszk will make polite code for this (and

[jira] [Updated] (SPARK-24593) can not find hive table after spark streaming started

2018-06-19 Thread lhq (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lhq updated SPARK-24593: Environment: {code:java} // demo code /* * Licensed to the Apache Software Foundation (ASF) under one or more *

[jira] [Created] (SPARK-24593) can not find hive table after spark streaming started

2018-06-19 Thread lhq (JIRA)
lhq created SPARK-24593: --- Summary: can not find hive table after spark streaming started Key: SPARK-24593 URL: https://issues.apache.org/jira/browse/SPARK-24593 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-24592) can not find hive table after spark streaming start

2018-06-19 Thread lhq (JIRA)
lhq created SPARK-24592: --- Summary: can not find hive table after spark streaming start Key: SPARK-24592 URL: https://issues.apache.org/jira/browse/SPARK-24592 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-24458) Invalid PythonUDF check_1(), requires attributes from more than one child

2018-06-19 Thread Ruben Berenguel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516908#comment-16516908 ] Ruben Berenguel commented on SPARK-24458: - That's what I thought [~AbdealiJK] (I had `1, 2, 3`)

[jira] [Commented] (SPARK-24458) Invalid PythonUDF check_1(), requires attributes from more than one child

2018-06-19 Thread Abdeali Kothari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516899#comment-16516899 ] Abdeali Kothari commented on SPARK-24458: - I can be anything. I had a file with 1 column, 1 row

[jira] [Commented] (SPARK-24458) Invalid PythonUDF check_1(), requires attributes from more than one child

2018-06-19 Thread Ruben Berenguel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516890#comment-16516890 ] Ruben Berenguel commented on SPARK-24458: - [~AbdealiJK] what does your `a.csv` file contain in

[jira] [Comment Edited] (SPARK-5158) Allow for keytab-based HDFS security in Standalone mode

2018-06-19 Thread Jelmer Kuperus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516856#comment-16516856 ] Jelmer Kuperus edited comment on SPARK-5158 at 6/19/18 9:33 AM: I ended

[jira] [Comment Edited] (SPARK-5158) Allow for keytab-based HDFS security in Standalone mode

2018-06-19 Thread Jelmer Kuperus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516856#comment-16516856 ] Jelmer Kuperus edited comment on SPARK-5158 at 6/19/18 9:32 AM: I ended

[jira] [Commented] (SPARK-24467) VectorAssemblerEstimator

2018-06-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516861#comment-16516861 ] Nick Pentreath commented on SPARK-24467: One option is to do that same as we did for one hot

[jira] [Commented] (SPARK-5158) Allow for keytab-based HDFS security in Standalone mode

2018-06-19 Thread Jelmer Kuperus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516856#comment-16516856 ] Jelmer Kuperus commented on SPARK-5158: --- I ended up with the following workaround which at first

[jira] [Commented] (SPARK-24423) Add a new option `query` for JDBC sources

2018-06-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516732#comment-16516732 ] Apache Spark commented on SPARK-24423: -- User 'dilipbiswal' has created a pull request for this

[jira] [Assigned] (SPARK-24423) Add a new option `query` for JDBC sources

2018-06-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24423: Assignee: Apache Spark > Add a new option `query` for JDBC sources >

[jira] [Assigned] (SPARK-24423) Add a new option `query` for JDBC sources

2018-06-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24423: Assignee: (was: Apache Spark) > Add a new option `query` for JDBC sources >