[jira] [Commented] (SPARK-22344) Prevent R CMD check from using /tmp

2017-11-01 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233751#comment-16233751 ] Felix Cheung commented on SPARK-22344: -- Hmm yes we do have to know if it has just fo

[jira] [Commented] (SPARK-19644) Memory leak in Spark Streaming

2017-11-01 Thread deng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233756#comment-16233756 ] deng commented on SPARK-19644: -- did the issue has been fixed? I am using spark 2.1.0 and i a

[jira] [Commented] (SPARK-22344) Prevent R CMD check from using /tmp

2017-11-01 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233762#comment-16233762 ] Felix Cheung commented on SPARK-22344: -- Maybe just delete the directory returned fro

[jira] [Commented] (SPARK-21725) spark thriftserver insert overwrite table partition select

2017-11-01 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233858#comment-16233858 ] Marco Gaido commented on SPARK-21725: - [~zhangxin0112zx] Can you share the spark-thri

[jira] [Commented] (SPARK-21725) spark thriftserver insert overwrite table partition select

2017-11-01 Thread xinzhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233875#comment-16233875 ] xinzhang commented on SPARK-21725: -- That is my target package log (+mysql) [https://gith

[jira] [Comment Edited] (SPARK-21725) spark thriftserver insert overwrite table partition select

2017-11-01 Thread xinzhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233875#comment-16233875 ] xinzhang edited comment on SPARK-21725 at 11/1/17 10:06 AM: [

[jira] [Resolved] (SPARK-22172) Worker hangs when the external shuffle service port is already in use

2017-11-01 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao resolved SPARK-22172. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19396 [https://githu

[jira] [Assigned] (SPARK-22172) Worker hangs when the external shuffle service port is already in use

2017-11-01 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao reassigned SPARK-22172: --- Assignee: Devaraj K > Worker hangs when the external shuffle service port is already in use

[jira] [Commented] (SPARK-22405) Enrich the event information and add new event of ExternalCatalogEvent

2017-11-01 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233902#comment-16233902 ] Herman van Hovell commented on SPARK-22405: --- I think adding additional events i

[jira] [Commented] (SPARK-2984) FileNotFoundException on _temporary directory

2017-11-01 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233924#comment-16233924 ] Steve Loughran commented on SPARK-2984: --- Darron: different stack trace, different pa

[jira] [Comment Edited] (SPARK-21725) spark thriftserver insert overwrite table partition select

2017-11-01 Thread xinzhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233875#comment-16233875 ] xinzhang edited comment on SPARK-21725 at 11/1/17 11:05 AM: [

[jira] [Commented] (SPARK-2984) FileNotFoundException on _temporary directory

2017-11-01 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233937#comment-16233937 ] Steve Loughran commented on SPARK-2984: --- [~soumdmw] you asked bq. is there a simple

[jira] [Comment Edited] (SPARK-21725) spark thriftserver insert overwrite table partition select

2017-11-01 Thread xinzhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233875#comment-16233875 ] xinzhang edited comment on SPARK-21725 at 11/1/17 11:17 AM: [

[jira] [Comment Edited] (SPARK-21725) spark thriftserver insert overwrite table partition select

2017-11-01 Thread xinzhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233875#comment-16233875 ] xinzhang edited comment on SPARK-21725 at 11/1/17 11:18 AM: [

[jira] [Resolved] (SPARK-22347) UDF is evaluated when 'F.when' condition is false

2017-11-01 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-22347. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19617 [https://githu

[jira] [Assigned] (SPARK-22347) UDF is evaluated when 'F.when' condition is false

2017-11-01 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-22347: --- Assignee: Liang-Chi Hsieh > UDF is evaluated when 'F.when' condition is false >

[jira] [Created] (SPARK-22407) Add rdd id column on storage page to speed up navigating

2017-11-01 Thread zhoukang (JIRA)
zhoukang created SPARK-22407: Summary: Add rdd id column on storage page to speed up navigating Key: SPARK-22407 URL: https://issues.apache.org/jira/browse/SPARK-22407 Project: Spark Issue Type:

[jira] [Updated] (SPARK-22407) Add rdd id column on storage page to speed up navigating

2017-11-01 Thread zhoukang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhoukang updated SPARK-22407: - Attachment: add-rddid.png rdd-cache.png > Add rdd id column on storage page to speed up n

[jira] [Updated] (SPARK-22407) Add rdd id column on storage page to speed up navigating

2017-11-01 Thread zhoukang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhoukang updated SPARK-22407: - Description: We can add rdd id column on storage page to speed up nagigating when many rdds are cached.

[jira] [Commented] (SPARK-22407) Add rdd id column on storage page to speed up navigating

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233993#comment-16233993 ] Apache Spark commented on SPARK-22407: -- User 'caneGuy' has created a pull request fo

[jira] [Assigned] (SPARK-22407) Add rdd id column on storage page to speed up navigating

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22407: Assignee: Apache Spark > Add rdd id column on storage page to speed up navigating > --

[jira] [Assigned] (SPARK-22407) Add rdd id column on storage page to speed up navigating

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22407: Assignee: (was: Apache Spark) > Add rdd id column on storage page to speed up navigati

[jira] [Commented] (SPARK-21088) CrossValidator, TrainValidationSplit should collect all models when fitting: Python API

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234087#comment-16234087 ] Apache Spark commented on SPARK-21088: -- User 'WeichenXu123' has created a pull reque

[jira] [Assigned] (SPARK-21088) CrossValidator, TrainValidationSplit should collect all models when fitting: Python API

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21088: Assignee: (was: Apache Spark) > CrossValidator, TrainValidationSplit should collect al

[jira] [Assigned] (SPARK-21088) CrossValidator, TrainValidationSplit should collect all models when fitting: Python API

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21088: Assignee: Apache Spark > CrossValidator, TrainValidationSplit should collect all models wh

[jira] [Resolved] (SPARK-19112) add codec for ZStandard

2017-11-01 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-19112. --- Resolution: Fixed Assignee: Sital Kedia Fix Version/s: 2.3.0 > add co

[jira] [Commented] (SPARK-21725) spark thriftserver insert overwrite table partition select

2017-11-01 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234105#comment-16234105 ] Marco Gaido commented on SPARK-21725: - I tried using a mysql metastore and the target

[jira] [Commented] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2017-11-01 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234112#comment-16234112 ] Marco Gaido commented on SPARK-22371: - Could you please provide an easy way to reprod

[jira] [Commented] (SPARK-21725) spark thriftserver insert overwrite table partition select

2017-11-01 Thread xinzhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234149#comment-16234149 ] xinzhang commented on SPARK-21725: -- I can't believe it. I build hadoop 2.8 last night. I

[jira] [Resolved] (SPARK-22190) Add Spark executor task metrics to Dropwizard metrics

2017-11-01 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-22190. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19426 [https://githu

[jira] [Assigned] (SPARK-22190) Add Spark executor task metrics to Dropwizard metrics

2017-11-01 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-22190: --- Assignee: Luca Canali > Add Spark executor task metrics to Dropwizard metrics >

[jira] [Commented] (SPARK-22398) Partition directories with leading 0s cause wrong results

2017-11-01 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234192#comment-16234192 ] Marco Gaido commented on SPARK-22398: - [~viirya] sorry for the unrequested ping, I sa

[jira] [Commented] (SPARK-2984) FileNotFoundException on _temporary directory

2017-11-01 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234225#comment-16234225 ] Ryan Blue commented on SPARK-2984: -- I don't have a good solution here. You could maybe is

[jira] [Created] (SPARK-22408) RelationalGroupedDataset's distinct pivot value calculation can launch many stages

2017-11-01 Thread Patrick Woody (JIRA)
Patrick Woody created SPARK-22408: - Summary: RelationalGroupedDataset's distinct pivot value calculation can launch many stages Key: SPARK-22408 URL: https://issues.apache.org/jira/browse/SPARK-22408

[jira] [Updated] (SPARK-22408) RelationalGroupedDataset's distinct pivot value calculation launches unnecessary stages

2017-11-01 Thread Patrick Woody (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Woody updated SPARK-22408: -- Summary: RelationalGroupedDataset's distinct pivot value calculation launches unnecessary stage

[jira] [Assigned] (SPARK-22408) RelationalGroupedDataset's distinct pivot value calculation launches unnecessary stages

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22408: Assignee: Apache Spark > RelationalGroupedDataset's distinct pivot value calculation launc

[jira] [Commented] (SPARK-22408) RelationalGroupedDataset's distinct pivot value calculation launches unnecessary stages

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234295#comment-16234295 ] Apache Spark commented on SPARK-22408: -- User 'pwoody' has created a pull request for

[jira] [Assigned] (SPARK-22408) RelationalGroupedDataset's distinct pivot value calculation launches unnecessary stages

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22408: Assignee: (was: Apache Spark) > RelationalGroupedDataset's distinct pivot value calcul

[jira] [Commented] (SPARK-21725) spark thriftserver insert overwrite table partition select

2017-11-01 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234332#comment-16234332 ] Marco Gaido commented on SPARK-21725: - I don't have any idea about which is the diffe

[jira] [Updated] (SPARK-22409) Add function type argument to pandas_udf

2017-11-01 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Jin updated SPARK-22409: --- Priority: Major (was: Trivial) > Add function type argument to pandas_udf >

[jira] [Created] (SPARK-22409) Add function type argument to pandas_udf

2017-11-01 Thread Li Jin (JIRA)
Li Jin created SPARK-22409: -- Summary: Add function type argument to pandas_udf Key: SPARK-22409 URL: https://issues.apache.org/jira/browse/SPARK-22409 Project: Spark Issue Type: Sub-task C

[jira] [Commented] (SPARK-22344) Prevent R CMD check from using /tmp

2017-11-01 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234372#comment-16234372 ] Shivaram Venkataraman commented on SPARK-22344: --- Right I was considering th

[jira] [Commented] (SPARK-22409) Add function type argument to pandas_udf

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234376#comment-16234376 ] Apache Spark commented on SPARK-22409: -- User 'icexelloss' has created a pull request

[jira] [Assigned] (SPARK-22409) Add function type argument to pandas_udf

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22409: Assignee: Apache Spark > Add function type argument to pandas_udf > --

[jira] [Assigned] (SPARK-22409) Add function type argument to pandas_udf

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22409: Assignee: (was: Apache Spark) > Add function type argument to pandas_udf > ---

[jira] [Commented] (SPARK-20928) SPIP: Continuous Processing Mode for Structured Streaming

2017-11-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234428#comment-16234428 ] Reynold Xin commented on SPARK-20928: - Maybe we can add some information metadata (li

[jira] [Created] (SPARK-22410) Excessive spill for Pyspark UDF when a row has shrunk

2017-11-01 Thread JIRA
Clément Stenac created SPARK-22410: -- Summary: Excessive spill for Pyspark UDF when a row has shrunk Key: SPARK-22410 URL: https://issues.apache.org/jira/browse/SPARK-22410 Project: Spark Iss

[jira] [Assigned] (SPARK-22372) Make YARN client extend SparkApplication

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22372: Assignee: Apache Spark > Make YARN client extend SparkApplication > --

[jira] [Commented] (SPARK-22372) Make YARN client extend SparkApplication

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234476#comment-16234476 ] Apache Spark commented on SPARK-22372: -- User 'vanzin' has created a pull request for

[jira] [Assigned] (SPARK-22372) Make YARN client extend SparkApplication

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22372: Assignee: (was: Apache Spark) > Make YARN client extend SparkApplication > ---

[jira] [Created] (SPARK-22411) Heuristic to combine splits in DataSourceScanExec isn't accurate when dynamic allocation is enabled

2017-11-01 Thread Vinitha Reddy Gankidi (JIRA)
Vinitha Reddy Gankidi created SPARK-22411: - Summary: Heuristic to combine splits in DataSourceScanExec isn't accurate when dynamic allocation is enabled Key: SPARK-22411 URL: https://issues.apache.org/jira

[jira] [Commented] (SPARK-19644) Memory leak in Spark Streaming

2017-11-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234638#comment-16234638 ] Shixiong Zhu commented on SPARK-19644: -- I happened to investigate a similar issue an

[jira] [Commented] (SPARK-15689) Data source API v2

2017-11-01 Thread Russell Spitzer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234640#comment-16234640 ] Russell Spitzer commented on SPARK-15689: - Something I just noticed, it may be he

[jira] [Comment Edited] (SPARK-15689) Data source API v2

2017-11-01 Thread Russell Spitzer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234640#comment-16234640 ] Russell Spitzer edited comment on SPARK-15689 at 11/1/17 7:58 PM: -

[jira] [Commented] (SPARK-19644) Memory leak in Spark Streaming

2017-11-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234643#comment-16234643 ] Shixiong Zhu commented on SPARK-19644: -- By the way, you can confirm this issue by ch

[jira] [Updated] (SPARK-19644) Memory leak in Spark Streaming (Encoder/Scala Reflection)

2017-11-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19644: - Summary: Memory leak in Spark Streaming (Encoder/Scala Reflection) (was: Memory leak in Spark St

[jira] [Updated] (SPARK-19644) Memory leak in Spark Streaming

2017-11-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19644: - Component/s: SQL > Memory leak in Spark Streaming > -- > >

[jira] [Updated] (SPARK-19644) Memory leak in Spark Streaming (Encoder/Scala Reflection)

2017-11-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19644: - Component/s: Structured Streaming > Memory leak in Spark Streaming (Encoder/Scala Reflection) > -

[jira] [Updated] (SPARK-19644) Memory leak in Spark Streaming (Encoder/Scala Reflection)

2017-11-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19644: - Description: I am using streaming on the production for some aggregation and fetching data from

[jira] [Updated] (SPARK-19644) Memory leak in Spark Streaming (Encoder/Scala Reflection)

2017-11-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19644: - Description: I am using streaming on the production for some aggregation and fetching data from

[jira] [Commented] (SPARK-19644) Memory leak in Spark Streaming (Encoder/Scala Reflection)

2017-11-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234655#comment-16234655 ] Shixiong Zhu commented on SPARK-19644: -- I added more components since it also affect

[jira] [Updated] (SPARK-19644) Memory leak in Spark Streaming (Encoder/Scala Reflection)

2017-11-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19644: - Description: I am using streaming on the production for some aggregation and fetching data from

[jira] [Assigned] (SPARK-22411) Heuristic to combine splits in DataSourceScanExec isn't accurate when dynamic allocation is enabled

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22411: Assignee: Apache Spark > Heuristic to combine splits in DataSourceScanExec isn't accurate

[jira] [Commented] (SPARK-22411) Heuristic to combine splits in DataSourceScanExec isn't accurate when dynamic allocation is enabled

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234676#comment-16234676 ] Apache Spark commented on SPARK-22411: -- User 'vgankidi' has created a pull request f

[jira] [Assigned] (SPARK-22411) Heuristic to combine splits in DataSourceScanExec isn't accurate when dynamic allocation is enabled

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22411: Assignee: (was: Apache Spark) > Heuristic to combine splits in DataSourceScanExec isn'

[jira] [Commented] (SPARK-18673) Dataframes doesn't work on Hadoop 3.x; Hive rejects Hadoop version

2017-11-01 Thread Aihua Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234679#comment-16234679 ] Aihua Xu commented on SPARK-18673: -- Hive is working on the Hadoop3.x support (HIVE-15016

[jira] [Created] (SPARK-22412) Fix incorrect comment in DataSourceScanExec

2017-11-01 Thread Vinitha Reddy Gankidi (JIRA)
Vinitha Reddy Gankidi created SPARK-22412: - Summary: Fix incorrect comment in DataSourceScanExec Key: SPARK-22412 URL: https://issues.apache.org/jira/browse/SPARK-22412 Project: Spark

[jira] [Updated] (SPARK-22412) Fix incorrect comment in DataSourceScanExec

2017-11-01 Thread Vinitha Reddy Gankidi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinitha Reddy Gankidi updated SPARK-22412: -- Component/s: (was: Spark Core) SQL > Fix incorrect comment

[jira] [Updated] (SPARK-22412) Fix incorrect comment in DataSourceScanExec

2017-11-01 Thread Vinitha Reddy Gankidi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinitha Reddy Gankidi updated SPARK-22412: -- Component/s: (was: Documentation) Spark Core > Fix incorre

[jira] [Assigned] (SPARK-22412) Fix incorrect comment in DataSourceScanExec

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22412: Assignee: Apache Spark > Fix incorrect comment in DataSourceScanExec > ---

[jira] [Assigned] (SPARK-22412) Fix incorrect comment in DataSourceScanExec

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22412: Assignee: (was: Apache Spark) > Fix incorrect comment in DataSourceScanExec >

[jira] [Commented] (SPARK-22412) Fix incorrect comment in DataSourceScanExec

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234702#comment-16234702 ] Apache Spark commented on SPARK-22412: -- User 'vgankidi' has created a pull request f

[jira] [Updated] (SPARK-22412) Fix incorrect comment in DataSourceScanExec

2017-11-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-22412: -- Priority: Trivial (was: Minor) > Fix incorrect comment in DataSourceScanExec > ---

[jira] [Commented] (SPARK-22412) Fix incorrect comment in DataSourceScanExec

2017-11-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234715#comment-16234715 ] Sean Owen commented on SPARK-22412: --- We generally don't make a JIRA for a one line comm

[jira] [Created] (SPARK-22413) Type coercion for IN is not coherent between Literals and subquery

2017-11-01 Thread Marco Gaido (JIRA)
Marco Gaido created SPARK-22413: --- Summary: Type coercion for IN is not coherent between Literals and subquery Key: SPARK-22413 URL: https://issues.apache.org/jira/browse/SPARK-22413 Project: Spark

[jira] [Assigned] (SPARK-22413) Type coercion for IN is not coherent between Literals and subquery

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22413: Assignee: Apache Spark > Type coercion for IN is not coherent between Literals and subquer

[jira] [Commented] (SPARK-22413) Type coercion for IN is not coherent between Literals and subquery

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234731#comment-16234731 ] Apache Spark commented on SPARK-22413: -- User 'mgaido91' has created a pull request f

[jira] [Assigned] (SPARK-22413) Type coercion for IN is not coherent between Literals and subquery

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22413: Assignee: (was: Apache Spark) > Type coercion for IN is not coherent between Literals

[jira] [Commented] (SPARK-22412) Fix incorrect comment in DataSourceScanExec

2017-11-01 Thread Vinitha Reddy Gankidi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234743#comment-16234743 ] Vinitha Reddy Gankidi commented on SPARK-22412: --- Okay, thanks for letting m

[jira] [Created] (SPARK-22414) Can't set driver env variables on yarn

2017-11-01 Thread Flavio Brasil (JIRA)
Flavio Brasil created SPARK-22414: - Summary: Can't set driver env variables on yarn Key: SPARK-22414 URL: https://issues.apache.org/jira/browse/SPARK-22414 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-22415) lint-r fails if lint-r.R installs any new packages

2017-11-01 Thread Joel Croteau (JIRA)
Joel Croteau created SPARK-22415: Summary: lint-r fails if lint-r.R installs any new packages Key: SPARK-22415 URL: https://issues.apache.org/jira/browse/SPARK-22415 Project: Spark Issue Type

[jira] [Commented] (SPARK-22414) Can't set driver env variables on yarn

2017-11-01 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234775#comment-16234775 ] Marcelo Vanzin commented on SPARK-22414: Have you tried {{spark.yarn.appMasterEnv

[jira] [Created] (SPARK-22416) Move OrcOptions from `sql/hive` to `sql/core`

2017-11-01 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-22416: - Summary: Move OrcOptions from `sql/hive` to `sql/core` Key: SPARK-22416 URL: https://issues.apache.org/jira/browse/SPARK-22416 Project: Spark Issue Type: B

[jira] [Assigned] (SPARK-22416) Move OrcOptions from `sql/hive` to `sql/core`

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22416: Assignee: Apache Spark > Move OrcOptions from `sql/hive` to `sql/core` > -

[jira] [Assigned] (SPARK-22416) Move OrcOptions from `sql/hive` to `sql/core`

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22416: Assignee: (was: Apache Spark) > Move OrcOptions from `sql/hive` to `sql/core` > --

[jira] [Commented] (SPARK-22416) Move OrcOptions from `sql/hive` to `sql/core`

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234784#comment-16234784 ] Apache Spark commented on SPARK-22416: -- User 'dongjoon-hyun' has created a pull requ

[jira] [Updated] (SPARK-22416) Move OrcOptions from `sql/hive` to `sql/core`

2017-11-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-22416: -- Priority: Minor (was: Major) Issue Type: Task (was: Bug) > Move OrcOptions from `sql/hive` to `

[jira] [Commented] (SPARK-15689) Data source API v2

2017-11-01 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234811#comment-16234811 ] Wenchen Fan commented on SPARK-15689: - how would a count(agg function) exist in filte

[jira] [Commented] (SPARK-15689) Data source API v2

2017-11-01 Thread Russell Spitzer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234820#comment-16234820 ] Russell Spitzer commented on SPARK-15689: - It does not, we can tell that a count

[jira] [Created] (SPARK-22417) createDataFrame from a pandas.DataFrame reads datetime64 values as longs

2017-11-01 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-22417: Summary: createDataFrame from a pandas.DataFrame reads datetime64 values as longs Key: SPARK-22417 URL: https://issues.apache.org/jira/browse/SPARK-22417 Project: Spa

[jira] [Commented] (SPARK-18838) High latency of event processing for large jobs

2017-11-01 Thread Anthony Truchet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234891#comment-16234891 ] Anthony Truchet commented on SPARK-18838: - I'm interested to work on a backport f

[jira] [Comment Edited] (SPARK-18838) High latency of event processing for large jobs

2017-11-01 Thread Anthony Truchet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234891#comment-16234891 ] Anthony Truchet edited comment on SPARK-18838 at 11/1/17 10:42 PM:

[jira] [Commented] (SPARK-15689) Data source API v2

2017-11-01 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234892#comment-16234892 ] Wenchen Fan commented on SPARK-15689: - Spark wants to get `unhandledFilters` first so

[jira] [Commented] (SPARK-22414) Can't set driver env variables on yarn

2017-11-01 Thread Flavio Brasil (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234893#comment-16234893 ] Flavio Brasil commented on SPARK-22414: --- Sorry, I didn't see this config. It'd be h

[jira] [Resolved] (SPARK-22414) Can't set driver env variables on yarn

2017-11-01 Thread Flavio Brasil (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Flavio Brasil resolved SPARK-22414. --- Resolution: Not A Problem > Can't set driver env variables on yarn >

[jira] [Commented] (SPARK-15689) Data source API v2

2017-11-01 Thread Russell Spitzer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234899#comment-16234899 ] Russell Spitzer commented on SPARK-15689: - I think knowing whether or not the cou

[jira] [Comment Edited] (SPARK-15689) Data source API v2

2017-11-01 Thread Russell Spitzer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234899#comment-16234899 ] Russell Spitzer edited comment on SPARK-15689 at 11/1/17 10:52 PM:

[jira] [Created] (SPARK-22418) Add test cases for NULL Handling

2017-11-01 Thread Xiao Li (JIRA)
Xiao Li created SPARK-22418: --- Summary: Add test cases for NULL Handling Key: SPARK-22418 URL: https://issues.apache.org/jira/browse/SPARK-22418 Project: Spark Issue Type: Test Components:

[jira] [Assigned] (SPARK-22243) streaming job failed to restart from checkpoint

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22243: Assignee: Apache Spark > streaming job failed to restart from checkpoint > ---

[jira] [Commented] (SPARK-22243) streaming job failed to restart from checkpoint

2017-11-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16235019#comment-16235019 ] Apache Spark commented on SPARK-22243: -- User 'ChenjunZou' has created a pull request

  1   2   >