[jira] [Commented] (SPARK-19861) watermark should not be a negative time.

2017-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15900631#comment-15900631 ] Apache Spark commented on SPARK-19861: -- User 'uncleGen' has created a pull request f

[jira] [Created] (SPARK-19862) In SparkEnv.scala,shortShuffleMgrNames tungsten-sort can be deleted.

2017-03-07 Thread guoxiaolong (JIRA)
guoxiaolong created SPARK-19862: --- Summary: In SparkEnv.scala,shortShuffleMgrNames tungsten-sort can be deleted. Key: SPARK-19862 URL: https://issues.apache.org/jira/browse/SPARK-19862 Project: Spark

[jira] [Commented] (SPARK-13969) Extend input format that feature hashing can handle

2017-03-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15900662#comment-15900662 ] Joseph K. Bradley commented on SPARK-13969: --- Noticing this JIRA again. I feel

[jira] [Created] (SPARK-19863) Whether or not use CachedKafkaConsumer need to be configured, when you use DirectKafkaInputDStream to connect the kafka in a Spark Streaming application

2017-03-07 Thread LvDongrong (JIRA)
LvDongrong created SPARK-19863: -- Summary: Whether or not use CachedKafkaConsumer need to be configured, when you use DirectKafkaInputDStream to connect the kafka in a Spark Streaming application Key: SPARK-19863 URL

[jira] [Commented] (SPARK-19863) Whether or not use CachedKafkaConsumer need to be configured, when you use DirectKafkaInputDStream to connect the kafka in a Spark Streaming application

2017-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15900669#comment-15900669 ] Apache Spark commented on SPARK-19863: -- User 'lvdongr' has created a pull request fo

[jira] [Assigned] (SPARK-19863) Whether or not use CachedKafkaConsumer need to be configured, when you use DirectKafkaInputDStream to connect the kafka in a Spark Streaming application

2017-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19863: Assignee: Apache Spark > Whether or not use CachedKafkaConsumer need to be configured, whe

[jira] [Assigned] (SPARK-19863) Whether or not use CachedKafkaConsumer need to be configured, when you use DirectKafkaInputDStream to connect the kafka in a Spark Streaming application

2017-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19863: Assignee: (was: Apache Spark) > Whether or not use CachedKafkaConsumer need to be conf

[jira] [Created] (SPARK-19864) add makeQualifiedPath in CatalogUtils to optimize some code

2017-03-07 Thread Song Jun (JIRA)
Song Jun created SPARK-19864: Summary: add makeQualifiedPath in CatalogUtils to optimize some code Key: SPARK-19864 URL: https://issues.apache.org/jira/browse/SPARK-19864 Project: Spark Issue Ty

[jira] [Resolved] (SPARK-19843) UTF8String => (int / long) conversion expensive for invalid inputs

2017-03-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-19843. - Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17184 [https://githu

[jira] [Assigned] (SPARK-19843) UTF8String => (int / long) conversion expensive for invalid inputs

2017-03-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-19843: --- Assignee: Tejas Patil > UTF8String => (int / long) conversion expensive for invalid inputs >

[jira] [Commented] (SPARK-19864) add makeQualifiedPath in CatalogUtils to optimize some code

2017-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15900675#comment-15900675 ] Apache Spark commented on SPARK-19864: -- User 'windpiger' has created a pull request

[jira] [Assigned] (SPARK-19864) add makeQualifiedPath in CatalogUtils to optimize some code

2017-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19864: Assignee: Apache Spark > add makeQualifiedPath in CatalogUtils to optimize some code > ---

[jira] [Assigned] (SPARK-19864) add makeQualifiedPath in CatalogUtils to optimize some code

2017-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19864: Assignee: (was: Apache Spark) > add makeQualifiedPath in CatalogUtils to optimize some

[jira] [Resolved] (SPARK-18389) Disallow cyclic view reference

2017-03-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-18389. - Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17152 [https://githu

[jira] [Assigned] (SPARK-18389) Disallow cyclic view reference

2017-03-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-18389: --- Assignee: Jiang Xingbo > Disallow cyclic view reference > -- > >

[jira] [Created] (SPARK-19865) remove the view identifier in SubqueryAlias

2017-03-07 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-19865: --- Summary: remove the view identifier in SubqueryAlias Key: SPARK-19865 URL: https://issues.apache.org/jira/browse/SPARK-19865 Project: Spark Issue Type: Sub-tas

[jira] [Resolved] (SPARK-19841) StreamingDeduplicateExec.watermarkPredicate should filter rows based on keys

2017-03-07 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-19841. -- Resolution: Fixed Fix Version/s: 2.2.0 > StreamingDeduplicateExec.watermarkPredicate sho

[jira] [Resolved] (SPARK-19859) The new watermark should override the old one

2017-03-07 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-19859. -- Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 > The new watermark shou

[jira] [Resolved] (SPARK-17629) Add local version of Word2Vec findSynonyms for spark.ml

2017-03-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-17629. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16811 [h

[jira] [Created] (SPARK-19866) Add local version of Word2Vec findSynonyms for spark.ml: Python API

2017-03-07 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-19866: - Summary: Add local version of Word2Vec findSynonyms for spark.ml: Python API Key: SPARK-19866 URL: https://issues.apache.org/jira/browse/SPARK-19866 Project

[jira] [Updated] (SPARK-19866) Add local version of Word2Vec findSynonyms for spark.ml: Python API

2017-03-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19866: -- Shepherd: Joseph K. Bradley > Add local version of Word2Vec findSynonyms for spark.ml:

[jira] [Resolved] (SPARK-19348) pyspark.ml.Pipeline gets corrupted under multi threaded use

2017-03-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-19348. --- Resolution: Fixed Fix Version/s: 2.0.3 2.1.1 Issue resolved

[jira] [Updated] (SPARK-19860) DataFrame join get conflict error if two frames has a same name column.

2017-03-07 Thread wuchang (JIRA)
amount1=11370812), Row(fdate=u'20170217', in_amount1=8208985), Row(fdate=u'20170203', in_amount1=8175477), Row(fdate=u'20170222', in_amount1=11032303), Row(fdate=u'20170216', in_amount1=11986702), Row(fdate=u'20170209', in_amount1=9082380), Row(fdate=u&#x

[jira] [Updated] (SPARK-19659) Fetch big blocks to disk when shuffle-read

2017-03-07 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jin xing updated SPARK-19659: - Attachment: SPARK-19659-design-v2.pdf > Fetch big blocks to disk when shuffle-read >

[jira] [Commented] (SPARK-19659) Fetch big blocks to disk when shuffle-read

2017-03-07 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15900741#comment-15900741 ] jin xing commented on SPARK-19659: -- [~irashid] [~rxin] I uploaded SPARK-19659-design-v2.

[jira] [Comment Edited] (SPARK-19659) Fetch big blocks to disk when shuffle-read

2017-03-07 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15900741#comment-15900741 ] jin xing edited comment on SPARK-19659 at 3/8/17 5:47 AM: -- [~ira

[jira] [Commented] (SPARK-19843) UTF8String => (int / long) conversion expensive for invalid inputs

2017-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15900773#comment-15900773 ] Apache Spark commented on SPARK-19843: -- User 'tejasapatil' has created a pull reques

[jira] [Commented] (SPARK-19812) YARN shuffle service fails to relocate recovery DB directories

2017-03-07 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15900821#comment-15900821 ] Saisai Shao commented on SPARK-19812: - [~tgraves], I'm not quite sure what you mean h

[jira] [Commented] (SPARK-15463) Support for creating a dataframe from CSV in Dataset[String]

2017-03-07 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15900824#comment-15900824 ] Takeshi Yamamuro commented on SPARK-15463: -- Have you seen https://github.com/apa

[jira] [Commented] (SPARK-13969) Extend input format that feature hashing can handle

2017-03-07 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15900825#comment-15900825 ] Nick Pentreath commented on SPARK-13969: I think {{HashingTF}} and {{FeatureHashe

[jira] [Comment Edited] (SPARK-19812) YARN shuffle service fails to relocate recovery DB directories

2017-03-07 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15900821#comment-15900821 ] Saisai Shao edited comment on SPARK-19812 at 3/8/17 7:40 AM: -

[jira] [Comment Edited] (SPARK-19812) YARN shuffle service fails to relocate recovery DB directories

2017-03-07 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15900821#comment-15900821 ] Saisai Shao edited comment on SPARK-19812 at 3/8/17 7:42 AM: -

[jira] [Commented] (SPARK-6951) History server slow startup if the event log directory is large

2017-03-07 Thread Cui Xixin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15900871#comment-15900871 ] Cui Xixin commented on SPARK-6951: -- In my case,the inprogress file is the main reason, so

<    1   2