[jira] [Commented] (SPARK-22308) Support unit tests of spark code using ScalaTest using suites other than FunSuite

2017-11-02 Thread Nathan Kronenfeld (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237121#comment-16237121 ] Nathan Kronenfeld commented on SPARK-22308: --- ok, found the problem - it was the new tests, they

[jira] [Commented] (SPARK-22427) StackOverFlowError when using FPGrowth

2017-11-02 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237112#comment-16237112 ] Kazuaki Ishizaki commented on SPARK-22427: -- Thank you for reporting an issue. Could you please

[jira] [Assigned] (SPARK-22254) clean up the implementation of `growToSize` in CompactBuffer

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22254: Assignee: Apache Spark > clean up the implementation of `growToSize` in CompactBuffer >

[jira] [Assigned] (SPARK-22254) clean up the implementation of `growToSize` in CompactBuffer

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22254: Assignee: (was: Apache Spark) > clean up the implementation of `growToSize` in

[jira] [Commented] (SPARK-22254) clean up the implementation of `growToSize` in CompactBuffer

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237035#comment-16237035 ] Apache Spark commented on SPARK-22254: -- User 'kiszk' has created a pull request for this issue:

[jira] [Commented] (SPARK-21791) ORC should support column names with dot

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237095#comment-16237095 ] Apache Spark commented on SPARK-21791: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Commented] (SPARK-15474) ORC data source fails to write and read back empty dataframe

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237094#comment-16237094 ] Apache Spark commented on SPARK-15474: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Commented] (SPARK-20682) Add new ORCFileFormat based on Apache ORC

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237093#comment-16237093 ] Apache Spark commented on SPARK-20682: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Updated] (SPARK-20682) Add new ORCFileFormat based on Apache ORC

2017-11-02 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-20682: -- Summary: Add new ORCFileFormat based on Apache ORC (was: Support a new faster ORC data source

[jira] [Updated] (SPARK-22433) Linear regression R^2 train/test terminology related

2017-11-02 Thread Teng Peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Teng Peng updated SPARK-22433: -- Description: Traditional statistics is traditional statistics. Their goal, framework, and

[jira] [Updated] (SPARK-22433) Linear regression R^2 train/test terminology related

2017-11-02 Thread Teng Peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Teng Peng updated SPARK-22433: -- Description: Traditional statistics is traditional statistics. Their goal, framework, and

[jira] [Updated] (SPARK-22433) Linear regression R^2 train/test terminology related

2017-11-02 Thread Teng Peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Teng Peng updated SPARK-22433: -- Description: Traditional statistics is traditional statistics. Their goal, framework, and

[jira] [Created] (SPARK-22433) Linear regression R^2 train/test terminology related

2017-11-02 Thread Teng Peng (JIRA)
Teng Peng created SPARK-22433: - Summary: Linear regression R^2 train/test terminology related Key: SPARK-22433 URL: https://issues.apache.org/jira/browse/SPARK-22433 Project: Spark Issue Type:

[jira] [Commented] (SPARK-22405) Enrich the event information and add new event of ExternalCatalogEvent

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236950#comment-16236950 ] Apache Spark commented on SPARK-22405: -- User 'jerryshao' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22405) Enrich the event information and add new event of ExternalCatalogEvent

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22405: Assignee: (was: Apache Spark) > Enrich the event information and add new event of

[jira] [Assigned] (SPARK-22405) Enrich the event information and add new event of ExternalCatalogEvent

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22405: Assignee: Apache Spark > Enrich the event information and add new event of

[jira] [Commented] (SPARK-22426) Spark AM launching containers on node where External spark shuffle service failed to initialize

2017-11-02 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236939#comment-16236939 ] Saisai Shao commented on SPARK-22426: - This kind of scenario was handled in SPARK-13669 regarding to

[jira] [Commented] (SPARK-14516) Clustering evaluator

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236915#comment-16236915 ] Apache Spark commented on SPARK-14516: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-21087) CrossValidator, TrainValidationSplit should collect all models when fitting: Scala API

2017-11-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-21087: - Assignee: Weichen Xu > CrossValidator, TrainValidationSplit should collect all

[jira] [Updated] (SPARK-21087) CrossValidator, TrainValidationSplit should collect all models when fitting: Scala API

2017-11-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21087: -- Shepherd: Joseph K. Bradley > CrossValidator, TrainValidationSplit should collect all

[jira] [Assigned] (SPARK-22211) LimitPushDown optimization for FullOuterJoin generates wrong results

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22211: Assignee: (was: Apache Spark) > LimitPushDown optimization for FullOuterJoin

[jira] [Commented] (SPARK-22211) LimitPushDown optimization for FullOuterJoin generates wrong results

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236861#comment-16236861 ] Apache Spark commented on SPARK-22211: -- User 'henryr' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22211) LimitPushDown optimization for FullOuterJoin generates wrong results

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22211: Assignee: Apache Spark > LimitPushDown optimization for FullOuterJoin generates wrong

[jira] [Updated] (SPARK-22429) Streaming checkpointing code does not retry after failure due to NullPointerException

2017-11-02 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-22429: - Component/s: (was: Structured Streaming) DStreams > Streaming checkpointing

[jira] [Commented] (SPARK-22147) BlockId.hashCode allocates a StringBuilder/String on each call

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236789#comment-16236789 ] Apache Spark commented on SPARK-22147: -- User 'BryanCutler' has created a pull request for this

[jira] [Updated] (SPARK-22306) INFER_AND_SAVE overwrites important metadata in Parquet Metastore table

2017-11-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-22306: Fix Version/s: 2.3.0 > INFER_AND_SAVE overwrites important metadata in Parquet Metastore table >

[jira] [Created] (SPARK-22432) Allow long creation site to be logged for RDDs

2017-11-02 Thread Michael Mior (JIRA)
Michael Mior created SPARK-22432: Summary: Allow long creation site to be logged for RDDs Key: SPARK-22432 URL: https://issues.apache.org/jira/browse/SPARK-22432 Project: Spark Issue Type:

[jira] [Created] (SPARK-22431) Creating Permanent view with illegal type

2017-11-02 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-22431: - Summary: Creating Permanent view with illegal type Key: SPARK-22431 URL: https://issues.apache.org/jira/browse/SPARK-22431 Project: Spark Issue

[jira] [Created] (SPARK-22430) Unknown tag warnings when building R docs with Roxygen 6.0.1

2017-11-02 Thread Joel Croteau (JIRA)
Joel Croteau created SPARK-22430: Summary: Unknown tag warnings when building R docs with Roxygen 6.0.1 Key: SPARK-22430 URL: https://issues.apache.org/jira/browse/SPARK-22430 Project: Spark

[jira] [Commented] (SPARK-22429) Streaming checkpointing code does not retry after failure due to NullPointerException

2017-11-02 Thread Tristan Stevens (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236492#comment-16236492 ] Tristan Stevens commented on SPARK-22429: - [~srowen] I've raised a PR against branch-2.2. master

[jira] [Assigned] (SPARK-22429) Streaming checkpointing code does not retry after failure due to NullPointerException

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22429: Assignee: Apache Spark > Streaming checkpointing code does not retry after failure due to

[jira] [Assigned] (SPARK-22429) Streaming checkpointing code does not retry after failure due to NullPointerException

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22429: Assignee: (was: Apache Spark) > Streaming checkpointing code does not retry after

[jira] [Commented] (SPARK-22429) Streaming checkpointing code does not retry after failure due to NullPointerException

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236489#comment-16236489 ] Apache Spark commented on SPARK-22429: -- User 'tmgstevens' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22401) Missing 2.1.2 tag in git

2017-11-02 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-22401: --- Assignee: holdenk > Missing 2.1.2 tag in git > > > Key:

[jira] [Resolved] (SPARK-22401) Missing 2.1.2 tag in git

2017-11-02 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-22401. - Resolution: Fixed > Missing 2.1.2 tag in git > > > Key:

[jira] [Updated] (SPARK-22401) Missing 2.1.2 tag in git

2017-11-02 Thread Holden Karau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Holden Karau updated SPARK-22401: - Fix Version/s: 2.1.2 > Missing 2.1.2 tag in git > > >

[jira] [Commented] (SPARK-22401) Missing 2.1.2 tag in git

2017-11-02 Thread Holden Karau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236407#comment-16236407 ] Holden Karau commented on SPARK-22401: -- Pushed, looking at the scripts they are all for tagging the

[jira] [Resolved] (SPARK-20807) Add compression/decompression of data to ColumnVector

2017-11-02 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kazuaki Ishizaki resolved SPARK-20807. -- Resolution: Won't Fix > Add compression/decompression of data to ColumnVector >

[jira] [Commented] (SPARK-21505) A dynamic join operator to improve the join reliability

2017-11-02 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236365#comment-16236365 ] Zhan Zhang commented on SPARK-21505: Any comments on this feature? Do you think the design is OK? If

[jira] [Assigned] (SPARK-22243) streaming job failed to restart from checkpoint

2017-11-02 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reassigned SPARK-22243: Assignee: StephenZou > streaming job failed to restart from checkpoint >

[jira] [Resolved] (SPARK-22243) streaming job failed to restart from checkpoint

2017-11-02 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-22243. -- Resolution: Fixed Fix Version/s: 2.3.0 > streaming job failed to restart from

[jira] [Commented] (SPARK-22429) Streaming checkpointing code does not retry after failure due to NullPointerException

2017-11-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236258#comment-16236258 ] Sean Owen commented on SPARK-22429: --- Sounds straightforward -- feel free to open a pull request. >

[jira] [Resolved] (SPARK-22416) Move OrcOptions from `sql/hive` to `sql/core`

2017-11-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-22416. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19636

[jira] [Assigned] (SPARK-22416) Move OrcOptions from `sql/hive` to `sql/core`

2017-11-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-22416: --- Assignee: Dongjoon Hyun > Move OrcOptions from `sql/hive` to `sql/core` >

[jira] [Commented] (SPARK-22254) clean up the implementation of `growToSize` in CompactBuffer

2017-11-02 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236105#comment-16236105 ] Kazuaki Ishizaki commented on SPARK-22254: -- I started working for this, and will submit a PR

[jira] [Commented] (SPARK-22344) Prevent R CMD check from using /tmp

2017-11-02 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236103#comment-16236103 ] Felix Cheung commented on SPARK-22344: -- Yes to both. If SPARK_HOME is set before calling

[jira] [Created] (SPARK-22429) Streaming checkpointing code does not retry after failure due to NullPointerException

2017-11-02 Thread Tristan Stevens (JIRA)
Tristan Stevens created SPARK-22429: --- Summary: Streaming checkpointing code does not retry after failure due to NullPointerException Key: SPARK-22429 URL: https://issues.apache.org/jira/browse/SPARK-22429

[jira] [Resolved] (SPARK-22329) Use NEVER_INFER for `spark.sql.hive.caseSensitiveInferenceMode` by default

2017-11-02 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-22329. --- Resolution: Won't Fix > Use NEVER_INFER for `spark.sql.hive.caseSensitiveInferenceMode` by

[jira] [Resolved] (SPARK-22369) PySpark: Document methods of spark.catalog interface

2017-11-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-22369. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.3.0 > PySpark:

[jira] [Commented] (SPARK-22419) Hive and Hive Thriftserver jars missing from "without hadoop" build

2017-11-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235783#comment-16235783 ] Sean Owen commented on SPARK-22419: --- Yes, it's useful for future reference. Spark should work fine with

[jira] [Comment Edited] (SPARK-22419) Hive and Hive Thriftserver jars missing from "without hadoop" build

2017-11-02 Thread Adam Kramer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235774#comment-16235774 ] Adam Kramer edited comment on SPARK-22419 at 11/2/17 2:01 PM: -- I'll assume

[jira] [Commented] (SPARK-22419) Hive and Hive Thriftserver jars missing from "without hadoop" build

2017-11-02 Thread Adam Kramer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235774#comment-16235774 ] Adam Kramer commented on SPARK-22419: - I'll assume it's on purpose for my stated reasons above.

[jira] [Commented] (SPARK-22306) INFER_AND_SAVE overwrites important metadata in Parquet Metastore table

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235725#comment-16235725 ] Apache Spark commented on SPARK-22306: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Resolved] (SPARK-22145) Issues with driver re-starting on mesos (supervise)

2017-11-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22145. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19374

[jira] [Assigned] (SPARK-22145) Issues with driver re-starting on mesos (supervise)

2017-11-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-22145: - Assignee: Stavros Kontopoulos > Issues with driver re-starting on mesos (supervise) >

[jira] [Resolved] (SPARK-21725) spark thriftserver insert overwrite table partition select

2017-11-02 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Gaido resolved SPARK-21725. - Resolution: Not A Bug > spark thriftserver insert overwrite table partition select >

[jira] [Commented] (SPARK-22398) Partition directories with leading 0s cause wrong results

2017-11-02 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235710#comment-16235710 ] Marco Gaido commented on SPARK-22398: - [~viirya] I see your point. Thanks for your answer. >

[jira] [Commented] (SPARK-22398) Partition directories with leading 0s cause wrong results

2017-11-02 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235708#comment-16235708 ] Marco Gaido commented on SPARK-22398: - [~hyukjin.kwon] I think that here there are two points: 1)

[jira] [Resolved] (SPARK-22408) RelationalGroupedDataset's distinct pivot value calculation launches unnecessary stages

2017-11-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-22408. - Resolution: Fixed Assignee: Patrick Woody Fix Version/s: 2.3.0 >

[jira] [Commented] (SPARK-11421) Add the ability to add a jar to the current class loader

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235649#comment-16235649 ] Apache Spark commented on SPARK-11421: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Resolved] (SPARK-22306) INFER_AND_SAVE overwrites important metadata in Parquet Metastore table

2017-11-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-22306. - Resolution: Fixed Fix Version/s: 2.2.1 Issue resolved by pull request 19622

[jira] [Assigned] (SPARK-22306) INFER_AND_SAVE overwrites important metadata in Parquet Metastore table

2017-11-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-22306: --- Assignee: Wenchen Fan > INFER_AND_SAVE overwrites important metadata in Parquet Metastore

[jira] [Commented] (SPARK-22428) Document spark properties for configuring the ContextCleaner

2017-11-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235559#comment-16235559 ] Sean Owen commented on SPARK-22428: --- It's probably OK to do so, but not all properties are meant to be

[jira] [Created] (SPARK-22428) Document spark properties for configuring the ContextCleaner

2017-11-02 Thread Andreas Maier (JIRA)
Andreas Maier created SPARK-22428: - Summary: Document spark properties for configuring the ContextCleaner Key: SPARK-22428 URL: https://issues.apache.org/jira/browse/SPARK-22428 Project: Spark

[jira] [Commented] (SPARK-22410) Excessive spill for Pyspark UDF when a row has shrunk

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235522#comment-16235522 ] Apache Spark commented on SPARK-22410: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22410) Excessive spill for Pyspark UDF when a row has shrunk

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22410: Assignee: Apache Spark > Excessive spill for Pyspark UDF when a row has shrunk >

[jira] [Assigned] (SPARK-22410) Excessive spill for Pyspark UDF when a row has shrunk

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22410: Assignee: (was: Apache Spark) > Excessive spill for Pyspark UDF when a row has shrunk

[jira] [Commented] (SPARK-22426) Spark AM launching containers on node where External spark shuffle service failed to initialize

2017-11-02 Thread Prabhu Joseph (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235500#comment-16235500 ] Prabhu Joseph commented on SPARK-22426: --- Node and NodeManager process is fine, External Spark

[jira] [Commented] (SPARK-22426) Spark AM launching containers on node where External spark shuffle service failed to initialize

2017-11-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235461#comment-16235461 ] Sean Owen commented on SPARK-22426: --- If the node has failed, YARN already can't or won't launch

[jira] [Created] (SPARK-22427) StackOverFlowError when using FPGrowth

2017-11-02 Thread lyt (JIRA)
lyt created SPARK-22427: --- Summary: StackOverFlowError when using FPGrowth Key: SPARK-22427 URL: https://issues.apache.org/jira/browse/SPARK-22427 Project: Spark Issue Type: Bug Components:

[jira] [Created] (SPARK-22426) Spark AM launching containers on node where External spark shuffle service failed to initialize

2017-11-02 Thread Prabhu Joseph (JIRA)
Prabhu Joseph created SPARK-22426: - Summary: Spark AM launching containers on node where External spark shuffle service failed to initialize Key: SPARK-22426 URL: https://issues.apache.org/jira/browse/SPARK-22426

[jira] [Updated] (SPARK-22426) Spark AM launching containers on node where External spark shuffle service failed to initialize

2017-11-02 Thread Prabhu Joseph (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prabhu Joseph updated SPARK-22426: -- Component/s: YARN > Spark AM launching containers on node where External spark shuffle service

[jira] [Commented] (SPARK-21911) Parallel Model Evaluation for ML Tuning: PySpark

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235444#comment-16235444 ] Apache Spark commented on SPARK-21911: -- User 'WeichenXu123' has created a pull request for this

[jira] [Resolved] (SPARK-22102) Reusing CliSessionState didn't set correct METASTOREWAREHOUSE

2017-11-02 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-22102. - Resolution: Cannot Reproduce Master branch cannot reproduce > Reusing CliSessionState didn't

[jira] [Reopened] (SPARK-22102) Reusing CliSessionState didn't set correct METASTOREWAREHOUSE

2017-11-02 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reopened SPARK-22102: - > Reusing CliSessionState didn't set correct METASTOREWAREHOUSE >

[jira] [Resolved] (SPARK-22102) Reusing CliSessionState didn't set correct METASTOREWAREHOUSE

2017-11-02 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-22102. - Resolution: Fixed > Reusing CliSessionState didn't set correct METASTOREWAREHOUSE >

[jira] [Commented] (SPARK-16986) "Started" time, "Completed" time and "Last Updated" time in history server UI are not user local time

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235406#comment-16235406 ] Apache Spark commented on SPARK-16986: -- User 'wangyum' has created a pull request for this issue:

[jira] [Created] (SPARK-22425) add output files information to EventLogger

2017-11-02 Thread Long Tian (JIRA)
Long Tian created SPARK-22425: - Summary: add output files information to EventLogger Key: SPARK-22425 URL: https://issues.apache.org/jira/browse/SPARK-22425 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-02 Thread chengning (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235384#comment-16235384 ] chengning edited comment on SPARK-22424 at 11/2/17 8:41 AM: sorry, my picture

[jira] [Comment Edited] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-02 Thread chengning (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235384#comment-16235384 ] chengning edited comment on SPARK-22424 at 11/2/17 8:31 AM: sorry, my

[jira] [Commented] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-02 Thread chengning (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235384#comment-16235384 ] chengning commented on SPARK-22424: --- !1.jpg|thumbnail! sorry, my picture not display, I post it

[jira] [Updated] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-02 Thread chengning (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chengning updated SPARK-22424: -- Attachment: 1.jpg > Task not finished for a long time in monitor UI, but I found it finished in >

[jira] [Comment Edited] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-02 Thread chengning (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235379#comment-16235379 ] chengning edited comment on SPARK-22424 at 11/2/17 8:25 AM: I have another

[jira] [Commented] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235381#comment-16235381 ] Sean Owen commented on SPARK-22424: --- I'm not following. You're circling different tasks. But again the

[jira] [Commented] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-02 Thread chengning (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235379#comment-16235379 ] chengning commented on SPARK-22424: --- I have another picture shows clearly !1.png|thumbnail!

[jira] [Updated] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-02 Thread chengning (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chengning updated SPARK-22424: -- Attachment: 1.png > Task not finished for a long time in monitor UI, but I found it finished in >

[jira] [Commented] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-02 Thread chengning (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235365#comment-16235365 ] chengning commented on SPARK-22424: --- Oh, I saw that the state is really SUCCESS, but Event Timeline

[jira] [Updated] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-02 Thread chengning (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chengning updated SPARK-22424: -- Description: Task not finished for a long time in monitor UI, but I found it finished in logs Thanks

[jira] [Updated] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-02 Thread chengning (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chengning updated SPARK-22424: -- Attachment: C33oL.jpg > Task not finished for a long time in monitor UI, but I found it finished in >

[jira] [Commented] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235344#comment-16235344 ] Sean Owen commented on SPARK-22424: --- This shows task 52 finished in both logs and UI. > Task not

[jira] [Created] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-02 Thread chengning (JIRA)
chengning created SPARK-22424: - Summary: Task not finished for a long time in monitor UI, but I found it finished in logs Key: SPARK-22424 URL: https://issues.apache.org/jira/browse/SPARK-22424 Project:

[jira] [Updated] (SPARK-22423) Scala test source files like TestHiveSingleton.scala should be in scala source root

2017-11-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-22423: -- Summary: Scala test source files like TestHiveSingleton.scala should be in scala source root (was:

[jira] [Assigned] (SPARK-22423) The TestHiveSingleton.scala file should be in scala directory

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22423: Assignee: Apache Spark > The TestHiveSingleton.scala file should be in scala directory >

[jira] [Commented] (SPARK-22423) The TestHiveSingleton.scala file should be in scala directory

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235336#comment-16235336 ] Apache Spark commented on SPARK-22423: -- User 'xubo245' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22423) The TestHiveSingleton.scala file should be in scala directory

2017-11-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22423: Assignee: (was: Apache Spark) > The TestHiveSingleton.scala file should be in scala

[jira] [Resolved] (SPARK-22419) Hive and Hive Thriftserver jars missing from "without hadoop" build

2017-11-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22419. --- Resolution: Not A Problem Fix Version/s: (was: 2.1.1) This is on purpose anyway, and

[jira] [Reopened] (SPARK-22419) Hive and Hive Thriftserver jars missing from "without hadoop" build

2017-11-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-22419: --- > Hive and Hive Thriftserver jars missing from "without hadoop" build >

[jira] [Created] (SPARK-22423) The TestHiveSingleton.scala file should be in scala directory

2017-11-02 Thread xubo245 (JIRA)
xubo245 created SPARK-22423: --- Summary: The TestHiveSingleton.scala file should be in scala directory Key: SPARK-22423 URL: https://issues.apache.org/jira/browse/SPARK-22423 Project: Spark Issue

[jira] [Comment Edited] (SPARK-21725) spark thriftserver insert overwrite table partition select

2017-11-02 Thread xinzhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235119#comment-16235119 ] xinzhang edited comment on SPARK-21725 at 11/2/17 7:26 AM: --- [~mgaido] Finally.I

[jira] [Resolved] (SPARK-22421) is there a plan for Structured streaming monitoring UI ?

2017-11-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22421. --- Resolution: Invalid Questions to the mailing list please > is there a plan for Structured

  1   2   >