[jira] [Created] (SPARK-25600) Make use of TypeCoercion.findTightestCommonType while inferring CSV schema

2018-10-03 Thread Dilip Biswal (JIRA)
Dilip Biswal created SPARK-25600: Summary: Make use of TypeCoercion.findTightestCommonType while inferring CSV schema Key: SPARK-25600 URL: https://issues.apache.org/jira/browse/SPARK-25600 Project: S

[jira] [Assigned] (SPARK-25600) Make use of TypeCoercion.findTightestCommonType while inferring CSV schema

2018-10-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25600: Assignee: (was: Apache Spark) > Make use of TypeCoercion.findTightestCommonType while

[jira] [Assigned] (SPARK-25600) Make use of TypeCoercion.findTightestCommonType while inferring CSV schema

2018-10-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25600: Assignee: Apache Spark > Make use of TypeCoercion.findTightestCommonType while inferring

[jira] [Commented] (SPARK-25600) Make use of TypeCoercion.findTightestCommonType while inferring CSV schema

2018-10-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16636605#comment-16636605 ] Apache Spark commented on SPARK-25600: -- User 'dilipbiswal' has created a pull reque

[jira] [Commented] (SPARK-25582) Error in Spark logs when using the org.apache.spark:spark-sql_2.11:2.2.0 Java library

2018-10-03 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16636616#comment-16636616 ] Marco Gaido commented on SPARK-25582: - Hi [~onyssius]. Sorry for the trouble, it sho

[jira] [Assigned] (SPARK-25595) Ignore corrupt Avro file if flag IGNORE_CORRUPT_FILES enabled

2018-10-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-25595: Assignee: Gengliang Wang > Ignore corrupt Avro file if flag IGNORE_CORRUPT_FILES enabled

[jira] [Resolved] (SPARK-25595) Ignore corrupt Avro file if flag IGNORE_CORRUPT_FILES enabled

2018-10-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25595. -- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 22611 [https://gi

[jira] [Updated] (SPARK-23153) Support application dependencies in submission client's local file system

2018-10-03 Thread Rob Vesse (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rob Vesse updated SPARK-23153: -- Description: Currently local dependencies are not supported with Spark on K8S i.e. if the user has cod

[jira] [Updated] (SPARK-22978) Register Scalar Vectorized UDFs for SQL Statement

2018-10-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-22978: - Summary: Register Scalar Vectorized UDFs for SQL Statement (was: Register Vectorized UDFs for S

[jira] [Created] (SPARK-25601) Register Grouped aggregate UDF Vectorized UDFs for SQL Statement

2018-10-03 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-25601: Summary: Register Grouped aggregate UDF Vectorized UDFs for SQL Statement Key: SPARK-25601 URL: https://issues.apache.org/jira/browse/SPARK-25601 Project: Spark

[jira] [Updated] (SPARK-25583) Add newly added History server related configurations in the documentation

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25583: -- Fix Version/s: 2.3.3 > Add newly added History server related configurations in the documentat

[jira] [Assigned] (SPARK-25589) Add BloomFilterBenchmark

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-25589: - Assignee: Dongjoon Hyun > Add BloomFilterBenchmark > > >

[jira] [Resolved] (SPARK-25589) Add BloomFilterBenchmark

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-25589. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 22605 [https://

[jira] [Resolved] (SPARK-25483) Refactor UnsafeArrayDataBenchmark to use main method

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-25483. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 22491 [https://

[jira] [Assigned] (SPARK-25483) Refactor UnsafeArrayDataBenchmark to use main method

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-25483: - Assignee: Yuming Wang > Refactor UnsafeArrayDataBenchmark to use main method >

[jira] [Updated] (SPARK-25483) Refactor UnsafeArrayDataBenchmark to use main method

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25483: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Refactor UnsafeArrayDataBen

[jira] [Updated] (SPARK-25565) Add scala style checker to check add Locale.ROOT to .toLowerCase and .toUpperCase for internal calls

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25565: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Add scala style checker to

[jira] [Assigned] (SPARK-25601) Register Grouped aggregate UDF Vectorized UDFs for SQL Statement

2018-10-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25601: Assignee: (was: Apache Spark) > Register Grouped aggregate UDF Vectorized UDFs for SQ

[jira] [Updated] (SPARK-25589) Add BloomFilterBenchmark

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25589: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Add BloomFilterBenchmark >

[jira] [Assigned] (SPARK-25601) Register Grouped aggregate UDF Vectorized UDFs for SQL Statement

2018-10-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25601: Assignee: Apache Spark > Register Grouped aggregate UDF Vectorized UDFs for SQL Statement

[jira] [Commented] (SPARK-25601) Register Grouped aggregate UDF Vectorized UDFs for SQL Statement

2018-10-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16636818#comment-16636818 ] Apache Spark commented on SPARK-25601: -- User 'HyukjinKwon' has created a pull reque

[jira] [Updated] (SPARK-25549) High level API to collect RDD statistics

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25549: -- Affects Version/s: (was: 2.5.0) 3.0.0 > High level API to collect R

[jira] [Updated] (SPARK-25542) Flaky test: OpenHashMapSuite

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25542: -- Affects Version/s: (was: 2.5.0) 2.4.0 > Flaky test: OpenHashMapSuit

[jira] [Updated] (SPARK-25553) Add EmptyInterpolatedStringChecker to scalastyle-config.xml

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25553: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Add EmptyInterpolatedString

[jira] [Updated] (SPARK-25532) A stable and efficient row representation

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25532: -- Affects Version/s: (was: 2.5.0) 3.0.0 > A stable and efficient row

[jira] [Updated] (SPARK-25534) Make `SQLHelper` trait

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25534: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Make `SQLHelper` trait > --

[jira] [Updated] (SPARK-25539) Update lz4-java to get speed improvement

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25539: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Update lz4-java to get spee

[jira] [Updated] (SPARK-25530) data source v2 write side API refactoring

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25530: -- Target Version/s: 3.0.0 (was: 2.5.0) > data source v2 write side API refactoring > --

[jira] [Updated] (SPARK-25530) data source v2 write side API refactoring

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25530: -- Affects Version/s: (was: 2.5.0) 3.0.0 > data source v2 write side A

[jira] [Updated] (SPARK-25531) new write APIs for data source v2

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25531: -- Affects Version/s: (was: 2.5.0) 3.0.0 > new write APIs for data sou

[jira] [Updated] (SPARK-25528) data source V2 read side API refactoring

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25528: -- Target Version/s: 3.0.0 (was: 2.5.0) > data source V2 read side API refactoring > ---

[jira] [Updated] (SPARK-25528) data source V2 read side API refactoring

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25528: -- Affects Version/s: (was: 2.5.0) 3.0.0 > data source V2 read side AP

[jira] [Commented] (SPARK-25530) data source v2 write side API refactoring

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16636822#comment-16636822 ] Dongjoon Hyun commented on SPARK-25530: --- I updated the versions from 2.5.0 to 3.0.

[jira] [Updated] (SPARK-25515) Add a config property for disabling auto deletion of PODS for debugging.

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25515: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Add a config property for d

[jira] [Updated] (SPARK-25508) Refactor OrcReadBenchmark to use main method

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25508: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Refactor OrcReadBenchmark t

[jira] [Updated] (SPARK-25510) Create a new trait SqlBasedBenchmark

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25510: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Create a new trait SqlBase

[jira] [Updated] (SPARK-25488) Refactor MiscBenchmark to use main method

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25488: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Refactor MiscBenchmark to u

[jira] [Updated] (SPARK-25486) Refactor SortBenchmark to use main method

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25486: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Refactor SortBenchmark to u

[jira] [Updated] (SPARK-25492) Refactor WideSchemaBenchmark to use main method

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25492: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Refactor WideSchemaBenchmar

[jira] [Updated] (SPARK-25485) Refactor UnsafeProjectionBenchmark to use main method

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25485: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Refactor UnsafeProjectionBe

[jira] [Updated] (SPARK-25481) Refactor ColumnarBatchBenchmark to use main method

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25481: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Refactor ColumnarBatchBench

[jira] [Updated] (SPARK-25479) Refactor DatasetBenchmark to use main method

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25479: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Refactor DatasetBenchmark t

[jira] [Updated] (SPARK-25478) Refactor CompressionSchemeBenchmark to use main method

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25478: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Refactor CompressionSchemeB

[jira] [Updated] (SPARK-25476) Refactor AggregateBenchmark to use main method

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25476: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Refactor AggregateBenchmark

[jira] [Updated] (SPARK-25475) Refactor all benchmark to save the result as a separate file

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25475: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Refactor all benchmark to s

[jira] [Updated] (SPARK-25458) Support FOR ALL COLUMNS in ANALYZE TABLE

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25458: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Support FOR ALL COLUMNS in

[jira] [Updated] (SPARK-25444) Refactor GenArrayData.genCodeToCreateArrayData() method

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25444: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Refactor GenArrayData.genCo

[jira] [Updated] (SPARK-25442) Support STS to run in K8S deployment with spark deployment mode as cluster

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25442: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Support STS to run in K8S d

[jira] [Updated] (SPARK-25457) IntegralDivide (div) should not always return long

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25457: -- Affects Version/s: (was: 2.5.0) 3.0.0 > IntegralDivide (div) should

[jira] [Updated] (SPARK-25390) data source V2 API refactoring

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25390: -- Affects Version/s: (was: 2.5.0) 3.0.0 > data source V2 API refactor

[jira] [Updated] (SPARK-25423) Output "dataFilters" in DataSourceScanExec.metadata

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25423: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Output "dataFilters" in Dat

[jira] [Updated] (SPARK-25390) data source V2 API refactoring

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25390: -- Target Version/s: 3.0.0 (was: 2.5.0) > data source V2 API refactoring > -

[jira] [Updated] (SPARK-16323) Avoid unnecessary cast when doing integral divide

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-16323: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Avoid unnecessary cast when

[jira] [Updated] (SPARK-25436) Bump master branch version to 2.5.0-SNAPSHOT

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25436: -- Affects Version/s: (was: 2.5.0) 3.0.0 > Bump master branch version

[jira] [Commented] (SPARK-25436) Bump master branch version to 2.5.0-SNAPSHOT

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16636833#comment-16636833 ] Dongjoon Hyun commented on SPARK-25436: --- I updated the versions to 3.0.0 since we

[jira] [Created] (SPARK-25602) range metrics can be wrong if the result rows are not fully consumed

2018-10-03 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-25602: --- Summary: range metrics can be wrong if the result rows are not fully consumed Key: SPARK-25602 URL: https://issues.apache.org/jira/browse/SPARK-25602 Project: Spark

[jira] [Commented] (SPARK-25062) Clean up BlockLocations in FileStatus objects

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16636900#comment-16636900 ] Dongjoon Hyun commented on SPARK-25062: --- Hi, [~petertoth]. According to your descr

[jira] [Updated] (SPARK-25062) Clean up BlockLocations in FileStatus objects

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25062: -- Issue Type: Improvement (was: Bug) > Clean up BlockLocations in FileStatus objects >

[jira] [Comment Edited] (SPARK-25062) Clean up BlockLocations in FileStatus objects

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16636900#comment-16636900 ] Dongjoon Hyun edited comment on SPARK-25062 at 10/3/18 12:43 PM: -

[jira] [Commented] (SPARK-21402) Java encoders - switch fields on collectAsList

2018-10-03 Thread Paul Praet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16636957#comment-16636957 ] Paul Praet commented on SPARK-21402: Still there in Spark 2.3.1. > Java encoders -

[jira] [Assigned] (SPARK-25602) range metrics can be wrong if the result rows are not fully consumed

2018-10-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25602: Assignee: Wenchen Fan (was: Apache Spark) > range metrics can be wrong if the result row

[jira] [Commented] (SPARK-25602) range metrics can be wrong if the result rows are not fully consumed

2018-10-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16636994#comment-16636994 ] Apache Spark commented on SPARK-25602: -- User 'cloud-fan' has created a pull request

[jira] [Assigned] (SPARK-25602) range metrics can be wrong if the result rows are not fully consumed

2018-10-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25602: Assignee: Apache Spark (was: Wenchen Fan) > range metrics can be wrong if the result row

[jira] [Commented] (SPARK-25602) range metrics can be wrong if the result rows are not fully consumed

2018-10-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16636995#comment-16636995 ] Apache Spark commented on SPARK-25602: -- User 'cloud-fan' has created a pull request

[jira] [Created] (SPARK-25603) `Projection` expression pushdown through `coalesce` and `limit`

2018-10-03 Thread DB Tsai (JIRA)
DB Tsai created SPARK-25603: --- Summary: `Projection` expression pushdown through `coalesce` and `limit` Key: SPARK-25603 URL: https://issues.apache.org/jira/browse/SPARK-25603 Project: Spark Issue

[jira] [Assigned] (SPARK-25538) incorrect row counts after distinct()

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-25538: - Assignee: Marco Gaido > incorrect row counts after distinct() > ---

[jira] [Resolved] (SPARK-25538) incorrect row counts after distinct()

2018-10-03 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-25538. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 22602 [https://

[jira] [Commented] (SPARK-25062) Clean up BlockLocations in FileStatus objects

2018-10-03 Thread Andrei Stankevich (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16637089#comment-16637089 ] Andrei Stankevich commented on SPARK-25062: --- Hi [~dongjoon], yes, it an improv

[jira] [Created] (SPARK-25604) Reduce the overall time costs in Jenkins tests

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25604: --- Summary: Reduce the overall time costs in Jenkins tests Key: SPARK-25604 URL: https://issues.apache.org/jira/browse/SPARK-25604 Project: Spark Issue Type: Umbrella

[jira] [Created] (SPARK-25605) CastSuite: cast string to timestamp 2 mins 31 sec

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25605: --- Summary: CastSuite: cast string to timestamp 2 mins 31 sec Key: SPARK-25605 URL: https://issues.apache.org/jira/browse/SPARK-25605 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-25606) DateExpressionsSuite: Hour 1 min

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25606: --- Summary: DateExpressionsSuite: Hour 1 min Key: SPARK-25606 URL: https://issues.apache.org/jira/browse/SPARK-25606 Project: Spark Issue Type: Sub-task Compone

[jira] [Commented] (SPARK-25501) Kafka delegation token support

2018-10-03 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16637240#comment-16637240 ] Thomas Graves commented on SPARK-25501: --- did you post SPIP to the dev list, I didn

[jira] [Created] (SPARK-25607) HashAggregationQueryWithControlledFallbackSuite: single distinct column set 42 seconds

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25607: --- Summary: HashAggregationQueryWithControlledFallbackSuite: single distinct column set 42 seconds Key: SPARK-25607 URL: https://issues.apache.org/jira/browse/SPARK-25607 Project:

[jira] [Created] (SPARK-25608) HashAggregationQueryWithControlledFallbackSuite: multiple distinct multiple columns sets 38 seconds

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25608: --- Summary: HashAggregationQueryWithControlledFallbackSuite: multiple distinct multiple columns sets 38 seconds Key: SPARK-25608 URL: https://issues.apache.org/jira/browse/SPARK-25608

[jira] [Commented] (SPARK-25501) Kafka delegation token support

2018-10-03 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16637248#comment-16637248 ] Thomas Graves commented on SPARK-25501: --- the spip title has "Structured Streaming"

[jira] [Created] (SPARK-25609) DataFrameSuite: SPARK-22226: splitExpressions should not generate codes beyond 64KB 49 seconds

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25609: --- Summary: DataFrameSuite: SPARK-6: splitExpressions should not generate codes beyond 64KB 49 seconds Key: SPARK-25609 URL: https://issues.apache.org/jira/browse/SPARK-25609

[jira] [Created] (SPARK-25610) DatasetCacheSuite: cache UDF result correctly 25 seconds

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25610: --- Summary: DatasetCacheSuite: cache UDF result correctly 25 seconds Key: SPARK-25610 URL: https://issues.apache.org/jira/browse/SPARK-25610 Project: Spark Issue Type: Su

[jira] [Created] (SPARK-25611) CompressionCodecSuite: both table-level and session-level compression are set 2 min 20 sec

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25611: --- Summary: CompressionCodecSuite: both table-level and session-level compression are set 2 min 20 sec Key: SPARK-25611 URL: https://issues.apache.org/jira/browse/SPARK-25611 Proj

[jira] [Created] (SPARK-25612) CompressionCodecSuite: table-level compression is not set but session-level compressions 47 seconds

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25612: --- Summary: CompressionCodecSuite: table-level compression is not set but session-level compressions 47 seconds Key: SPARK-25612 URL: https://issues.apache.org/jira/browse/SPARK-25612

[jira] [Created] (SPARK-25613) HiveSparkSubmitSuite: dir 1 min 3 seconds

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25613: --- Summary: HiveSparkSubmitSuite: dir 1 min 3 seconds Key: SPARK-25613 URL: https://issues.apache.org/jira/browse/SPARK-25613 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-25614) HiveSparkSubmitSuite: SPARK-18989: DESC TABLE should not fail with format class not found 38 seconds

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25614: --- Summary: HiveSparkSubmitSuite: SPARK-18989: DESC TABLE should not fail with format class not found 38 seconds Key: SPARK-25614 URL: https://issues.apache.org/jira/browse/SPARK-25614

[jira] [Created] (SPARK-25615) KafkaSinkSuite: streaming - write to non-existing topic 1 min

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25615: --- Summary: KafkaSinkSuite: streaming - write to non-existing topic 1 min Key: SPARK-25615 URL: https://issues.apache.org/jira/browse/SPARK-25615 Project: Spark Issue Ty

[jira] [Created] (SPARK-25616) KafkaSinkSuite: generic - write big data with small producer buffer 57 secs

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25616: --- Summary: KafkaSinkSuite: generic - write big data with small producer buffer 57 secs Key: SPARK-25616 URL: https://issues.apache.org/jira/browse/SPARK-25616 Project: Spark

[jira] [Created] (SPARK-25617) KafkaContinuousSinkSuite: generic - write big data with small producer buffer 56 secs

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25617: --- Summary: KafkaContinuousSinkSuite: generic - write big data with small producer buffer 56 secs Key: SPARK-25617 URL: https://issues.apache.org/jira/browse/SPARK-25617 Project:

[jira] [Created] (SPARK-25618) KafkaContinuousSourceStressForDontFailOnDataLossSuite: stress test for failOnDataLoss=false 1 min 1 sec

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25618: --- Summary: KafkaContinuousSourceStressForDontFailOnDataLossSuite: stress test for failOnDataLoss=false 1 min 1 sec Key: SPARK-25618 URL: https://issues.apache.org/jira/browse/SPARK-25618

[jira] [Commented] (SPARK-25501) Kafka delegation token support

2018-10-03 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16637268#comment-16637268 ] Gabor Somogyi commented on SPARK-25501: --- Yeah, it's posted on the dev list. To an

[jira] [Created] (SPARK-25619) WithAggregationKinesisStreamSuite: split and merge shards in a stream 2 min 15 sec

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25619: --- Summary: WithAggregationKinesisStreamSuite: split and merge shards in a stream 2 min 15 sec Key: SPARK-25619 URL: https://issues.apache.org/jira/browse/SPARK-25619 Project: Spa

[jira] [Created] (SPARK-25620) WithAggregationKinesisStreamSuite: failure recovery 1 min 36 seconds

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25620: --- Summary: WithAggregationKinesisStreamSuite: failure recovery 1 min 36 seconds Key: SPARK-25620 URL: https://issues.apache.org/jira/browse/SPARK-25620 Project: Spark I

[jira] [Resolved] (SPARK-25582) Error in Spark logs when using the org.apache.spark:spark-sql_2.11:2.2.0 Java library

2018-10-03 Thread Thomas Brugiere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Brugiere resolved SPARK-25582. - Resolution: Later > Error in Spark logs when using the org.apache.spark:spark-sql_2.11:2

[jira] [Reopened] (SPARK-25582) Error in Spark logs when using the org.apache.spark:spark-sql_2.11:2.2.0 Java library

2018-10-03 Thread Thomas Brugiere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Brugiere reopened SPARK-25582: - > Error in Spark logs when using the org.apache.spark:spark-sql_2.11:2.2.0 Java > library >

[jira] [Updated] (SPARK-25620) WithAggregationKinesisStreamSuite: failure recovery 1 min 36 seconds

2018-10-03 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-25620: Description: org.apache.spark.streaming.kinesis.WithAggregationKinesisStreamSuite.failure recovery Took

[jira] [Updated] (SPARK-25619) WithAggregationKinesisStreamSuite: split and merge shards in a stream 2 min 15 sec

2018-10-03 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-25619: Description: org.apache.spark.streaming.kinesis.WithAggregationKinesisStreamSuite.split and merge shards

[jira] [Created] (SPARK-25621) BucketedReadWithHiveSupportSuite: read partitioning bucketed tables having composite filters 45 sec

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25621: --- Summary: BucketedReadWithHiveSupportSuite: read partitioning bucketed tables having composite filters 45 sec Key: SPARK-25621 URL: https://issues.apache.org/jira/browse/SPARK-25621

[jira] [Created] (SPARK-25622) BucketedReadWithHiveSupportSuite: read partitioning bucketed tables with bucket pruning filters - 42 seconds

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25622: --- Summary: BucketedReadWithHiveSupportSuite: read partitioning bucketed tables with bucket pruning filters - 42 seconds Key: SPARK-25622 URL: https://issues.apache.org/jira/browse/SPARK-25622

[jira] [Created] (SPARK-25623) LogisticRegressionSuite: multinomial logistic regression with intercept with L1 regularization 1 min 10 sec

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25623: --- Summary: LogisticRegressionSuite: multinomial logistic regression with intercept with L1 regularization 1 min 10 sec Key: SPARK-25623 URL: https://issues.apache.org/jira/browse/SPARK-25623

[jira] [Created] (SPARK-25624) LogisticRegressionSuite.multinomial logistic regression with intercept with elasticnet regularization 56 seconds

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25624: --- Summary: LogisticRegressionSuite.multinomial logistic regression with intercept with elasticnet regularization 56 seconds Key: SPARK-25624 URL: https://issues.apache.org/jira/browse/SPARK-2

[jira] [Created] (SPARK-25625) LogisticRegressionSuite.binary logistic regression with intercept with ElasticNet regularization - 33 sec

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25625: --- Summary: LogisticRegressionSuite.binary logistic regression with intercept with ElasticNet regularization - 33 sec Key: SPARK-25625 URL: https://issues.apache.org/jira/browse/SPARK-25625

[jira] [Created] (SPARK-25626) HiveClientSuites: getPartitionsByFilter returns all partitions when hive.metastore.try.direct.sql=false 46 sec

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25626: --- Summary: HiveClientSuites: getPartitionsByFilter returns all partitions when hive.metastore.try.direct.sql=false 46 sec Key: SPARK-25626 URL: https://issues.apache.org/jira/browse/SPARK-256

[jira] [Created] (SPARK-25627) ContinuousStressSuite - 8 mins 13 sec

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25627: --- Summary: ContinuousStressSuite - 8 mins 13 sec Key: SPARK-25627 URL: https://issues.apache.org/jira/browse/SPARK-25627 Project: Spark Issue Type: Sub-task Co

[jira] [Created] (SPARK-25628) DistributedSuite: recover from repeated node failures during shuffle-reduce 40 seconds

2018-10-03 Thread Xiao Li (JIRA)
Xiao Li created SPARK-25628: --- Summary: DistributedSuite: recover from repeated node failures during shuffle-reduce 40 seconds Key: SPARK-25628 URL: https://issues.apache.org/jira/browse/SPARK-25628 Project:

  1   2   >