[jira] [Updated] (SPARK-22967) VersionSuite failed on Windows caused by Windows format path

2018-01-08 Thread wuyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wuyi updated SPARK-22967: - Summary: VersionSuite failed on Windows caused by Windows format path (was: VersionSuite failed on Windows

[jira] [Commented] (SPARK-21293) R document update structured streaming

2018-01-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317805#comment-16317805 ] Felix Cheung commented on SPARK-21293: -- leaving it open for the rest of items > R document update

[jira] [Resolved] (SPARK-21292) R document Catalog function metadata refresh

2018-01-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-21292. -- Resolution: Fixed Fix Version/s: 2.3.0 Target Version/s: 2.3.0 > R document

[jira] [Updated] (SPARK-21290) R document Programmatically Specifying the Schema in SQL guide

2018-01-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-21290: - Target Version/s: (was: 2.3.0) > R document Programmatically Specifying the Schema in SQL

[jira] [Assigned] (SPARK-21292) R document Catalog function metadata refresh

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21292: Assignee: Felix Cheung (was: Apache Spark) > R document Catalog function metadata

[jira] [Assigned] (SPARK-21292) R document Catalog function metadata refresh

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21292: Assignee: Apache Spark (was: Felix Cheung) > R document Catalog function metadata

[jira] [Commented] (SPARK-21292) R document Catalog function metadata refresh

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317745#comment-16317745 ] Apache Spark commented on SPARK-21292: -- User 'felixcheung' has created a pull request for this

[jira] [Commented] (SPARK-21293) R document update structured streaming

2018-01-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317739#comment-16317739 ] Felix Cheung commented on SPARK-21293: -- not done: Join Operations Streaming Deduplication > R

[jira] [Assigned] (SPARK-21293) R document update structured streaming

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21293: Assignee: Felix Cheung (was: Apache Spark) > R document update structured streaming >

[jira] [Commented] (SPARK-21293) R document update structured streaming

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317736#comment-16317736 ] Apache Spark commented on SPARK-21293: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-21293) R document update structured streaming

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21293: Assignee: Apache Spark (was: Felix Cheung) > R document update structured streaming >

[jira] [Assigned] (SPARK-23000) Flaky test suite DataSourceWithHiveMetastoreCatalogSuite in Spark 2.3

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23000: Assignee: Apache Spark (was: Xiao Li) > Flaky test suite

[jira] [Commented] (SPARK-23000) Flaky test suite DataSourceWithHiveMetastoreCatalogSuite in Spark 2.3

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317699#comment-16317699 ] Apache Spark commented on SPARK-23000: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23000) Flaky test suite DataSourceWithHiveMetastoreCatalogSuite in Spark 2.3

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23000: Assignee: Xiao Li (was: Apache Spark) > Flaky test suite

[jira] [Created] (SPARK-23000) Flaky test suite DataSourceWithHiveMetastoreCatalogSuite in Spark 2.3

2018-01-08 Thread Xiao Li (JIRA)
Xiao Li created SPARK-23000: --- Summary: Flaky test suite DataSourceWithHiveMetastoreCatalogSuite in Spark 2.3 Key: SPARK-23000 URL: https://issues.apache.org/jira/browse/SPARK-23000 Project: Spark

[jira] [Commented] (SPARK-22972) Couldn't find corresponding Hive SerDe for data source provider org.apache.spark.sql.hive.orc.

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317668#comment-16317668 ] Apache Spark commented on SPARK-22972: -- User 'xubo245' has created a pull request for this issue:

[jira] [Resolved] (SPARK-22984) Fix incorrect bitmap copying and offset shifting in GenerateUnsafeRowJoiner

2018-01-08 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-22984. - Resolution: Fixed Fix Version/s: 2.3.0 2.2.2 > Fix incorrect bitmap

[jira] [Assigned] (SPARK-22990) Fix method isFairScheduler in JobsTab and StagesTab

2018-01-08 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-22990: --- Assignee: Gengliang Wang > Fix method isFairScheduler in JobsTab and StagesTab >

[jira] [Assigned] (SPARK-22999) 'show databases like command' can remove the like keyword

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22999: Assignee: (was: Apache Spark) > 'show databases like command' can remove the like

[jira] [Assigned] (SPARK-22999) 'show databases like command' can remove the like keyword

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22999: Assignee: Apache Spark > 'show databases like command' can remove the like keyword >

[jira] [Resolved] (SPARK-22990) Fix method isFairScheduler in JobsTab and StagesTab

2018-01-08 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-22990. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 20186

[jira] [Commented] (SPARK-22999) 'show databases like command' can remove the like keyword

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317571#comment-16317571 ] Apache Spark commented on SPARK-22999: -- User 'guoxiaolongzte' has created a pull request for this

[jira] [Created] (SPARK-22999) 'show databases like command' can remove the like keyword

2018-01-08 Thread guoxiaolongzte (JIRA)
guoxiaolongzte created SPARK-22999: -- Summary: 'show databases like command' can remove the like keyword Key: SPARK-22999 URL: https://issues.apache.org/jira/browse/SPARK-22999 Project: Spark

[jira] [Assigned] (SPARK-22972) Couldn't find corresponding Hive SerDe for data source provider org.apache.spark.sql.hive.orc.

2018-01-08 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-22972: --- Assignee: xubo245 > Couldn't find corresponding Hive SerDe for data source provider >

[jira] [Resolved] (SPARK-22972) Couldn't find corresponding Hive SerDe for data source provider org.apache.spark.sql.hive.orc.

2018-01-08 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-22972. - Resolution: Fixed Fix Version/s: 2.3.0 > Couldn't find corresponding Hive SerDe for data source

[jira] [Updated] (SPARK-22386) Data Source V2 improvements

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-22386: --- Target Version/s: 2.3.0 > Data Source V2 improvements > --- > >

[jira] [Assigned] (SPARK-21646) Add new type coercion rules to compatible with Hive

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21646: Assignee: Apache Spark > Add new type coercion rules to compatible with Hive >

[jira] [Assigned] (SPARK-21646) Add new type coercion rules to compatible with Hive

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21646: Assignee: (was: Apache Spark) > Add new type coercion rules to compatible with Hive >

[jira] [Reopened] (SPARK-21646) Add new type coercion rules to compatible with Hive

2018-01-08 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reopened SPARK-21646: - > Add new type coercion rules to compatible with Hive > ---

[jira] [Resolved] (SPARK-22722) Test Coverage for Type Coercion Compatibility

2018-01-08 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-22722. - Resolution: Fixed > Test Coverage for Type Coercion Compatibility >

[jira] [Commented] (SPARK-22998) Value for SPARK_MOUNTED_CLASSPATH in executor pods is not set

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317451#comment-16317451 ] Apache Spark commented on SPARK-22998: -- User 'liyinan926' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22998) Value for SPARK_MOUNTED_CLASSPATH in executor pods is not set

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22998: Assignee: (was: Apache Spark) > Value for SPARK_MOUNTED_CLASSPATH in executor pods is

[jira] [Assigned] (SPARK-22998) Value for SPARK_MOUNTED_CLASSPATH in executor pods is not set

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22998: Assignee: Apache Spark > Value for SPARK_MOUNTED_CLASSPATH in executor pods is not set >

[jira] [Created] (SPARK-22998) Value for SPARK_MOUNTED_CLASSPATH in executor pods is not set

2018-01-08 Thread Yinan Li (JIRA)
Yinan Li created SPARK-22998: Summary: Value for SPARK_MOUNTED_CLASSPATH in executor pods is not set Key: SPARK-22998 URL: https://issues.apache.org/jira/browse/SPARK-22998 Project: Spark Issue

[jira] [Commented] (SPARK-22994) Require a single container image for Spark-on-K8S

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317383#comment-16317383 ] Apache Spark commented on SPARK-22994: -- User 'vanzin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22994) Require a single container image for Spark-on-K8S

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22994: Assignee: Apache Spark > Require a single container image for Spark-on-K8S >

[jira] [Assigned] (SPARK-22994) Require a single container image for Spark-on-K8S

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22994: Assignee: (was: Apache Spark) > Require a single container image for Spark-on-K8S >

[jira] [Commented] (SPARK-22997) Add additional defenses against use of freed MemoryBlocks

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317373#comment-16317373 ] Apache Spark commented on SPARK-22997: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22997) Add additional defenses against use of freed MemoryBlocks

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22997: Assignee: Apache Spark (was: Josh Rosen) > Add additional defenses against use of freed

[jira] [Assigned] (SPARK-22997) Add additional defenses against use of freed MemoryBlocks

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22997: Assignee: Josh Rosen (was: Apache Spark) > Add additional defenses against use of freed

[jira] [Commented] (SPARK-22976) Worker cleanup can remove running driver directories

2018-01-08 Thread Russell Spitzer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317367#comment-16317367 ] Russell Spitzer commented on SPARK-22976: - Made a PR against 2.0 but it's valid against all

[jira] [Assigned] (SPARK-22976) Worker cleanup can remove running driver directories

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22976: Assignee: Apache Spark > Worker cleanup can remove running driver directories >

[jira] [Assigned] (SPARK-22976) Worker cleanup can remove running driver directories

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22976: Assignee: (was: Apache Spark) > Worker cleanup can remove running driver directories

[jira] [Commented] (SPARK-22976) Worker cleanup can remove running driver directories

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317364#comment-16317364 ] Apache Spark commented on SPARK-22976: -- User 'RussellSpitzer' has created a pull request for this

[jira] [Created] (SPARK-22997) Add additional defenses against use of freed MemoryBlocks

2018-01-08 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-22997: -- Summary: Add additional defenses against use of freed MemoryBlocks Key: SPARK-22997 URL: https://issues.apache.org/jira/browse/SPARK-22997 Project: Spark Issue

[jira] [Created] (SPARK-22996) update R to pass newest version of lintr checks

2018-01-08 Thread shane knapp (JIRA)
shane knapp created SPARK-22996: --- Summary: update R to pass newest version of lintr checks Key: SPARK-22996 URL: https://issues.apache.org/jira/browse/SPARK-22996 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-22975) MetricsReporter producing NullPointerException when there was no progress reported

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22975: Assignee: Apache Spark > MetricsReporter producing NullPointerException when there was no

[jira] [Commented] (SPARK-22975) MetricsReporter producing NullPointerException when there was no progress reported

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317257#comment-16317257 ] Apache Spark commented on SPARK-22975: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22975) MetricsReporter producing NullPointerException when there was no progress reported

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22975: Assignee: (was: Apache Spark) > MetricsReporter producing NullPointerException when

[jira] [Commented] (SPARK-22995) Spark UI stdout/stderr links point to executors internal address

2018-01-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317202#comment-16317202 ] Sean Owen commented on SPARK-22995: --- Questions should start on the mailing list. Internal address

[jira] [Created] (SPARK-22995) Spark UI stdout/stderr links point to executors internal address

2018-01-08 Thread Jhon Cardenas (JIRA)
Jhon Cardenas created SPARK-22995: - Summary: Spark UI stdout/stderr links point to executors internal address Key: SPARK-22995 URL: https://issues.apache.org/jira/browse/SPARK-22995 Project: Spark

[jira] [Commented] (SPARK-22980) Using pandas_udf when inputs are not Pandas's Series or DataFrame

2018-01-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317100#comment-16317100 ] Hyukjin Kwon commented on SPARK-22980: -- Could we just fix it by adding a simple note that the length

[jira] [Resolved] (SPARK-22912) Support v2 streaming sources and sinks in MicroBatchExecution

2018-01-08 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-22912. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 20097

[jira] [Updated] (SPARK-18569) Support R formula arithmetic

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-18569: --- Target Version/s: 2.4.0 (was: 2.3.0) > Support R formula arithmetic >

[jira] [Resolved] (SPARK-16026) Cost-based Optimizer Framework

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal resolved SPARK-16026. Resolution: Fixed Assignee: Zhenhua Wang Fix Version/s: 2.3.0 Resolving

[jira] [Updated] (SPARK-4502) Spark SQL reads unneccesary nested fields from Parquet

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-4502: -- Target Version/s: 2.4.0 (was: 2.3.0) > Spark SQL reads unneccesary nested fields from Parquet

[jira] [Commented] (SPARK-4502) Spark SQL reads unneccesary nested fields from Parquet

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317062#comment-16317062 ] Sameer Agarwal commented on SPARK-4502: --- +1 This is an extremely useful feature and we should

[jira] [Assigned] (SPARK-22992) Remove assumption of cluster domain in Kubernetes mode

2018-01-08 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-22992: -- Assignee: Anirudh Ramanathan > Remove assumption of cluster domain in Kubernetes mode

[jira] [Resolved] (SPARK-22992) Remove assumption of cluster domain in Kubernetes mode

2018-01-08 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-22992. Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 20187

[jira] [Updated] (SPARK-9576) DataFrame API improvement umbrella ticket (in Spark 2.x)

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-9576: -- Target Version/s: 2.4.0 (was: 2.3.0) > DataFrame API improvement umbrella ticket (in Spark

[jira] [Updated] (SPARK-7768) Make user-defined type (UDT) API public

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-7768: -- Target Version/s: 2.4.0 (was: 2.3.0) > Make user-defined type (UDT) API public >

[jira] [Updated] (SPARK-12978) Skip unnecessary final group-by when input data already clustered with group-by keys

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-12978: --- Target Version/s: 2.4.0 (was: 2.3.0) > Skip unnecessary final group-by when input data

[jira] [Updated] (SPARK-13184) Support minPartitions parameter for JSON and CSV datasources as options

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-13184: --- Target Version/s: 2.4.0 (was: 2.3.0) > Support minPartitions parameter for JSON and CSV

[jira] [Updated] (SPARK-13682) Finalize the public API for FileFormat

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-13682: --- Target Version/s: 2.4.0 (was: 2.3.0) > Finalize the public API for FileFormat >

[jira] [Updated] (SPARK-14098) Generate Java code to build CachedColumnarBatch and get values from CachedColumnarBatch when DataFrame.cache() is called

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-14098: --- Target Version/s: 2.4.0 (was: 2.3.0) > Generate Java code to build CachedColumnarBatch and

[jira] [Updated] (SPARK-14543) SQL/Hive insertInto has unexpected results

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-14543: --- Target Version/s: 2.4.0 (was: 2.3.0) > SQL/Hive insertInto has unexpected results >

[jira] [Updated] (SPARK-15420) Repartition and sort before Parquet writes

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-15420: --- Target Version/s: 2.4.0 (was: 2.3.0) > Repartition and sort before Parquet writes >

[jira] [Updated] (SPARK-15380) Generate code that stores a float/double value in each column from ColumnarBatch when DataFrame.cache() is used

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-15380: --- Target Version/s: 2.4.0 (was: 2.3.0) > Generate code that stores a float/double value in

[jira] [Commented] (SPARK-15420) Repartition and sort before Parquet writes

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317050#comment-16317050 ] Sameer Agarwal commented on SPARK-15420: re-targeting this for 2.4.0 > Repartition and sort

[jira] [Updated] (SPARK-15117) Generate code that get a value in each compressed column from CachedBatch when DataFrame.cache() is called

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-15117: --- Target Version/s: 2.4.0 (was: 2.3.0) > Generate code that get a value in each compressed

[jira] [Updated] (SPARK-15690) Fast single-node (single-process) in-memory shuffle

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-15690: --- Target Version/s: 2.4.0 (was: 2.3.0) > Fast single-node (single-process) in-memory shuffle

[jira] [Updated] (SPARK-15691) Refactor and improve Hive support

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-15691: --- Target Version/s: 2.4.0 (was: 2.3.0) > Refactor and improve Hive support >

[jira] [Updated] (SPARK-15693) Write schema definition out for file-based data sources to avoid schema inference

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-15693: --- Target Version/s: 2.4.0 (was: 2.3.0) > Write schema definition out for file-based data

[jira] [Updated] (SPARK-15694) Implement ScriptTransformation in sql/core

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-15694: --- Target Version/s: 2.4.0 (was: 2.3.0) > Implement ScriptTransformation in sql/core >

[jira] [Updated] (SPARK-16011) SQL metrics include duplicated attempts

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-16011: --- Target Version/s: 2.4.0 (was: 2.3.0) > SQL metrics include duplicated attempts >

[jira] [Updated] (SPARK-15867) Use bucket files for TABLESAMPLE BUCKET

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-15867: --- Target Version/s: 2.4.0 (was: 2.3.0) > Use bucket files for TABLESAMPLE BUCKET >

[jira] [Updated] (SPARK-16196) Optimize in-memory scan performance using ColumnarBatches

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-16196: --- Target Version/s: 2.4.0 (was: 2.3.0) > Optimize in-memory scan performance using

[jira] [Updated] (SPARK-16217) Support SELECT INTO statement

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-16217: --- Target Version/s: 2.4.0 (was: 2.3.0) > Support SELECT INTO statement >

[jira] [Updated] (SPARK-16275) Implement all the Hive fallback functions

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-16275: --- Target Version/s: 2.4.0 (was: 2.3.0) > Implement all the Hive fallback functions >

[jira] [Updated] (SPARK-16317) Add file filtering interface for FileFormat

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-16317: --- Target Version/s: 2.4.0 (was: 2.3.0) > Add file filtering interface for FileFormat >

[jira] [Updated] (SPARK-16323) Avoid unnecessary cast when doing integral divide

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-16323: --- Target Version/s: 2.4.0 (was: 2.3.0) > Avoid unnecessary cast when doing integral divide >

[jira] [Updated] (SPARK-16412) Generate Java code that gets an array in each column of CachedBatch when DataFrame.cache() is called

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-16412: --- Target Version/s: 2.4.0 (was: 2.3.0) > Generate Java code that gets an array in each column

[jira] [Updated] (SPARK-16390) Dataset API improvements

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-16390: --- Target Version/s: 2.4.0 (was: 2.3.0) > Dataset API improvements >

[jira] [Updated] (SPARK-16483) Unifying struct fields and columns

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-16483: --- Target Version/s: 2.4.0 (was: 2.3.0) > Unifying struct fields and columns >

[jira] [Updated] (SPARK-16452) basic INFORMATION_SCHEMA support

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-16452: --- Target Version/s: 2.4.0 (was: 2.3.0) > basic INFORMATION_SCHEMA support >

[jira] [Resolved] (SPARK-17626) TPC-DS performance improvements using star-schema heuristics

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal resolved SPARK-17626. Resolution: Done Assignee: Ioana Delaney Fix Version/s: 2.2.0

[jira] [Updated] (SPARK-17915) Prepare ColumnVector implementation for UnsafeData

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-17915: --- Target Version/s: 2.4.0 (was: 2.3.0) > Prepare ColumnVector implementation for UnsafeData >

[jira] [Commented] (SPARK-17915) Prepare ColumnVector implementation for UnsafeData

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317015#comment-16317015 ] Sameer Agarwal commented on SPARK-17915: [~kiszk] is this JIRA still relevant? > Prepare

[jira] [Updated] (SPARK-17939) Spark-SQL Nullability: Optimizations vs. Enforcement Clarification

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-17939: --- Target Version/s: 2.4.0 (was: 2.3.0) > Spark-SQL Nullability: Optimizations vs. Enforcement

[jira] [Updated] (SPARK-18084) write.partitionBy() does not recognize nested columns that select() can access

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-18084: --- Target Version/s: 2.4.0 (was: 2.3.0) > write.partitionBy() does not recognize nested

[jira] [Updated] (SPARK-18057) Update structured streaming kafka from 10.0.1 to 10.2.0

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-18057: --- Target Version/s: 2.4.0 (was: 2.3.0) > Update structured streaming kafka from 10.0.1 to

[jira] [Updated] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-18134: --- Target Version/s: 2.4.0 (was: 2.3.0) > SQL: MapType in Group BY and Joins not working >

[jira] [Updated] (SPARK-18455) General support for correlated subquery processing

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-18455: --- Target Version/s: 2.4.0 (was: 2.3.0) > General support for correlated subquery processing >

[jira] [Updated] (SPARK-18245) Improving support for bucketed table

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-18245: --- Target Version/s: 2.4.0 (was: 2.3.0) > Improving support for bucketed table >

[jira] [Created] (SPARK-22994) Require a single container image for Spark-on-K8S

2018-01-08 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-22994: -- Summary: Require a single container image for Spark-on-K8S Key: SPARK-22994 URL: https://issues.apache.org/jira/browse/SPARK-22994 Project: Spark Issue

[jira] [Updated] (SPARK-18388) Running aggregation on many columns throws SOE

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-18388: --- Target Version/s: 2.4.0 (was: 2.3.0) > Running aggregation on many columns throws SOE >

[jira] [Assigned] (SPARK-22993) checkpointInterval param doc should be clearer

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22993: Assignee: Apache Spark > checkpointInterval param doc should be clearer >

[jira] [Updated] (SPARK-18543) SaveAsTable(CTAS) using overwrite could change table definition

2018-01-08 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-18543: --- Target Version/s: 2.4.0 (was: 2.3.0) > SaveAsTable(CTAS) using overwrite could change table

[jira] [Assigned] (SPARK-22993) checkpointInterval param doc should be clearer

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22993: Assignee: (was: Apache Spark) > checkpointInterval param doc should be clearer >

[jira] [Commented] (SPARK-22993) checkpointInterval param doc should be clearer

2018-01-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317011#comment-16317011 ] Apache Spark commented on SPARK-22993: -- User 'sethah' has created a pull request for this issue:

  1   2   >