[jira] [Resolved] (SPARK-24114) improve instrumentation for spark.ml.recommendation

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24114. -- Resolution: Incomplete > improve instrumentation for spark.ml.recommendation > ---

[jira] [Resolved] (SPARK-24281) Option to disable Spark UI's web filter in YARN-client mode

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24281. -- Resolution: Incomplete > Option to disable Spark UI's web filter in YARN-client mode > ---

[jira] [Resolved] (SPARK-24848) When a stage fails onStageCompleted is called before onTaskEnd

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24848. -- Resolution: Incomplete > When a stage fails onStageCompleted is called before onTaskEnd >

[jira] [Resolved] (SPARK-21024) CSV parse mode handles Univocity parser exceptions

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21024. -- Resolution: Incomplete > CSV parse mode handles Univocity parser exceptions >

[jira] [Resolved] (SPARK-23910) Publish executor memory utilisation in heartbeat events

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23910. -- Resolution: Incomplete > Publish executor memory utilisation in heartbeat events > ---

[jira] [Resolved] (SPARK-21199) Its not possible to impute Vector types

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21199. -- Resolution: Incomplete > Its not possible to impute Vector types > ---

[jira] [Resolved] (SPARK-25022) Add spark.executor.pyspark.memory support to Mesos

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25022. -- Resolution: Incomplete > Add spark.executor.pyspark.memory support to Mesos >

[jira] [Resolved] (SPARK-24425) Regression from 1.6 to 2.x - Spark no longer respects input partitions, unnecessary shuffle required

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24425. -- Resolution: Incomplete > Regression from 1.6 to 2.x - Spark no longer respects input partition

[jira] [Resolved] (SPARK-24835) col function ignores drop

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24835. -- Resolution: Incomplete > col function ignores drop > - > >

[jira] [Resolved] (SPARK-24527) select column alias should support quotation marks

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24527. -- Resolution: Incomplete > select column alias should support quotation marks >

[jira] [Resolved] (SPARK-23507) Migrate file-based data sources to data source v2

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23507. -- Resolution: Incomplete > Migrate file-based data sources to data source v2 > -

[jira] [Resolved] (SPARK-22031) KMeans - Compute cost for a single vector

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22031. -- Resolution: Incomplete > KMeans - Compute cost for a single vector > -

[jira] [Resolved] (SPARK-18388) Running aggregation on many columns throws SOE

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-18388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-18388. -- Resolution: Incomplete > Running aggregation on many columns throws SOE >

[jira] [Resolved] (SPARK-24832) Improve inputMetrics's bytesRead update for ColumnarBatch

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24832. -- Resolution: Incomplete > Improve inputMetrics's bytesRead update for ColumnarBatch > -

[jira] [Resolved] (SPARK-24738) [HistoryServer] FsHistoryProvider clean outdated event logs at start

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24738. -- Resolution: Incomplete > [HistoryServer] FsHistoryProvider clean outdated event logs at start

[jira] [Resolved] (SPARK-21071) make WritableColumnVector public

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21071. -- Resolution: Incomplete > make WritableColumnVector public > >

[jira] [Resolved] (SPARK-25563) Spark application hangs If container allocate on lost Nodemanager

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25563. -- Resolution: Incomplete > Spark application hangs If container allocate on lost Nodemanager > -

[jira] [Resolved] (SPARK-25334) Default SessionCatalog should support UDFs

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25334. -- Resolution: Incomplete > Default SessionCatalog should support UDFs >

[jira] [Resolved] (SPARK-23607) Use HDFS extended attributes to store application summary to improve the Spark History Server performance

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23607. -- Resolution: Incomplete > Use HDFS extended attributes to store application summary to improve

[jira] [Resolved] (SPARK-24702) Unable to cast to calendar interval in spark sql.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24702. -- Resolution: Incomplete > Unable to cast to calendar interval in spark sql. > -

[jira] [Resolved] (SPARK-25057) Unable to start spark on master URL

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25057. -- Resolution: Incomplete > Unable to start spark on master URL > ---

[jira] [Resolved] (SPARK-20869) Master should clear failed apps when worker down

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20869. -- Resolution: Incomplete > Master should clear failed apps when worker down > --

[jira] [Resolved] (SPARK-23800) Support partial function and callable object with pandas UDF

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23800. -- Resolution: Incomplete > Support partial function and callable object with pandas UDF > --

[jira] [Resolved] (SPARK-23813) [SparkSQL] the result is different between hive and spark when use PARSE_URL()

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23813. -- Resolution: Incomplete > [SparkSQL] the result is different between hive and spark when use >

[jira] [Resolved] (SPARK-24245) Flaky test: KafkaContinuousSinkSuite

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24245. -- Resolution: Incomplete > Flaky test: KafkaContinuousSinkSuite > --

[jira] [Resolved] (SPARK-19250) In security cluster, spark beeline connect to hive metastore failed

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-19250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19250. -- Resolution: Incomplete > In security cluster, spark beeline connect to hive metastore failed >

[jira] [Resolved] (SPARK-24025) Join of bucketed and non-bucketed tables can give two exchanges and sorts for non-bucketed side

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24025. -- Resolution: Incomplete > Join of bucketed and non-bucketed tables can give two exchanges and s

[jira] [Resolved] (SPARK-24843) Spark2 job (in cluster mode) is unable to execute steps in HBase (error# java.lang.NoClassDefFoundError: org/apache/hadoop/hbase/CompatibilityFactory)

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24843. -- Resolution: Incomplete > Spark2 job (in cluster mode) is unable to execute steps in HBase (err

[jira] [Resolved] (SPARK-21084) Improvements to dynamic allocation for notebook use cases

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21084. -- Resolution: Incomplete > Improvements to dynamic allocation for notebook use cases > -

[jira] [Resolved] (SPARK-23631) Add summary to RandomForestClassificationModel

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23631. -- Resolution: Incomplete > Add summary to RandomForestClassificationModel >

[jira] [Resolved] (SPARK-24939) Support YARN Shared Cache in Spark

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24939. -- Resolution: Incomplete > Support YARN Shared Cache in Spark >

[jira] [Resolved] (SPARK-24049) Add a feature to not start speculative tasks when average task duration is less than a configurable absolute number

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24049. -- Resolution: Incomplete > Add a feature to not start speculative tasks when average task durati

[jira] [Resolved] (SPARK-24689) java.io.NotSerializableException: org.apache.spark.mllib.clustering.DistributedLDAModel

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24689. -- Resolution: Incomplete > java.io.NotSerializableException: > org.apache.spark.mllib.clusterin

[jira] [Resolved] (SPARK-15777) Catalog federation

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15777. -- Resolution: Incomplete > Catalog federation > -- > > Key: SPAR

[jira] [Resolved] (SPARK-21040) On executor/worker decommission consider speculatively re-launching current tasks

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21040. -- Resolution: Incomplete > On executor/worker decommission consider speculatively re-launching c

[jira] [Resolved] (SPARK-24905) Spark 2.3 Internal URL env variable

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24905. -- Resolution: Incomplete > Spark 2.3 Internal URL env variable > ---

[jira] [Resolved] (SPARK-24494) Give users possibility to skip own classes in SparkContext.getCallSite()

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24494. -- Resolution: Incomplete > Give users possibility to skip own classes in SparkContext.getCallSit

[jira] [Resolved] (SPARK-23796) There's no API to change state RDD's name

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23796. -- Resolution: Incomplete > There's no API to change state RDD's name > -

[jira] [Resolved] (SPARK-14604) Modify design of ML model summaries

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-14604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-14604. -- Resolution: Incomplete > Modify design of ML model summaries > ---

[jira] [Resolved] (SPARK-23837) Create table as select gives exception if the spark generated alias name contains comma

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23837. -- Resolution: Incomplete > Create table as select gives exception if the spark generated alias n

[jira] [Resolved] (SPARK-24837) Add kafka as spark metrics sink

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24837. -- Resolution: Incomplete > Add kafka as spark metrics sink > --- > >

[jira] [Resolved] (SPARK-24273) Failure while using .checkpoint method to private S3 store via S3A connector

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24273. -- Resolution: Incomplete > Failure while using .checkpoint method to private S3 store via S3A co

[jira] [Resolved] (SPARK-25351) Handle Pandas category type when converting from Python with Arrow

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25351. -- Resolution: Incomplete > Handle Pandas category type when converting from Python with Arrow >

[jira] [Resolved] (SPARK-23669) Executors fetch jars and name the jars with md5 prefix

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23669. -- Resolution: Incomplete > Executors fetch jars and name the jars with md5 prefix >

[jira] [Resolved] (SPARK-15573) Backwards-compatible persistence for spark.ml

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15573. -- Resolution: Incomplete > Backwards-compatible persistence for spark.ml > -

[jira] [Resolved] (SPARK-20732) Copy cache data when node is being shut down

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20732. -- Resolution: Incomplete > Copy cache data when node is being shut down > --

[jira] [Resolved] (SPARK-24163) Support "ANY" or sub-query for Pivot "IN" clause

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24163. -- Resolution: Incomplete > Support "ANY" or sub-query for Pivot "IN" clause > --

[jira] [Resolved] (SPARK-20744) Predicates with multiple columns do not work

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20744. -- Resolution: Incomplete > Predicates with multiple columns do not work > --

[jira] [Resolved] (SPARK-24100) Add the CompressionCodec to the saveAsTextFiles interface.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24100. -- Resolution: Incomplete > Add the CompressionCodec to the saveAsTextFiles interface. >

[jira] [Resolved] (SPARK-24750) HiveCaseSensitiveInferenceMode with INFER_AND_SAVE will show WRITE permission denied even if select table operation

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24750. -- Resolution: Incomplete > HiveCaseSensitiveInferenceMode with INFER_AND_SAVE will show WRITE pe

[jira] [Resolved] (SPARK-25217) Error thrown when creating BlockMatrix

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25217. -- Resolution: Incomplete > Error thrown when creating BlockMatrix >

[jira] [Resolved] (SPARK-22359) Improve the test coverage of window functions

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22359. -- Resolution: Incomplete > Improve the test coverage of window functions > -

[jira] [Resolved] (SPARK-18600) BZ2 CRC read error needs better reporting

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-18600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-18600. -- Resolution: Incomplete > BZ2 CRC read error needs better reporting > -

[jira] [Resolved] (SPARK-21962) Distributed Tracing in Spark

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21962. -- Resolution: Incomplete > Distributed Tracing in Spark > > >

[jira] [Resolved] (SPARK-20443) The blockSize of MLLIB ALS should be setting by the User

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20443. -- Resolution: Incomplete > The blockSize of MLLIB ALS should be setting by the User > -

[jira] [Resolved] (SPARK-24607) Distribute by rand() can lead to data inconsistency

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24607. -- Resolution: Incomplete > Distribute by rand() can lead to data inconsistency > ---

[jira] [Resolved] (SPARK-25107) Spark 2.2.0 Upgrade Issue : Throwing TreeNodeException: makeCopy, tree: CatalogRelation Errors

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25107. -- Resolution: Incomplete > Spark 2.2.0 Upgrade Issue : Throwing TreeNodeException: makeCopy, tre

[jira] [Resolved] (SPARK-23797) SparkSQL performance on small TPCDS tables is very low when compared to Drill or Presto

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23797. -- Resolution: Incomplete > SparkSQL performance on small TPCDS tables is very low when compared

[jira] [Resolved] (SPARK-24974) Spark put all file's paths into SharedInMemoryCache even for unused partitions.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24974. -- Resolution: Incomplete > Spark put all file's paths into SharedInMemoryCache even for unused

[jira] [Resolved] (SPARK-24955) spark continuing to execute on a task despite not reading all data from a downed machine

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24955. -- Resolution: Incomplete > spark continuing to execute on a task despite not reading all data fr

[jira] [Resolved] (SPARK-24550) Add support for Kubernetes specific metrics

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24550. -- Resolution: Incomplete > Add support for Kubernetes specific metrics > ---

[jira] [Resolved] (SPARK-25585) Allow users to specify scale of result in Decimal arithmetic

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25585. -- Resolution: Incomplete > Allow users to specify scale of result in Decimal arithmetic > --

[jira] [Resolved] (SPARK-21406) Add logLikelihood to GLR families

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21406. -- Resolution: Incomplete > Add logLikelihood to GLR families > -

[jira] [Resolved] (SPARK-12878) Dataframe fails with nested User Defined Types

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-12878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12878. -- Resolution: Incomplete > Dataframe fails with nested User Defined Types >

[jira] [Resolved] (SPARK-9636) Treat $SPARK_HOME as write-only

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-9636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9636. - Resolution: Incomplete > Treat $SPARK_HOME as write-only > --- > >

[jira] [Resolved] (SPARK-23181) Add compatibility tests for SHS serialized data / disk format

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23181. -- Resolution: Incomplete > Add compatibility tests for SHS serialized data / disk format > -

[jira] [Resolved] (SPARK-8614) Row order preservation for operations on MLlib IndexedRowMatrix

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-8614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-8614. - Resolution: Incomplete > Row order preservation for operations on MLlib IndexedRowMatrix > --

[jira] [Resolved] (SPARK-24585) Adding ability to audit file system before and after test to ensure all files are cleaned up.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24585. -- Resolution: Incomplete > Adding ability to audit file system before and after test to ensure a

[jira] [Resolved] (SPARK-18245) Improving support for bucketed table

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-18245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-18245. -- Resolution: Incomplete > Improving support for bucketed table > --

[jira] [Resolved] (SPARK-24431) wrong areaUnderPR calculation in BinaryClassificationEvaluator

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24431. -- Resolution: Incomplete > wrong areaUnderPR calculation in BinaryClassificationEvaluator > ---

[jira] [Resolved] (SPARK-25219) KMeans Clustering - Text Data - Results are incorrect

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25219. -- Resolution: Incomplete > KMeans Clustering - Text Data - Results are incorrect > -

[jira] [Resolved] (SPARK-23236) Make it easier to find the rest API, especially in local mode

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23236. -- Resolution: Incomplete > Make it easier to find the rest API, especially in local mode > -

[jira] [Resolved] (SPARK-25180) Spark standalone failure in Utils.doFetchFile() if nslookup of local hostname fails

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25180. -- Resolution: Incomplete > Spark standalone failure in Utils.doFetchFile() if nslookup of local

[jira] [Resolved] (SPARK-24208) Cannot resolve column in self join after applying Pandas UDF

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24208. -- Resolution: Incomplete > Cannot resolve column in self join after applying Pandas UDF > --

[jira] [Resolved] (SPARK-22245) dataframe should always put partition columns at the end

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22245. -- Resolution: Incomplete > dataframe should always put partition columns at the end > --

[jira] [Resolved] (SPARK-21940) Support timezone for timestamps in SparkR

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21940. -- Resolution: Incomplete > Support timezone for timestamps in SparkR > -

[jira] [Resolved] (SPARK-21353) add checkValue in spark.internal.config about how to correctly set configurations

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21353. -- Resolution: Incomplete > add checkValue in spark.internal.config about how to correctly set >

[jira] [Resolved] (SPARK-4285) Transpose RDD[Vector] to column store for ML

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-4285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-4285. - Resolution: Incomplete > Transpose RDD[Vector] to column store for ML > -

[jira] [Resolved] (SPARK-20295) when spark.sql.adaptive.enabled is enabled, have conflict with Exchange Resue

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20295. -- Resolution: Incomplete > when spark.sql.adaptive.enabled is enabled, have conflict with Excha

[jira] [Resolved] (SPARK-20618) Support Custom Partitioners in PySpark

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20618. -- Resolution: Incomplete > Support Custom Partitioners in PySpark >

[jira] [Resolved] (SPARK-12126) JDBC datasource processes filters only commonly pushed down.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-12126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12126. -- Resolution: Incomplete > JDBC datasource processes filters only commonly pushed down. > --

[jira] [Resolved] (SPARK-22964) don't allow task restarts for continuous processing

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22964. -- Resolution: Incomplete > don't allow task restarts for continuous processing > ---

[jira] [Resolved] (SPARK-20629) Copy shuffle data when nodes are being shut down

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20629. -- Resolution: Incomplete > Copy shuffle data when nodes are being shut down > --

[jira] [Resolved] (SPARK-24293) Serialized shuffle supports mapSideCombine

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24293. -- Resolution: Incomplete > Serialized shuffle supports mapSideCombine >

[jira] [Resolved] (SPARK-23740) Add FPGrowth Param for filtering out very common items

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23740. -- Resolution: Incomplete > Add FPGrowth Param for filtering out very common items >

[jira] [Resolved] (SPARK-15041) adding mode strategy for ml.feature.Imputer for categorical features

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15041. -- Resolution: Incomplete > adding mode strategy for ml.feature.Imputer for categorical features

[jira] [Resolved] (SPARK-8582) Optimize checkpointing to avoid computing an RDD twice

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-8582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-8582. - Resolution: Incomplete > Optimize checkpointing to avoid computing an RDD twice > ---

[jira] [Resolved] (SPARK-21076) R dapply doesn't return array or raw columns when array have different length

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21076. -- Resolution: Incomplete > R dapply doesn't return array or raw columns when array have differen

[jira] [Resolved] (SPARK-16418) DataFrame.filter fails if it references a window function

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16418. -- Resolution: Incomplete > DataFrame.filter fails if it references a window function > -

[jira] [Resolved] (SPARK-22565) Session-based windowing

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22565. -- Resolution: Incomplete > Session-based windowing > --- > >

[jira] [Resolved] (SPARK-24969) SQL: to_date function can't parse date strings in different locales.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24969. -- Resolution: Incomplete > SQL: to_date function can't parse date strings in different locales.

[jira] [Resolved] (SPARK-15516) Schema merging in driver fails for parquet when merging LongType and IntegerType

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15516. -- Resolution: Incomplete > Schema merging in driver fails for parquet when merging LongType and

[jira] [Resolved] (SPARK-19241) remove hive generated table properties if they are not useful in Spark

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-19241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19241. -- Resolution: Incomplete > remove hive generated table properties if they are not useful in Spar

[jira] [Resolved] (SPARK-24081) Spark SQL drops the table while writing into table in "overwrite" mode.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24081. -- Resolution: Incomplete > Spark SQL drops the table while writing into table in "overwrite" mo

[jira] [Resolved] (SPARK-15690) Fast single-node (single-process) in-memory shuffle

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15690. -- Resolution: Incomplete > Fast single-node (single-process) in-memory shuffle > ---

[jira] [Resolved] (SPARK-24745) Map function does not keep rdd name

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24745. -- Resolution: Incomplete > Map function does not keep rdd name > --

[jira] [Resolved] (SPARK-22731) Add a test for ROWID type to OracleIntegrationSuite

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22731. -- Resolution: Incomplete > Add a test for ROWID type to OracleIntegrationSuite > ---

[jira] [Resolved] (SPARK-23858) Need to apply pyarrow adjustments to complex types with DateType/TimestampType

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23858. -- Resolution: Incomplete > Need to apply pyarrow adjustments to complex types with > DateType/T

[jira] [Resolved] (SPARK-23237) Add UI / endpoint for threaddumps for executors with active tasks

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23237. -- Resolution: Incomplete > Add UI / endpoint for threaddumps for executors with active tasks > -

[jira] [Resolved] (SPARK-24266) Spark client terminates while driver is still running

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24266. -- Resolution: Incomplete > Spark client terminates while driver is still running > -

<    1   2   3   4   5   6   >