[jira] [Resolved] (SPARK-13077) Expose BlockGenerator functionality in the public API

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-13077. -- Resolution: Incomplete > Expose BlockGenerator functionality in the public API >

[jira] [Resolved] (SPARK-17189) [MINOR] Looses the interface from UnsafeRow to InternalRow in AggregationIterator if UnsafeRow specific method is not used

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17189. -- Resolution: Incomplete > [MINOR] Looses the interface from UnsafeRow to InternalRow in >

[jira] [Resolved] (SPARK-14586) SparkSQL doesn't parse decimal like Hive

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-14586. -- Resolution: Incomplete > SparkSQL doesn't parse decimal like Hive >

[jira] [Resolved] (SPARK-17020) Materialization of RDD via DataFrame.rdd forces a poor re-distribution of data

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17020. -- Resolution: Incomplete > Materialization of RDD via DataFrame.rdd forces a poor

[jira] [Resolved] (SPARK-16149) API consistency discussion: CountVectorizer.{minDF -> minDocFreq, minTF -> minTermFreq}

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16149. -- Resolution: Incomplete > API consistency discussion: CountVectorizer.{minDF -> minDocFreq,

[jira] [Resolved] (SPARK-15385) Jobs never complete for ClusterManagers that don't implement killTask

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15385. -- Resolution: Incomplete > Jobs never complete for ClusterManagers that don't implement

[jira] [Resolved] (SPARK-7610) Design clustering abstractions for Pipelines API

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-7610. - Resolution: Incomplete > Design clustering abstractions for Pipelines API >

[jira] [Resolved] (SPARK-10069) Python's ReduceByKeyAndWindow DStream Keeps Growing

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10069. -- Resolution: Incomplete > Python's ReduceByKeyAndWindow DStream Keeps Growing >

[jira] [Resolved] (SPARK-11168) Explore precise behavior of treeAggregate

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-11168. -- Resolution: Incomplete > Explore precise behavior of treeAggregate >

[jira] [Resolved] (SPARK-16204) Row() interface

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16204. -- Resolution: Incomplete > Row() interface > --- > > Key:

[jira] [Resolved] (SPARK-17151) Decide how to handle inferring number of classes in Multinomial logistic regression

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17151. -- Resolution: Incomplete > Decide how to handle inferring number of classes in Multinomial

[jira] [Resolved] (SPARK-9004) Add s3 bytes read/written metrics

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9004. - Resolution: Incomplete > Add s3 bytes read/written metrics > - >

[jira] [Resolved] (SPARK-13336) Add non-numerical summaries to DataFrame.describe

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-13336. -- Resolution: Incomplete > Add non-numerical summaries to DataFrame.describe >

[jira] [Resolved] (SPARK-15582) Support for Groovy closures

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15582. -- Resolution: Incomplete > Support for Groovy closures > --- > >

[jira] [Resolved] (SPARK-17836) Use cross validation to determine the number of clusters for EM or KMeans algorithms

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17836. -- Resolution: Incomplete > Use cross validation to determine the number of clusters for EM or

[jira] [Resolved] (SPARK-7786) Allow StreamingListener to be specified in SparkConf and loaded when creating StreamingContext

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-7786. - Resolution: Incomplete > Allow StreamingListener to be specified in SparkConf and loaded when

[jira] [Resolved] (SPARK-13073) creating R like summary for logistic Regression in Spark - Scala

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-13073. -- Resolution: Incomplete > creating R like summary for logistic Regression in Spark - Scala >

[jira] [Resolved] (SPARK-7492) Convert LocalDataFrame to LocalMatrix

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-7492. - Resolution: Incomplete > Convert LocalDataFrame to LocalMatrix >

[jira] [Resolved] (SPARK-16227) Json schema inference fails when `:` exists in file path

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16227. -- Resolution: Incomplete > Json schema inference fails when `:` exists in file path >

[jira] [Resolved] (SPARK-7341) Fix the flaky test: org.apache.spark.streaming.InputStreamsSuite.socket input stream

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-7341. - Resolution: Incomplete > Fix the flaky test: org.apache.spark.streaming.InputStreamsSuite.socket

[jira] [Resolved] (SPARK-16093) Spark2.0 take no effect after set spark.sql.autoBroadcastJoinThreshold = 1

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16093. -- Resolution: Incomplete > Spark2.0 take no effect after set

[jira] [Resolved] (SPARK-17935) Add KafkaForeachWriter in external kafka-0.8.0 for structured streaming module

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17935. -- Resolution: Incomplete > Add KafkaForeachWriter in external kafka-0.8.0 for structured

[jira] [Resolved] (SPARK-9778) remove unnecessary evaluation from SortOrder

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9778. - Resolution: Incomplete > remove unnecessary evaluation from SortOrder >

[jira] [Resolved] (SPARK-11770) Spark SQL field resolution error in GROUP BY HAVING clause

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-11770. -- Resolution: Incomplete > Spark SQL field resolution error in GROUP BY HAVING clause >

[jira] [Resolved] (SPARK-13116) TungstenAggregate though it is supposedly capable of all processing unsafe & safe rows, fails if the input is safe rows

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-13116. -- Resolution: Incomplete > TungstenAggregate though it is supposedly capable of all processing

[jira] [Resolved] (SPARK-8724) Need documentation on how to deploy or use SparkR in Spark 1.4.0+

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-8724. - Resolution: Incomplete > Need documentation on how to deploy or use SparkR in Spark 1.4.0+ >

[jira] [Resolved] (SPARK-14539) Fetching delegation tokens in Hive-Thriftserver fails when hive.server2.enable.doAs = True

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-14539. -- Resolution: Incomplete > Fetching delegation tokens in Hive-Thriftserver fails when >

[jira] [Resolved] (SPARK-13611) import Aggregator doesn't work in Spark Shell

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-13611. -- Resolution: Incomplete > import Aggregator doesn't work in Spark Shell >

[jira] [Resolved] (SPARK-9484) Word2Vec import/export for original binary format

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9484. - Resolution: Incomplete > Word2Vec import/export for original binary format >

[jira] [Resolved] (SPARK-12571) AWS credentials not available for read.parquet in SQLContext

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12571. -- Resolution: Incomplete > AWS credentials not available for read.parquet in SQLContext >

[jira] [Resolved] (SPARK-10569) Kryo serialization fails on sortByKey operation on registered RDDs

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10569. -- Resolution: Incomplete > Kryo serialization fails on sortByKey operation on registered RDDs >

[jira] [Resolved] (SPARK-10387) Code generation for decision tree

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10387. -- Resolution: Incomplete > Code generation for decision tree >

[jira] [Resolved] (SPARK-13909) DataFrames DISK_ONLY persistence leads to OOME

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-13909. -- Resolution: Incomplete > DataFrames DISK_ONLY persistence leads to OOME >

[jira] [Resolved] (SPARK-16741) spark.speculation causes duplicate rows in df.write.jdbc()

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16741. -- Resolution: Incomplete > spark.speculation causes duplicate rows in df.write.jdbc() >

[jira] [Resolved] (SPARK-11159) Nested SQL UDF raises java.lang.UnsupportedOperationException: Cannot evaluate expression

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-11159. -- Resolution: Incomplete > Nested SQL UDF raises java.lang.UnsupportedOperationException:

[jira] [Resolved] (SPARK-10555) Add INotifyDStream to Spark Streaming

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10555. -- Resolution: Incomplete > Add INotifyDStream to Spark Streaming >

[jira] [Resolved] (SPARK-10055) San Francisco Crime Classification

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10055. -- Resolution: Incomplete > San Francisco Crime Classification >

[jira] [Resolved] (SPARK-13886) ArrayType of BinaryType not supported in Row.equals method

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-13886. -- Resolution: Incomplete > ArrayType of BinaryType not supported in Row.equals method >

[jira] [Resolved] (SPARK-17146) Add RandomizedSearch to the CrossValidator API

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17146. -- Resolution: Incomplete > Add RandomizedSearch to the CrossValidator API >

[jira] [Resolved] (SPARK-15507) ClassCastException: SomeCaseClass cannot be cast to org.apache.spark.sql.Row

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15507. -- Resolution: Incomplete > ClassCastException: SomeCaseClass cannot be cast to

[jira] [Resolved] (SPARK-18253) ML Instrumentation logging requires too much manual implementation

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-18253. -- Resolution: Incomplete > ML Instrumentation logging requires too much manual implementation >

[jira] [Resolved] (SPARK-12448) Add UserDefinedType support to Cast

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12448. -- Resolution: Incomplete > Add UserDefinedType support to Cast >

[jira] [Resolved] (SPARK-15376) DataFrame write.jdbc() inserts more rows than acutal

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15376. -- Resolution: Incomplete > DataFrame write.jdbc() inserts more rows than acutal >

[jira] [Resolved] (SPARK-9953) ML Vector, Matrix semantic equality + hashcode

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9953. - Resolution: Incomplete > ML Vector, Matrix semantic equality + hashcode >

[jira] [Resolved] (SPARK-17059) Allow FileFormat to specify partition pruning strategy

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17059. -- Resolution: Incomplete > Allow FileFormat to specify partition pruning strategy >

[jira] [Resolved] (SPARK-14924) Tuning estimatorParamMaps with OneVsRest.classifier fails during persistence

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-14924. -- Resolution: Incomplete > Tuning estimatorParamMaps with OneVsRest.classifier fails during

[jira] [Resolved] (SPARK-10869) Auto-normalization of semi-structured schema from a dataframe

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10869. -- Resolution: Incomplete > Auto-normalization of semi-structured schema from a dataframe >

[jira] [Resolved] (SPARK-10297) When save data to a data source table, we should bound the size of a saved file

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10297. -- Resolution: Incomplete > When save data to a data source table, we should bound the size of a

[jira] [Resolved] (SPARK-15588) Paginate Stage Table in Stages tab, Job Table in Jobs tab, and Query Table in SQL tab

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15588. -- Resolution: Incomplete > Paginate Stage Table in Stages tab, Job Table in Jobs tab, and Query

[jira] [Resolved] (SPARK-11365) consolidate aggregates for summary statistics in weighted least squares

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-11365. -- Resolution: Incomplete > consolidate aggregates for summary statistics in weighted least

[jira] [Resolved] (SPARK-14622) Retain lost executors status

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-14622. -- Resolution: Incomplete > Retain lost executors status > > >

[jira] [Resolved] (SPARK-9300) histogram_numeric aggregate function

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9300. - Resolution: Incomplete > histogram_numeric aggregate function >

[jira] [Resolved] (SPARK-12083) java.lang.IllegalArgumentException: requirement failed: Overflowed precision (q98)

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12083. -- Resolution: Incomplete > java.lang.IllegalArgumentException: requirement failed: Overflowed

[jira] [Resolved] (SPARK-18595) Handling ignoreIfExists in HiveExternalCatalog createTable API

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-18595. -- Resolution: Incomplete > Handling ignoreIfExists in HiveExternalCatalog createTable API >

[jira] [Resolved] (SPARK-10031) Join two UnsafeRows in SortMergeJoin if possible

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10031. -- Resolution: Incomplete > Join two UnsafeRows in SortMergeJoin if possible >

[jira] [Resolved] (SPARK-9427) Add expression functions in SparkR

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9427. - Resolution: Incomplete > Add expression functions in SparkR > --

[jira] [Resolved] (SPARK-17325) Inconsistent Spillable threshold and AppendOnlyMap growing threshold may trigger out-of-memory errors

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17325. -- Resolution: Incomplete > Inconsistent Spillable threshold and AppendOnlyMap growing threshold

[jira] [Resolved] (SPARK-11943) Rapidly starting and stopping SparkContexts in local-cluster mode may cause JVM to exit

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-11943. -- Resolution: Incomplete > Rapidly starting and stopping SparkContexts in local-cluster mode

[jira] [Resolved] (SPARK-7038) [Streaming] Spark Sink requires spark assembly in classpath

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-7038. - Resolution: Incomplete > [Streaming] Spark Sink requires spark assembly in classpath >

[jira] [Resolved] (SPARK-15897) Function Registry should just take in FunctionIdentifier for type safety and avoid duplicating

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15897. -- Resolution: Incomplete > Function Registry should just take in FunctionIdentifier for type

[jira] [Resolved] (SPARK-15201) Handle integer overflow correctly in hash code computation

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15201. -- Resolution: Incomplete > Handle integer overflow correctly in hash code computation >

[jira] [Resolved] (SPARK-9696) Add random seed Param to PySpark ML Pipeline

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9696. - Resolution: Incomplete > Add random seed Param to PySpark ML Pipeline >

[jira] [Resolved] (SPARK-16039) Spark SQL - Number of rows inserted by Insert Sql

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16039. -- Resolution: Incomplete > Spark SQL - Number of rows inserted by Insert Sql >

[jira] [Resolved] (SPARK-17789) Don't force users to set k for KMeans if initial model is set

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17789. -- Resolution: Incomplete > Don't force users to set k for KMeans if initial model is set >

[jira] [Resolved] (SPARK-14354) Let Expand take name expressions and infer output attributes

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-14354. -- Resolution: Incomplete > Let Expand take name expressions and infer output attributes >

[jira] [Resolved] (SPARK-11801) Notify driver when OOM is thrown before executor JVM is killed

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-11801. -- Resolution: Incomplete > Notify driver when OOM is thrown before executor JVM is killed >

[jira] [Resolved] (SPARK-9111) Dumping the memory info when an executor dies abnormally

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9111. - Resolution: Incomplete > Dumping the memory info when an executor dies abnormally >

[jira] [Resolved] (SPARK-13946) PySpark DataFrames allows you to silently use aggregate expressions derived from different table expressions

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-13946. -- Resolution: Incomplete > PySpark DataFrames allows you to silently use aggregate expressions

[jira] [Resolved] (SPARK-14557) Reading textfile (created though CTAS) doesn't work when pathFilter is enabled.

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-14557. -- Resolution: Incomplete > Reading textfile (created though CTAS) doesn't work when pathFilter

[jira] [Resolved] (SPARK-13434) Reduce Spark RandomForest memory footprint

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-13434. -- Resolution: Incomplete > Reduce Spark RandomForest memory footprint >

[jira] [Resolved] (SPARK-16417) spark 1.5.2 receiver store(single-record) with ahead log enabled makes executor crash if there is an exception when BlockGenerator storing block

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16417. -- Resolution: Incomplete > spark 1.5.2 receiver store(single-record) with ahead log enabled

[jira] [Resolved] (SPARK-15499) Add python testsuite with remote debug and single test parameter to help developer debug code easier.

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15499. -- Resolution: Incomplete > Add python testsuite with remote debug and single test parameter to

[jira] [Resolved] (SPARK-9269) Add Set to the matching type in ArrayConverter

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9269. - Resolution: Incomplete > Add Set to the matching type in ArrayConverter >

[jira] [Resolved] (SPARK-12117) Column Aliases are Ignored in callUDF while using struct()

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12117. -- Resolution: Incomplete > Column Aliases are Ignored in callUDF while using struct() >

[jira] [Resolved] (SPARK-9038) Missing TaskEnd event when task attempt is superseded by another (speculative) attempt

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9038. - Resolution: Incomplete > Missing TaskEnd event when task attempt is superseded by another >

[jira] [Resolved] (SPARK-17691) Add aggregate function to collect list with maximum number of elements

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17691. -- Resolution: Incomplete > Add aggregate function to collect list with maximum number of

[jira] [Resolved] (SPARK-6816) Add SparkConf API to configure SparkR

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-6816. - Resolution: Incomplete > Add SparkConf API to configure SparkR >

[jira] [Resolved] (SPARK-15389) DataFrame filter by isNotNull fails in complex, large case

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15389. -- Resolution: Incomplete > DataFrame filter by isNotNull fails in complex, large case >

[jira] [Resolved] (SPARK-7653) ML Pipeline and meta-algs should take random seed param

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-7653. - Resolution: Incomplete > ML Pipeline and meta-algs should take random seed param >

[jira] [Resolved] (SPARK-14054) Support parameters for UDTs

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-14054. -- Resolution: Incomplete > Support parameters for UDTs > --- > >

[jira] [Resolved] (SPARK-8983) ML Tuning Cross-Validation Improvements

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-8983. - Resolution: Incomplete > ML Tuning Cross-Validation Improvements >

[jira] [Resolved] (SPARK-17600) Cannot set public address for Worker and Master Web UI

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17600. -- Resolution: Incomplete > Cannot set public address for Worker and Master Web UI >

[jira] [Resolved] (SPARK-12178) Expose reporting of StreamInputInfo for custom made streams

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12178. -- Resolution: Incomplete > Expose reporting of StreamInputInfo for custom made streams >

[jira] [Resolved] (SPARK-12622) spark-submit fails on executors when jar has a space in it

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12622. -- Resolution: Incomplete > spark-submit fails on executors when jar has a space in it >

[jira] [Resolved] (SPARK-9599) Dynamic partitioning based on key-distribution

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9599. - Resolution: Incomplete > Dynamic partitioning based on key-distribution >

[jira] [Resolved] (SPARK-13683) Finalize the public interface for OutputWriter[Factory]

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-13683. -- Resolution: Incomplete > Finalize the public interface for OutputWriter[Factory] >

[jira] [Resolved] (SPARK-9442) java.lang.ArithmeticException: / by zero when reading Parquet

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9442. - Resolution: Incomplete > java.lang.ArithmeticException: / by zero when reading Parquet >

[jira] [Resolved] (SPARK-17156) Add multiclass logistic regression Scala Example

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17156. -- Resolution: Incomplete > Add multiclass logistic regression Scala Example >

[jira] [Resolved] (SPARK-14926) OneVsRest labelMetadata uses incorrect name

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-14926. -- Resolution: Incomplete > OneVsRest labelMetadata uses incorrect name >

[jira] [Resolved] (SPARK-7597) Make default doc build avoid search engine indexing

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-7597. - Resolution: Incomplete > Make default doc build avoid search engine indexing >

[jira] [Resolved] (SPARK-9237) Added Top N Column Values for DataFrames

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9237. - Resolution: Incomplete > Added Top N Column Values for DataFrames >

[jira] [Resolved] (SPARK-12099) Standalone and Mesos Should use OnOutOfMemoryError handlers

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12099. -- Resolution: Incomplete > Standalone and Mesos Should use OnOutOfMemoryError handlers >

[jira] [Resolved] (SPARK-10659) DataFrames and SparkSQL saveAsParquetFile does not preserve REQUIRED (not nullable) flag in schema

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10659. -- Resolution: Incomplete > DataFrames and SparkSQL saveAsParquetFile does not preserve REQUIRED

[jira] [Resolved] (SPARK-12319) ExchangeCoordinatorSuite fails on big-endian platforms

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12319. -- Resolution: Incomplete > ExchangeCoordinatorSuite fails on big-endian platforms >

[jira] [Resolved] (SPARK-7398) Add back-pressure to Spark Streaming (umbrella JIRA)

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-7398. - Resolution: Incomplete > Add back-pressure to Spark Streaming (umbrella JIRA) >

[jira] [Resolved] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10588. -- Resolution: Incomplete > Saving a DataFrame containing only nulls to JSON doesn't work >

[jira] [Resolved] (SPARK-8842) Spark SQL - Insert into table Issue

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-8842. - Resolution: Incomplete > Spark SQL - Insert into table Issue >

[jira] [Resolved] (SPARK-10936) UDAF "mode" for categorical variables

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10936. -- Resolution: Incomplete > UDAF "mode" for categorical variables >

[jira] [Resolved] (SPARK-17135) Consolidate code in linear/logistic regression where possible

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17135. -- Resolution: Incomplete > Consolidate code in linear/logistic regression where possible >

[jira] [Resolved] (SPARK-11907) Allowing errors as values in DataFrames (like 'Either Left/Right')

2019-05-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-11907. -- Resolution: Incomplete > Allowing errors as values in DataFrames (like 'Either Left/Right') >

<    2   3   4   5   6   7   8   9   10   11   >