[jira] [Assigned] (SPARK-20317) There is no need to use 'return' keyword in scala method

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20317: Assignee: (was: Apache Spark) > There is no need to use 'return' keyword in scala

[jira] [Commented] (SPARK-20317) There is no need to use 'return' keyword in scala method

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967190#comment-15967190 ] Apache Spark commented on SPARK-20317: -- User 'ouyangxiaochen' has created a pull request for this

[jira] [Assigned] (SPARK-20317) There is no need to use 'return' keyword in scala method

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20317: Assignee: Apache Spark > There is no need to use 'return' keyword in scala method >

[jira] [Comment Edited] (SPARK-20081) RandomForestClassifier doesn't seem to support more than 100 labels

2017-04-13 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-20081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967196#comment-15967196 ] Yan Facai (颜发才) edited comment on SPARK-20081 at 4/13/17 6:48 AM: --

[jira] [Created] (SPARK-20317) There is no need to use 'return' keyword in scala method

2017-04-13 Thread Xiaochen Ouyang (JIRA)
Xiaochen Ouyang created SPARK-20317: --- Summary: There is no need to use 'return' keyword in scala method Key: SPARK-20317 URL: https://issues.apache.org/jira/browse/SPARK-20317 Project: Spark

[jira] [Updated] (SPARK-20081) RandomForestClassifier doesn't seem to support more than 100 labels

2017-04-13 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-20081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Facai (颜发才) updated SPARK-20081: Component/s: ML > RandomForestClassifier doesn't seem to support more than 100 labels >

[jira] [Commented] (SPARK-20081) RandomForestClassifier doesn't seem to support more than 100 labels

2017-04-13 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-20081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967204#comment-15967204 ] Yan Facai (颜发才) commented on SPARK-20081: - How about adding a `setNumClass` to shortcut infer

[jira] [Created] (SPARK-20318) Use Catalyst type for min/max in ColumnStat for ease of estimation

2017-04-13 Thread Zhenhua Wang (JIRA)
Zhenhua Wang created SPARK-20318: Summary: Use Catalyst type for min/max in ColumnStat for ease of estimation Key: SPARK-20318 URL: https://issues.apache.org/jira/browse/SPARK-20318 Project: Spark

[jira] [Commented] (SPARK-20081) RandomForestClassifier doesn't seem to support more than 100 labels

2017-04-13 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-20081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967196#comment-15967196 ] Yan Facai (颜发才) commented on SPARK-20081: - [~creinig] Christian, RandomForestClassifier use

[jira] [Commented] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968232#comment-15968232 ] Michael Gummelt commented on SPARK-20328: - cc [~colorant] [~hfeng] [~vanzin] > HadoopRDDs create

[jira] [Commented] (SPARK-20327) Add CLI support for YARN-3926

2017-04-13 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968149#comment-15968149 ] Mark Grover commented on SPARK-20327: - Daniel, we don't assign JIRAs in Spark. Folks issue a PR and

[jira] [Created] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
Michael Gummelt created SPARK-20328: --- Summary: HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs Key: SPARK-20328 URL: https://issues.apache.org/jira/browse/SPARK-20328 Project:

[jira] [Commented] (SPARK-19909) Batches will fail in case that temporary checkpoint dir is on local file system while metadata dir is on HDFS

2017-04-13 Thread Rob (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968092#comment-15968092 ] Rob commented on SPARK-19909: - Is there an easy way to avoid this issue while waiting for it to be resolved?

[jira] [Resolved] (SPARK-20038) FileFormatWriter.ExecuteWriteTask.releaseResources() implementations to be re-entrant

2017-04-13 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid resolved SPARK-20038. -- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17364

[jira] [Resolved] (SPARK-20232) Better combineByKey documentation: clarify memory allocation, better example

2017-04-13 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-20232. - Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17545

[jira] [Assigned] (SPARK-20038) FileFormatWriter.ExecuteWriteTask.releaseResources() implementations to be re-entrant

2017-04-13 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid reassigned SPARK-20038: Assignee: Steve Loughran > FileFormatWriter.ExecuteWriteTask.releaseResources()

[jira] [Commented] (SPARK-20081) RandomForestClassifier doesn't seem to support more than 100 labels

2017-04-13 Thread Christian Reiniger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967328#comment-15967328 ] Christian Reiniger commented on SPARK-20081: Thanks for the feedback. We had to implement

[jira] [Commented] (SPARK-20081) RandomForestClassifier doesn't seem to support more than 100 labels

2017-04-13 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-20081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967347#comment-15967347 ] Yan Facai (颜发才) commented on SPARK-20081: - Yes, you should use `builder.putLong("num_vals",

[jira] [Comment Edited] (SPARK-20081) RandomForestClassifier doesn't seem to support more than 100 labels

2017-04-13 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-20081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967347#comment-15967347 ] Yan Facai (颜发才) edited comment on SPARK-20081 at 4/13/17 9:40 AM: -- Yes,

[jira] [Assigned] (SPARK-20318) Use Catalyst type for min/max in ColumnStat for ease of estimation

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20318: Assignee: Apache Spark > Use Catalyst type for min/max in ColumnStat for ease of

[jira] [Assigned] (SPARK-20318) Use Catalyst type for min/max in ColumnStat for ease of estimation

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20318: Assignee: (was: Apache Spark) > Use Catalyst type for min/max in ColumnStat for ease

[jira] [Commented] (SPARK-20318) Use Catalyst type for min/max in ColumnStat for ease of estimation

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967210#comment-15967210 ] Apache Spark commented on SPARK-20318: -- User 'wzhfy' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-19141) VectorAssembler metadata causing memory issues

2017-04-13 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-19141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967246#comment-15967246 ] Yan Facai (颜发才) edited comment on SPARK-19141 at 4/13/17 7:42 AM: --

[jira] [Commented] (SPARK-19141) VectorAssembler metadata causing memory issues

2017-04-13 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-19141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967246#comment-15967246 ] Yan Facai (颜发才) commented on SPARK-19141: - `VectorAssembler` will create attribute (name) for

[jira] [Resolved] (SPARK-20265) Improve Prefix'span pre-processing efficiency

2017-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20265. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17575

[jira] [Assigned] (SPARK-20265) Improve Prefix'span pre-processing efficiency

2017-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-20265: - Assignee: Cyril de Vogelaere > Improve Prefix'span pre-processing efficiency >

[jira] [Comment Edited] (SPARK-20081) RandomForestClassifier doesn't seem to support more than 100 labels

2017-04-13 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-20081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967347#comment-15967347 ] Yan Facai (颜发才) edited comment on SPARK-20081 at 4/13/17 9:40 AM: -- Yes,

[jira] [Updated] (SPARK-20320) AnalysisException: Columns of grouping_id (count(value#17L)) does not match grouping columns (count(value#17L))

2017-04-13 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Laskowski updated SPARK-20320: Shepherd: Davies Liu (was: Herman van Hovell) Description: I'm not questioning the

[jira] [Commented] (SPARK-20319) Already quoted identifiers are getting wrapped with additional quotes

2017-04-13 Thread Umesh Chaudhary (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967229#comment-15967229 ] Umesh Chaudhary commented on SPARK-20319: - I am working on fix, please assign this to me. >

[jira] [Updated] (SPARK-20319) Already quoted identifiers are getting wrapped with additional quotes

2017-04-13 Thread Umesh Chaudhary (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Umesh Chaudhary updated SPARK-20319: Description: The issue was caused by

[jira] [Commented] (SPARK-20317) There is no need to use 'return' keyword in scala method

2017-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967276#comment-15967276 ] Sean Owen commented on SPARK-20317: --- [~ouyangxc] please don't bother with things like this in JIRA >

[jira] [Comment Edited] (SPARK-20316) In SparkSQLCLIDriver, val and var should strictly follow the Scala syntax

2017-04-13 Thread Xiaochen Ouyang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967284#comment-15967284 ] Xiaochen Ouyang edited comment on SPARK-20316 at 4/13/17 8:39 AM: --

[jira] [Resolved] (SPARK-20037) impossible to set kafka offsets using kafka 0.10 and spark 2.0.0

2017-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20037. --- Resolution: Duplicate > impossible to set kafka offsets using kafka 0.10 and spark 2.0.0 >

[jira] [Closed] (SPARK-20317) There is no need to use 'return' keyword in scala method

2017-04-13 Thread Xiaochen Ouyang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiaochen Ouyang closed SPARK-20317. --- Resolution: Duplicate merge this issue to #SPARK-20316 > There is no need to use 'return'

[jira] [Created] (SPARK-20319) Already quoted identifiers are getting wrapped with additional quotes

2017-04-13 Thread Umesh Chaudhary (JIRA)
Umesh Chaudhary created SPARK-20319: --- Summary: Already quoted identifiers are getting wrapped with additional quotes Key: SPARK-20319 URL: https://issues.apache.org/jira/browse/SPARK-20319 Project:

[jira] [Commented] (SPARK-20316) In SparkSQLCLIDriver, val and var should strictly follow the Scala syntax

2017-04-13 Thread Xiaochen Ouyang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967284#comment-15967284 ] Xiaochen Ouyang commented on SPARK-20316: - OK,I'll try to work on this, but it may be take a lot

[jira] [Created] (SPARK-20320) AnalysisException: Columns of grouping_id (count(value#17L)) does not match grouping columns (count(value#17L))

2017-04-13 Thread Jacek Laskowski (JIRA)
Jacek Laskowski created SPARK-20320: --- Summary: AnalysisException: Columns of grouping_id (count(value#17L)) does not match grouping columns (count(value#17L)) Key: SPARK-20320 URL:

[jira] [Comment Edited] (SPARK-20081) RandomForestClassifier doesn't seem to support more than 100 labels

2017-04-13 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-20081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967347#comment-15967347 ] Yan Facai (颜发才) edited comment on SPARK-20081 at 4/13/17 9:42 AM: -- Yes,

[jira] [Assigned] (SPARK-20189) Fix spark kinesis testcases to remove deprecated createStream and use Builders

2017-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-20189: - Assignee: Yash Sharma Priority: Minor (was: Major) > Fix spark kinesis testcases to remove

[jira] [Created] (SPARK-20330) CLONE - SparkContext.localProperties leaked

2017-04-13 Thread Oleg White (JIRA)
Oleg White created SPARK-20330: -- Summary: CLONE - SparkContext.localProperties leaked Key: SPARK-20330 URL: https://issues.apache.org/jira/browse/SPARK-20330 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968423#comment-15968423 ] Marcelo Vanzin commented on SPARK-20328: bq. But it shouldn't need delegation tokens at all,

[jira] [Commented] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968469#comment-15968469 ] Michael Gummelt commented on SPARK-20328: - bq. I have no idea what that means. I'm pretty sure a

[jira] [Updated] (SPARK-16900) Complete-mode output for file sinks

2017-04-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-16900: - Component/s: (was: DStreams) Structured Streaming > Complete-mode output

[jira] [Comment Edited] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968411#comment-15968411 ] Michael Gummelt edited comment on SPARK-20328 at 4/13/17 11:59 PM: --- bq.

[jira] [Comment Edited] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968416#comment-15968416 ] Michael Gummelt edited comment on SPARK-20328 at 4/13/17 11:59 PM: --- bq.

[jira] [Commented] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968343#comment-15968343 ] Marcelo Vanzin commented on SPARK-20328: Hmm... that seems related to delegation token support.

[jira] [Created] (SPARK-20331) Broaden support for Hive partition pruning predicate pushdown

2017-04-13 Thread Michael Allman (JIRA)
Michael Allman created SPARK-20331: -- Summary: Broaden support for Hive partition pruning predicate pushdown Key: SPARK-20331 URL: https://issues.apache.org/jira/browse/SPARK-20331 Project: Spark

[jira] [Commented] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968411#comment-15968411 ] Michael Gummelt commented on SPARK-20328: - > The Mesos backend (I mean the code in Spark, not the

[jira] [Assigned] (SPARK-18127) Add hooks and extension points to Spark

2017-04-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell reassigned SPARK-18127: - Assignee: Sameer Agarwal (was: Herman van Hovell) > Add hooks and extension

[jira] [Commented] (SPARK-20321) Spark UI cannot be shutdown in spark streaming app

2017-04-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968290#comment-15968290 ] Shixiong Zhu commented on SPARK-20321: -- You cannot stop a StreamingContext in foreachRDD. For

[jira] [Closed] (SPARK-20330) CLONE - SparkContext.localProperties leaked

2017-04-13 Thread Oleg White (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleg White closed SPARK-20330. -- Resolution: Duplicate > CLONE - SparkContext.localProperties leaked >

[jira] [Commented] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968388#comment-15968388 ] Michael Gummelt commented on SPARK-20328: - Hey [~vanzin], thanks for the response. Everything

[jira] [Commented] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968396#comment-15968396 ] Marcelo Vanzin commented on SPARK-20328: bq. The problem can't be solved in the Mesos backend I

[jira] [Commented] (SPARK-20293) In the page of 'jobs' or 'stages' of history server web ui,,click the 'Go' button, query paging data, the page error

2017-04-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968473#comment-15968473 ] Hyukjin Kwon commented on SPARK-20293: -- I can't reproduce in the current master. See -

[jira] [Commented] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968474#comment-15968474 ] Marcelo Vanzin commented on SPARK-20328: bq. Since the driver is authenticated, it can request

[jira] [Updated] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Gummelt updated SPARK-20328: Description: In order to obtain {{InputSplit}} information, {{HadoopRDD}} creates a

[jira] [Updated] (SPARK-20312) query optimizer calls udf with null values when it doesn't expect them

2017-04-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-20312: - Component/s: (was: Spark Core) SQL > query optimizer calls udf with null

[jira] [Updated] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Gummelt updated SPARK-20328: Description: In order to obtain {{InputSplit}} information, {{HadoopRDD}} creates a

[jira] [Comment Edited] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968416#comment-15968416 ] Michael Gummelt edited comment on SPARK-20328 at 4/14/17 12:02 AM: --- bq.

[jira] [Commented] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968432#comment-15968432 ] Michael Gummelt commented on SPARK-20328: - bq. It depends. e.g. on YARN, when you submit in

[jira] [Commented] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968440#comment-15968440 ] Marcelo Vanzin commented on SPARK-20328: bq. that the driver is already logged in via the

[jira] [Updated] (SPARK-20331) Broaden support for Hive partition pruning predicate pushdown

2017-04-13 Thread Michael Allman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Allman updated SPARK-20331: --- Description: Spark 2.1 introduced scalable support for Hive tables with huge numbers of

[jira] [Commented] (SPARK-20331) Broaden support for Hive partition pruning predicate pushdown

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968439#comment-15968439 ] Apache Spark commented on SPARK-20331: -- User 'mallman' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20331) Broaden support for Hive partition pruning predicate pushdown

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20331: Assignee: Apache Spark > Broaden support for Hive partition pruning predicate pushdown >

[jira] [Assigned] (SPARK-20331) Broaden support for Hive partition pruning predicate pushdown

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20331: Assignee: (was: Apache Spark) > Broaden support for Hive partition pruning predicate

[jira] [Created] (SPARK-20329) Resolution error when HAVING clause uses GROUP BY expression that involves implicit type coercion

2017-04-13 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-20329: -- Summary: Resolution error when HAVING clause uses GROUP BY expression that involves implicit type coercion Key: SPARK-20329 URL: https://issues.apache.org/jira/browse/SPARK-20329

[jira] [Comment Edited] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968388#comment-15968388 ] Michael Gummelt edited comment on SPARK-20328 at 4/13/17 11:27 PM: --- Hey

[jira] [Comment Edited] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968388#comment-15968388 ] Michael Gummelt edited comment on SPARK-20328 at 4/13/17 11:27 PM: --- Hey

[jira] [Commented] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs

2017-04-13 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968416#comment-15968416 ] Michael Gummelt commented on SPARK-20328: - > It shouldn't need to do it not for the reasons you

[jira] [Updated] (SPARK-19946) DebugFilesystem.assertNoOpenStreams should report the open streams to help debugging

2017-04-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-19946: -- Fix Version/s: 2.1.1 > DebugFilesystem.assertNoOpenStreams should report the open

[jira] [Assigned] (SPARK-20232) Better combineByKey documentation: clarify memory allocation, better example

2017-04-13 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-20232: --- Assignee: David Gingrich > Better combineByKey documentation: clarify memory allocation, better

[jira] [Created] (SPARK-20325) Spark Structured Streaming documentation Update: checkpoint configuration

2017-04-13 Thread Kate Eri (JIRA)
Kate Eri created SPARK-20325: Summary: Spark Structured Streaming documentation Update: checkpoint configuration Key: SPARK-20325 URL: https://issues.apache.org/jira/browse/SPARK-20325 Project: Spark

[jira] [Assigned] (SPARK-20233) Apply star-join filter heuristics to dynamic programming join enumeration

2017-04-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-20233: --- Assignee: Ioana Delaney > Apply star-join filter heuristics to dynamic programming join

[jira] [Resolved] (SPARK-20233) Apply star-join filter heuristics to dynamic programming join enumeration

2017-04-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-20233. - Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17546

[jira] [Resolved] (SPARK-20189) Fix spark kinesis testcases to remove deprecated createStream and use Builders

2017-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20189. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17506

[jira] [Updated] (SPARK-20316) In SparkSQLCLIDriver, val and var should strictly follow the Scala syntax

2017-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-20316: -- Priority: Trivial (was: Major) This is not worth fixing alone. Either try to fix all such instances

[jira] [Commented] (SPARK-20310) Dependency convergence error for scala-xml

2017-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967275#comment-15967275 ] Sean Owen commented on SPARK-20310: --- Yes but you can and should manage the dependency up to a common

[jira] [Resolved] (SPARK-20284) Make SerializationStream and DeserializationStream extend Closeable

2017-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20284. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17598

[jira] [Assigned] (SPARK-20284) Make SerializationStream and DeserializationStream extend Closeable

2017-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-20284: - Assignee: Sergei Lebedev > Make SerializationStream and DeserializationStream extend Closeable

[jira] [Commented] (SPARK-20319) Already quoted identifiers are getting wrapped with additional quotes

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967318#comment-15967318 ] Apache Spark commented on SPARK-20319: -- User 'umesh9794' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20319) Already quoted identifiers are getting wrapped with additional quotes

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20319: Assignee: Apache Spark > Already quoted identifiers are getting wrapped with additional

[jira] [Assigned] (SPARK-20319) Already quoted identifiers are getting wrapped with additional quotes

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20319: Assignee: (was: Apache Spark) > Already quoted identifiers are getting wrapped with

[jira] [Commented] (SPARK-20199) GradientBoostedTreesModel doesn't have Column Sampling Rate Paramenter

2017-04-13 Thread pralabhkumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967574#comment-15967574 ] pralabhkumar commented on SPARK-20199: -- Shouldn't there be a parameter in GBMParameters() val

[jira] [Commented] (SPARK-19946) DebugFilesystem.assertNoOpenStreams should report the open streams to help debugging

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967599#comment-15967599 ] Apache Spark commented on SPARK-19946: -- User 'bogdanrdc' has created a pull request for this issue:

[jira] [Commented] (SPARK-16742) Kerberos support for Spark on Mesos

2017-04-13 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968545#comment-15968545 ] Saisai Shao commented on SPARK-16742: - [~mgummelt], do you have a design doc of the kerberos support

[jira] [Updated] (SPARK-20333) Fix HashPartitioner in DAGSchedulerSuite

2017-04-13 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jin xing updated SPARK-20333: - Description: In test "don't submit stage until its dependencies map outputs are registered

[jira] [Created] (SPARK-20334) Return a better error message when correlated predicates contain aggregate expression that has mixture of outer and local references

2017-04-13 Thread Dilip Biswal (JIRA)
Dilip Biswal created SPARK-20334: Summary: Return a better error message when correlated predicates contain aggregate expression that has mixture of outer and local references Key: SPARK-20334 URL:

[jira] [Resolved] (SPARK-20293) In the page of 'jobs' or 'stages' of history server web ui,,click the 'Go' button, query paging data, the page error

2017-04-13 Thread Alex Bozarth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Bozarth resolved SPARK-20293. -- Resolution: Duplicate Already fixed in master and branch-2.1 > In the page of 'jobs' or

[jira] [Created] (SPARK-20332) Avro/Parquet GenericFixed decimal is not read into Spark correctly

2017-04-13 Thread Justin Pihony (JIRA)
Justin Pihony created SPARK-20332: - Summary: Avro/Parquet GenericFixed decimal is not read into Spark correctly Key: SPARK-20332 URL: https://issues.apache.org/jira/browse/SPARK-20332 Project: Spark

[jira] [Created] (SPARK-20333) Fix HashPartitioner in DAGSchedulerSuite

2017-04-13 Thread jin xing (JIRA)
jin xing created SPARK-20333: Summary: Fix HashPartitioner in DAGSchedulerSuite Key: SPARK-20333 URL: https://issues.apache.org/jira/browse/SPARK-20333 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-20333) Fix HashPartitioner in DAGSchedulerSuite

2017-04-13 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jin xing updated SPARK-20333: - Description: In test "don't submit stage until its dependencies map outputs are registered

[jira] [Created] (SPARK-20335) Children expressions of Hive UDF impacts the determinism of Hive UDF

2017-04-13 Thread Xiao Li (JIRA)
Xiao Li created SPARK-20335: --- Summary: Children expressions of Hive UDF impacts the determinism of Hive UDF Key: SPARK-20335 URL: https://issues.apache.org/jira/browse/SPARK-20335 Project: Spark

[jira] [Assigned] (SPARK-20333) Fix HashPartitioner in DAGSchedulerSuite

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20333: Assignee: Apache Spark > Fix HashPartitioner in DAGSchedulerSuite >

[jira] [Assigned] (SPARK-20333) Fix HashPartitioner in DAGSchedulerSuite

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20333: Assignee: (was: Apache Spark) > Fix HashPartitioner in DAGSchedulerSuite >

[jira] [Commented] (SPARK-20333) Fix HashPartitioner in DAGSchedulerSuite

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968576#comment-15968576 ] Apache Spark commented on SPARK-20333: -- User 'jinxing64' has created a pull request for this issue:

[jira] [Commented] (SPARK-20335) Children expressions of Hive UDF impacts the determinism of Hive UDF

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968639#comment-15968639 ] Apache Spark commented on SPARK-20335: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20335) Children expressions of Hive UDF impacts the determinism of Hive UDF

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20335: Assignee: Apache Spark (was: Xiao Li) > Children expressions of Hive UDF impacts the

[jira] [Assigned] (SPARK-20335) Children expressions of Hive UDF impacts the determinism of Hive UDF

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20335: Assignee: Xiao Li (was: Apache Spark) > Children expressions of Hive UDF impacts the

[jira] [Assigned] (SPARK-20334) Return a better error message when correlated predicates contain aggregate expression that has mixture of outer and local references

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20334: Assignee: Apache Spark > Return a better error message when correlated predicates contain

[jira] [Commented] (SPARK-20334) Return a better error message when correlated predicates contain aggregate expression that has mixture of outer and local references

2017-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968650#comment-15968650 ] Apache Spark commented on SPARK-20334: -- User 'dilipbiswal' has created a pull request for this

  1   2   >