[jira] [Assigned] (SPARK-18697) Upgrade sbt plugins

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18697: Assignee: (was: Apache Spark) > Upgrade sbt plugins > --- > >

[jira] [Assigned] (SPARK-18697) Upgrade sbt plugins

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18697: Assignee: Apache Spark > Upgrade sbt plugins > --- > >

[jira] [Assigned] (SPARK-18724) Add TuningSummary for TrainValidationSplit

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18724: Assignee: Apache Spark > Add TuningSummary for TrainValidationSplit >

[jira] [Commented] (SPARK-18724) Add TuningSummary for TrainValidationSplit

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723559#comment-15723559 ] Apache Spark commented on SPARK-18724: -- User 'hhbyyh' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18724) Add TuningSummary for TrainValidationSplit

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18724: Assignee: (was: Apache Spark) > Add TuningSummary for TrainValidationSplit >

[jira] [Commented] (SPARK-18085) Better History Server scalability for many / large applications

2016-12-05 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723547#comment-15723547 ] Marcelo Vanzin commented on SPARK-18085: While the issues you raise are valid, they're also

[jira] [Assigned] (SPARK-18723) Expanded programming guide information on wholeTextFiles

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18723: Assignee: (was: Apache Spark) > Expanded programming guide information on

[jira] [Commented] (SPARK-18723) Expanded programming guide information on wholeTextFiles

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723539#comment-15723539 ] Apache Spark commented on SPARK-18723: -- User 'michalsenkyr' has created a pull request for this

[jira] [Assigned] (SPARK-18723) Expanded programming guide information on wholeTextFiles

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18723: Assignee: Apache Spark > Expanded programming guide information on wholeTextFiles >

[jira] [Commented] (SPARK-15328) Word2Vec import for original binary format

2016-12-05 Thread Robin East (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723536#comment-15723536 ] Robin East commented on SPARK-15328: Any news on the PR for this? There seem to be a few issues with

[jira] [Created] (SPARK-18727) Support schema evolution as new files are inserted into table

2016-12-05 Thread Eric Liang (JIRA)
Eric Liang created SPARK-18727: -- Summary: Support schema evolution as new files are inserted into table Key: SPARK-18727 URL: https://issues.apache.org/jira/browse/SPARK-18727 Project: Spark

[jira] [Updated] (SPARK-18727) Support schema evolution as new files are inserted into table

2016-12-05 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Liang updated SPARK-18727: --- Component/s: SQL > Support schema evolution as new files are inserted into table >

[jira] [Created] (SPARK-18726) Filesystem unnecessarily scanned twice during creation of non-catalog table

2016-12-05 Thread Eric Liang (JIRA)
Eric Liang created SPARK-18726: -- Summary: Filesystem unnecessarily scanned twice during creation of non-catalog table Key: SPARK-18726 URL: https://issues.apache.org/jira/browse/SPARK-18726 Project:

[jira] [Commented] (SPARK-18717) Datasets - crash (compile exception) when mapping to immutable scala map

2016-12-05 Thread Andrew Ray (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723527#comment-15723527 ] Andrew Ray commented on SPARK-18717: I have a fix for this, will make a PR in a bit > Datasets -

[jira] [Commented] (SPARK-18717) Datasets - crash (compile exception) when mapping to immutable scala map

2016-12-05 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723518#comment-15723518 ] Dongjoon Hyun commented on SPARK-18717: --- +1 > Datasets - crash (compile exception) when mapping to

[jira] [Created] (SPARK-18725) Creating a datasource table with schema should not scan all files for table

2016-12-05 Thread Eric Liang (JIRA)
Eric Liang created SPARK-18725: -- Summary: Creating a datasource table with schema should not scan all files for table Key: SPARK-18725 URL: https://issues.apache.org/jira/browse/SPARK-18725 Project:

[jira] [Updated] (SPARK-18724) Add TuningSummary for TrainValidationSplit

2016-12-05 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-18724: --- Description: Currently TrainValidationSplitModel only provides tuning metrics in the format of

[jira] [Created] (SPARK-18724) Add TuningSummary for TrainValidationSplit

2016-12-05 Thread yuhao yang (JIRA)
yuhao yang created SPARK-18724: -- Summary: Add TuningSummary for TrainValidationSplit Key: SPARK-18724 URL: https://issues.apache.org/jira/browse/SPARK-18724 Project: Spark Issue Type:

[jira] [Commented] (SPARK-18717) Datasets - crash (compile exception) when mapping to immutable scala map

2016-12-05 Thread Andrew Ray (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723499#comment-15723499 ] Andrew Ray commented on SPARK-18717: Use `scala.collection.Map` as the type in your case class

[jira] [Assigned] (SPARK-18539) Cannot filter by nonexisting column in parquet file

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18539: Assignee: Apache Spark > Cannot filter by nonexisting column in parquet file >

[jira] [Commented] (SPARK-18539) Cannot filter by nonexisting column in parquet file

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723473#comment-15723473 ] Apache Spark commented on SPARK-18539: -- User 'xwu0226' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18539) Cannot filter by nonexisting column in parquet file

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18539: Assignee: (was: Apache Spark) > Cannot filter by nonexisting column in parquet file >

[jira] [Updated] (SPARK-18723) Expanded programming guide information on wholeTextFiles

2016-12-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-18723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michal Šenkýř updated SPARK-18723: -- Summary: Expanded programming guide information on wholeTextFiles (was: Expanded programming

[jira] [Created] (SPARK-18723) Expanded programming guid information on wholeTextFiles

2016-12-05 Thread JIRA
Michal Šenkýř created SPARK-18723: - Summary: Expanded programming guid information on wholeTextFiles Key: SPARK-18723 URL: https://issues.apache.org/jira/browse/SPARK-18723 Project: Spark

[jira] [Commented] (SPARK-18722) Move no data rate limit from StreamExecution to ProgressReporter

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723430#comment-15723430 ] Apache Spark commented on SPARK-18722: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18722) Move no data rate limit from StreamExecution to ProgressReporter

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18722: Assignee: (was: Apache Spark) > Move no data rate limit from StreamExecution to

[jira] [Assigned] (SPARK-18722) Move no data rate limit from StreamExecution to ProgressReporter

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18722: Assignee: Apache Spark > Move no data rate limit from StreamExecution to ProgressReporter

[jira] [Created] (SPARK-18722) Move no data rate limit from StreamExecution to ProgressReporter

2016-12-05 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-18722: Summary: Move no data rate limit from StreamExecution to ProgressReporter Key: SPARK-18722 URL: https://issues.apache.org/jira/browse/SPARK-18722 Project: Spark

[jira] [Created] (SPARK-18721) ForeachSink breaks Watermark in append mode

2016-12-05 Thread Cristian Opris (JIRA)
Cristian Opris created SPARK-18721: -- Summary: ForeachSink breaks Watermark in append mode Key: SPARK-18721 URL: https://issues.apache.org/jira/browse/SPARK-18721 Project: Spark Issue Type:

[jira] [Commented] (SPARK-17822) JVMObjectTracker.objMap may leak JVM objects

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723342#comment-15723342 ] Apache Spark commented on SPARK-17822: -- User 'mengxr' has created a pull request for this issue:

[jira] [Commented] (SPARK-18539) Cannot filter by nonexisting column in parquet file

2016-12-05 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723296#comment-15723296 ] Xin Wu commented on SPARK-18539: Yes. I have the fix and will submit PR and cc everyone for review. >

[jira] [Commented] (SPARK-18539) Cannot filter by nonexisting column in parquet file

2016-12-05 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723259#comment-15723259 ] Xiao Li commented on SPARK-18539: - [~lian cheng] [~rxin] We might be able to capture and process the

[jira] [Commented] (SPARK-18142) Spark Master tries to launch workers 145 times within 1 minute

2016-12-05 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723252#comment-15723252 ] Shixiong Zhu commented on SPARK-18142: -- Looks like we need a blacklist mechanism for workers. >

[jira] [Closed] (SPARK-18709) Automatic null conversion bug (instead of throwing error) when creating a Spark Datarame with incompatible types for fields.

2016-12-05 Thread Amogh Param (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amogh Param closed SPARK-18709. --- The fix is in 2.0.0. > Automatic null conversion bug (instead of throwing error) when creating a >

[jira] [Commented] (SPARK-18549) Failed to Uncache a View that References a Dropped Table.

2016-12-05 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723240#comment-15723240 ] Herman van Hovell commented on SPARK-18549: --- Yeah, lets retarget this. > Failed to Uncache a

[jira] [Updated] (SPARK-18549) Failed to Uncache a View that References a Dropped Table.

2016-12-05 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18549: -- Target Version/s: 2.2.0 (was: 2.1.0) > Failed to Uncache a View that References a

[jira] [Commented] (SPARK-18694) Add StreamingQuery.explain and exception to Python and fix StreamingQueryException

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723239#comment-15723239 ] Apache Spark commented on SPARK-18694: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Updated] (SPARK-18388) Running aggregation on many columns throws SOE

2016-12-05 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18388: -- Target Version/s: 2.2.0 (was: 2.1.0) > Running aggregation on many columns throws SOE

[jira] [Commented] (SPARK-18709) Automatic null conversion bug (instead of throwing error) when creating a Spark Datarame with incompatible types for fields.

2016-12-05 Thread Amogh Param (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723236#comment-15723236 ] Amogh Param commented on SPARK-18709: - Thanks, I'll close the ticket. > Automatic null conversion

[jira] [Commented] (SPARK-18709) Automatic null conversion bug (instead of throwing error) when creating a Spark Datarame with incompatible types for fields.

2016-12-05 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723201#comment-15723201 ] Dongjoon Hyun commented on SPARK-18709: --- Yes. It will not be in 1.6.4 (if exists). > Automatic

[jira] [Comment Edited] (SPARK-18709) Automatic null conversion bug (instead of throwing error) when creating a Spark Datarame with incompatible types for fields.

2016-12-05 Thread Amogh Param (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723189#comment-15723189 ] Amogh Param edited comment on SPARK-18709 at 12/5/16 7:50 PM: -- [~dongjoon]

[jira] [Commented] (SPARK-18709) Automatic null conversion bug (instead of throwing error) when creating a Spark Datarame with incompatible types for fields.

2016-12-05 Thread Amogh Param (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723189#comment-15723189 ] Amogh Param commented on SPARK-18709: - [~dongjoon] Thanks for the fix. Just to clarify, does this

[jira] [Commented] (SPARK-650) Add a "setup hook" API for running initialization code on each executor

2016-12-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723170#comment-15723170 ] Sean Owen commented on SPARK-650: - Why? info X can be included in the closure, and the executor can call

[jira] [Resolved] (SPARK-18711) NPE in generated SpecificMutableProjection for Aggregator

2016-12-05 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-18711. --- Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 2.1.0 > NPE

[jira] [Updated] (SPARK-18716) Restrict the disk usage of spark event log.

2016-12-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18716: -- Target Version/s: (was: 2.0.3, 2.1.1) > Restrict the disk usage of spark event log. >

[jira] [Resolved] (SPARK-18709) Automatic null conversion bug (instead of throwing error) when creating a Spark Datarame with incompatible types for fields.

2016-12-05 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-18709. --- Resolution: Fixed Sure. [~zsxwing] I think the issue reporter, [~amogh.91], will agree to

[jira] [Resolved] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-12-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18531. --- Resolution: Not A Problem Fix Version/s: 1.6.3 That suggests the workaround works, and, that

[jira] [Updated] (SPARK-18470) Provide Spark Streaming Monitor Rest Api

2016-12-05 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18470: - Component/s: (was: Structured Streaming) DStreams > Provide Spark Streaming

[jira] [Reopened] (SPARK-18560) Receiver data can not be dataSerialized properly.

2016-12-05 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reopened SPARK-18560: -- > Receiver data can not be dataSerialized properly. >

[jira] [Updated] (SPARK-18560) Receiver data can not be dataSerialized properly.

2016-12-05 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18560: - Assignee: (was: Genmao Yu) > Receiver data can not be dataSerialized properly. >

[jira] [Resolved] (SPARK-18560) Receiver data can not be dataSerialized properly.

2016-12-05 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-18560. -- Resolution: Duplicate > Receiver data can not be dataSerialized properly. >

[jira] [Updated] (SPARK-18560) Receiver data can not be dataSerialized properly.

2016-12-05 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18560: - Component/s: (was: Spark Core) DStreams > Receiver data can not be

[jira] [Resolved] (SPARK-18560) Receiver data can not be dataSerialized properly.

2016-12-05 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-18560. -- Resolution: Fixed Assignee: Genmao Yu Fix Version/s: 2.1.0

[jira] [Commented] (SPARK-18709) Automatic null conversion bug (instead of throwing error) when creating a Spark Datarame with incompatible types for fields.

2016-12-05 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723100#comment-15723100 ] Shixiong Zhu commented on SPARK-18709: -- [~dongjoon] Is it already resolved? If so, could you close

[jira] [Updated] (SPARK-18709) Automatic null conversion bug (instead of throwing error) when creating a Spark Datarame with incompatible types for fields.

2016-12-05 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18709: - Component/s: (was: Spark Core) SQL > Automatic null conversion bug (instead

[jira] [Resolved] (SPARK-18470) Provide Spark Streaming Monitor Rest Api

2016-12-05 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-18470. -- Resolution: Duplicate > Provide Spark Streaming Monitor Rest Api >

[jira] [Commented] (SPARK-18720) Code Refactoring of withColumn

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723088#comment-15723088 ] Apache Spark commented on SPARK-18720: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18720) Code Refactoring of withColumn

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18720: Assignee: (was: Apache Spark) > Code Refactoring of withColumn >

[jira] [Assigned] (SPARK-18720) Code Refactoring of withColumn

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18720: Assignee: Apache Spark > Code Refactoring of withColumn > --

[jira] [Created] (SPARK-18720) Code Refactoring of withColumn

2016-12-05 Thread Xiao Li (JIRA)
Xiao Li created SPARK-18720: --- Summary: Code Refactoring of withColumn Key: SPARK-18720 URL: https://issues.apache.org/jira/browse/SPARK-18720 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-18719) Document spark.ui.showConsoleProgress

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723057#comment-15723057 ] Apache Spark commented on SPARK-18719: -- User 'nchammas' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18719) Document spark.ui.showConsoleProgress

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18719: Assignee: (was: Apache Spark) > Document spark.ui.showConsoleProgress >

[jira] [Assigned] (SPARK-18719) Document spark.ui.showConsoleProgress

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18719: Assignee: Apache Spark > Document spark.ui.showConsoleProgress >

[jira] [Assigned] (SPARK-18349) Update R API documentation on ml model summary

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18349: Assignee: Apache Spark > Update R API documentation on ml model summary >

[jira] [Assigned] (SPARK-18349) Update R API documentation on ml model summary

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18349: Assignee: (was: Apache Spark) > Update R API documentation on ml model summary >

[jira] [Commented] (SPARK-18349) Update R API documentation on ml model summary

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723044#comment-15723044 ] Apache Spark commented on SPARK-18349: -- User 'wangmiao1981' has created a pull request for this

[jira] [Updated] (SPARK-18284) Scheme of DataFrame generated from RDD is diffrent between master and 2.0

2016-12-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18284: Fix Version/s: (was: 2.1.0) 2.2.0 > Scheme of DataFrame generated from RDD

[jira] [Commented] (SPARK-18284) Scheme of DataFrame generated from RDD is diffrent between master and 2.0

2016-12-05 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15723012#comment-15723012 ] Yin Huai commented on SPARK-18284: -- [~kiszk] btw, do we know what caused the nullable setting change in

[jira] [Updated] (SPARK-18284) Scheme of DataFrame generated from RDD is different between master and 2.0

2016-12-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18284: Summary: Scheme of DataFrame generated from RDD is different between master and 2.0 (was: Scheme

[jira] [Assigned] (SPARK-18715) Fix wrong AIC calculation in Binomial GLM

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18715: Assignee: Apache Spark > Fix wrong AIC calculation in Binomial GLM >

[jira] [Commented] (SPARK-18715) Fix wrong AIC calculation in Binomial GLM

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722995#comment-15722995 ] Apache Spark commented on SPARK-18715: -- User 'actuaryzhang' has created a pull request for this

[jira] [Assigned] (SPARK-18715) Fix wrong AIC calculation in Binomial GLM

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18715: Assignee: (was: Apache Spark) > Fix wrong AIC calculation in Binomial GLM >

[jira] [Commented] (SPARK-18349) Update R API documentation on ml model summary

2016-12-05 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722932#comment-15722932 ] Miao Wang commented on SPARK-18349: --- I go through the "summary" methods in mllib.R and have the

[jira] [Commented] (SPARK-18539) Cannot filter by nonexisting column in parquet file

2016-12-05 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722891#comment-15722891 ] Cheng Lian commented on SPARK-18539: Haven't looked deeply into this issue, but my hunch is that this

[jira] [Commented] (SPARK-17822) JVMObjectTracker.objMap may leak JVM objects

2016-12-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722886#comment-15722886 ] Xiangrui Meng commented on SPARK-17822: --- The issue comes with multiple RBackend connections. It is

[jira] [Resolved] (SPARK-18713) using SparkR build step wise regression model (glm)

2016-12-05 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-18713. --- Resolution: Not A Problem > using SparkR build step wise regression model

[jira] [Commented] (SPARK-18713) using SparkR build step wise regression model (glm)

2016-12-05 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722810#comment-15722810 ] Shivaram Venkataraman commented on SPARK-18713: --- I dont think we support step wise

[jira] [Commented] (SPARK-11215) Add multiple columns support to StringIndexer

2016-12-05 Thread Barry Becker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722696#comment-15722696 ] Barry Becker commented on SPARK-11215: -- This would be a good feature. It might be nice to add an

[jira] [Created] (SPARK-18719) Document spark.ui.showConsoleProgress

2016-12-05 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-18719: Summary: Document spark.ui.showConsoleProgress Key: SPARK-18719 URL: https://issues.apache.org/jira/browse/SPARK-18719 Project: Spark Issue Type:

[jira] [Commented] (SPARK-18648) spark-shell --jars option does not add jars to classpath on windows

2016-12-05 Thread Michel Lemay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722546#comment-15722546 ] Michel Lemay commented on SPARK-18648: -- Doing the same in Spark 1.6.2 was working correctly. This

[jira] [Commented] (SPARK-18325) SparkR 2.1 QA: Check for new R APIs requiring example code

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722493#comment-15722493 ] Apache Spark commented on SPARK-18325: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18325) SparkR 2.1 QA: Check for new R APIs requiring example code

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18325: Assignee: Yanbo Liang (was: Apache Spark) > SparkR 2.1 QA: Check for new R APIs

[jira] [Assigned] (SPARK-18325) SparkR 2.1 QA: Check for new R APIs requiring example code

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18325: Assignee: Apache Spark (was: Yanbo Liang) > SparkR 2.1 QA: Check for new R APIs

[jira] [Comment Edited] (SPARK-18085) Better History Server scalability for many / large applications

2016-12-05 Thread Dmitry Buzolin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722425#comment-15722425 ] Dmitry Buzolin edited comment on SPARK-18085 at 12/5/16 2:45 PM: - I would

[jira] [Commented] (SPARK-18085) Better History Server scalability for many / large applications

2016-12-05 Thread Dmitry Buzolin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722425#comment-15722425 ] Dmitry Buzolin commented on SPARK-18085: I would like add my observations after working with SHS:

[jira] [Assigned] (SPARK-18325) SparkR 2.1 QA: Check for new R APIs requiring example code

2016-12-05 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-18325: --- Assignee: Yanbo Liang > SparkR 2.1 QA: Check for new R APIs requiring example code >

[jira] [Assigned] (SPARK-18718) Skip some test failures due to path length limitation and fix tests to pass on Windows

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18718: Assignee: Apache Spark > Skip some test failures due to path length limitation and fix

[jira] [Assigned] (SPARK-18718) Skip some test failures due to path length limitation and fix tests to pass on Windows

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18718: Assignee: (was: Apache Spark) > Skip some test failures due to path length limitation

[jira] [Commented] (SPARK-18718) Skip some test failures due to path length limitation and fix tests to pass on Windows

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722363#comment-15722363 ] Apache Spark commented on SPARK-18718: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-18512) FileNotFoundException on _temporary directory with Spark Streaming 2.0.1 and S3A

2016-12-05 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722354#comment-15722354 ] Steve Loughran commented on SPARK-18512: of course, if you do switch to EMRFS, you should get

[jira] [Updated] (SPARK-18718) Skip some test failures due to path length limitation and fix tests to pass on Windows

2016-12-05 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-18718: - Summary: Skip some test failures due to path length limitation and fix tests to pass on Windows

[jira] [Updated] (SPARK-18718) Skip some test failures due th path length limitation and fix tests to pass on Windows

2016-12-05 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-18718: - Description: There are some tests failed on Windows due to the wrong format of path and the

[jira] [Created] (SPARK-18718) Skip some test failures due th path length limitation and fix tests to pass on Windows

2016-12-05 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-18718: Summary: Skip some test failures due th path length limitation and fix tests to pass on Windows Key: SPARK-18718 URL: https://issues.apache.org/jira/browse/SPARK-18718

[jira] [Commented] (SPARK-18512) FileNotFoundException on _temporary directory with Spark Streaming 2.0.1 and S3A

2016-12-05 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722320#comment-15722320 ] Steve Loughran commented on SPARK-18512: Actually, this is the problem whcih MAPREDUCE-6478 deals

[jira] [Commented] (SPARK-18512) FileNotFoundException on _temporary directory with Spark Streaming 2.0.1 and S3A

2016-12-05 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722304#comment-15722304 ] Steve Loughran commented on SPARK-18512: no. What you are seeing is an eventual consistency

[jira] [Commented] (SPARK-18091) Deep if expressions cause Generated SpecificUnsafeProjection code to exceed JVM code size limit

2016-12-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722292#comment-15722292 ] Apache Spark commented on SPARK-18091: -- User 'kapilsingh5050' has created a pull request for this

[jira] [Updated] (SPARK-18512) FileNotFoundException on _temporary directory with Spark Streaming 2.0.1 and S3A

2016-12-05 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-18512: --- Environment: AWS EMR 5.0.1 Spark 2.0.1 S3 EU-West-1 (S3A) was: AWS EMR 5.0.1 Spark 2.0.1

[jira] [Created] (SPARK-18717) Datasets - crash (compile exception) when mapping to immutable scala map

2016-12-05 Thread Damian Momot (JIRA)
Damian Momot created SPARK-18717: Summary: Datasets - crash (compile exception) when mapping to immutable scala map Key: SPARK-18717 URL: https://issues.apache.org/jira/browse/SPARK-18717 Project:

[jira] [Commented] (SPARK-650) Add a "setup hook" API for running initialization code on each executor

2016-12-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722178#comment-15722178 ] Michael Schmeißer commented on SPARK-650: - Thanks [~robert.neumann]! I am ready to help, if I can.

[jira] [Commented] (SPARK-650) Add a "setup hook" API for running initialization code on each executor

2016-12-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15722170#comment-15722170 ] Michael Schmeißer commented on SPARK-650: - A singleton is not really feasible if additional

<    1   2   3   >