[jira] [Commented] (SPARK-18877) Unable to read given csv data. Excepion: java.lang.IllegalArgumentException: requirement failed: Decimal precision 28 exceeds max precision 20

2016-12-15 Thread Navya Krishnappa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753728#comment-15753728 ] Navya Krishnappa commented on SPARK-18877: -- Precision and scale vary depending on the decimal

[jira] [Updated] (SPARK-18845) PageRank has incorrect initialization value that leads to slow convergence

2016-12-15 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave updated SPARK-18845: --- Assignee: Andrew Ray > PageRank has incorrect initialization value that leads to slow convergence >

[jira] [Resolved] (SPARK-18845) PageRank has incorrect initialization value that leads to slow convergence

2016-12-15 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave resolved SPARK-18845. Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16271

[jira] [Commented] (SPARK-18877) Unable to read given csv data. Excepion: java.lang.IllegalArgumentException: requirement failed: Decimal precision 28 exceeds max precision 20

2016-12-15 Thread Navya Krishnappa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753716#comment-15753716 ] Navya Krishnappa commented on SPARK-18877: -- I'm reading through csvReader (.csv(sourceFile)) and

[jira] [Commented] (SPARK-18895) Fix resource-closing-related and path-related test failures in identified ones on Windows

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753671#comment-15753671 ] Apache Spark commented on SPARK-18895: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-18895) Fix resource-closing-related and path-related test failures in identified ones on Windows

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18895: Assignee: (was: Apache Spark) > Fix resource-closing-related and path-related test

[jira] [Assigned] (SPARK-18895) Fix resource-closing-related and path-related test failures in identified ones on Windows

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18895: Assignee: Apache Spark > Fix resource-closing-related and path-related test failures in

[jira] [Created] (SPARK-18895) Fix resource-closing-related and path-related test failures in identified ones on Windows

2016-12-15 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-18895: Summary: Fix resource-closing-related and path-related test failures in identified ones on Windows Key: SPARK-18895 URL: https://issues.apache.org/jira/browse/SPARK-18895

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753588#comment-15753588 ] Shivaram Venkataraman commented on SPARK-18817: --- Just to check - Is your Spark installation

[jira] [Resolved] (SPARK-18892) Alias percentile_approx approx_percentile

2016-12-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18892. - Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 > Alias percentile_approx

[jira] [Commented] (SPARK-18838) High latency of event processing for large jobs

2016-12-15 Thread Sital Kedia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753535#comment-15753535 ] Sital Kedia commented on SPARK-18838: - cc - [~kayousterhout] > High latency of event processing for

[jira] [Updated] (SPARK-18838) High latency of event processing for large jobs

2016-12-15 Thread Sital Kedia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sital Kedia updated SPARK-18838: Description: Currently we are observing the issue of very high event processing delay in driver's

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753462#comment-15753462 ] Felix Cheung commented on SPARK-18817: -- I ran more of this but wasn't seeinng derby.log or

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753414#comment-15753414 ] Felix Cheung commented on SPARK-18817: -- It looks like javax.jdo.option.ConnectionURL can also be set

[jira] [Commented] (SPARK-18893) Not support "alter table .. add columns .."

2016-12-15 Thread lichenglin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753304#comment-15753304 ] lichenglin commented on SPARK-18893: spark 2.0 has disable "alter table".

[jira] [Assigned] (SPARK-18894) Event time watermark delay threshold specified in months or years gives incorrect results

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18894: Assignee: Apache Spark (was: Tathagata Das) > Event time watermark delay threshold

[jira] [Commented] (SPARK-18894) Event time watermark delay threshold specified in months or years gives incorrect results

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753298#comment-15753298 ] Apache Spark commented on SPARK-18894: -- User 'tdas' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18894) Event time watermark delay threshold specified in months or years gives incorrect results

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18894: Assignee: Tathagata Das (was: Apache Spark) > Event time watermark delay threshold

[jira] [Updated] (SPARK-18894) Event time watermark delay threshold specified in months or years gives incorrect results

2016-12-15 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-18894: -- Priority: Critical (was: Major) > Event time watermark delay threshold specified in months or

[jira] [Updated] (SPARK-18894) Event time watermark delay threshold specified in months or years gives incorrect results

2016-12-15 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-18894: -- Affects Version/s: 2.1.0 Target Version/s: 2.1.0 > Event time watermark delay threshold

[jira] [Created] (SPARK-18894) Event time watermark delay threshold specified in months or years gives incorrect results

2016-12-15 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-18894: - Summary: Event time watermark delay threshold specified in months or years gives incorrect results Key: SPARK-18894 URL: https://issues.apache.org/jira/browse/SPARK-18894

[jira] [Commented] (SPARK-18272) Test topic addition for subscribePattern on Kafka DStream and Structured Stream

2016-12-15 Thread Bravo Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753289#comment-15753289 ] Bravo Zhang commented on SPARK-18272: - Does "subscribing topic by pattern with topic deletions" in

[jira] [Created] (SPARK-18893) Not support "alter table .. add columns .."

2016-12-15 Thread zuotingbing (JIRA)
zuotingbing created SPARK-18893: --- Summary: Not support "alter table .. add columns .." Key: SPARK-18893 URL: https://issues.apache.org/jira/browse/SPARK-18893 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753227#comment-15753227 ] Wenchen Fan commented on SPARK-18817: - the warehouse path will be created no matter hive support is

[jira] [Comment Edited] (SPARK-14130) [Table related commands] Alter column

2016-12-15 Thread lichenglin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753146#comment-15753146 ] lichenglin edited comment on SPARK-14130 at 12/16/16 2:00 AM: --

[jira] [Issue Comment Deleted] (SPARK-14130) [Table related commands] Alter column

2016-12-15 Thread lichenglin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lichenglin updated SPARK-14130: --- Comment: was deleted (was: "TOK_ALTERTABLE_ADDCOLS" is a very important command for data warehouse.

[jira] [Issue Comment Deleted] (SPARK-14130) [Table related commands] Alter column

2016-12-15 Thread lichenglin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lichenglin updated SPARK-14130: --- Comment: was deleted (was: "TOK_ALTERTABLE_ADDCOLS" is a very important command for data warehouse.

[jira] [Commented] (SPARK-14130) [Table related commands] Alter column

2016-12-15 Thread lichenglin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753147#comment-15753147 ] lichenglin commented on SPARK-14130: "TOK_ALTERTABLE_ADDCOLS" is a very important command for data

[jira] [Commented] (SPARK-14130) [Table related commands] Alter column

2016-12-15 Thread lichenglin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753148#comment-15753148 ] lichenglin commented on SPARK-14130: "TOK_ALTERTABLE_ADDCOLS" is a very important command for data

[jira] [Commented] (SPARK-14130) [Table related commands] Alter column

2016-12-15 Thread lichenglin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753146#comment-15753146 ] lichenglin commented on SPARK-14130: "TOK_ALTERTABLE_ADDCOLS" is a very important command for data

[jira] [Closed] (SPARK-18855) Add RDD flatten function

2016-12-15 Thread Linbo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Linbo closed SPARK-18855. - Resolution: Unresolved > Add RDD flatten function > > > Key:

[jira] [Commented] (SPARK-18855) Add RDD flatten function

2016-12-15 Thread Linbo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753120#comment-15753120 ] Linbo commented on SPARK-18855: --- Tried several ways, the more "Spark" way is trying to create

[jira] [Comment Edited] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Brendan Dwyer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753093#comment-15753093 ] Brendan Dwyer edited comment on SPARK-18817 at 12/16/16 1:45 AM: - I'm not

[jira] [Comment Edited] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Brendan Dwyer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753093#comment-15753093 ] Brendan Dwyer edited comment on SPARK-18817 at 12/16/16 1:30 AM: - I'm not

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Brendan Dwyer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753095#comment-15753095 ] Brendan Dwyer commented on SPARK-18817: --- {code} library("SparkR") sparkR.session() df <-

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Brendan Dwyer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753093#comment-15753093 ] Brendan Dwyer commented on SPARK-18817: --- I'm not sure the CRAN people would be okay with that. >

[jira] [Commented] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753089#comment-15753089 ] Yanbo Liang commented on SPARK-18862: - Great! Will send PR soon. > Split SparkR mllib.R into

[jira] [Assigned] (SPARK-17807) Scalatest listed as compile dependency in spark-tags

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17807: Assignee: Apache Spark > Scalatest listed as compile dependency in spark-tags >

[jira] [Assigned] (SPARK-17807) Scalatest listed as compile dependency in spark-tags

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17807: Assignee: (was: Apache Spark) > Scalatest listed as compile dependency in spark-tags

[jira] [Commented] (SPARK-17807) Scalatest listed as compile dependency in spark-tags

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753085#comment-15753085 ] Apache Spark commented on SPARK-17807: -- User 'vanzin' has created a pull request for this issue:

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753064#comment-15753064 ] Felix Cheung commented on SPARK-18817: -- Actually, I'm not seeing derby.log or metastore_db in the

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753048#comment-15753048 ] Felix Cheung commented on SPARK-18817: -- Tested this just now, I still see spark-warehouse when

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753040#comment-15753040 ] Felix Cheung commented on SPARK-18817: -- we could, but we did ship 2.0 with it enabled by default

[jira] [Issue Comment Deleted] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18817: - Comment: was deleted (was: we could, but we did ship 2.0 with it enabled by default though.

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753041#comment-15753041 ] Felix Cheung commented on SPARK-18817: -- we could, but we did ship 2.0 with it enabled by default

[jira] [Comment Edited] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753041#comment-15753041 ] Felix Cheung edited comment on SPARK-18817 at 12/16/16 1:03 AM: we could,

[jira] [Commented] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753032#comment-15753032 ] Felix Cheung commented on SPARK-18862: -- FYI I reorg the vignettes based on what's discussed here.

[jira] [Commented] (SPARK-18849) Vignettes final checks for Spark 2.1

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753027#comment-15753027 ] Apache Spark commented on SPARK-18849: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-18892) Alias percentile_approx approx_percentile

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18892: Assignee: Reynold Xin (was: Apache Spark) > Alias percentile_approx approx_percentile >

[jira] [Commented] (SPARK-18892) Alias percentile_approx approx_percentile

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753011#comment-15753011 ] Apache Spark commented on SPARK-18892: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18892) Alias percentile_approx approx_percentile

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18892: Assignee: Apache Spark (was: Reynold Xin) > Alias percentile_approx approx_percentile >

[jira] [Created] (SPARK-18892) Alias percentile_approx approx_percentile

2016-12-15 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18892: --- Summary: Alias percentile_approx approx_percentile Key: SPARK-18892 URL: https://issues.apache.org/jira/browse/SPARK-18892 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-17807) Scalatest listed as compile dependency in spark-tags

2016-12-15 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752997#comment-15752997 ] Marcelo Vanzin commented on SPARK-17807: Reopening since this is a real issue (the dependency

[jira] [Reopened] (SPARK-17807) Scalatest listed as compile dependency in spark-tags

2016-12-15 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reopened SPARK-17807: > Scalatest listed as compile dependency in spark-tags >

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752973#comment-15752973 ] Shivaram Venkataraman commented on SPARK-18817: --- In that case an easier fix might be to

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Brendan Dwyer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752951#comment-15752951 ] Brendan Dwyer commented on SPARK-18817: --- [~shivaram] it does not happen if I disable Hive. >

[jira] [Commented] (SPARK-5632) not able to resolve dot('.') in field name

2016-12-15 Thread William Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752948#comment-15752948 ] William Shen commented on SPARK-5632: - Thanks [~marmbrus]. I see that the backtick works in 1.5.0 as

[jira] [Commented] (SPARK-5632) not able to resolve dot('.') in field name

2016-12-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752929#comment-15752929 ] Michael Armbrust commented on SPARK-5632: - Hmm, I agree that error is confusing. It does work if

[jira] [Updated] (SPARK-18084) write.partitionBy() does not recognize nested columns that select() can access

2016-12-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18084: - Target Version/s: 2.2.0 > write.partitionBy() does not recognize nested columns that

[jira] [Commented] (SPARK-18278) Support native submission of spark jobs to a kubernetes cluster

2016-12-15 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752907#comment-15752907 ] Andrew Ash commented on SPARK-18278: There are definitely challenges in building features that take

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752902#comment-15752902 ] Shivaram Venkataraman commented on SPARK-18817: --- [~bdwyer] Does this still happen if you

[jira] [Commented] (SPARK-5632) not able to resolve dot('.') in field name

2016-12-15 Thread William Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752896#comment-15752896 ] William Shen commented on SPARK-5632: - Thank you [~marmbrus] for the speedy response! However I ran

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752898#comment-15752898 ] Shivaram Venkataraman commented on SPARK-18817: --- Yeah I dont know how to avoid creating

[jira] [Resolved] (SPARK-18868) Flaky Test: StreamingQueryListenerSuite

2016-12-15 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-18868. -- Resolution: Fixed Assignee: Burak Yavuz Fix Version/s: 2.2.0

[jira] [Commented] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752864#comment-15752864 ] Joseph K. Bradley commented on SPARK-18844: --- Note: Please don't set the Target Version or Fix

[jira] [Updated] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18844: -- Target Version/s: (was: 2.0.3) > Add more binary classification metrics to

[jira] [Updated] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18844: -- Fix Version/s: (was: 2.0.2) > Add more binary classification metrics to

[jira] [Updated] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18844: -- Issue Type: New Feature (was: Improvement) > Add more binary classification metrics

[jira] [Created] (SPARK-18891) Support for specific collection types

2016-12-15 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-18891: Summary: Support for specific collection types Key: SPARK-18891 URL: https://issues.apache.org/jira/browse/SPARK-18891 Project: Spark Issue Type:

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Brendan Dwyer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752844#comment-15752844 ] Brendan Dwyer commented on SPARK-18817: --- I'm also seeing _derby.log_ and a folder named

[jira] [Updated] (SPARK-18890) Do all task serialization in CoarseGrainedExecutorBackend thread (rather than TaskSchedulerImpl)

2016-12-15 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-18890: --- Description: As part of benchmarking this change:

[jira] [Created] (SPARK-18890) Do all task serialization in CoarseGrainedExecutorBackend thread (rather than TaskSchedulerImpl)

2016-12-15 Thread Kay Ousterhout (JIRA)
Kay Ousterhout created SPARK-18890: -- Summary: Do all task serialization in CoarseGrainedExecutorBackend thread (rather than TaskSchedulerImpl) Key: SPARK-18890 URL:

[jira] [Commented] (SPARK-5632) not able to resolve dot('.') in field name

2016-12-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752771#comment-15752771 ] Michael Armbrust commented on SPARK-5632: - If you expand the commit you'll see its included in

[jira] [Commented] (SPARK-8425) Add blacklist mechanism for task scheduling

2016-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752760#comment-15752760 ] Apache Spark commented on SPARK-8425: - User 'squito' has created a pull request for this issue:

[jira] [Commented] (SPARK-5632) not able to resolve dot('.') in field name

2016-12-15 Thread William Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752747#comment-15752747 ] William Shen commented on SPARK-5632: - Is this still targeted for 1.4.0 as indicated in JIRA (or was

[jira] [Updated] (SPARK-17931) taskScheduler has some unneeded serialization

2016-12-15 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-17931: --- Component/s: (was: Spark Core) Scheduler > taskScheduler has some

[jira] [Updated] (SPARK-17931) taskScheduler has some unneeded serialization

2016-12-15 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-17931: --- Description: In the existing code, there are three layers of serialization involved in

[jira] [Resolved] (SPARK-12777) Dataset fields can't be Scala tuples

2016-12-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-12777. -- Resolution: Fixed Fix Version/s: 2.1.0 This works in 2.1:

[jira] [Commented] (SPARK-18886) Delay scheduling should not delay some executors indefinitely if one task is scheduled before delay timeout

2016-12-15 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752722#comment-15752722 ] Imran Rashid commented on SPARK-18886: -- [~mridul] sorry if I am being slow here, but do you mind

[jira] [Resolved] (SPARK-18888) partitionBy in DataStreamWriter in Python throws _to_seq not defined

2016-12-15 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-1. --- Resolution: Fixed Assignee: Burak Yavuz Fix Version/s: 2.1.0 > partitionBy

[jira] [Updated] (SPARK-18850) Make StreamExecution and progress classes serializable

2016-12-15 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18850: - Target Version/s: 2.1.0 > Make StreamExecution and progress classes serializable >

[jira] [Updated] (SPARK-18850) Make StreamExecution and progress classes serializable

2016-12-15 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18850: - Affects Version/s: 2.1.0 > Make StreamExecution and progress classes serializable >

[jira] [Updated] (SPARK-18850) Make StreamExecution and progress classes serializable

2016-12-15 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18850: - Summary: Make StreamExecution and progress classes serializable (was: Make StreamExecution

[jira] [Updated] (SPARK-18850) Make StreamExecution and progress classes serializable

2016-12-15 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18850: - Description: Make StreamExecution and progress classes serializable because it is too easy for

[jira] [Closed] (SPARK-16178) SQL - Hive writer should not require partition names to match table partitions

2016-12-15 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun closed SPARK-16178. - Resolution: Won't Fix > SQL - Hive writer should not require partition names to match table

[jira] [Commented] (SPARK-16178) SQL - Hive writer should not require partition names to match table partitions

2016-12-15 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752636#comment-15752636 ] Dongjoon Hyun commented on SPARK-16178: --- Thank you! Then, I'll close this as Won't Fix. > SQL -

[jira] [Commented] (SPARK-16178) SQL - Hive writer should not require partition names to match table partitions

2016-12-15 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752632#comment-15752632 ] Ryan Blue commented on SPARK-16178: --- Sure. I think the result was Won't Fix. > SQL - Hive writer

[jira] [Comment Edited] (SPARK-18886) Delay scheduling should not delay some executors indefinitely if one task is scheduled before delay timeout

2016-12-15 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752579#comment-15752579 ] Mridul Muralidharan edited comment on SPARK-18886 at 12/15/16 9:35 PM:

[jira] [Commented] (SPARK-18886) Delay scheduling should not delay some executors indefinitely if one task is scheduled before delay timeout

2016-12-15 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752579#comment-15752579 ] Mridul Muralidharan commented on SPARK-18886: - [~imranr] For almost all cases, delay

[jira] [Commented] (SPARK-16178) SQL - Hive writer should not require partition names to match table partitions

2016-12-15 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752548#comment-15752548 ] Dongjoon Hyun commented on SPARK-16178: --- Hi, [~rdblue]. The PR seems to be closed. I'm wondering we

[jira] [Created] (SPARK-18889) Spark incorrectly reads default columns from a Hive view

2016-12-15 Thread Salil Surendran (JIRA)
Salil Surendran created SPARK-18889: --- Summary: Spark incorrectly reads default columns from a Hive view Key: SPARK-18889 URL: https://issues.apache.org/jira/browse/SPARK-18889 Project: Spark

[jira] [Resolved] (SPARK-18826) Make FileStream be able to start with most recent files

2016-12-15 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-18826. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 16251

[jira] [Resolved] (SPARK-17119) Add configuration property to allow the history server to delete .inprogress files

2016-12-15 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-17119. Resolution: Duplicate This was actually already implemented (without the need for a config

[jira] [Comment Edited] (SPARK-17493) Spark Job hangs while DataFrame writing to HDFS path with parquet mode

2016-12-15 Thread Anbu Cheeralan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752443#comment-15752443 ] Anbu Cheeralan edited comment on SPARK-17493 at 12/15/16 8:55 PM: --

[jira] [Comment Edited] (SPARK-17493) Spark Job hangs while DataFrame writing to HDFS path with parquet mode

2016-12-15 Thread Anbu Cheeralan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752443#comment-15752443 ] Anbu Cheeralan edited comment on SPARK-17493 at 12/15/16 8:54 PM: --

[jira] [Resolved] (SPARK-8425) Add blacklist mechanism for task scheduling

2016-12-15 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid resolved SPARK-8425. - Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 14079

[jira] [Updated] (SPARK-8425) Add blacklist mechanism for task scheduling

2016-12-15 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-8425: Assignee: Mao, Wei (was: Imran Rashid) > Add blacklist mechanism for task scheduling >

[jira] [Commented] (SPARK-18886) Delay scheduling should not delay some executors indefinitely if one task is scheduled before delay timeout

2016-12-15 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752462#comment-15752462 ] Imran Rashid commented on SPARK-18886: -- [~mridulm80] good point, perhaps the right answer here is

[jira] [Updated] (SPARK-18823) Assignation by column name variable not available or bug?

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18823: -- Fix Version/s: (was: 2.0.2) > Assignation by column name variable not available or

[jira] [Commented] (SPARK-18823) Assignation by column name variable not available or bug?

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15752456#comment-15752456 ] Joseph K. Bradley commented on SPARK-18823: --- Note: Please don't set the Target Version or Fix

  1   2   >