[jira] [Updated] (SPARK-18218) Optimize BlockMatrix multiplication, which may cause OOM and low parallelism usage problem in several cases

2016-11-03 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18218: Shepherd: Yanbo Liang > Optimize BlockMatrix multiplication, which may cause OOM and low

[jira] [Commented] (SPARK-18193) queueStream not updated if rddQueue.add after create queueStream in Java

2016-11-03 Thread Hubert Kang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15635294#comment-15635294 ] Hubert Kang commented on SPARK-18193: - Is it possible to do that in opposite way, which means update

[jira] [Commented] (SPARK-18225) job will miss when driver removed by master in spark streaming

2016-11-03 Thread liujianhui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15635261#comment-15635261 ] liujianhui commented on SPARK-18225: we provide a platform for user to submit their streaming app, at

[jira] [Resolved] (SPARK-18259) QueryExecution should not catch Throwable

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18259. - Resolution: Fixed Fix Version/s: 2.1.0 > QueryExecution should not catch Throwable >

[jira] [Commented] (SPARK-18225) job will miss when driver removed by master in spark streaming

2016-11-03 Thread liujianhui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15635252#comment-15635252 ] liujianhui commented on SPARK-18225: it still doCheckpoint even killed by UI because the

[jira] [Commented] (SPARK-17348) Incorrect results from subquery transformation

2016-11-03 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15635204#comment-15635204 ] Nattavut Sutyanyong commented on SPARK-17348: - [~hvanhovell], would you please review my PR

[jira] [Assigned] (SPARK-18217) Disallow creating permanent views based on temporary views or UDFs

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18217: Assignee: Xiao Li (was: Apache Spark) > Disallow creating permanent views based on

[jira] [Assigned] (SPARK-18217) Disallow creating permanent views based on temporary views or UDFs

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18217: Assignee: Apache Spark (was: Xiao Li) > Disallow creating permanent views based on

[jira] [Commented] (SPARK-18217) Disallow creating permanent views based on temporary views or UDFs

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15635202#comment-15635202 ] Apache Spark commented on SPARK-18217: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Commented] (SPARK-17348) Incorrect results from subquery transformation

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15635199#comment-15635199 ] Apache Spark commented on SPARK-17348: -- User 'nsyca' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17348) Incorrect results from subquery transformation

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17348: Assignee: (was: Apache Spark) > Incorrect results from subquery transformation >

[jira] [Assigned] (SPARK-17348) Incorrect results from subquery transformation

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17348: Assignee: Apache Spark > Incorrect results from subquery transformation >

[jira] [Created] (SPARK-18262) JSON.org license is now CatX

2016-11-03 Thread Sean Busbey (JIRA)
Sean Busbey created SPARK-18262: --- Summary: JSON.org license is now CatX Key: SPARK-18262 URL: https://issues.apache.org/jira/browse/SPARK-18262 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-18261) Add statistics to MemorySink for joining

2016-11-03 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15635014#comment-15635014 ] Burak Yavuz commented on SPARK-18261: - Go for it! > Add statistics to MemorySink for joining >

[jira] [Commented] (SPARK-18261) Add statistics to MemorySink for joining

2016-11-03 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634979#comment-15634979 ] Liwei Lin commented on SPARK-18261: --- If no one's working on this, I'd like to take this > Add

[jira] [Updated] (SPARK-18185) Should fix INSERT OVERWRITE TABLE of Datasource tables with dynamic partitions

2016-11-03 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Liang updated SPARK-18185: --- Description: As of current 2.1, INSERT OVERWRITE with dynamic partitions against a Datasource table

[jira] [Updated] (SPARK-18185) Should fix INSERT OVERWRITE TABLE of Datasource tables with dynamic partitions

2016-11-03 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Liang updated SPARK-18185: --- Summary: Should fix INSERT OVERWRITE TABLE of Datasource tables with dynamic partitions (was:

[jira] [Commented] (SPARK-18101) ExternalCatalogSuite should test with mixed case fields

2016-11-03 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634816#comment-15634816 ] Wenchen Fan commented on SPARK-18101: - Hi [~ekhliang] , can this newly added

[jira] [Commented] (SPARK-17337) Incomplete algorithm for name resolution in Catalyst paser may lead to incorrect result

2016-11-03 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634741#comment-15634741 ] Nattavut Sutyanyong commented on SPARK-17337: - As commented in the PR, the code was nicely

[jira] [Updated] (SPARK-18260) from_json can throw a better exception when it can't find the column or be nullSafe

2016-11-03 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18260: - Component/s: SQL > from_json can throw a better exception when it can't find the column or be >

[jira] [Created] (SPARK-18261) Add statistics to MemorySink for joining

2016-11-03 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-18261: --- Summary: Add statistics to MemorySink for joining Key: SPARK-18261 URL: https://issues.apache.org/jira/browse/SPARK-18261 Project: Spark Issue Type: New

[jira] [Updated] (SPARK-18260) from_json can throw a better exception when it can't find the column or be nullSafe

2016-11-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18260: - Target Version/s: 2.1.0 Priority: Blocker (was: Major) > from_json can

[jira] [Resolved] (SPARK-18138) More officially deprecate support for Python 2.6, Java 7, and Scala 2.10

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18138. - Resolution: Fixed Assignee: Sean Owen Fix Version/s: 2.1.0 > More officially

[jira] [Updated] (SPARK-18138) More officially deprecate support for Python 2.6, Java 7, and Scala 2.10

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18138: Target Version/s: 2.1.0 (was: 2.2.0) > More officially deprecate support for Python 2.6, Java 7,

[jira] [Assigned] (SPARK-18235) ml.ALSModel function parity: ALSModel should support recommendforAll

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18235: Assignee: Apache Spark > ml.ALSModel function parity: ALSModel should support

[jira] [Commented] (SPARK-18260) from_json can throw a better exception when it can't find the column or be nullSafe

2016-11-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634766#comment-15634766 ] Michael Armbrust commented on SPARK-18260: -- We should return null if the input is null. >

[jira] [Assigned] (SPARK-18235) ml.ALSModel function parity: ALSModel should support recommendforAll

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18235: Assignee: (was: Apache Spark) > ml.ALSModel function parity: ALSModel should support

[jira] [Updated] (SPARK-18138) More officially deprecate support for Python 2.6, Java 7, and Scala 2.10

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18138: Summary: More officially deprecate support for Python 2.6, Java 7, and Scala 2.10 (was: Remove

[jira] [Commented] (SPARK-18235) ml.ALSModel function parity: ALSModel should support recommendforAll

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634753#comment-15634753 ] Apache Spark commented on SPARK-18235: -- User 'hhbyyh' has created a pull request for this issue:

[jira] [Updated] (SPARK-14657) RFormula output wrong features when formula w/o intercept

2016-11-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14657: -- Target Version/s: 2.2.0 (was: 2.1.0) > RFormula output wrong features when formula

[jira] [Commented] (SPARK-17337) Incomplete algorithm for name resolution in Catalyst paser may lead to incorrect result

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634694#comment-15634694 ] Apache Spark commented on SPARK-17337: -- User 'hvanhovell' has created a pull request for this issue:

[jira] [Resolved] (SPARK-12488) LDA describeTopics() Generates Invalid Term IDs

2016-11-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-12488. --- Resolution: Fixed Assignee: Xiangrui Meng Fix Version/s:

[jira] [Created] (SPARK-18260) from_json can throw a better exception when it can't find the column or be nullSafe

2016-11-03 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-18260: --- Summary: from_json can throw a better exception when it can't find the column or be nullSafe Key: SPARK-18260 URL: https://issues.apache.org/jira/browse/SPARK-18260

[jira] [Commented] (SPARK-12488) LDA describeTopics() Generates Invalid Term IDs

2016-11-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634683#comment-15634683 ] Joseph K. Bradley commented on SPARK-12488: --- I'm going to close this since it seems like it has

[jira] [Resolved] (SPARK-18254) UDFs don't see aliased column names

2016-11-03 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-18254. Resolution: Fixed Assignee: Eyal Farago (was: Davies Liu) > UDFs don't see aliased column

[jira] [Commented] (SPARK-18086) Regression: Hive variables no longer work in Spark 2.0

2016-11-03 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634605#comment-15634605 ] Ryan Blue commented on SPARK-18086: --- Yeah, I'll update the PR. > Regression: Hive variables no longer

[jira] [Assigned] (SPARK-18259) QueryExecution should not catch Throwable

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18259: Assignee: Apache Spark (was: Herman van Hovell) > QueryExecution should not catch

[jira] [Assigned] (SPARK-18259) QueryExecution should not catch Throwable

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18259: Assignee: Herman van Hovell (was: Apache Spark) > QueryExecution should not catch

[jira] [Commented] (SPARK-18259) QueryExecution should not catch Throwable

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634573#comment-15634573 ] Apache Spark commented on SPARK-18259: -- User 'hvanhovell' has created a pull request for this issue:

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-11-03 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634568#comment-15634568 ] holdenk commented on SPARK-15581: - This sounds like really good suggestions - I think some of the biggest

[jira] [Resolved] (SPARK-18257) Improve error reporting for FileStressSuite in streaming

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18257. - Resolution: Fixed Fix Version/s: 2.1.0 > Improve error reporting for FileStressSuite in

[jira] [Commented] (SPARK-18254) UDFs don't see aliased column names

2016-11-03 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634455#comment-15634455 ] Nicholas Chammas commented on SPARK-18254: --    So it was specifically some broken

[jira] [Commented] (SPARK-18254) UDFs don't see aliased column names

2016-11-03 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634432#comment-15634432 ] Herman van Hovell commented on SPARK-18254: --- We 'accidentally' fixed this yesterday with

[jira] [Comment Edited] (SPARK-18254) UDFs don't see aliased column names

2016-11-03 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634391#comment-15634391 ] Nicholas Chammas edited comment on SPARK-18254 at 11/3/16 9:58 PM: --- If

[jira] [Commented] (SPARK-18254) UDFs don't see aliased column names

2016-11-03 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634428#comment-15634428 ] Nicholas Chammas commented on SPARK-18254: -- Just tried it. Seems like the fix is only available

[jira] [Created] (SPARK-18259) QueryExecution should not catch Throwable

2016-11-03 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-18259: - Summary: QueryExecution should not catch Throwable Key: SPARK-18259 URL: https://issues.apache.org/jira/browse/SPARK-18259 Project: Spark Issue

[jira] [Commented] (SPARK-18254) UDFs don't see aliased column names

2016-11-03 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634416#comment-15634416 ] Davies Liu commented on SPARK-18254: Could you also try 2.0.2? > UDFs don't see aliased column names

[jira] [Commented] (SPARK-18254) UDFs don't see aliased column names

2016-11-03 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634391#comment-15634391 ] Nicholas Chammas commented on SPARK-18254: -- If I try branch-2.1 on

[jira] [Comment Edited] (SPARK-18254) UDFs don't see aliased column names

2016-11-03 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634391#comment-15634391 ] Nicholas Chammas edited comment on SPARK-18254 at 11/3/16 9:46 PM: --- If

[jira] [Updated] (SPARK-18212) Flaky test: org.apache.spark.sql.kafka010.KafkaSourceSuite.assign from specific offsets

2016-11-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18212: - Assignee: Cody Koeninger > Flaky test:

[jira] [Resolved] (SPARK-18212) Flaky test: org.apache.spark.sql.kafka010.KafkaSourceSuite.assign from specific offsets

2016-11-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-18212. -- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15737

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-11-03 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634354#comment-15634354 ] Seth Hendrickson commented on SPARK-15581: -- I think the points you mention are very important to

[jira] [Comment Edited] (SPARK-15581) MLlib 2.1 Roadmap

2016-11-03 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634354#comment-15634354 ] Seth Hendrickson edited comment on SPARK-15581 at 11/3/16 9:28 PM: --- I

[jira] [Commented] (SPARK-15798) Secondary sort in Dataset/DataFrame

2016-11-03 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634352#comment-15634352 ] koert kuipers commented on SPARK-15798: --- looking at the code for Window operators it seems to me

[jira] [Commented] (SPARK-9487) Use the same num. worker threads in Scala/Python unit tests

2016-11-03 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634342#comment-15634342 ] Saikat Kanjilal commented on SPARK-9487: added local[4] to repl, sparksql, streaming, all tests

[jira] [Commented] (SPARK-18254) UDFs don't see aliased column names

2016-11-03 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634333#comment-15634333 ] Davies Liu commented on SPARK-18254: I tried the following in master (2.1), it works {code}

[jira] [Resolved] (SPARK-18099) Spark distributed cache should throw exception if same file is specified to dropped in --files --archives

2016-11-03 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-18099. --- Resolution: Fixed Assignee: Kishor Patil Fix Version/s: 2.2.0

[jira] [Commented] (SPARK-18086) Regression: Hive variables no longer work in Spark 2.0

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634172#comment-15634172 ] Reynold Xin commented on SPARK-18086: - [~rdblue] Does my explanation make sense? Can you change the

[jira] [Commented] (SPARK-18230) MatrixFactorizationModel.recommendProducts throws NoSuchElement exception when the user does not exist

2016-11-03 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634257#comment-15634257 ] yuhao yang commented on SPARK-18230: Sorry, I got a little confused between the different recommend

[jira] [Commented] (SPARK-18254) UDFs don't see aliased column names

2016-11-03 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634228#comment-15634228 ] Davies Liu commented on SPARK-18254: I doubt it's a bug in ExtractPythonUDFs, not operator push down,

[jira] [Commented] (SPARK-18210) Pipeline.copy does not create an instance with the same UID

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634240#comment-15634240 ] Apache Spark commented on SPARK-18210: -- User 'wojtek-szymanski' has created a pull request for this

[jira] [Assigned] (SPARK-18210) Pipeline.copy does not create an instance with the same UID

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18210: Assignee: (was: Apache Spark) > Pipeline.copy does not create an instance with the

[jira] [Assigned] (SPARK-18210) Pipeline.copy does not create an instance with the same UID

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18210: Assignee: Apache Spark > Pipeline.copy does not create an instance with the same UID >

[jira] [Updated] (SPARK-18258) Sinks need access to offset representation

2016-11-03 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cody Koeninger updated SPARK-18258: --- Description: Transactional "exactly-once" semantics for output require storing an offset

[jira] [Created] (SPARK-18258) Sinks need access to offset representation

2016-11-03 Thread Cody Koeninger (JIRA)
Cody Koeninger created SPARK-18258: -- Summary: Sinks need access to offset representation Key: SPARK-18258 URL: https://issues.apache.org/jira/browse/SPARK-18258 Project: Spark Issue Type:

[jira] [Commented] (SPARK-18238) WARN Executor: 1 block locks were not released by TID

2016-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634156#comment-15634156 ] Sean Owen commented on SPARK-18238: --- Can you say any more about how you make this occur? > WARN

[jira] [Comment Edited] (SPARK-14222) Cross-publish jackson-module-scala for Scala 2.12

2016-11-03 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634143#comment-15634143 ] Jakob Odersky edited comment on SPARK-14222 at 11/3/16 8:33 PM: Thanks

[jira] [Commented] (SPARK-18193) queueStream not updated if rddQueue.add after create queueStream in Java

2016-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634149#comment-15634149 ] Sean Owen commented on SPARK-18193: --- Oh I see, it's the opposite. The QueueStream example should be

[jira] [Comment Edited] (SPARK-14222) Cross-publish jackson-module-scala for Scala 2.12

2016-11-03 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634143#comment-15634143 ] Jakob Odersky edited comment on SPARK-14222 at 11/3/16 8:30 PM: Thanks

[jira] [Commented] (SPARK-14222) Cross-publish jackson-module-scala for Scala 2.12

2016-11-03 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634143#comment-15634143 ] Jakob Odersky commented on SPARK-14222: --- Thanks Sean, however I realized that the dependency is in

[jira] [Commented] (SPARK-14222) Cross-publish jackson-module-scala for Scala 2.12

2016-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634128#comment-15634128 ] Sean Owen commented on SPARK-14222: --- Probably. The limiting factor is often run-time compatibility with

[jira] [Updated] (SPARK-18254) UDFs don't see aliased column names

2016-11-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18254: - Target Version/s: 2.1.0 > UDFs don't see aliased column names >

[jira] [Commented] (SPARK-18254) UDFs don't see aliased column names

2016-11-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634122#comment-15634122 ] Michael Armbrust commented on SPARK-18254: -- Is this yet another bug caused by the generic

[jira] [Commented] (SPARK-14222) Cross-publish jackson-module-scala for Scala 2.12

2016-11-03 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634117#comment-15634117 ] Jakob Odersky commented on SPARK-14222: --- A newer version of module (vertsion 2.8.4) is available

[jira] [Commented] (SPARK-15377) Enabling SASL Spark 1.6.1

2016-11-03 Thread Shridhar Ramachandran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634088#comment-15634088 ] Shridhar Ramachandran commented on SPARK-15377: --- It is likely that you haven't enabled

[jira] [Updated] (SPARK-17937) Clarify Kafka offset semantics for Structured Streaming

2016-11-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-17937: - Priority: Critical (was: Major) > Clarify Kafka offset semantics for Structured

[jira] [Updated] (SPARK-17937) Clarify Kafka offset semantics for Structured Streaming

2016-11-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-17937: - Issue Type: Improvement (was: Sub-task) Parent: (was: SPARK-15406) >

[jira] [Updated] (SPARK-17937) Clarify Kafka offset semantics for Structured Streaming

2016-11-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-17937: - Target Version/s: 2.1.0 > Clarify Kafka offset semantics for Structured Streaming >

[jira] [Commented] (SPARK-17937) Clarify Kafka offset semantics for Structured Streaming

2016-11-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634061#comment-15634061 ] Michael Armbrust commented on SPARK-17937: -- I'm going to pull this out from the parent JIRA as I

[jira] [Commented] (SPARK-14220) Build and test Spark against Scala 2.12

2016-11-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634050#comment-15634050 ] Josh Rosen commented on SPARK-14220: SPARK-14643 is likely to be the hardest task. > Build and test

[jira] [Commented] (SPARK-14220) Build and test Spark against Scala 2.12

2016-11-03 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634027#comment-15634027 ] Jakob Odersky commented on SPARK-14220: --- at least most dependencies will probably make 2.12 builds

[jira] [Comment Edited] (SPARK-14220) Build and test Spark against Scala 2.12

2016-11-03 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15634027#comment-15634027 ] Jakob Odersky edited comment on SPARK-14220 at 11/3/16 7:54 PM: At least

[jira] [Commented] (SPARK-11914) [SQL] Support coalesce and repartition in Dataset APIs

2016-11-03 Thread Ivan Gozali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15633984#comment-15633984 ] Ivan Gozali commented on SPARK-11914: - Hi, apologies for bringing this up in an old issue. I was

[jira] [Assigned] (SPARK-18257) Improve error reporting for FileStressSuite in streaming

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18257: Assignee: Reynold Xin (was: Apache Spark) > Improve error reporting for FileStressSuite

[jira] [Commented] (SPARK-18257) Improve error reporting for FileStressSuite in streaming

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15633931#comment-15633931 ] Apache Spark commented on SPARK-18257: -- User 'rxin' has created a pull request for this issue:

[jira] [Commented] (SPARK-15799) Release SparkR on CRAN

2016-11-03 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15633932#comment-15633932 ] Shivaram Venkataraman commented on SPARK-15799: --- Yes - I think this is good to go. The only

[jira] [Assigned] (SPARK-18257) Improve error reporting for FileStressSuite in streaming

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18257: Assignee: Apache Spark (was: Reynold Xin) > Improve error reporting for FileStressSuite

[jira] [Commented] (SPARK-9487) Use the same num. worker threads in Scala/Python unit tests

2016-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15633923#comment-15633923 ] Sean Owen commented on SPARK-9487: -- Yes, keep going, why not? > Use the same num. worker threads in

[jira] [Commented] (SPARK-18230) MatrixFactorizationModel.recommendProducts throws NoSuchElement exception when the user does not exist

2016-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15633917#comment-15633917 ] Sean Owen commented on SPARK-18230: --- Agree, I can't see how you'd return anything in this case. A

[jira] [Created] (SPARK-18257) Improve error reporting for FileStressSuite in streaming

2016-11-03 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18257: --- Summary: Improve error reporting for FileStressSuite in streaming Key: SPARK-18257 URL: https://issues.apache.org/jira/browse/SPARK-18257 Project: Spark Issue

[jira] [Commented] (SPARK-18256) Improve performance of event log replay in HistoryServer based on profiler results

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15633888#comment-15633888 ] Apache Spark commented on SPARK-18256: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18256) Improve performance of event log replay in HistoryServer based on profiler results

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18256: Assignee: Apache Spark (was: Josh Rosen) > Improve performance of event log replay in

[jira] [Assigned] (SPARK-18256) Improve performance of event log replay in HistoryServer based on profiler results

2016-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18256: Assignee: Josh Rosen (was: Apache Spark) > Improve performance of event log replay in

[jira] [Created] (SPARK-18256) Improve performance of event log replay in HistoryServer based on profiler results

2016-11-03 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-18256: -- Summary: Improve performance of event log replay in HistoryServer based on profiler results Key: SPARK-18256 URL: https://issues.apache.org/jira/browse/SPARK-18256

[jira] [Updated] (SPARK-18256) Improve performance of event log replay in HistoryServer based on profiler results

2016-11-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-18256: --- Issue Type: Improvement (was: Bug) > Improve performance of event log replay in HistoryServer based

[jira] [Updated] (SPARK-18237) hive.exec.stagingdir have no effect in spark2.0.1

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18237: Fix Version/s: (was: 2.0.3) > hive.exec.stagingdir have no effect in spark2.0.1 >

[jira] [Resolved] (SPARK-18237) hive.exec.stagingdir have no effect in spark2.0.1

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18237. - Resolution: Fixed Assignee: ClassNotFoundExp Fix Version/s: 2.1.0

[jira] [Resolved] (SPARK-18244) Rename partitionProviderIsHive -> tracksPartitionsInCatalog

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18244. - Resolution: Fixed Fix Version/s: 2.1.0 > Rename partitionProviderIsHive ->

[jira] [Updated] (SPARK-14220) Build and test Spark against Scala 2.12

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-14220: Target Version/s: (was: 2.2.0) > Build and test Spark against Scala 2.12 >

[jira] [Commented] (SPARK-14220) Build and test Spark against Scala 2.12

2016-11-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15633773#comment-15633773 ] Reynold Xin commented on SPARK-14220: - Yea in reality it's going to be really painful to upgrade. >

  1   2   3   >