[jira] [Updated] (SPARK-4672) Cut off the super long serialization chain in GraphX to avoid the StackOverflow error

2014-12-01 Thread Lijie Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lijie Xu updated SPARK-4672: Description: While running iterative algorithms in GraphX, a StackOverflow error will stably occur in the

[jira] [Updated] (SPARK-4672) Cut off the super long serialization chain in GraphX to avoid the StackOverflow error

2014-12-01 Thread Lijie Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lijie Xu updated SPARK-4672: Description: While running iterative algorithms in GraphX, a StackOverflow error will stably occur in the

[jira] [Updated] (SPARK-1112) When spark.akka.frameSize 10, task results bigger than 10MiB block execution

2014-12-01 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-1112: -- Fix Version/s: 1.1.0 1.0.3 When spark.akka.frameSize 10, task results bigger than

[jira] [Created] (SPARK-4674) Refactor getCallSite in Utils

2014-12-01 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-4674: -- Summary: Refactor getCallSite in Utils Key: SPARK-4674 URL: https://issues.apache.org/jira/browse/SPARK-4674 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-4674) Refactor getCallSite in Utils

2014-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229529#comment-14229529 ] Apache Spark commented on SPARK-4674: - User 'viirya' has created a pull request for

[jira] [Updated] (SPARK-1112) When spark.akka.frameSize 10, task results bigger than 10MiB block execution

2014-12-01 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-1112: -- Fix Version/s: (was: 1.0.3) 1.0.1 When spark.akka.frameSize 10, task results

[jira] [Commented] (SPARK-1112) When spark.akka.frameSize 10, task results bigger than 10MiB block execution

2014-12-01 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229531#comment-14229531 ] Josh Rosen commented on SPARK-1112: --- Looks like the Fix Versions accidentally got

[jira] [Resolved] (SPARK-4664) Overflow of `maxFrameSizeBytes`

2014-12-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-4664. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Shixiong Zhu Overflow of

[jira] [Resolved] (SPARK-2192) Examples Data Not in Binary Distribution

2014-12-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2192. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3480

[jira] [Updated] (SPARK-2192) Examples Data Not in Binary Distribution

2014-12-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2192: - Assignee: Sean Owen Examples Data Not in Binary Distribution

[jira] [Updated] (SPARK-2192) Examples Data Not in Binary Distribution

2014-12-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2192: - Component/s: MLlib Examples Data Not in Binary Distribution

[jira] [Resolved] (SPARK-4661) Minor code and docs cleanup

2014-12-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-4661. Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Shixiong Zhu Minor code and docs

[jira] [Commented] (SPARK-4671) Streaming block need not to replicate 2 copies when WAL is enabled

2014-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229547#comment-14229547 ] Apache Spark commented on SPARK-4671: - User 'jerryshao' has created a pull request for

[jira] [Commented] (SPARK-3926) result of JavaRDD collectAsMap() is not serializable

2014-12-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229553#comment-14229553 ] Shixiong Zhu commented on SPARK-3926: - {noformat} scala :javap -private -c

[jira] [Updated] (SPARK-4672) Cut off the super long serialization chain in GraphX to avoid the StackOverflow error

2014-12-01 Thread Lijie Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lijie Xu updated SPARK-4672: Description: While running iterative algorithms in GraphX, a StackOverflow error will stably occur in the

[jira] [Commented] (SPARK-3926) result of JavaRDD collectAsMap() is not serializable

2014-12-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229599#comment-14229599 ] Shixiong Zhu commented on SPARK-3926: - {quote} To allow subtypes of non-serializable

[jira] [Issue Comment Deleted] (SPARK-3926) result of JavaRDD collectAsMap() is not serializable

2014-12-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-3926: Comment: was deleted (was: {noformat} scala :javap -private -c

[jira] [Created] (SPARK-4675) Find similar products and similar users in MatrixFactorizationModel

2014-12-01 Thread Steven Bourke (JIRA)
Steven Bourke created SPARK-4675: Summary: Find similar products and similar users in MatrixFactorizationModel Key: SPARK-4675 URL: https://issues.apache.org/jira/browse/SPARK-4675 Project: Spark

[jira] [Commented] (SPARK-4675) Find similar products and similar users in MatrixFactorizationModel

2014-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229610#comment-14229610 ] Apache Spark commented on SPARK-4675: - User 'sbourke' has created a pull request for

[jira] [Issue Comment Deleted] (SPARK-3926) result of JavaRDD collectAsMap() is not serializable

2014-12-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3926: - Comment: was deleted (was: [~zsxwing] The issue is about {{SerializableMapWrapper}}; you're analyzing

[jira] [Commented] (SPARK-3926) result of JavaRDD collectAsMap() is not serializable

2014-12-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229628#comment-14229628 ] Sean Owen commented on SPARK-3926: -- I see, there may be even two issues here --

[jira] [Commented] (SPARK-4672) Cut off the super long serialization chain in GraphX to avoid the StackOverflow error

2014-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229711#comment-14229711 ] Apache Spark commented on SPARK-4672: - User 'JerryLead' has created a pull request for

[jira] [Updated] (SPARK-4672) Cut off the super long serialization chain in GraphX to avoid the StackOverflow error

2014-12-01 Thread Lijie Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lijie Xu updated SPARK-4672: Description: While running iterative algorithms in GraphX, a StackOverflow error will stably occur in the

[jira] [Commented] (SPARK-4672) Cut off the super long serialization chain in GraphX to avoid the StackOverflow error

2014-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229763#comment-14229763 ] Apache Spark commented on SPARK-4672: - User 'JerryLead' has created a pull request for

[jira] [Created] (SPARK-4676) JavaSchemaRDD.schema may throw NullType MatchError if sql has null

2014-12-01 Thread YanTang Zhai (JIRA)
YanTang Zhai created SPARK-4676: --- Summary: JavaSchemaRDD.schema may throw NullType MatchError if sql has null Key: SPARK-4676 URL: https://issues.apache.org/jira/browse/SPARK-4676 Project: Spark

[jira] [Commented] (SPARK-4676) JavaSchemaRDD.schema may throw NullType MatchError if sql has null

2014-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229776#comment-14229776 ] Apache Spark commented on SPARK-4676: - User 'YanTangZhai' has created a pull request

[jira] [Created] (SPARK-4677) Add hadoop input time in task webui

2014-12-01 Thread YanTang Zhai (JIRA)
YanTang Zhai created SPARK-4677: --- Summary: Add hadoop input time in task webui Key: SPARK-4677 URL: https://issues.apache.org/jira/browse/SPARK-4677 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-4677) Add hadoop input time in task webui

2014-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229826#comment-14229826 ] Apache Spark commented on SPARK-4677: - User 'YanTangZhai' has created a pull request

[jira] [Updated] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-12-01 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-4101: - Description: 1) Would be nice to be able to retrieve underlying model map, to be able to work

[jira] [Updated] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-12-01 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-4101: - Description: 1) Would be nice to be able to retrieve underlying model map, to be able to work

[jira] [Commented] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-12-01 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229916#comment-14229916 ] Ilya Ganelin commented on SPARK-4101: - Hu Peter - did you have an algorithm in mind

[jira] [Comment Edited] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-12-01 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229916#comment-14229916 ] Ilya Ganelin edited comment on SPARK-4101 at 12/1/14 3:48 PM: --

[jira] [Commented] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-12-01 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229921#comment-14229921 ] Peter Rudenko commented on SPARK-4101: -- Here's an interactive example:

[jira] [Commented] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-12-01 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229943#comment-14229943 ] Peter Rudenko commented on SPARK-4101: -- But i want to be able to extend it further

[jira] [Closed] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-12-01 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko closed SPARK-4101. Resolution: Fixed The main feature fixed by SPARK-4582, other functionality is not critical.

[jira] [Commented] (SPARK-3926) result of JavaRDD collectAsMap() is not serializable

2014-12-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230042#comment-14230042 ] Sean Owen commented on SPARK-3926: -- Yep, you're right, it's the constructor. I think

[jira] [Commented] (SPARK-4609) Job can not finish if there is one bad slave in clusters

2014-12-01 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230209#comment-14230209 ] Kay Ousterhout commented on SPARK-4609: --- I agree that (b) seems like a good approach

[jira] [Created] (SPARK-4678) A SQL query with subquery fails

2014-12-01 Thread Tsuyoshi OZAWA (JIRA)
Tsuyoshi OZAWA created SPARK-4678: - Summary: A SQL query with subquery fails Key: SPARK-4678 URL: https://issues.apache.org/jira/browse/SPARK-4678 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-4678) A SQL query with subquery fails with TreeNodeException

2014-12-01 Thread Tsuyoshi OZAWA (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tsuyoshi OZAWA updated SPARK-4678: -- Summary: A SQL query with subquery fails with TreeNodeException (was: A SQL query with

[jira] [Commented] (SPARK-4349) Spark driver hangs on sc.parallelize() if exception is thrown during serialization

2014-12-01 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230277#comment-14230277 ] Matt Cheah commented on SPARK-4349: --- I agree that's a better solution. I fell back to

[jira] [Commented] (SPARK-4298) The spark-submit cannot read Main-Class from Manifest.

2014-12-01 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230367#comment-14230367 ] Brennon York commented on SPARK-4298: - I'll take this one. Can someone assign to me?

[jira] [Commented] (SPARK-4616) SPARK_CONF_DIR is not effective in spark-submit

2014-12-01 Thread Brennon York (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230368#comment-14230368 ] Brennon York commented on SPARK-4616: - I can finish this. Can someone assign this to

[jira] [Created] (SPARK-4679) Race condition in querying the Spark UI JSON endpoint when Jetty context handlers are added and removed

2014-12-01 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-4679: - Summary: Race condition in querying the Spark UI JSON endpoint when Jetty context handlers are added and removed Key: SPARK-4679 URL: https://issues.apache.org/jira/browse/SPARK-4679

[jira] [Updated] (SPARK-4679) Race condition in querying the Spark UI JSON endpoint when Jetty context handlers are added and removed

2014-12-01 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-4679: -- Description: We started seeing some strange behavior when we were querying the Spark UI JSON endpoint

[jira] [Updated] (SPARK-4679) Race condition in querying the Spark UI JSON endpoint when Jetty context handlers are added and removed

2014-12-01 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-4679: -- Description: We started seeing some strange behavior when we were querying the Spark UI JSON endpoint

[jira] [Commented] (SPARK-4349) Spark driver hangs on sc.parallelize() if exception is thrown during serialization

2014-12-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230379#comment-14230379 ] Patrick Wendell commented on SPARK-4349: Sounds good Matt. I think it's worth

[jira] [Commented] (SPARK-4349) Spark driver hangs on sc.parallelize() if exception is thrown during serialization

2014-12-01 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230384#comment-14230384 ] Matt Cheah commented on SPARK-4349: --- Priorities shifted since I wrote that comment and I

[jira] [Commented] (SPARK-4644) Implement skewed join

2014-12-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230390#comment-14230390 ] Patrick Wendell commented on SPARK-4644: I would push back a bit on what you said

[jira] [Created] (SPARK-4680) Add support for no-op compression

2014-12-01 Thread Victor Tso (JIRA)
Victor Tso created SPARK-4680: - Summary: Add support for no-op compression Key: SPARK-4680 URL: https://issues.apache.org/jira/browse/SPARK-4680 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-4680) Add support for no-op compression

2014-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230405#comment-14230405 ] Apache Spark commented on SPARK-4680: - User 'roxchkplusony' has created a pull request

[jira] [Comment Edited] (SPARK-4644) Implement skewed join

2014-12-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230390#comment-14230390 ] Patrick Wendell edited comment on SPARK-4644 at 12/1/14 8:37 PM:

[jira] [Commented] (SPARK-4609) Job can not finish if there is one bad slave in clusters

2014-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230456#comment-14230456 ] Apache Spark commented on SPARK-4609: - User 'davies' has created a pull request for

[jira] [Resolved] (SPARK-4258) NPE with new Parquet Filters

2014-12-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4258. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3440

[jira] [Reopened] (SPARK-4258) NPE with new Parquet Filters

2014-12-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reopened SPARK-4258: - NPE with new Parquet Filters Key: SPARK-4258

[jira] [Issue Comment Deleted] (SPARK-4258) NPE with new Parquet Filters

2014-12-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4258: Comment: was deleted (was: Issue resolved by pull request 3440

[jira] [Resolved] (SPARK-4358) Parsing NumericLit with more specified types

2014-12-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4358. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3208

[jira] [Resolved] (SPARK-4650) Supporting multi column support in countDistinct function like count(distinct c1,c2..) in Spark SQL

2014-12-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4650. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3511

[jira] [Resolved] (SPARK-4658) Code documentation issue in DDL of datasource

2014-12-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4658. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3516

[jira] [Created] (SPARK-4681) Turn on host level blacklisting by default

2014-12-01 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-4681: -- Summary: Turn on host level blacklisting by default Key: SPARK-4681 URL: https://issues.apache.org/jira/browse/SPARK-4681 Project: Spark Issue Type:

[jira] [Updated] (SPARK-4679) Race condition in querying the Spark UI JSON endpoint when Jetty context handlers are added and removed

2014-12-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4679: --- Component/s: Web UI Race condition in querying the Spark UI JSON endpoint when Jetty context

[jira] [Updated] (SPARK-4679) Race condition in querying the Spark UI JSON endpoint when Jetty context handlers are added and removed

2014-12-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4679: --- Labels: (was: UI) Race condition in querying the Spark UI JSON endpoint when Jetty context

[jira] [Created] (SPARK-4682) Consolidate various 'Clock' classes

2014-12-01 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-4682: - Summary: Consolidate various 'Clock' classes Key: SPARK-4682 URL: https://issues.apache.org/jira/browse/SPARK-4682 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-4189) FileSegmentManagedBuffer should have a configurable memory map threshold

2014-12-01 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230579#comment-14230579 ] Ilya Ganelin commented on SPARK-4189: - Looking at the code I see // Just copy the

[jira] [Created] (SPARK-4683) Add a beeline.cmd to run on Windows

2014-12-01 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-4683: Summary: Add a beeline.cmd to run on Windows Key: SPARK-4683 URL: https://issues.apache.org/jira/browse/SPARK-4683 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-4684) Add a script to run JDBC server on Windows

2014-12-01 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-4684: Summary: Add a script to run JDBC server on Windows Key: SPARK-4684 URL: https://issues.apache.org/jira/browse/SPARK-4684 Project: Spark Issue Type: New

[jira] [Updated] (SPARK-4685) Update JavaDoc settings to include spark.ml and all spark.mllib subpackages in the right sections

2014-12-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-4685: - Priority: Trivial (was: Major) Update JavaDoc settings to include spark.ml and all spark.mllib

[jira] [Created] (SPARK-4685) Update JavaDoc settings to include spark.ml and all spark.mllib subpackages in the right sections

2014-12-01 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-4685: Summary: Update JavaDoc settings to include spark.ml and all spark.mllib subpackages in the right sections Key: SPARK-4685 URL: https://issues.apache.org/jira/browse/SPARK-4685

[jira] [Updated] (SPARK-4685) Update JavaDoc settings to include spark.ml and all spark.mllib subpackages in the right sections

2014-12-01 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-4685: - Target Version/s: 1.2.1 (was: 1.2.0) Update JavaDoc settings to include spark.ml and all

[jira] [Updated] (SPARK-4684) Add a script to run JDBC server on Windows

2014-12-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4684: Assignee: Cheng Lian Add a script to run JDBC server on Windows

[jira] [Updated] (SPARK-4683) Add a beeline.cmd to run on Windows

2014-12-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4683: Assignee: Cheng Lian Add a beeline.cmd to run on Windows

[jira] [Updated] (SPARK-4666) executor.memoryOverhead config should take a memory string

2014-12-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4666: --- Component/s: Spark Core executor.memoryOverhead config should take a memory string

[jira] [Resolved] (SPARK-4529) support view with column alias specified

2014-12-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4529. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3396

[jira] [Resolved] (SPARK-4082) Show Waiting/Queued Stages in Spark UI

2014-12-01 Thread Pat McDonough (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pat McDonough resolved SPARK-4082. -- Resolution: Duplicate Show Waiting/Queued Stages in Spark UI

[jira] [Created] (SPARK-4686) Link to allowed master URLs is broken in configuration documentation

2014-12-01 Thread Kay Ousterhout (JIRA)
Kay Ousterhout created SPARK-4686: - Summary: Link to allowed master URLs is broken in configuration documentation Key: SPARK-4686 URL: https://issues.apache.org/jira/browse/SPARK-4686 Project: Spark

[jira] [Commented] (SPARK-4686) Link to allowed master URLs is broken in configuration documentation

2014-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230761#comment-14230761 ] Apache Spark commented on SPARK-4686: - User 'kayousterhout' has created a pull request

[jira] [Resolved] (SPARK-4268) Use #::: to get benefit from Stream in SqlLexical.allCaseVersions

2014-12-01 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4268. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3132

[jira] [Created] (SPARK-4687) SparkContext#addFile doesn't keep file folder information

2014-12-01 Thread Jimmy Xiang (JIRA)
Jimmy Xiang created SPARK-4687: -- Summary: SparkContext#addFile doesn't keep file folder information Key: SPARK-4687 URL: https://issues.apache.org/jira/browse/SPARK-4687 Project: Spark Issue

[jira] [Updated] (SPARK-4688) Have a single shared network timeout in Spark

2014-12-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4688: --- Summary: Have a single shared network timeout in Spark (was: Consolidate timeout defaults in

[jira] [Created] (SPARK-4688) Consolidate timeout defaults in Spark

2014-12-01 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-4688: -- Summary: Consolidate timeout defaults in Spark Key: SPARK-4688 URL: https://issues.apache.org/jira/browse/SPARK-4688 Project: Spark Issue Type:

[jira] [Updated] (SPARK-4688) Have a single shared network timeout in Spark

2014-12-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-4688: --- Description: We have several different timeouts, but in most cases users just want to set something

[jira] [Commented] (SPARK-4642) Documents about running-on-YARN needs update

2014-12-01 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230805#comment-14230805 ] Masayoshi TSUZUKI commented on SPARK-4642: -- As discussed in

[jira] [Commented] (SPARK-4498) Standalone Master can fail to recognize completed/failed applications

2014-12-01 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230807#comment-14230807 ] Josh Rosen commented on SPARK-4498: --- [~andrewor14] and I just had a long discussion

[jira] [Created] (SPARK-4689) Unioning 2 SchemaRDDs should return a SchemaRDD in Python, Scala, and Java

2014-12-01 Thread Chris Fregly (JIRA)
Chris Fregly created SPARK-4689: --- Summary: Unioning 2 SchemaRDDs should return a SchemaRDD in Python, Scala, and Java Key: SPARK-4689 URL: https://issues.apache.org/jira/browse/SPARK-4689 Project:

[jira] [Commented] (SPARK-4687) SparkContext#addFile doesn't keep file folder information

2014-12-01 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230873#comment-14230873 ] Xuefu Zhang commented on SPARK-4687: [~jxiang], alternatively, would a new method,

[jira] [Commented] (SPARK-3963) Support getting task-scoped properties from TaskContext

2014-12-01 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230878#comment-14230878 ] Shivaram Venkataraman commented on SPARK-3963: -- [~pwendell] This looks pretty

[jira] [Updated] (SPARK-3963) Support getting task-scoped properties from TaskContext

2014-12-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3963: --- Target Version/s: 1.3.0 (was: 1.2.0) Support getting task-scoped properties from

[jira] [Commented] (SPARK-3963) Support getting task-scoped properties from TaskContext

2014-12-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230887#comment-14230887 ] Patrick Wendell commented on SPARK-3963: [~shivaram] - I think HadoopRDD has a

[jira] [Commented] (SPARK-4229) Create hadoop configuration in a consistent way

2014-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230888#comment-14230888 ] Apache Spark commented on SPARK-4229: - User 'koeninger' has created a pull request for

[jira] [Commented] (SPARK-4498) Standalone Master can fail to recognize completed/failed applications

2014-12-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230895#comment-14230895 ] Patrick Wendell commented on SPARK-4498: Hey Josh, The proposal you gave here

[jira] [Commented] (SPARK-3963) Support getting task-scoped properties from TaskContext

2014-12-01 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230900#comment-14230900 ] Shivaram Venkataraman commented on SPARK-3963: -- Thanks. I somehow missed

[jira] [Commented] (SPARK-4687) SparkContext#addFile doesn't keep file folder information

2014-12-01 Thread Jimmy Xiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230901#comment-14230901 ] Jimmy Xiang commented on SPARK-4687: If it can add files under the folder recursively,

[jira] [Commented] (SPARK-4498) Standalone Master can fail to recognize completed/failed applications

2014-12-01 Thread Mark Hamstra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230910#comment-14230910 ] Mark Hamstra commented on SPARK-4498: - I'd argue against reverting 2425 on the grounds

[jira] [Comment Edited] (SPARK-1962) Add RDD cache reference counting

2014-12-01 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230908#comment-14230908 ] Ilya Ganelin edited comment on SPARK-1962 at 12/2/14 3:16 AM: --

[jira] [Commented] (SPARK-4644) Implement skewed join

2014-12-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230920#comment-14230920 ] Shixiong Zhu commented on SPARK-4644: - [~pwendell] Fair enough. I'll see what I can do

[jira] [Commented] (SPARK-4672) Cut off the super long serialization chain in GraphX to avoid the StackOverflow error

2014-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230922#comment-14230922 ] Apache Spark commented on SPARK-4672: - User 'JerryLead' has created a pull request for

[jira] [Resolved] (SPARK-4611) Implement the efficient vector norm

2014-12-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4611. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3462

[jira] [Commented] (SPARK-4498) Standalone Master can fail to recognize completed/failed applications

2014-12-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230926#comment-14230926 ] Andrew Or commented on SPARK-4498: -- Ok, [~joshrosen] and I talked about this more

[jira] [Commented] (SPARK-4498) Standalone Master can fail to recognize completed/failed applications

2014-12-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230928#comment-14230928 ] Andrew Or commented on SPARK-4498: -- Yes [~markhamstra] SPARK-2424 will actually be

[jira] [Commented] (SPARK-4672) Cut off the super long serialization chain in GraphX to avoid the StackOverflow error

2014-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230931#comment-14230931 ] Apache Spark commented on SPARK-4672: - User 'JerryLead' has created a pull request for

[jira] [Commented] (SPARK-4498) Standalone Master can fail to recognize completed/failed applications

2014-12-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14230938#comment-14230938 ] Patrick Wendell commented on SPARK-4498: Going back to the original comment by

  1   2   >