[jira] [Commented] (SPARK-4630) Dynamically determine optimal number of partitions

2015-01-19 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283225#comment-14283225 ] Sandy Ryza commented on SPARK-4630: --- One way I was thinking it might make sense to

[jira] [Closed] (SPARK-5282) RowMatrix easily gets int overflow in the memory size warning

2015-01-19 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-5282. - fixed RowMatrix easily gets int overflow in the memory size warning

[jira] [Resolved] (SPARK-5214) Add EventLoop and change DAGScheduler to an EventLoop

2015-01-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5214. Resolution: Fixed Fix Version/s: 1.3.0 Add EventLoop and change DAGScheduler to an

[jira] [Commented] (SPARK-5329) UIWorkloadGenerator should stop SparkContext.

2015-01-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283374#comment-14283374 ] Apache Spark commented on SPARK-5329: - User 'sarutak' has created a pull request for

[jira] [Commented] (SPARK-5325) Simplifying Hive shim implementation

2015-01-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283181#comment-14283181 ] Cheng Lian commented on SPARK-5325: --- Some test cases in {{HiveCompatibilitySuite}} fail

[jira] [Commented] (SPARK-5321) Add transpose() method to Matrix

2015-01-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283182#comment-14283182 ] Apache Spark commented on SPARK-5321: - User 'brkyvz' has created a pull request for

[jira] [Updated] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-01-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-5327: -- Description: Git commit: e7884bc950a374408959b6118efe2c62fbe50608 Run the following SBT session to

[jira] [Updated] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-01-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-5327: -- Description: Git commit: e7884bc950a374408959b6118efe2c62fbe50608 Run the following SBT session to

[jira] [Updated] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-01-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5327: Assignee: Adrian Wang HiveCompatibilitySuite fails when executed against Hive 0.12.0

[jira] [Created] (SPARK-5328) Update PySpark MLlib NaiveBayes API to take model type parameter for Bernoulli fit

2015-01-19 Thread Leah McGuire (JIRA)
Leah McGuire created SPARK-5328: --- Summary: Update PySpark MLlib NaiveBayes API to take model type parameter for Bernoulli fit Key: SPARK-5328 URL: https://issues.apache.org/jira/browse/SPARK-5328

[jira] [Created] (SPARK-5326) Show fetch wait time as optional metric in the UI

2015-01-19 Thread Kay Ousterhout (JIRA)
Kay Ousterhout created SPARK-5326: - Summary: Show fetch wait time as optional metric in the UI Key: SPARK-5326 URL: https://issues.apache.org/jira/browse/SPARK-5326 Project: Spark Issue

[jira] [Commented] (SPARK-4630) Dynamically determine optimal number of partitions

2015-01-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283161#comment-14283161 ] Reynold Xin commented on SPARK-4630: The problem is that as the stage if producing its

[jira] [Created] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-01-19 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-5327: - Summary: HiveCompatibilitySuite fails when executed against Hive 0.12.0 Key: SPARK-5327 URL: https://issues.apache.org/jira/browse/SPARK-5327 Project: Spark

[jira] [Commented] (SPARK-5326) Show fetch wait time as optional metric in the UI

2015-01-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283204#comment-14283204 ] Apache Spark commented on SPARK-5326: - User 'kayousterhout' has created a pull request

[jira] [Comment Edited] (SPARK-2823) GraphX jobs throw IllegalArgumentException

2015-01-19 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283331#comment-14283331 ] Pedro Rodriguez edited comment on SPARK-2823 at 1/20/15 2:46 AM:

[jira] [Commented] (SPARK-4630) Dynamically determine optimal number of partitions

2015-01-19 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283146#comment-14283146 ] Sandy Ryza commented on SPARK-4630: --- [~rxin] I agree that there are probably a ton of

[jira] [Created] (SPARK-5329) UIWorkloadGenerator should stop SparkContext.

2015-01-19 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-5329: - Summary: UIWorkloadGenerator should stop SparkContext. Key: SPARK-5329 URL: https://issues.apache.org/jira/browse/SPARK-5329 Project: Spark Issue Type:

[jira] [Commented] (SPARK-2823) GraphX jobs throw IllegalArgumentException

2015-01-19 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283331#comment-14283331 ] Pedro Rodriguez commented on SPARK-2823: I just ran into this bug while testing

[jira] [Commented] (SPARK-5190) Allow spark listeners to be added before spark context gets initialized.

2015-01-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283361#comment-14283361 ] Apache Spark commented on SPARK-5190: - User 'JoshRosen' has created a pull request for

[jira] [Commented] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-01-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283190#comment-14283190 ] Cheng Lian commented on SPARK-5327: --- Should add dedicated Jenkins builder for Hive

[jira] [Commented] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-01-19 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283308#comment-14283308 ] Adrian Wang commented on SPARK-5327: [~lian cheng]Do you mean we have to test both

[jira] [Updated] (SPARK-5328) Update PySpark MLlib NaiveBayes API to take model type parameter for Bernoulli fit

2015-01-19 Thread Leah McGuire (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Leah McGuire updated SPARK-5328: Description: [SPARK-4894] Adds Bernoulli-variant of Naive Bayes adds Bernoulli fitting to

[jira] [Comment Edited] (SPARK-5309) Reduce Binary/String conversion overhead when reading/writing Parquet files

2015-01-19 Thread MIchael Davies (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282263#comment-14282263 ] MIchael Davies edited comment on SPARK-5309 at 1/19/15 8:41 AM:

[jira] [Comment Edited] (SPARK-5309) Reduce Binary/String conversion overhead when reading/writing Parquet files

2015-01-19 Thread MIchael Davies (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282263#comment-14282263 ] MIchael Davies edited comment on SPARK-5309 at 1/19/15 8:39 AM:

[jira] [Comment Edited] (SPARK-5309) Reduce Binary/String conversion overhead when reading/writing Parquet files

2015-01-19 Thread MIchael Davies (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282263#comment-14282263 ] MIchael Davies edited comment on SPARK-5309 at 1/19/15 8:35 AM:

[jira] [Created] (SPARK-5315) reduceByWindow returns Scala DStream not JavaDStream

2015-01-19 Thread Saisai Shao (JIRA)
Saisai Shao created SPARK-5315: -- Summary: reduceByWindow returns Scala DStream not JavaDStream Key: SPARK-5315 URL: https://issues.apache.org/jira/browse/SPARK-5315 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5309) Reduce Binary/String conversion overhead when reading/writing Parquet files

2015-01-19 Thread MIchael Davies (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282263#comment-14282263 ] MIchael Davies commented on SPARK-5309: --- Additionally noticed that predicates that

[jira] [Comment Edited] (SPARK-5309) Reduce Binary/String conversion overhead when reading/writing Parquet files

2015-01-19 Thread MIchael Davies (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282263#comment-14282263 ] MIchael Davies edited comment on SPARK-5309 at 1/19/15 8:35 AM:

[jira] [Commented] (SPARK-4630) Dynamically determine optimal number of partitions

2015-01-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282273#comment-14282273 ] Reynold Xin commented on SPARK-4630: This seems like something better suited for

[jira] [Updated] (SPARK-5297) File Streams do not work with custom key/values

2015-01-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5297: --- Fix Version/s: (was: 1.2.0) File Streams do not work with custom key/values

[jira] [Created] (SPARK-5330) Core | Scala 2.11 | Transitive dependency on com.fasterxml.jackson.core :jackson-core:2.3.1 causes compatibility issues

2015-01-19 Thread Aniket Bhatnagar (JIRA)
Aniket Bhatnagar created SPARK-5330: --- Summary: Core | Scala 2.11 | Transitive dependency on com.fasterxml.jackson.core :jackson-core:2.3.1 causes compatibility issues Key: SPARK-5330 URL:

[jira] [Commented] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-01-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283471#comment-14283471 ] Cheng Lian commented on SPARK-5327: --- Good point, I'll add that. Thanks for reminding!

[jira] [Commented] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-01-19 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283466#comment-14283466 ] Adrian Wang commented on SPARK-5327: Yes, I see that. But these cases are fine if we

[jira] [Updated] (SPARK-5297) File Streams do not work with custom key/values

2015-01-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5297: --- Description: The following code: {code}

[jira] [Updated] (SPARK-5270) Provide isEmpty utility function in RDD API

2015-01-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5270: --- Summary: Provide isEmpty utility function in RDD API (was: Elegantly check if RDD is empty)

[jira] [Updated] (SPARK-5270) Provide isEmpty() function in RDD API

2015-01-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5270: --- Summary: Provide isEmpty() function in RDD API (was: Provide isEmpty utility function in RDD

[jira] [Updated] (SPARK-5270) Provide isEmpty utility function in RDD API

2015-01-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5270: --- Assignee: Sean Owen Provide isEmpty utility function in RDD API

[jira] [Commented] (SPARK-5325) Simplifying Hive shim implementation

2015-01-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283520#comment-14283520 ] Cheng Lian commented on SPARK-5325: --- Fixed in PR #4107 Simplifying Hive shim

[jira] [Updated] (SPARK-5288) Stabilize Spark SQL data type API followup

2015-01-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5288: --- Assignee: Yin Huai Stabilize Spark SQL data type API followup

[jira] [Commented] (SPARK-3185) SPARK launch on Hadoop 2 in EC2 throws Tachyon exception when Formatting JOURNAL_FOLDER

2015-01-19 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283493#comment-14283493 ] Florian Verhein commented on SPARK-3185: I built tachyon with the correct hadoop

[jira] [Updated] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-01-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-5327: -- Description: Git commit: e7884bc950a374408959b6118efe2c62fbe50608 Run the following SBT session to

[jira] [Commented] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-01-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283436#comment-14283436 ] Cheng Lian commented on SPARK-5327: --- Would like to add that the reason why I'd like to

[jira] [Updated] (SPARK-5295) Stabilize data types

2015-01-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5295: --- Assignee: Yin Huai (was: Reynold Xin) Stabilize data types

[jira] [Updated] (SPARK-5295) Stabilize data types

2015-01-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5295: --- Description: 1. We expose all the stuff in data types right now, including NumericTypes, etc. These

[jira] [Commented] (SPARK-5323) Row shouldn't extend Seq

2015-01-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283511#comment-14283511 ] Apache Spark commented on SPARK-5323: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-01-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283434#comment-14283434 ] Cheng Lian commented on SPARK-5327: --- [~adrian-wang] Thanks for pointing out the {{.q}}

[jira] [Commented] (SPARK-5330) Core | Scala 2.11 | Transitive dependency on com.fasterxml.jackson.core :jackson-core:2.3.1 causes compatibility issues

2015-01-19 Thread Aniket Bhatnagar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283432#comment-14283432 ] Aniket Bhatnagar commented on SPARK-5330: - One possible workaround is do define

[jira] [Commented] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-01-19 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283457#comment-14283457 ] Adrian Wang commented on SPARK-5327: So I think we'd better build whitelist and

[jira] [Commented] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-01-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283458#comment-14283458 ] Cheng Lian commented on SPARK-5327: --- We've already had separate blacklist in the Hive

[jira] [Created] (SPARK-5331) Tachyon workers seem to ignore tachyon.master.hostname and use localhost instead

2015-01-19 Thread Florian Verhein (JIRA)
Florian Verhein created SPARK-5331: -- Summary: Tachyon workers seem to ignore tachyon.master.hostname and use localhost instead Key: SPARK-5331 URL: https://issues.apache.org/jira/browse/SPARK-5331

[jira] [Commented] (SPARK-4660) JavaSerializer uses wrong classloader

2015-01-19 Thread Jacek Lewandowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283454#comment-14283454 ] Jacek Lewandowski commented on SPARK-4660: -- Pull requests for 1.2 and master

[jira] [Commented] (SPARK-4660) JavaSerializer uses wrong classloader

2015-01-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283453#comment-14283453 ] Apache Spark commented on SPARK-4660: - User 'jacek-lewandowski' has created a pull

[jira] [Commented] (SPARK-4660) JavaSerializer uses wrong classloader

2015-01-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283452#comment-14283452 ] Apache Spark commented on SPARK-4660: - User 'jacek-lewandowski' has created a pull

[jira] [Resolved] (SPARK-5270) Provide isEmpty() function in RDD API

2015-01-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5270. Resolution: Fixed Fix Version/s: 1.3.0 Provide isEmpty() function in RDD API

[jira] [Commented] (SPARK-4630) Dynamically determine optimal number of partitions

2015-01-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283539#comment-14283539 ] Reynold Xin commented on SPARK-4630: Maybe the first step towards this is to find what

[jira] [Resolved] (SPARK-4504) run-example fails if multiple example jars present in target folder

2015-01-19 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4504. --- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 1.1.2 Issue

[jira] [Updated] (SPARK-4504) run-example fails if multiple example jars present in target folder

2015-01-19 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4504: -- Assignee: Venkata Ramana G run-example fails if multiple example jars present in target folder

[jira] [Created] (SPARK-5325) Simplifying Hive shim implementation

2015-01-19 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-5325: - Summary: Simplifying Hive shim implementation Key: SPARK-5325 URL: https://issues.apache.org/jira/browse/SPARK-5325 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5325) Simplifying Hive shim implementation

2015-01-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283073#comment-14283073 ] Apache Spark commented on SPARK-5325: - User 'liancheng' has created a pull request for

[jira] [Created] (SPARK-5317) Another straightforward way to set BoostingStrategy.defaultParams

2015-01-19 Thread peishenjia (JIRA)
peishenjia created SPARK-5317: - Summary: Another straightforward way to set BoostingStrategy.defaultParams Key: SPARK-5317 URL: https://issues.apache.org/jira/browse/SPARK-5317 Project: Spark

[jira] [Resolved] (SPARK-2595) The driver run garbage collection, when the executor throws OutOfMemoryError exception

2015-01-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2595. Resolution: Won't Fix Per PR comment, closing this for now. The driver run garbage

[jira] [Commented] (SPARK-3924) Upgrade to Akka version 2.3.7

2015-01-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-3924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282408#comment-14282408 ] François Garillot commented on SPARK-3924: -- [~helena_e] Please bump to 2.3.8.

[jira] [Updated] (SPARK-5212) Add support of schema-less transformation

2015-01-19 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5212: --- Description: This pr adds the support of schema-less syntax, custom field delimiter and SerDe

[jira] [Updated] (SPARK-5212) Add support of schema-less, custom field delimiter and SerDe for HiveQL transform

2015-01-19 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5212: --- Summary: Add support of schema-less, custom field delimiter and SerDe for HiveQL transform

[jira] [Resolved] (SPARK-5088) Use spark-class for running executors directly on mesos

2015-01-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5088. Resolution: Fixed Assignee: Jongyoul Lee Use spark-class for running executors

[jira] [Commented] (SPARK-4697) System properties should override environment variables

2015-01-19 Thread WangTaoTheTonic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282397#comment-14282397 ] WangTaoTheTonic commented on SPARK-4697: Should this be closed? System

[jira] [Commented] (SPARK-2620) case class cannot be used as key for reduce

2015-01-19 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282608#comment-14282608 ] Derrick Burns commented on SPARK-2620: -- I think we need a solution not a workaround

[jira] [Commented] (SPARK-2222) Add multiclass evaluation metrics

2015-01-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282592#comment-14282592 ] Apache Spark commented on SPARK-: - User 'avulanov' has created a pull request for

[jira] [Commented] (SPARK-5309) Reduce Binary/String conversion overhead when reading/writing Parquet files

2015-01-19 Thread MIchael Davies (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282565#comment-14282565 ] MIchael Davies commented on SPARK-5309: --- Here are some timings showing effect of

[jira] [Resolved] (SPARK-5217) Spark UI should report pending stages during job execution on AllStagesPage.

2015-01-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5217. Resolution: Fixed Fix Version/s: 1.3.0 Spark UI should report pending stages during

[jira] [Commented] (SPARK-5315) reduceByWindow returns Scala DStream not JavaDStream

2015-01-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282310#comment-14282310 ] Sean Owen commented on SPARK-5315: -- Agree, but someone may be calling this method and

[jira] [Comment Edited] (SPARK-5315) reduceByWindow returns Scala DStream not JavaDStream

2015-01-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282317#comment-14282317 ] Saisai Shao edited comment on SPARK-5315 at 1/19/15 9:53 AM: -

[jira] [Resolved] (SPARK-4417) New API: sample RDD to fixed number of items

2015-01-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4417. Resolution: Won't Fix Assignee: Ilya Ganelin [~ilganeli] ended up taking a crack a

[jira] [Created] (SPARK-5320) Joins on simple table created using select

2015-01-19 Thread Kuldeep (JIRA)
Kuldeep created SPARK-5320: -- Summary: Joins on simple table created using select Key: SPARK-5320 URL: https://issues.apache.org/jira/browse/SPARK-5320 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5315) reduceByWindow returns Scala DStream not JavaDStream

2015-01-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282358#comment-14282358 ] Apache Spark commented on SPARK-5315: - User 'jerryshao' has created a pull request for

[jira] [Commented] (SPARK-5222) YARN client and cluster modes have different app name behaviors

2015-01-19 Thread WangTaoTheTonic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282396#comment-14282396 ] WangTaoTheTonic commented on SPARK-5222: If I didn't miss something, the issue

[jira] [Commented] (SPARK-3439) Add Canopy Clustering Algorithm

2015-01-19 Thread Muhammad-Ali A'rabi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282460#comment-14282460 ] Muhammad-Ali A'rabi commented on SPARK-3439: We are using k-means with canopy

[jira] [Commented] (SPARK-2620) case class cannot be used as key for reduce

2015-01-19 Thread Frank Rosner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282323#comment-14282323 ] Frank Rosner commented on SPARK-2620: - The issue is caused by the fact that pattern

[jira] [Created] (SPARK-5319) Choosing partition size instead of count

2015-01-19 Thread Idan Zalzberg (JIRA)
Idan Zalzberg created SPARK-5319: Summary: Choosing partition size instead of count Key: SPARK-5319 URL: https://issues.apache.org/jira/browse/SPARK-5319 Project: Spark Issue Type:

[jira] [Updated] (SPARK-5320) Joins on simple table created using select gives error

2015-01-19 Thread Kuldeep (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kuldeep updated SPARK-5320: --- Summary: Joins on simple table created using select gives error (was: Joins on simple table created using

[jira] [Commented] (SPARK-5315) reduceByWindow returns Scala DStream not JavaDStream

2015-01-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282287#comment-14282287 ] Sean Owen commented on SPARK-5315: -- Ah, yeah that looks like a typo for sure. The other

[jira] [Resolved] (SPARK-3758) Script style checking

2015-01-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3758. Resolution: Won't Fix This patch ended up being so large, I think we're gonna pass on it.

[jira] [Commented] (SPARK-5315) reduceByWindow returns Scala DStream not JavaDStream

2015-01-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282320#comment-14282320 ] Sean Owen commented on SPARK-5315: -- Oh, that's a good point and good news then. We can

[jira] [Commented] (SPARK-5317) Another straightforward way to set BoostingStrategy.defaultParams

2015-01-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282332#comment-14282332 ] Apache Spark commented on SPARK-5317: - User 'Peishen-Jia' has created a pull request

[jira] [Created] (SPARK-5318) Add ability to control partition count in SparkSql

2015-01-19 Thread Idan Zalzberg (JIRA)
Idan Zalzberg created SPARK-5318: Summary: Add ability to control partition count in SparkSql Key: SPARK-5318 URL: https://issues.apache.org/jira/browse/SPARK-5318 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5176) Thrift server fails with confusing error message when deploy-mode is cluster

2015-01-19 Thread Adam Gutglick (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282402#comment-14282402 ] Adam Gutglick commented on SPARK-5176: -- Should it be fixed in 1.2.0/1.3.0 or straight

[jira] [Commented] (SPARK-5316) DAGScheduler may make shuffleToMapStage leak if getParentStages failes

2015-01-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282416#comment-14282416 ] Apache Spark commented on SPARK-5316: - User 'YanTangZhai' has created a pull request

[jira] [Created] (SPARK-5316) DAGScheduler may make shuffleToMapStage leak if getParentStages failes

2015-01-19 Thread YanTang Zhai (JIRA)
YanTang Zhai created SPARK-5316: --- Summary: DAGScheduler may make shuffleToMapStage leak if getParentStages failes Key: SPARK-5316 URL: https://issues.apache.org/jira/browse/SPARK-5316 Project: Spark

[jira] [Commented] (SPARK-5315) reduceByWindow returns Scala DStream not JavaDStream

2015-01-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282299#comment-14282299 ] Saisai Shao commented on SPARK-5315: Yes, user can wrap with JavaDStream, but I think

[jira] [Resolved] (SPARK-3288) All fields in TaskMetrics should be private and use getters/setters

2015-01-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3288. Resolution: Fixed Fix Version/s: 1.3.0 Target Version/s: 1.3.0 (was:

[jira] [Updated] (SPARK-3288) All fields in TaskMetrics should be private and use getters/setters

2015-01-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3288: --- Assignee: Ilya Ganelin (was: Dale Richardson) All fields in TaskMetrics should be private

[jira] [Comment Edited] (SPARK-3439) Add Canopy Clustering Algorithm

2015-01-19 Thread Muhammad-Ali A'rabi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282460#comment-14282460 ] Muhammad-Ali A'rabi edited comment on SPARK-3439 at 1/19/15 12:32 PM:

[jira] [Commented] (SPARK-5182) Partitioning support for tables created by the data source API

2015-01-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282682#comment-14282682 ] Yin Huai commented on SPARK-5182: - [~btiernay] This feature is targeted for both.

[jira] [Commented] (SPARK-5246) spark/spark-ec2.py cannot start Spark master in VPC if local DNS name does not resolve

2015-01-19 Thread Vladimir Grigor (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282701#comment-14282701 ] Vladimir Grigor commented on SPARK-5246: please see another pull request for this

[jira] [Closed] (SPARK-3924) Upgrade to Akka version 2.3.7

2015-01-19 Thread Helena Edelson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Helena Edelson closed SPARK-3924. - Resolution: Fixed Upgrade to Akka version 2.3.7 -

[jira] [Commented] (SPARK-5288) Stabilize Spark SQL data type API followup

2015-01-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282675#comment-14282675 ] Yin Huai commented on SPARK-5288: - Yeah, that sounds good. Stabilize Spark SQL data type

[jira] [Commented] (SPARK-5315) reduceByWindow returns Scala DStream not JavaDStream

2015-01-19 Thread Jeff Nadler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282703#comment-14282703 ] Jeff Nadler commented on SPARK-5315: Thanks Jerry! I was just about to file this and

[jira] [Commented] (SPARK-3924) Upgrade to Akka version 2.3.7

2015-01-19 Thread Helena Edelson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282763#comment-14282763 ] Helena Edelson commented on SPARK-3924: --- I wrote this ticket against the previous,

[jira] [Updated] (SPARK-4206) BlockManager warnings in local mode: Block $blockId already exists on this machine; not re-adding it

2015-01-19 Thread Tony Stevenson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tony Stevenson updated SPARK-4206: -- Reporter: Imran Rashid (was: Imran Rashid) BlockManager warnings in local mode: Block

[jira] [Comment Edited] (SPARK-5309) Reduce Binary/String conversion overhead when reading/writing Parquet files

2015-01-19 Thread MIchael Davies (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14282790#comment-14282790 ] MIchael Davies edited comment on SPARK-5309 at 1/19/15 6:11 PM:

  1   2   >