[jira] [Created] (SPARK-2548) JavaRecoverableWordCount is missing

2014-07-16 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-2548: Summary: JavaRecoverableWordCount is missing Key: SPARK-2548 URL: https://issues.apache.org/jira/browse/SPARK-2548 Project: Spark Issue Type: Bug C

[jira] [Updated] (SPARK-2548) JavaRecoverableWordCount is missing

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2548: - Priority: Minor (was: Major) > JavaRecoverableWordCount is missing > ---

[jira] [Commented] (SPARK-2492) KafkaReceiver minor changes to align with Kafka 0.8

2014-07-16 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064636#comment-14064636 ] Saisai Shao commented on SPARK-2492: Hi TD, I revisit the Kafka's ConsoleConsumer ca

[jira] [Issue Comment Deleted] (SPARK-2521) Broadcast RDD object once per TaskSet (instead of sending it for every task)

2014-07-16 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Ash updated SPARK-2521: -- Comment: was deleted (was: Reynold's PR: https://github.com/apache/spark/pull/1452) > Broadcast RDD ob

[jira] [Commented] (SPARK-2521) Broadcast RDD object once per TaskSet (instead of sending it for every task)

2014-07-16 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064626#comment-14064626 ] Andrew Ash commented on SPARK-2521: --- Reynold's PR: https://github.com/apache/spark/pull/

[jira] [Commented] (SPARK-2546) Configuration object thread safety issue

2014-07-16 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064625#comment-14064625 ] Andrew Ash commented on SPARK-2546: --- On the thread: Me: {quote} Reynold's recent annou

[jira] [Commented] (SPARK-2048) Optimizations to CPU usage of external spilling code

2014-07-16 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064602#comment-14064602 ] Matei Zaharia commented on SPARK-2048: -- I added one more issue to this BTW, about EAO

[jira] [Updated] (SPARK-2048) Optimizations to CPU usage of external spilling code

2014-07-16 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2048: - Description: In the external spilling code in ExternalAppendOnlyMap and CoGroupedRDD, there are

[jira] [Commented] (SPARK-2433) In MLlib, implementation for Naive Bayes in Spark 0.9.1 is having an implementation bug.

2014-07-16 Thread Rahul K Bhojwani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064590#comment-14064590 ] Rahul K Bhojwani commented on SPARK-2433: - There is another small error in the doc

[jira] [Created] (SPARK-2547) The clustering documentaion example provided for spark 0.9.1/docs is having a error

2014-07-16 Thread Rahul K Bhojwani (JIRA)
Rahul K Bhojwani created SPARK-2547: --- Summary: The clustering documentaion example provided for spark 0.9.1/docs is having a error Key: SPARK-2547 URL: https://issues.apache.org/jira/browse/SPARK-2547

[jira] [Created] (SPARK-2546) Configuration object thread safety issue

2014-07-16 Thread Andrew Ash (JIRA)
Andrew Ash created SPARK-2546: - Summary: Configuration object thread safety issue Key: SPARK-2546 URL: https://issues.apache.org/jira/browse/SPARK-2546 Project: Spark Issue Type: Bug Co

[jira] [Closed] (SPARK-2466) Got two different block manager registrations

2014-07-16 Thread Alex Gaudio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Gaudio closed SPARK-2466. -- Resolution: Duplicate dup of 2445: https://issues.apache.org/jira/browse/SPARK-2445 > Got two differen

[jira] [Commented] (SPARK-2466) Got two different block manager registrations

2014-07-16 Thread Alex Gaudio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064586#comment-14064586 ] Alex Gaudio commented on SPARK-2466: 2466 is a duplicate of 2445. > Got two different

[jira] [Commented] (SPARK-2433) In MLlib, implementation for Naive Bayes in Spark 0.9.1 is having an implementation bug.

2014-07-16 Thread Rahul K Bhojwani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064583#comment-14064583 ] Rahul K Bhojwani commented on SPARK-2433: - Apologies for no response and not corre

[jira] [Commented] (SPARK-2523) Potential Bugs if SerDe is not the identical among partitions and table

2014-07-16 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064576#comment-14064576 ] Cheng Hao commented on SPARK-2523: -- I think the root cause is the when ALTER table with d

[jira] [Commented] (SPARK-2523) Potential Bugs if SerDe is not the identical among partitions and table

2014-07-16 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064570#comment-14064570 ] Cheng Hao commented on SPARK-2523: -- sbt/sbt hive/console {code:title=prepare.scala|border

[jira] [Resolved] (SPARK-2156) When the size of serialized results for one partition is slightly smaller than 10MB (the default akka.frameSize), the execution blocks

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2156. -- Resolution: Fixed > When the size of serialized results for one partition is slightly smaller

[jira] [Updated] (SPARK-2156) When the size of serialized results for one partition is slightly smaller than 10MB (the default akka.frameSize), the execution blocks

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2156: - Fix Version/s: 0.9.2 > When the size of serialized results for one partition is slightly smaller

[jira] [Resolved] (SPARK-1112) When spark.akka.frameSize > 10, task results bigger than 10MiB block execution

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1112. -- Resolution: Fixed Fix Version/s: (was: 1.0.1) (was: 1.1.0)

[jira] [Updated] (SPARK-1112) When spark.akka.frameSize > 10, task results bigger than 10MiB block execution

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1112: - Target Version/s: 0.9.2, 1.0.1, 1.1.0 (was: 1.0.1, 1.1.0, 0.9.3) > When spark.akka.frameSize > 1

[jira] [Commented] (SPARK-1112) When spark.akka.frameSize > 10, task results bigger than 10MiB block execution

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064552#comment-14064552 ] Xiangrui Meng commented on SPARK-1112: -- PR for branch-0.9: https://github.com/apache/

[jira] [Commented] (SPARK-2156) When the size of serialized results for one partition is slightly smaller than 10MB (the default akka.frameSize), the execution blocks

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064551#comment-14064551 ] Xiangrui Meng commented on SPARK-2156: -- PR for branch-0.9: https://github.com/apache/

[jira] [Created] (SPARK-2545) Add a diagnosis mode for closures to figure out what they're bringing in

2014-07-16 Thread Aaron Davidson (JIRA)
Aaron Davidson created SPARK-2545: - Summary: Add a diagnosis mode for closures to figure out what they're bringing in Key: SPARK-2545 URL: https://issues.apache.org/jira/browse/SPARK-2545 Project: Spa

[jira] [Updated] (SPARK-1112) When spark.akka.frameSize > 10, task results bigger than 10MiB block execution

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1112: - Target Version/s: 1.0.1, 1.1.0, 0.9.3 (was: 0.9.2, 1.0.1, 1.1.0) > When spark.akka.frameSize > 1

[jira] [Updated] (SPARK-1576) Passing of JAVA_OPTS to YARN on command line

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1576: - Fix Version/s: (was: 0.9.2) (was: 0.9.0) > Passing of JAVA_OPTS to YAR

[jira] [Updated] (SPARK-1576) Passing of JAVA_OPTS to YARN on command line

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1576: - Target Version/s: 0.9.3 Affects Version/s: 0.9.2 > Passing of JAVA_OPTS to YARN on command l

[jira] [Updated] (SPARK-1346) Backport SPARK-1210 into 0.9 branch

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1346: - Fix Version/s: (was: 0.9.2) > Backport SPARK-1210 into 0.9 branch > -

[jira] [Resolved] (SPARK-1759) sbt/sbt package fail cause by directory

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-1759. -- Resolution: Not a Problem Target Version/s: (was: 0.9.3) > sbt/sbt package fail cau

[jira] [Updated] (SPARK-1346) Backport SPARK-1210 into 0.9 branch

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1346: - Target Version/s: 0.9.3 > Backport SPARK-1210 into 0.9 branch > -

[jira] [Updated] (SPARK-1849) Broken UTF-8 encoded data gets character replacements and thus can't be "fixed"

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1849: - Fix Version/s: (was: 0.9.2) (was: 1.0.0) > Broken UTF-8 encoded data g

[jira] [Updated] (SPARK-1444) Update branch-0.9's SBT to 0.13.1 so that it works with Java 8

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1444: - Target Version/s: 0.9.3 > Update branch-0.9's SBT to 0.13.1 so that it works with Java 8 > --

[jira] [Updated] (SPARK-1759) sbt/sbt package fail cause by directory

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1759: - Target Version/s: 0.9.3 > sbt/sbt package fail cause by directory > -

[jira] [Updated] (SPARK-1498) Spark can hang if pyspark tasks fail

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1498: - Target Version/s: 0.9.3 > Spark can hang if pyspark tasks fail >

[jira] [Updated] (SPARK-1444) Update branch-0.9's SBT to 0.13.1 so that it works with Java 8

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1444: - Fix Version/s: (was: 0.9.2) > Update branch-0.9's SBT to 0.13.1 so that it works with Java 8

[jira] [Updated] (SPARK-1759) sbt/sbt package fail cause by directory

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1759: - Fix Version/s: (was: 0.9.2) > sbt/sbt package fail cause by directory > -

[jira] [Updated] (SPARK-1498) Spark can hang if pyspark tasks fail

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1498: - Fix Version/s: (was: 0.9.2) > Spark can hang if pyspark tasks fail >

[jira] [Resolved] (SPARK-2433) In MLlib, implementation for Naive Bayes in Spark 0.9.1 is having an implementation bug.

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2433. -- Resolution: Fixed Fix Version/s: (was: 1.0.0) 0.9.2 Issue resolve

[jira] [Commented] (SPARK-2463) Creating multiple StreamingContexts from shell generates duplicate Streaming tabs in UI

2014-07-16 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064501#comment-14064501 ] Nicholas Chammas commented on SPARK-2463: - Ah OK, that's good to hear! Sorry, rere

[jira] [Created] (SPARK-2544) Improve ALS algorithm resource usage

2014-07-16 Thread Guoqiang Li (JIRA)
Guoqiang Li created SPARK-2544: -- Summary: Improve ALS algorithm resource usage Key: SPARK-2544 URL: https://issues.apache.org/jira/browse/SPARK-2544 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-2534) Avoid pulling in the entire RDD or PairRDDFunctions in various operators

2014-07-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2534: --- Summary: Avoid pulling in the entire RDD or PairRDDFunctions in various operators (was: Avoid pullin

[jira] [Created] (SPARK-2543) Resizable serialization buffers for kryo

2014-07-16 Thread koert kuipers (JIRA)
koert kuipers created SPARK-2543: Summary: Resizable serialization buffers for kryo Key: SPARK-2543 URL: https://issues.apache.org/jira/browse/SPARK-2543 Project: Spark Issue Type: Improvemen

[jira] [Commented] (SPARK-2463) Creating multiple StreamingContexts from shell generates duplicate Streaming tabs in UI

2014-07-16 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064486#comment-14064486 ] Tathagata Das commented on SPARK-2463: -- No no, you can stop one StreamingContext (wit

[jira] [Created] (SPARK-2542) Exit Code Class should be renamed and placed package properly

2014-07-16 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-2542: - Summary: Exit Code Class should be renamed and placed package properly Key: SPARK-2542 URL: https://issues.apache.org/jira/browse/SPARK-2542 Project: Spark

[jira] [Commented] (SPARK-2463) Creating multiple StreamingContexts from shell generates duplicate Streaming tabs in UI

2014-07-16 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064478#comment-14064478 ] Nicholas Chammas commented on SPARK-2463: - {quote} Yeah, as a first step, we shoul

[jira] [Updated] (SPARK-2411) Standalone Master - direct users to turn on event logs

2014-07-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2411: - Attachment: (was: Master history not found.png) > Standalone Master - direct users to turn on event l

[jira] [Updated] (SPARK-2411) Standalone Master - direct users to turn on event logs

2014-07-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2411: - Attachment: Event logging not enabled.png Application history not found.png

[jira] [Commented] (SPARK-2463) Creating multiple StreamingContexts from shell generates duplicate Streaming tabs in UI

2014-07-16 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064474#comment-14064474 ] Tathagata Das commented on SPARK-2463: -- Yeah, as a first step, we should detect that

[jira] [Commented] (SPARK-2481) The environment variables SPARK_HISTORY_OPTS is covered in start-history-server.sh

2014-07-16 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064471#comment-14064471 ] Masayoshi TSUZUKI commented on SPARK-2481: -- Ah, I understand what you meant and I

[jira] [Commented] (SPARK-2406) Partitioned Parquet Support

2014-07-16 Thread Pat McDonough (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064450#comment-14064450 ] Pat McDonough commented on SPARK-2406: -- Okay, so it sounds like these are completely

[jira] [Resolved] (SPARK-2322) Exception in resultHandler could crash DAGScheduler and shutdown SparkContext

2014-07-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-2322. Resolution: Fixed Fix Version/s: 1.1.0 1.0.1 > Exception in resultHandler

[jira] [Created] (SPARK-2541) Standalone mode can't access secure HDFS anymore

2014-07-16 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-2541: Summary: Standalone mode can't access secure HDFS anymore Key: SPARK-2541 URL: https://issues.apache.org/jira/browse/SPARK-2541 Project: Spark Issue Type: Bu

[jira] [Updated] (SPARK-2438) Streaming + MLLib

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2438: - Assignee: Jeremy Freeman > Streaming + MLLib > - > > Key: SPARK-2

[jira] [Updated] (SPARK-2433) In MLlib, implementation for Naive Bayes in Spark 0.9.1 is having an implementation bug.

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2433: - Fix Version/s: 1.0.0 > In MLlib, implementation for Naive Bayes in Spark 0.9.1 is having an > im

[jira] [Assigned] (SPARK-2433) In MLlib, implementation for Naive Bayes in Spark 0.9.1 is having an implementation bug.

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-2433: Assignee: Xiangrui Meng > In MLlib, implementation for Naive Bayes in Spark 0.9.1 is having

[jira] [Commented] (SPARK-2433) In MLlib, implementation for Naive Bayes in Spark 0.9.1 is having an implementation bug.

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064421#comment-14064421 ] Xiangrui Meng commented on SPARK-2433: -- PR for branch-0.9: https://github.com/apache/

[jira] [Commented] (SPARK-2540) Add More Types Support for unwarpData of HiveUDF

2014-07-16 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064411#comment-14064411 ] Cheng Hao commented on SPARK-2540: -- https://github.com/apache/spark/pull/1436 > Add More

[jira] [Created] (SPARK-2540) Add More Types Support for unwarpData of HiveUDF

2014-07-16 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-2540: Summary: Add More Types Support for unwarpData of HiveUDF Key: SPARK-2540 URL: https://issues.apache.org/jira/browse/SPARK-2540 Project: Spark Issue Type: Improvemen

[jira] [Created] (SPARK-2539) ConnectionManager should handle Uncaught Exception

2014-07-16 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-2539: - Summary: ConnectionManager should handle Uncaught Exception Key: SPARK-2539 URL: https://issues.apache.org/jira/browse/SPARK-2539 Project: Spark Issue Type

[jira] [Commented] (SPARK-2535) Add StringComparison case to NullPropagation.

2014-07-16 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064381#comment-14064381 ] Takuya Ueshin commented on SPARK-2535: -- PR: https://github.com/apache/spark/pull/1451

[jira] [Updated] (SPARK-2537) Workaround Timezone specific Hive tests

2014-07-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-2537: -- Description: Several Hive tests in {{HiveCompatibilitySuite}} are timezone sensitive: - {{timestamp_1}

[jira] [Commented] (SPARK-2495) Ability to re-create ML models

2014-07-16 Thread Alexander Albul (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064374#comment-14064374 ] Alexander Albul commented on SPARK-2495: Hi Meng, Here is the list of models that

[jira] [Created] (SPARK-2538) External aggregation in Python

2014-07-16 Thread Davies Liu (JIRA)
Davies Liu created SPARK-2538: - Summary: External aggregation in Python Key: SPARK-2538 URL: https://issues.apache.org/jira/browse/SPARK-2538 Project: Spark Issue Type: Improvement Comp

[jira] [Created] (SPARK-2537) Workaround Timezone specific Hive tests

2014-07-16 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-2537: - Summary: Workaround Timezone specific Hive tests Key: SPARK-2537 URL: https://issues.apache.org/jira/browse/SPARK-2537 Project: Spark Issue Type: Bug Com

[jira] [Updated] (SPARK-2536) Update the MLlib page of Spark website

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2536: - Description: It still shows v0.9. (was: It stills shows v0.9.) > Update the MLlib page of Spark

[jira] [Comment Edited] (SPARK-2501) Handle stage re-submissions properly in the UI

2014-07-16 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064350#comment-14064350 ] Masayoshi TSUZUKI edited comment on SPARK-2501 at 7/16/14 11:58 PM:

[jira] [Created] (SPARK-2536) Update the MLlib page of Spark website

2014-07-16 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-2536: Summary: Update the MLlib page of Spark website Key: SPARK-2536 URL: https://issues.apache.org/jira/browse/SPARK-2536 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-2535) Add StringComparison case to NullPropagation.

2014-07-16 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-2535: Summary: Add StringComparison case to NullPropagation. Key: SPARK-2535 URL: https://issues.apache.org/jira/browse/SPARK-2535 Project: Spark Issue Type: Impro

[jira] [Commented] (SPARK-2501) Handle stage re-submissions properly in the UI

2014-07-16 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064350#comment-14064350 ] Masayoshi TSUZUKI commented on SPARK-2501: -- [SPARK-2299] seems to include the pro

[jira] [Updated] (SPARK-2534) Avoid pulling in the entire RDD in groupByKey

2014-07-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2534: --- Target Version/s: 1.1.0, 1.0.2 (was: 1.1.0) > Avoid pulling in the entire RDD in groupByKey > --

[jira] [Commented] (SPARK-2534) Avoid pulling in the entire RDD in groupByKey

2014-07-16 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064333#comment-14064333 ] Sandy Ryza commented on SPARK-2534: --- Yowza > Avoid pulling in the entire RDD in groupBy

[jira] [Created] (SPARK-2534) Avoid pulling in the entire RDD in groupByKey

2014-07-16 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-2534: -- Summary: Avoid pulling in the entire RDD in groupByKey Key: SPARK-2534 URL: https://issues.apache.org/jira/browse/SPARK-2534 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-2533) ---- Show summary of locality level of completed tasks in the each stage page of web UI

2014-07-16 Thread Masayoshi TSUZUKI (JIRA)
Masayoshi TSUZUKI created SPARK-2533: Summary: Show summary of locality level of completed tasks in the each stage page of web UI Key: SPARK-2533 URL: https://issues.apache.org/jira/browse/SPARK-2533

[jira] [Updated] (SPARK-2533) Show summary of locality level of completed tasks in the each stage page of web UI

2014-07-16 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Masayoshi TSUZUKI updated SPARK-2533: - Summary: Show summary of locality level of completed tasks in the each stage page of web

[jira] [Updated] (SPARK-1997) Update breeze to version 0.8.1

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1997: - Target Version/s: 1.1.0 > Update breeze to version 0.8.1 > -- > >

[jira] [Updated] (SPARK-2495) Ability to re-create ML models

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2495: - Assignee: Alexander Albul > Ability to re-create ML models > -- > >

[jira] [Commented] (SPARK-2495) Ability to re-create ML models

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064251#comment-14064251 ] Xiangrui Meng commented on SPARK-2495: -- They were not by mistake but we were not sure

[jira] [Commented] (SPARK-2154) Worker goes down.

2014-07-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064172#comment-14064172 ] Patrick Wendell commented on SPARK-2154: [~talk2siva8] Yes, that's correct. > Wor

[jira] [Commented] (SPARK-2420) Change Spark build to minimize library conflicts

2014-07-16 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064168#comment-14064168 ] Xuefu Zhang commented on SPARK-2420: As to guava conflict, HIVE-7387 has more details

[jira] [Updated] (SPARK-2434) Generate runtime warnings for naive implementations

2014-07-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2434: - Assignee: Burak Yavuz > Generate runtime warnings for naive implementations > ---

[jira] [Created] (SPARK-2532) Fix issues with consolidated shuffle

2014-07-16 Thread Mridul Muralidharan (JIRA)
Mridul Muralidharan created SPARK-2532: -- Summary: Fix issues with consolidated shuffle Key: SPARK-2532 URL: https://issues.apache.org/jira/browse/SPARK-2532 Project: Spark Issue Type: Bu

[jira] [Closed] (SPARK-2154) Worker goes down.

2014-07-16 Thread siva venkat gogineni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] siva venkat gogineni closed SPARK-2154. --- Fixed in the future releases > Worker goes down. > - > >

[jira] [Resolved] (SPARK-2154) Worker goes down.

2014-07-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2154. Resolution: Fixed Fix Version/s: 1.1.0 1.0.2 Issue resolved by pu

[jira] [Updated] (SPARK-2154) Worker goes down.

2014-07-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2154: --- Assignee: Aaron Davidson > Worker goes down. > - > > Key: SPA

[jira] [Commented] (SPARK-2531) Make BroadcastNestedLoopJoin take into account a BuildSide

2014-07-16 Thread Zongheng Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064106#comment-14064106 ] Zongheng Yang commented on SPARK-2531: -- Github PR: https://github.com/apache/spark/pu

[jira] [Commented] (SPARK-2519) Eliminate pattern-matching on Tuple2 in performance-critical aggregation code

2014-07-16 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064102#comment-14064102 ] Sandy Ryza commented on SPARK-2519: --- I looked in ShuffledRDD, ExternalAppendOnlyMap, App

[jira] [Updated] (SPARK-2411) Standalone Master - direct users to turn on event logs

2014-07-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2411: - Attachment: Master history not found.png > Standalone Master - direct users to turn on event logs > -

[jira] [Updated] (SPARK-2411) Standalone Master - direct users to turn on event logs

2014-07-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2411: - Attachment: (was: Master event logs.png) > Standalone Master - direct users to turn on event logs > -

[jira] [Commented] (SPARK-2454) Separate driver spark home from executor spark home

2014-07-16 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064085#comment-14064085 ] Nan Zhu commented on SPARK-2454: I see, it makes sense to me... > Separate driver spark h

[jira] [Updated] (SPARK-2531) Make BroadcastNestedLoopJoin take into account a BuildSide

2014-07-16 Thread Zongheng Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zongheng Yang updated SPARK-2531: - Component/s: SQL Priority: Minor (was: Major) Target Version/s: 1.1.0, 1.

[jira] [Created] (SPARK-2531) Make BroadcastNestedLoopJoin take into account a BuildSide

2014-07-16 Thread Zongheng Yang (JIRA)
Zongheng Yang created SPARK-2531: Summary: Make BroadcastNestedLoopJoin take into account a BuildSide Key: SPARK-2531 URL: https://issues.apache.org/jira/browse/SPARK-2531 Project: Spark Issu

[jira] [Closed] (SPARK-2465) Use long as user / item ID for ALS

2014-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-2465. Resolution: Won't Fix Will possibly revisit this in the long term, or look at creating a parallel LongALS

[jira] [Updated] (SPARK-2454) Separate driver spark home from executor spark home

2014-07-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2454: - Description: The driver may not always share the same directory structure as the executors. It makes lit

[jira] [Commented] (SPARK-2454) Separate driver spark home from executor spark home

2014-07-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064036#comment-14064036 ] Andrew Or commented on SPARK-2454: -- There may be multiple installations of Spark on the e

[jira] [Commented] (SPARK-2420) Change Spark build to minimize library conflicts

2014-07-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064019#comment-14064019 ] Sean Owen commented on SPARK-2420: -- It'd be best to say what problem you are seeing with

[jira] [Commented] (SPARK-2492) KafkaReceiver minor changes to align with Kafka 0.8

2014-07-16 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064003#comment-14064003 ] Tathagata Das commented on SPARK-2492: -- I think i am starting to get it. But why does

[jira] [Resolved] (SPARK-2504) Fix nullability of Substring expression.

2014-07-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2504. - Resolution: Fixed Fix Version/s: 1.0.2 1.1.0 Assignee:

[jira] [Issue Comment Deleted] (SPARK-1215) Clustering: Index out of bounds error

2014-07-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-1215: - Comment: was deleted (was: Just to let you know, I'll give the go-ahead for this tomorrow

[jira] [Commented] (SPARK-2420) Change Spark build to minimize library conflicts

2014-07-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064000#comment-14064000 ] Reynold Xin commented on SPARK-2420: Thanks for looking into this. > Change Spark bui

[jira] [Commented] (SPARK-2420) Change Spark build to minimize library conflicts

2014-07-16 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14063998#comment-14063998 ] Xuefu Zhang commented on SPARK-2420: Thanks for your comments, [~srowen]. I mostly agr

[jira] [Created] (SPARK-2530) Relax incorrect assumption of one ExternalAppendOnlyMap per thread

2014-07-16 Thread Andrew Or (JIRA)
Andrew Or created SPARK-2530: Summary: Relax incorrect assumption of one ExternalAppendOnlyMap per thread Key: SPARK-2530 URL: https://issues.apache.org/jira/browse/SPARK-2530 Project: Spark Iss

  1   2   >