[jira] [Commented] (SPARK-2612) ALS has data skew for popular product

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069883#comment-14069883 ] Apache Spark commented on SPARK-2612: - User 'renozhang' has created a pull request for

[jira] [Commented] (SPARK-2421) Spark should treat writable as serializable for keys

2014-07-21 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069880#comment-14069880 ] Sandy Ryza commented on SPARK-2421: --- It should be relatively straightforward to add a Wr

[jira] [Created] (SPARK-2613) CLONE - word2vec: Distributed Representation of Words

2014-07-21 Thread Yifan Yang (JIRA)
Yifan Yang created SPARK-2613: - Summary: CLONE - word2vec: Distributed Representation of Words Key: SPARK-2613 URL: https://issues.apache.org/jira/browse/SPARK-2613 Project: Spark Issue Type: New

[jira] [Updated] (SPARK-2612) ALS has data skew for popular product

2014-07-21 Thread Peng Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peng Zhang updated SPARK-2612: -- Description: Usually there are some popular products which are related with many users in Rating input

[jira] [Created] (SPARK-2612) ALS has data skew for popular product

2014-07-21 Thread Peng Zhang (JIRA)
Peng Zhang created SPARK-2612: - Summary: ALS has data skew for popular product Key: SPARK-2612 URL: https://issues.apache.org/jira/browse/SPARK-2612 Project: Spark Issue Type: Bug Compo

[jira] [Resolved] (SPARK-2020) Spark 1.0.0 fails to run in coarse-grained mesos mode

2014-07-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2020. Resolution: Duplicate > Spark 1.0.0 fails to run in coarse-grained mesos mode > ---

[jira] [Updated] (SPARK-2022) Spark 1.0.0 is failing if mesos.coarse set to true

2014-07-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2022: --- Assignee: Tim Chen > Spark 1.0.0 is failing if mesos.coarse set to true > ---

[jira] [Commented] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069852#comment-14069852 ] Yin Huai commented on SPARK-2576: - I will try to add a Spark REPL test. > slave node thr

[jira] [Updated] (SPARK-2470) Fix PEP 8 violations

2014-07-21 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2470: --- Assignee: Nicholas Chammas (was: Prashant Sharma) > Fix PEP 8 violations > > >

[jira] [Resolved] (SPARK-2470) Fix PEP 8 violations

2014-07-21 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-2470. Resolution: Fixed Fix Version/s: 1.1.0 > Fix PEP 8 violations > > >

[jira] [Commented] (SPARK-2492) KafkaReceiver minor changes to align with Kafka 0.8

2014-07-21 Thread Tobias Pfeiffer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069795#comment-14069795 ] Tobias Pfeiffer commented on SPARK-2492: Maybe we should just add a helper functio

[jira] [Commented] (SPARK-2514) Random RDD generator

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069778#comment-14069778 ] Apache Spark commented on SPARK-2514: - User 'dorx' has created a pull request for this

[jira] [Created] (SPARK-2611) VPC Issue while creating an ec2 cluster

2014-07-21 Thread Anass BENSRHIR (JIRA)
Anass BENSRHIR created SPARK-2611: - Summary: VPC Issue while creating an ec2 cluster Key: SPARK-2611 URL: https://issues.apache.org/jira/browse/SPARK-2611 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-1166) leftover vpc_id may block the creation of new ec2 cluster

2014-07-21 Thread Anass BENSRHIR (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069766#comment-14069766 ] Anass BENSRHIR commented on SPARK-1166: --- Actually i've got the same error today , a

[jira] [Comment Edited] (SPARK-2282) PySpark crashes if too many tasks complete quickly

2014-07-21 Thread Jeremy Freeman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069758#comment-14069758 ] Jeremy Freeman edited comment on SPARK-2282 at 7/22/14 3:18 AM:

[jira] [Commented] (SPARK-2282) PySpark crashes if too many tasks complete quickly

2014-07-21 Thread Jeremy Freeman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069758#comment-14069758 ] Jeremy Freeman commented on SPARK-2282: --- Hi all, I'm "the scientist", a couple updat

[jira] [Commented] (SPARK-2492) KafkaReceiver minor changes to align with Kafka 0.8

2014-07-21 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069727#comment-14069727 ] Saisai Shao commented on SPARK-2492: Hi Tobias, I agree with you. Though I do not kn

[jira] [Comment Edited] (SPARK-2492) KafkaReceiver minor changes to align with Kafka 0.8

2014-07-21 Thread Tobias Pfeiffer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069708#comment-14069708 ] Tobias Pfeiffer edited comment on SPARK-2492 at 7/22/14 2:23 AM: ---

[jira] [Comment Edited] (SPARK-2492) KafkaReceiver minor changes to align with Kafka 0.8

2014-07-21 Thread Tobias Pfeiffer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069708#comment-14069708 ] Tobias Pfeiffer edited comment on SPARK-2492 at 7/22/14 2:23 AM: ---

[jira] [Commented] (SPARK-2492) KafkaReceiver minor changes to align with Kafka 0.8

2014-07-21 Thread Tobias Pfeiffer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069708#comment-14069708 ] Tobias Pfeiffer commented on SPARK-2492: http://kafka.apache.org/08/configuration.

[jira] [Commented] (SPARK-2383) With auto.offset.reset, KafkaReceiver potentially deletes Consumer nodes from Zookeeper

2014-07-21 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069692#comment-14069692 ] Saisai Shao commented on SPARK-2383: Hi Tobias, I've also noticed this problem, seem

[jira] [Resolved] (SPARK-2086) Improve output of toDebugString to make shuffle boundaries more clear

2014-07-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2086. Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1364 [https://

[jira] [Commented] (SPARK-2335) k-Nearest Neighbor classification and regression for MLLib

2014-07-21 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069676#comment-14069676 ] Yu Ishikawa commented on SPARK-2335: Hi Braian, I am implementing a approximate kNN-

[jira] [Commented] (SPARK-1767) Prefer HDFS-cached replicas when scheduling data-local tasks

2014-07-21 Thread Colin Patrick McCabe (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069653#comment-14069653 ] Colin Patrick McCabe commented on SPARK-1767: - [~pwendell], [~kayousterhout]:

[jira] [Updated] (SPARK-2479) Comparing floating-point numbers using relative error in UnitTests

2014-07-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2479: - Assignee: DB Tsai > Comparing floating-point numbers using relative error in UnitTests >

[jira] [Closed] (SPARK-2599) almostEquals mllib.util.TestingUtils does not behave as expected when comparing against 0.0

2014-07-21 Thread Doris Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doris Xin closed SPARK-2599. Resolution: Duplicate Refer to this issue: https://issues.apache.org/jira/browse/SPARK-2479 > almostEquals

[jira] [Resolved] (SPARK-2434) Generate runtime warnings for naive implementations

2014-07-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2434. -- Resolution: Fixed Fix Version/s: 1.1.0 Issue resolved by pull request 1515 [https://gith

[jira] [Updated] (SPARK-2610) When spark.serializer is set as org.apache.spark.serializer.KryoSerializer, importing a method causes multiple spark applications creations

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-2610: Priority: Minor (was: Major) > When spark.serializer is set as org.apache.spark.serializer.KryoSerializer,

[jira] [Commented] (SPARK-2610) When spark.serializer is set as org.apache.spark.serializer.KryoSerializer, importing a method causes multiple spark applications creations

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069513#comment-14069513 ] Yin Huai commented on SPARK-2610: - For JavaSerializer... {code} class X() { println("What!

[jira] [Commented] (SPARK-2610) When spark.serializer is set as org.apache.spark.serializer.KryoSerializer, importing a method causes multiple spark applications creations

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069510#comment-14069510 ] Yin Huai commented on SPARK-2610: - Thanks [~ilikerps] for helping me narrowing down the pr

[jira] [Updated] (SPARK-2610) When spark.serializer is set as org.apache.spark.serializer.KryoSerializer, importing a method causes multiple spark applications creations

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-2610: Description: To reproduce, set {code} spark.serializerorg.apache.spark.serializer.KryoSerializer {c

[jira] [Commented] (SPARK-2022) Spark 1.0.0 is failing if mesos.coarse set to true

2014-07-21 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069508#comment-14069508 ] Timothy Chen commented on SPARK-2022: - this seems duplicate of SPARK-2020 > Spark 1.0

[jira] [Updated] (SPARK-2610) When spark.serializer is set as org.apache.spark.serializer.KryoSerializer, importing a method causes multiple spark applications creations

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-2610: Summary: When spark.serializer is set as org.apache.spark.serializer.KryoSerializer, importing a method cau

[jira] [Commented] (SPARK-2454) Separate driver spark home from executor spark home

2014-07-21 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069479#comment-14069479 ] Nan Zhu commented on SPARK-2454: there is a related issue and fix https://issues.apache.

[jira] [Updated] (SPARK-2420) Change Spark build to minimize library conflicts

2014-07-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2420: --- Target Version/s: 1.1.0 > Change Spark build to minimize library conflicts >

[jira] [Commented] (SPARK-2602) sbt/sbt test steals window focus on OS X

2014-07-21 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069404#comment-14069404 ] Nicholas Chammas commented on SPARK-2602: - I upgraded to Java 7 and that resolved

[jira] [Commented] (SPARK-2505) Weighted Regularizer

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069354#comment-14069354 ] Apache Spark commented on SPARK-2505: - User 'dbtsai' has created a pull request for th

[jira] [Created] (SPARK-2610) When spark.serializer is set as org.apache.spark.serializer.KryoSerializer, importing sqlContext.createSchemaRDD causes multiple spark applications creations

2014-07-21 Thread Yin Huai (JIRA)
Yin Huai created SPARK-2610: --- Summary: When spark.serializer is set as org.apache.spark.serializer.KryoSerializer, importing sqlContext.createSchemaRDD causes multiple spark applications creations Key: SPARK-2610 URL

[jira] [Commented] (SPARK-2609) Log thread ID when spilling ExternalAppendOnlyMap

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069327#comment-14069327 ] Apache Spark commented on SPARK-2609: - User 'andrewor14' has created a pull request fo

[jira] [Updated] (SPARK-2259) Spark submit documentation for --deploy-mode is highly misleading

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2259: - Affects Version/s: (was: 1.1.0) 1.0.1 > Spark submit documentation for --deplo

[jira] [Updated] (SPARK-2266) Log page on Worker UI displays "Some(app-id)"

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2266: - Affects Version/s: (was: 1.1.0) 1.0.0 > Log page on Worker UI displays "Some(a

[jira] [Updated] (SPARK-2260) Spark submit standalone-cluster mode is broken

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2260: - Affects Version/s: (was: 1.1.0) 1.0.1 > Spark submit standalone-cluster mode i

[jira] [Updated] (SPARK-2258) Worker UI displays zombie executors

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2258: - Affects Version/s: (was: 1.1.0) 1.0.0 > Worker UI displays zombie executors >

[jira] [Updated] (SPARK-2349) Fix NPE in ExternalAppendOnlyMap

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2349: - Affects Version/s: (was: 1.1.0) 1.0.0 > Fix NPE in ExternalAppendOnlyMap > ---

[jira] [Updated] (SPARK-2392) Executors should not start their own HTTP servers

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2392: - Affects Version/s: (was: 1.1.0) 1.0.0 > Executors should not start their own H

[jira] [Updated] (SPARK-2454) Separate driver spark home from executor spark home

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2454: - Affects Version/s: (was: 1.1.0) 1.0.1 > Separate driver spark home from execut

[jira] [Commented] (SPARK-2567) Resubmitted stage sometimes remains as active stage in the web UI

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069311#comment-14069311 ] Apache Spark commented on SPARK-2567: - User 'tsudukim' has created a pull request for

[jira] [Updated] (SPARK-2307) SparkUI Storage page cached statuses incorrect

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2307: - Affects Version/s: (was: 1.1.0) 1.0.1 > SparkUI Storage page cached statuses i

[jira] [Updated] (SPARK-2435) Add shutdown hook to bin/pyspark

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2435: - Affects Version/s: (was: 1.1.0) 1.0.1 > Add shutdown hook to bin/pyspark > ---

[jira] [Updated] (SPARK-2411) Standalone Master - direct users to turn on event logs

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2411: - Affects Version/s: (was: 1.1.0) 1.0.1 > Standalone Master - direct users to tu

[jira] [Updated] (SPARK-2423) Clean up SparkSubmit for readability

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2423: - Affects Version/s: (was: 1.1.0) 1.0.1 > Clean up SparkSubmit for readability >

[jira] [Updated] (SPARK-2296) Refactor util.JsonProtocol for evolvability

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2296: - Affects Version/s: (was: 1.1.0) 1.0.0 > Refactor util.JsonProtocol for evolvab

[jira] [Updated] (SPARK-2340) Resolve paths properly for event logging / history server

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2340: - Affects Version/s: (was: 1.0.0) 1.0.1 > Resolve paths properly for event loggi

[jira] [Updated] (SPARK-2307) SparkUI Storage page cached statuses incorrect

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2307: - Affects Version/s: (was: 1.0.1) 1.0.0 > SparkUI Storage page cached statuses i

[jira] [Updated] (SPARK-2350) Master throws NPE

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2350: - Affects Version/s: (was: 1.1.0) 1.0.0 > Master throws NPE > -

[jira] [Updated] (SPARK-2300) PySpark shell hides stderr output

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2300: - Affects Version/s: (was: 1.1.0) 1.0.0 > PySpark shell hides stderr output > --

[jira] [Updated] (SPARK-2530) Relax incorrect assumption of one ExternalAppendOnlyMap per thread

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2530: - Affects Version/s: (was: 1.1.0) 1.0.1 > Relax incorrect assumption of one Exte

[jira] [Created] (SPARK-2609) Log thread ID when spilling ExternalAppendOnlyMap

2014-07-21 Thread Andrew Or (JIRA)
Andrew Or created SPARK-2609: Summary: Log thread ID when spilling ExternalAppendOnlyMap Key: SPARK-2609 URL: https://issues.apache.org/jira/browse/SPARK-2609 Project: Spark Issue Type: Improveme

[jira] [Updated] (SPARK-2584) Do not mutate block storage level on the UI

2014-07-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2584: - Affects Version/s: (was: 1.1.0) 1.0.1 > Do not mutate block storage level on t

[jira] [Commented] (SPARK-2567) Resubmitted stage sometimes remains as active stage in the web UI

2014-07-21 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069306#comment-14069306 ] Masayoshi TSUZUKI commented on SPARK-2567: -- PRed: https://github.com/apache/spark

[jira] [Updated] (SPARK-2099) Report TaskMetrics for running tasks

2014-07-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2099: --- Component/s: Spark Core > Report TaskMetrics for running tasks >

[jira] [Updated] (SPARK-2585) Remove special handling of Hadoop JobConf

2014-07-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2585: --- Component/s: Spark Core > Remove special handling of Hadoop JobConf > ---

[jira] [Updated] (SPARK-2583) ConnectionManager cannot distinguish whether error occurred or not

2014-07-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2583: --- Component/s: Spark Core > ConnectionManager cannot distinguish whether error occurred or not

[jira] [Commented] (SPARK-2567) Resubmitted stage sometimes remains as active stage in the web UI

2014-07-21 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069278#comment-14069278 ] Masayoshi TSUZUKI commented on SPARK-2567: -- in submitMissingTasks of DAGScheduler

[jira] [Resolved] (SPARK-2605) Drop Hive Table If Exists Throw out Error by Spark

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-2605. - Resolution: Not a Problem I am marking this as "Not a Problem". > Drop Hive Table If Exists Throw out Er

[jira] [Commented] (SPARK-2605) Drop Hive Table If Exists Throw out Error by Spark

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069275#comment-14069275 ] Yin Huai commented on SPARK-2605: - It is log entry because hivetesting does not exist in y

[jira] [Commented] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069248#comment-14069248 ] Yin Huai commented on SPARK-2576: - After SPARK-1199 has been reverted, my test works now (

[jira] [Comment Edited] (SPARK-1767) Prefer HDFS-cached replicas when scheduling data-local tasks

2014-07-21 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13997761#comment-13997761 ] Aaron Davidson edited comment on SPARK-1767 at 7/21/14 7:46 PM:

[jira] [Updated] (SPARK-2550) Support regularization and intercept in pyspark's linear methods

2014-07-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2550: - Assignee: Michael Yannakopoulos > Support regularization and intercept in pyspark's linear method

[jira] [Commented] (SPARK-2550) Support regularization and intercept in pyspark's linear methods

2014-07-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069107#comment-14069107 ] Xiangrui Meng commented on SPARK-2550: -- Great! Please feel free to submit a PR. Thank

[jira] [Updated] (SPARK-2494) Hash of None is different cross machines in CPython

2014-07-21 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2494: - Assignee: Davies Liu > Hash of None is different cross machines in CPython >

[jira] [Commented] (SPARK-2599) almostEquals mllib.util.TestingUtils does not behave as expected when comparing against 0.0

2014-07-21 Thread Doris Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069052#comment-14069052 ] Doris Xin commented on SPARK-2599: -- Did some digging through the codebase, and it seems l

[jira] [Updated] (SPARK-2494) Hash of None is different cross machines in CPython

2014-07-21 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2494: - Target Version/s: 1.1.0, 1.0.2, 0.9.3 (was: 1.1.0, 1.0.2) > Hash of None is different cross mach

[jira] [Updated] (SPARK-2494) Hash of None is different cross machines in CPython

2014-07-21 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2494: - Fix Version/s: (was: 1.0.1) (was: 1.0.0) 0.9.3

[jira] [Resolved] (SPARK-2494) Hash of None is different cross machines in CPython

2014-07-21 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-2494. -- Resolution: Fixed Merged this; thanks! > Hash of None is different cross machines in CPython >

[jira] [Updated] (SPARK-2494) Hash of None is different cross machines in CPython

2014-07-21 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2494: - Affects Version/s: 0.9.2 0.9.0 0.9.1 > Hash of None

[jira] [Updated] (SPARK-2494) Hash of None is different cross machines in CPython

2014-07-21 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-2494: - Priority: Major (was: Blocker) > Hash of None is different cross machines in CPython > -

[jira] [Updated] (SPARK-1199) Type mismatch in Spark shell when using case class defined in shell

2014-07-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1199: --- Description: *NOTE: This issue was fixed in 1.0.1, but the fix was reverted in Spark 1.0.2 p

[jira] [Updated] (SPARK-1199) Type mismatch in Spark shell when using case class defined in shell

2014-07-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1199: --- Description: NOTE: This issue was fixed in 1.0.1, but the fix was reverted in Spark 1.0.2 pe

[jira] [Updated] (SPARK-1199) Type mismatch in Spark shell when using case class defined in shell

2014-07-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1199: --- Fix Version/s: (was: 1.0.1) > Type mismatch in Spark shell when using case class defined

[jira] [Commented] (SPARK-1199) Type mismatch in Spark shell when using case class defined in shell

2014-07-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069038#comment-14069038 ] Patrick Wendell commented on SPARK-1199: Just a note, I've reverted this fix in br

[jira] [Commented] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069002#comment-14069002 ] Yin Huai commented on SPARK-2576: - [~svend] Let me see what I can do about the 1.0.1 doc.

[jira] [Commented] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069008#comment-14069008 ] Yin Huai commented on SPARK-2576: - I tried a standalone application (org.apache.spark.exam

[jira] [Commented] (SPARK-2434) Generate runtime warnings for naive implementations

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14069010#comment-14069010 ] Apache Spark commented on SPARK-2434: - User 'brkyvz' has created a pull request for th

[jira] [Resolved] (SPARK-1707) Remove 3 second sleep before starting app on YARN

2014-07-21 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-1707. -- Resolution: Fixed Fix Version/s: 1.1.0 Target Version/s: 1.1.0 > Remove 3 seco

[jira] [Updated] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-07-21 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2576: Assignee: Yin Huai > slave node throws NoClassDefFoundError $line11.$read$ when executing a

[jira] [Commented] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-07-21 Thread Svend Vanderveken (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068881#comment-14068881 ] Svend Vanderveken commented on SPARK-2576: -- Maybe this tuto should be updated unt

[jira] [Comment Edited] (SPARK-2583) ConnectionManager cannot distinguish whether error occurred or not

2014-07-21 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068814#comment-14068814 ] Kousuke Saruta edited comment on SPARK-2583 at 7/21/14 5:46 PM:

[jira] [Commented] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-07-21 Thread Svend Vanderveken (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068852#comment-14068852 ] Svend Vanderveken commented on SPARK-2576: -- Great Yin! I confirm this workaround

[jira] [Commented] (SPARK-2583) ConnectionManager cannot distinguish whether error occurred or not

2014-07-21 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068814#comment-14068814 ] Kousuke Saruta commented on SPARK-2583: --- Hi [~pwendell], When I simulate disk fault

[jira] [Comment Edited] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068807#comment-14068807 ] Yin Huai edited comment on SPARK-2576 at 7/21/14 5:11 PM: -- Forgot

[jira] [Commented] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068807#comment-14068807 ] Yin Huai commented on SPARK-2576: - Forgot to mention, I used a local cluster and spark she

[jira] [Commented] (SPARK-2434) Generate runtime warnings for naive implementations

2014-07-21 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068771#comment-14068771 ] Burak Yavuz commented on SPARK-2434: Hi Michael, I did what was required already. Wil

[jira] [Commented] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-07-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068726#comment-14068726 ] Yin Huai commented on SPARK-2576: - I did a quick test. Seems "import sqlContext.createSche

[jira] [Commented] (SPARK-2608) scheduler backend create executor launch command not correctly

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068710#comment-14068710 ] Apache Spark commented on SPARK-2608: - User 'scwf' has created a pull request for this

[jira] [Created] (SPARK-2608) scheduler backend create executor launch command not correctly

2014-07-21 Thread wangfei (JIRA)
wangfei created SPARK-2608: -- Summary: scheduler backend create executor launch command not correctly Key: SPARK-2608 URL: https://issues.apache.org/jira/browse/SPARK-2608 Project: Spark Issue Type:

[jira] [Created] (SPARK-2607) SchemaRDD unionall prevents caching

2014-07-21 Thread Thierry Herrmann (JIRA)
Thierry Herrmann created SPARK-2607: --- Summary: SchemaRDD unionall prevents caching Key: SPARK-2607 URL: https://issues.apache.org/jira/browse/SPARK-2607 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-2604) Spark Application hangs on yarn in edge case scenario of executor memory requirement

2014-07-21 Thread Twinkle Sachdeva (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068648#comment-14068648 ] Twinkle Sachdeva commented on SPARK-2604: - For Executors, In verifyClusterResource

[jira] [Comment Edited] (SPARK-2604) Spark Application hangs on yarn in edge case scenario of executor memory requirement

2014-07-21 Thread Twinkle Sachdeva (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068648#comment-14068648 ] Twinkle Sachdeva edited comment on SPARK-2604 at 7/21/14 3:44 PM: --

[jira] [Commented] (SPARK-1680) Clean up use of setExecutorEnvs in SparkConf

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068584#comment-14068584 ] Apache Spark commented on SPARK-1680: - User 'tgravescs' has created a pull request for

  1   2   >