Matei Zaharia created SPARK-2013:
Summary: Add Python pickleFile to programming guide
Key: SPARK-2013
URL: https://issues.apache.org/jira/browse/SPARK-2013
Project: Spark
Issue Type:
Matei Zaharia created SPARK-2014:
Summary: Make PySpark store RDDs in MEMORY_ONLY_SER with
compression by default
Key: SPARK-2014
URL: https://issues.apache.org/jira/browse/SPARK-2014
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-1977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14017477#comment-14017477
]
Xiangrui Meng commented on SPARK-1977:
--
This is more likely a version conflict in
Reynold Xin created SPARK-2016:
--
Summary: rdd in-memory storage UI becomes unresponsive when the
number of RDD partitions is large
Key: SPARK-2016
URL: https://issues.apache.org/jira/browse/SPARK-2016
Reynold Xin created SPARK-2017:
--
Summary: web ui stage page becomes unresponsive when the number of
tasks is large
Key: SPARK-2017
URL: https://issues.apache.org/jira/browse/SPARK-2017
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-2016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Reynold Xin updated SPARK-2016:
---
Labels: starter (was: )
rdd in-memory storage UI becomes unresponsive when the number of RDD
[
https://issues.apache.org/jira/browse/SPARK-1977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14017508#comment-14017508
]
Neville Li edited comment on SPARK-1977 at 6/4/14 8:45 AM:
---
We
[
https://issues.apache.org/jira/browse/SPARK-1977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14017508#comment-14017508
]
Neville Li commented on SPARK-1977:
---
We submit 1 spark-assembly and 1 job assembly jar
[
https://issues.apache.org/jira/browse/SPARK-1999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chen Chao updated SPARK-1999:
-
Comment: was deleted
(was: https://github.com/apache/spark/pull/950
sorry,i will repost soon, the above
[
https://issues.apache.org/jira/browse/SPARK-1999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14017519#comment-14017519
]
Chen Chao commented on SPARK-1999:
--
PR:https://github.com/apache/spark/pull/968
UI :
[
https://issues.apache.org/jira/browse/SPARK-1999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chen Chao updated SPARK-1999:
-
Comment: was deleted
(was: I have fixed and tested fine. Please assign it to me , I will post a PR
[
https://issues.apache.org/jira/browse/SPARK-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14017528#comment-14017528
]
Sean Owen commented on SPARK-2018:
--
The meaning of the error is that Java thinks two
sam created SPARK-2019:
--
Summary: Spark workers die/disappear when job fails for nearly any
reason
Key: SPARK-2019
URL: https://issues.apache.org/jira/browse/SPARK-2019
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-1520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14017753#comment-14017753
]
Qiuzhuang Lian commented on SPARK-1520:
---
I can run the assembly jar via
[
https://issues.apache.org/jira/browse/SPARK-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14017776#comment-14017776
]
Mark Hamstra commented on SPARK-2019:
-
Please don't leave the Affects Version/s
[
https://issues.apache.org/jira/browse/SPARK-1817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14017812#comment-14017812
]
Kan Zhang commented on SPARK-1817:
--
There are 2 issues related to this bug. One is that
[
https://issues.apache.org/jira/browse/SPARK-1817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kan Zhang updated SPARK-1817:
-
Comment: was deleted
(was: PR: https://github.com/apache/spark/pull/760)
RDD zip erroneous when
[
https://issues.apache.org/jira/browse/SPARK-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matei Zaharia updated SPARK-2013:
-
Assignee: Kan Zhang
Add Python pickleFile to programming guide
[
https://issues.apache.org/jira/browse/SPARK-1973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng resolved SPARK-1973.
--
Resolution: Implemented
PR: https://github.com/apache/spark/pull/919
Add randomSplit to
[
https://issues.apache.org/jira/browse/SPARK-1973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-1973:
-
Assignee: Sean Owen
Add randomSplit to JavaRDD (with tests, and tidy Java tests)
[
https://issues.apache.org/jira/browse/SPARK-1704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yin Huai updated SPARK-1704:
Comment: was deleted
(was: [~marmbrus] I am attaching the link to the PR.)
java.lang.AssertionError:
Ajay Viswanathan created SPARK-2020:
---
Summary: Spark 1.0.0 fails to run in coarse-grained mesos mode
Key: SPARK-2020
URL: https://issues.apache.org/jira/browse/SPARK-2020
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018022#comment-14018022
]
Ajay Viswanathan commented on SPARK-2020:
-
Do I have to use Java 8 to rectify this
[
https://issues.apache.org/jira/browse/SPARK-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018048#comment-14018048
]
sam commented on SPARK-2019:
Sorry. Its 0.9.1
Spark workers die/disappear when job fails for
[
https://issues.apache.org/jira/browse/SPARK-1508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018043#comment-14018043
]
Zongheng Yang commented on SPARK-1508:
--
WIP PR:
[
https://issues.apache.org/jira/browse/SPARK-1508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018053#comment-14018053
]
Michael Armbrust commented on SPARK-1508:
-
It is likely we will fix this issue
[
https://issues.apache.org/jira/browse/SPARK-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mark Hamstra updated SPARK-2019:
Affects Version/s: 0.9.1
Spark workers die/disappear when job fails for nearly any reason
[
https://issues.apache.org/jira/browse/SPARK-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell updated SPARK-2019:
---
Fix Version/s: 0.9.2
Spark workers die/disappear when job fails for nearly any reason
[
https://issues.apache.org/jira/browse/SPARK-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018086#comment-14018086
]
Patrick Wendell commented on SPARK-2019:
We should dig into this and figure out
[
https://issues.apache.org/jira/browse/SPARK-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell updated SPARK-2019:
---
Priority: Critical (was: Major)
Spark workers die/disappear when job fails for nearly any
[
https://issues.apache.org/jira/browse/SPARK-1977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018155#comment-14018155
]
Xiangrui Meng commented on SPARK-1977:
--
In our example code, we only register
[
https://issues.apache.org/jira/browse/SPARK-1912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matei Zaharia updated SPARK-1912:
-
Target Version/s: 0.9.2, 1.0.1, 1.1.0 (was: 0.9.2, 1.0.1)
Compression memory issue during
[
https://issues.apache.org/jira/browse/SPARK-1912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matei Zaharia updated SPARK-1912:
-
Target Version/s: 0.9.2, 1.0.1
Compression memory issue during reduce
[
https://issues.apache.org/jira/browse/SPARK-1977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018254#comment-14018254
]
Neville Li commented on SPARK-1977:
---
Yes we did register 'Rating'. And we had to
holdenk created SPARK-2023:
--
Summary: PySpark reduce does a map side reduce and then sends the
results to the driver for final reduce, instead do this more like Scala Spark.
Key: SPARK-2023
URL:
Matei Zaharia created SPARK-2024:
Summary: Add saveAsSequenceFile to PySpark
Key: SPARK-2024
URL: https://issues.apache.org/jira/browse/SPARK-2024
Project: Spark
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/SPARK-1790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell updated SPARK-1790:
---
Fix Version/s: 1.1.0
0.9.2
Update EC2 scripts to support r3 instance
[
https://issues.apache.org/jira/browse/SPARK-1790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell resolved SPARK-1790.
Resolution: Fixed
Update EC2 scripts to support r3 instance types
[
https://issues.apache.org/jira/browse/SPARK-2011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018327#comment-14018327
]
Tim Weninger commented on SPARK-2011:
-
I also think that there is a memory leak
[
https://issues.apache.org/jira/browse/SPARK-1977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018328#comment-14018328
]
Shuo Xiang commented on SPARK-1977:
---
Hi [~neville], I just run the MovieLens example on
Tim Weninger created SPARK-2025:
---
Summary: EdgeRDD persists after pregel iteration
Key: SPARK-2025
URL: https://issues.apache.org/jira/browse/SPARK-2025
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Weninger updated SPARK-2025:
Description:
Symptoms: During execution of a pregel script/function a copy of an
intermediate
Bernardo Gomez Palacio created SPARK-2026:
-
Summary: Maven hadoop* Profiles Should Set the expected Hadoop
Version.
Key: SPARK-2026
URL: https://issues.apache.org/jira/browse/SPARK-2026
Aaron Davidson created SPARK-2027:
-
Summary: spark-ec2 puts Hadoop's log4j ahead of Spark's in
classpath
Key: SPARK-2027
URL: https://issues.apache.org/jira/browse/SPARK-2027
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ankur Dave reassigned SPARK-2025:
-
Assignee: Ankur Dave
EdgeRDD persists after pregel iteration
[
https://issues.apache.org/jira/browse/SPARK-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018389#comment-14018389
]
Tim Weninger commented on SPARK-2025:
-
adding
[
https://issues.apache.org/jira/browse/SPARK-1977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018390#comment-14018390
]
Neville Li commented on SPARK-1977:
---
Our YARN cluster runs 2.2.0. We built
[
https://issues.apache.org/jira/browse/SPARK-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018391#comment-14018391
]
Tim Weninger commented on SPARK-2025:
-
I'll leave it to you to make the bug fix. You
[
https://issues.apache.org/jira/browse/SPARK-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018402#comment-14018402
]
Ankur Dave commented on SPARK-2025:
---
Proposed fix:
[
https://issues.apache.org/jira/browse/SPARK-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ankur Dave updated SPARK-1988:
--
Priority: Minor (was: Major)
Enable storing edges out-of-core
[
https://issues.apache.org/jira/browse/SPARK-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018409#comment-14018409
]
Yanjie Gao commented on SPARK-2018:
---
Thanks for your quick reply!
I believe they use
Aaron Davidson created SPARK-2028:
-
Summary: Users of HadoopRDD cannot access the partition InputSplits
Key: SPARK-2028
URL: https://issues.apache.org/jira/browse/SPARK-2028
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-2028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell updated SPARK-2028:
---
Issue Type: New Feature (was: Bug)
Users of HadoopRDD cannot access the partition
[
https://issues.apache.org/jira/browse/SPARK-2028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018474#comment-14018474
]
Patrick Wendell commented on SPARK-2028:
I wantonly changed this from a Bug to a
[
https://issues.apache.org/jira/browse/SPARK-2027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell updated SPARK-2027:
---
Component/s: EC2
spark-ec2 puts Hadoop's log4j ahead of Spark's in classpath
[
https://issues.apache.org/jira/browse/SPARK-2028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Patrick Wendell updated SPARK-2028:
---
Summary: Let users of HadoopRDD access the partition InputSplits (was:
Users of HadoopRDD
[
https://issues.apache.org/jira/browse/SPARK-2024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018485#comment-14018485
]
Kan Zhang commented on SPARK-2024:
--
You meant SPARK-1416?
Add saveAsSequenceFile to
Takuya Ueshin created SPARK-2029:
Summary: Bump pom.xml version number of master branch to
1.1.0-SNAPSHOT.
Key: SPARK-2029
URL: https://issues.apache.org/jira/browse/SPARK-2029
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-2029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018492#comment-14018492
]
Takuya Ueshin commented on SPARK-2029:
--
PRed:
Takuya Ueshin created SPARK-2030:
Summary: Bump SparkBuild.scala version number of branch-1.0 to
1.0.1-SNAPSHOT.
Key: SPARK-2030
URL: https://issues.apache.org/jira/browse/SPARK-2030
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14018493#comment-14018493
]
Takuya Ueshin commented on SPARK-2030:
--
PRed:
61 matches
Mail list logo