[
https://issues.apache.org/jira/browse/SPARK-3577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138354#comment-14138354
]
Sandy Ryza commented on SPARK-3577:
---
In the old code, the ShuffleWriteMetrics didn't get
Sandy Ryza created SPARK-3560:
-
Summary: In yarn-cluster mode, jars are distributed through
multiple mechanisms.
Key: SPARK-3560
URL: https://issues.apache.org/jira/browse/SPARK-3560
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-3560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-3560:
--
Component/s: YARN
In yarn-cluster mode, jars are distributed through multiple mechanisms.
[
https://issues.apache.org/jira/browse/SPARK-3172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14130475#comment-14130475
]
Sandy Ryza commented on SPARK-3172:
---
I mean in the web UI (which will require
Sandy Ryza created SPARK-3497:
-
Summary: Report serialized size of task binary
Key: SPARK-3497
URL: https://issues.apache.org/jira/browse/SPARK-3497
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-3464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-3464:
--
Description: In most cases, even when an application is utilizing only a
small fraction of its
[
https://issues.apache.org/jira/browse/SPARK-3441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14125936#comment-14125936
]
Sandy Ryza edited comment on SPARK-3441 at 9/8/14 7:09 PM:
---
[
https://issues.apache.org/jira/browse/SPARK-3441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14125936#comment-14125936
]
Sandy Ryza commented on SPARK-3441:
---
I'll add mention that this can be used to get
[
https://issues.apache.org/jira/browse/SPARK-3441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14126192#comment-14126192
]
Sandy Ryza commented on SPARK-3441:
---
bq. One case where you may not care about giving a
[
https://issues.apache.org/jira/browse/SPARK-3441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14126454#comment-14126454
]
Sandy Ryza commented on SPARK-3441:
---
Right. It's not much work, but there are some
[
https://issues.apache.org/jira/browse/SPARK-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14125235#comment-14125235
]
Sandy Ryza commented on SPARK-3174:
---
To be clear, by YARN shuffle you mean the MR2
[
https://issues.apache.org/jira/browse/SPARK-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123238#comment-14123238
]
Sandy Ryza commented on SPARK-3174:
---
I've been putting a little bit of thought into this
Sandy Ryza created SPARK-3419:
-
Summary: Scheduler shouldn't delay running a task when executors
don't reside at any of its preferred locations
Key: SPARK-3419
URL: https://issues.apache.org/jira/browse/SPARK-3419
[
https://issues.apache.org/jira/browse/SPARK-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-3174:
--
Attachment: SPARK-3174design.pdf
Under YARN, add and remove executors based on load
[
https://issues.apache.org/jira/browse/SPARK-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123504#comment-14123504
]
Sandy Ryza commented on SPARK-3174:
---
Posted a high-level design doc.
Under YARN, add
[
https://issues.apache.org/jira/browse/SPARK-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123587#comment-14123587
]
Sandy Ryza commented on SPARK-2099:
---
Yeah, unfortunately I haven't had the chance to add
[
https://issues.apache.org/jira/browse/SPARK-3082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza resolved SPARK-3082.
---
Resolution: Fixed
Fix Version/s: 1.1.0
yarn.Client.logClusterResourceDetails throws NPE if
[
https://issues.apache.org/jira/browse/SPARK-2978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14118286#comment-14118286
]
Sandy Ryza commented on SPARK-2978:
---
IIUC, that would require using ShuffledRDD
[
https://issues.apache.org/jira/browse/SPARK-2978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14119080#comment-14119080
]
Sandy Ryza commented on SPARK-2978:
---
What's the thinking behind adding
[
https://issues.apache.org/jira/browse/SPARK-2978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14119091#comment-14119091
]
Sandy Ryza commented on SPARK-2978:
---
Ah ok, sounds good.
Provide an MR-style shuffle
Sandy Ryza created SPARK-3360:
-
Summary: Add RowMatrix.multiply(Vector)
Key: SPARK-3360
URL: https://issues.apache.org/jira/browse/SPARK-3360
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-3179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14117825#comment-14117825
]
Sandy Ryza commented on SPARK-3179:
---
Hi Michael,
Happy to help review your code or
[
https://issues.apache.org/jira/browse/SPARK-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-1239:
--
Summary: Don't fetch all map output statuses at each reducer during
shuffles (was: Don't fetch all map
Sandy Ryza created SPARK-3183:
-
Summary: Add option for requesting full YARN cluster
Key: SPARK-3183
URL: https://issues.apache.org/jira/browse/SPARK-3183
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-2978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105126#comment-14105126
]
Sandy Ryza commented on SPARK-2978:
---
So I started looking into this a little more and
[
https://issues.apache.org/jira/browse/SPARK-2978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105128#comment-14105128
]
Sandy Ryza commented on SPARK-2978:
---
[~jerryshao], if I understand correctly, ShuffleRDD
Sandy Ryza created SPARK-3172:
-
Summary: Distinguish between shuffle spill on the map and reduce
side
Key: SPARK-3172
URL: https://issues.apache.org/jira/browse/SPARK-3172
Project: Spark
Issue
Sandy Ryza created SPARK-3174:
-
Summary: Under YARN, add and remove executors based on load
Key: SPARK-3174
URL: https://issues.apache.org/jira/browse/SPARK-3174
Project: Spark
Issue Type:
Sandy Ryza created SPARK-3179:
-
Summary: Add task OutputMetrics
Key: SPARK-3179
URL: https://issues.apache.org/jira/browse/SPARK-3179
Project: Spark
Issue Type: Improvement
Components:
[
https://issues.apache.org/jira/browse/SPARK-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14099888#comment-14099888
]
Sandy Ryza commented on SPARK-3019:
---
I agree that it's not typically a problem, but I
[
https://issues.apache.org/jira/browse/SPARK-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14099983#comment-14099983
]
Sandy Ryza commented on SPARK-3019:
---
Thanks for the info Mridul. A few extra
[
https://issues.apache.org/jira/browse/SPARK-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14099533#comment-14099533
]
Sandy Ryza commented on SPARK-2089:
---
These customizations should only come from Hadoop
[
https://issues.apache.org/jira/browse/SPARK-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14099537#comment-14099537
]
Sandy Ryza commented on SPARK-3019:
---
Just scanned this, so apologies if the answer is
Sandy Ryza created SPARK-3082:
-
Summary: yarn.Client.logClusterResourceDetails throws NPE if
YARN's getQueueInfo returns null
Key: SPARK-3082
URL: https://issues.apache.org/jira/browse/SPARK-3082
[
https://issues.apache.org/jira/browse/SPARK-3082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-3082:
--
Summary: yarn.Client.logClusterResourceDetails throws NPE if requested
queue doesn't exist (was:
[
https://issues.apache.org/jira/browse/SPARK-3028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14097193#comment-14097193
]
Sandy Ryza commented on SPARK-3028:
---
+1 to what Patrick said. I'll post a patch along
[
https://issues.apache.org/jira/browse/SPARK-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14097826#comment-14097826
]
Sandy Ryza commented on SPARK-2089:
---
H, it's true that my suggestion would require
[
https://issues.apache.org/jira/browse/SPARK-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14097826#comment-14097826
]
Sandy Ryza edited comment on SPARK-2089 at 8/14/14 10:41 PM:
-
Sandy Ryza created SPARK-3052:
-
Summary: Misleading and spurious FileSystem closed errors whenever
a job fails while reading from Hadoop
Key: SPARK-3052
URL: https://issues.apache.org/jira/browse/SPARK-3052
Sandy Ryza created SPARK-3053:
-
Summary: Reconcile spark.files.userClassPathFirst with
spark.yarn.user.classpath.first
Key: SPARK-3053
URL: https://issues.apache.org/jira/browse/SPARK-3053
Project: Spark
Sandy Ryza created SPARK-3055:
-
Summary: Stack trace logged in driver on job failure is usually
uninformative
Key: SPARK-3055
URL: https://issues.apache.org/jira/browse/SPARK-3055
Project: Spark
Sandy Ryza created SPARK-3014:
-
Summary: Log a more informative message when yarn-cluster app
fails because SparkContext wasn't initialized
Key: SPARK-3014
URL: https://issues.apache.org/jira/browse/SPARK-3014
[
https://issues.apache.org/jira/browse/SPARK-3014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-3014:
--
Summary: Log a more informative messages in a couple failure scenarios
(was: Log a more informative
[
https://issues.apache.org/jira/browse/SPARK-3014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-3014:
--
Description:
This is what shows up currently when the user code fails to initialize a
SparkContext
[
https://issues.apache.org/jira/browse/SPARK-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14094872#comment-14094872
]
Sandy Ryza commented on SPARK-2089:
---
My opinion is that we should have a narrower API
Sandy Ryza created SPARK-2978:
-
Summary: Provide an MR-style shuffle transformation
Key: SPARK-2978
URL: https://issues.apache.org/jira/browse/SPARK-2978
Project: Spark
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/SPARK-2978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-2978:
--
Description:
For Hive on Spark joins in particular, and for running legacy MR code in
general, I
[
https://issues.apache.org/jira/browse/SPARK-2978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-2978:
--
Description:
For Hive on Spark joins in particular, and for running legacy MR code in
general, I
[
https://issues.apache.org/jira/browse/SPARK-2978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-2978:
--
Description:
For Hive on Spark joins in particular, and for running legacy MR code in
general, I
[
https://issues.apache.org/jira/browse/SPARK-2945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14091817#comment-14091817
]
Sandy Ryza commented on SPARK-2945:
---
spark.executor.instances apparently isn't used for
[
https://issues.apache.org/jira/browse/SPARK-2926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14091489#comment-14091489
]
Sandy Ryza commented on SPARK-2926:
---
Hi Saisai,
This seems like a very useful addition.
[
https://issues.apache.org/jira/browse/SPARK-1683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza resolved SPARK-1683.
---
Resolution: Fixed
Display filesystem read statistics with each task
[
https://issues.apache.org/jira/browse/SPARK-1683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14088900#comment-14088900
]
Sandy Ryza commented on SPARK-1683:
---
https://github.com/apache/spark/pull/962
Display
Sandy Ryza created SPARK-2900:
-
Summary: inputBytes aren't aggregated for stages like other task
metrics
Key: SPARK-2900
URL: https://issues.apache.org/jira/browse/SPARK-2900
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-2564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza resolved SPARK-2564.
---
Resolution: Fixed
Fix Version/s: 1.1.0
ShuffleReadMetrics.totalBlocksFetched is redundant
Sandy Ryza created SPARK-2894:
-
Summary: spark-shell doesn't accept flags
Key: SPARK-2894
URL: https://issues.apache.org/jira/browse/SPARK-2894
Project: Spark
Issue Type: Bug
Sandy Ryza created SPARK-2819:
-
Summary: Difficult to turn on intercept with linear models
Key: SPARK-2819
URL: https://issues.apache.org/jira/browse/SPARK-2819
Project: Spark
Issue Type:
Sandy Ryza created SPARK-2738:
-
Summary: Remove redundant imports in BlockManagerSuite
Key: SPARK-2738
URL: https://issues.apache.org/jira/browse/SPARK-2738
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-2664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072925#comment-14072925
]
Sandy Ryza commented on SPARK-2664:
---
I think the right behavior here is worth a little
[
https://issues.apache.org/jira/browse/SPARK-2664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072925#comment-14072925
]
Sandy Ryza edited comment on SPARK-2664 at 7/24/14 7:18 AM:
I
[
https://issues.apache.org/jira/browse/SPARK-2421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14069880#comment-14069880
]
Sandy Ryza commented on SPARK-2421:
---
It should be relatively straightforward to add a
Sandy Ryza created SPARK-2621:
-
Summary: Update task InputMetrics incrementally
Key: SPARK-2621
URL: https://issues.apache.org/jira/browse/SPARK-2621
Project: Spark
Issue Type: Improvement
Sandy Ryza created SPARK-2625:
-
Summary: Fix ShuffleReadMetrics for NettyBlockFetcherIterator
Key: SPARK-2625
URL: https://issues.apache.org/jira/browse/SPARK-2625
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza resolved SPARK-2519.
---
Resolution: Fixed
Fix Version/s: 1.1.0
Eliminate pattern-matching on Tuple2 in
Sandy Ryza created SPARK-2574:
-
Summary: Avoid allocating new ArrayBuffer in groupByKey's
mergeCombiner
Key: SPARK-2574
URL: https://issues.apache.org/jira/browse/SPARK-2574
Project: Spark
Sandy Ryza created SPARK-2553:
-
Summary: CoGroupedRDD unnecessarily allocates a Tuple2 per dep per
key
Key: SPARK-2553
URL: https://issues.apache.org/jira/browse/SPARK-2553
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-2553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14064709#comment-14064709
]
Sandy Ryza commented on SPARK-2553:
---
https://github.com/apache/spark/pull/1461
Sandy Ryza created SPARK-2564:
-
Summary: ShuffleReadMetrics.totalBlocksFetched is redundant
Key: SPARK-2564
URL: https://issues.apache.org/jira/browse/SPARK-2564
Project: Spark
Issue Type:
Sandy Ryza created SPARK-2565:
-
Summary: Update ShuffleReadMetrics as blocks are fetched
Key: SPARK-2565
URL: https://issues.apache.org/jira/browse/SPARK-2565
Project: Spark
Issue Type:
Sandy Ryza created SPARK-2566:
-
Summary: Update ShuffleWriteMetrics as data is written
Key: SPARK-2566
URL: https://issues.apache.org/jira/browse/SPARK-2566
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-2564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14065826#comment-14065826
]
Sandy Ryza commented on SPARK-2564:
---
https://github.com/apache/spark/pull/1474
[
https://issues.apache.org/jira/browse/SPARK-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14063251#comment-14063251
]
Sandy Ryza commented on SPARK-2519:
---
https://github.com/apache/spark/pull/1435
[
https://issues.apache.org/jira/browse/SPARK-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14064102#comment-14064102
]
Sandy Ryza commented on SPARK-2519:
---
I looked in ShuffledRDD, ExternalAppendOnlyMap,
[
https://issues.apache.org/jira/browse/SPARK-2534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14064333#comment-14064333
]
Sandy Ryza commented on SPARK-2534:
---
Yowza
Avoid pulling in the entire RDD in
Sandy Ryza created SPARK-2461:
-
Summary: Add a toString method to GeneralizedLinearModel
Key: SPARK-2461
URL: https://issues.apache.org/jira/browse/SPARK-2461
Project: Spark
Issue Type:
Sandy Ryza created SPARK-2462:
-
Summary: Make Vector.apply public
Key: SPARK-2462
URL: https://issues.apache.org/jira/browse/SPARK-2462
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-2384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14053834#comment-14053834
]
Sandy Ryza commented on SPARK-2384:
---
This is a great idea
Add tooltips for shuffle
[
https://issues.apache.org/jira/browse/SPARK-2310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-2310:
--
Summary: Support arbitrary options on the command line with spark-submit
(was: Allow giving arbitrary
[
https://issues.apache.org/jira/browse/SPARK-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14041503#comment-14041503
]
Sandy Ryza commented on SPARK-1767:
---
It will be in the Hadoop 2.5 release
Prefer
[
https://issues.apache.org/jira/browse/SPARK-1675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-1675:
--
Summary: Make clear whether computePrincipalComponents requires centered
data (was: Make clear
[
https://issues.apache.org/jira/browse/SPARK-1675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14039952#comment-14039952
]
Sandy Ryza commented on SPARK-1675:
---
I think it still wouldn't hurt to add a remark that
[
https://issues.apache.org/jira/browse/SPARK-1675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-1675:
--
Priority: Trivial (was: Major)
Make clear whether computePrincipalComponents requires centered data
[
https://issues.apache.org/jira/browse/SPARK-1209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza reopened SPARK-1209:
---
It doesn't look like this was actually fixed.
SparkHadoopUtil should not use package org.apache.hadoop
Sandy Ryza created SPARK-2149:
-
Summary: [MLLIB] Kernel density estimation
Key: SPARK-2149
URL: https://issues.apache.org/jira/browse/SPARK-2149
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-2149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-2149:
--
Summary: [MLLIB] Univariate kernel density estimation (was: [MLLIB] Kernel
density estimation)
[
https://issues.apache.org/jira/browse/SPARK-2149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14032079#comment-14032079
]
Sandy Ryza commented on SPARK-2149:
---
https://github.com/apache/spark/pull/1093
[MLLIB]
Sandy Ryza created SPARK-2146:
-
Summary: Fix the takeOrdered doc
Key: SPARK-2146
URL: https://issues.apache.org/jira/browse/SPARK-2146
Project: Spark
Issue Type: Bug
Affects Versions: 1.0.0
Sandy Ryza created SPARK-2142:
-
Summary: Give better indicator of how GC cuts into task time
Key: SPARK-2142
URL: https://issues.apache.org/jira/browse/SPARK-2142
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza resolved SPARK-1954.
---
Resolution: Duplicate
Make it easier to get Spark on YARN code to compile in IntelliJ
[
https://issues.apache.org/jira/browse/SPARK-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14028937#comment-14028937
]
Sandy Ryza commented on SPARK-2089:
---
I'll take this up.
It seems like our options are:
Sandy Ryza created SPARK-2131:
-
Summary: Collect per-task hdfs-bytes-written metrics
Key: SPARK-2131
URL: https://issues.apache.org/jira/browse/SPARK-2131
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-2131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-2131:
--
Summary: Collect per-task filesystem-bytes-read/written metrics (was:
Collect per-task
[
https://issues.apache.org/jira/browse/SPARK-2131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-2131:
--
Summary: Collect per-task filesystem-bytes-written metrics (was: Collect
per-task hdfs-bytes-written
Sandy Ryza created SPARK-2114:
-
Summary: Aggregations on raw data
Key: SPARK-2114
URL: https://issues.apache.org/jira/browse/SPARK-2114
Project: Spark
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/SPARK-2114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-2114:
--
Description:
For groupByKey and join transformations, Spark tasks on the reduce side
deserialize
[
https://issues.apache.org/jira/browse/SPARK-2114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-2114:
--
Summary: groupByKey and joins on raw data (was: Aggregations on raw data)
groupByKey and joins on
[
https://issues.apache.org/jira/browse/SPARK-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14028681#comment-14028681
]
Sandy Ryza commented on SPARK-2099:
---
https://github.com/apache/spark/pull/1056
Report
Sandy Ryza created SPARK-2099:
-
Summary: Report metrics for running tasks
Key: SPARK-2099
URL: https://issues.apache.org/jira/browse/SPARK-2099
Project: Spark
Issue Type: Improvement
Affects
[
https://issues.apache.org/jira/browse/SPARK-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sandy Ryza updated SPARK-2099:
--
Description:
Spark currently collects a set of helpful task metrics, like shuffle bytes
written, GC
Sandy Ryza created SPARK-2084:
-
Summary: Mention SPARK_JAR in env var section on configuration page
Key: SPARK-2084
URL: https://issues.apache.org/jira/browse/SPARK-2084
Project: Spark
Issue
301 - 400 of 440 matches
Mail list logo