[
https://issues.apache.org/jira/browse/SPARK-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt Cheah updated SPARK-8597:
--
Attachment: table.csv
> DataFrame partitionBy memory pressure scales extremely poorly
>
[
https://issues.apache.org/jira/browse/SPARK-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt Cheah updated SPARK-8597:
--
Summary: DataFrame partitionBy memory pressure scales extremely poorly
(was: DataFrame partitionBy scal
Matt Cheah created SPARK-8597:
-
Summary: DataFrame partitionBy scales extremely poorly
Key: SPARK-8597
URL: https://issues.apache.org/jira/browse/SPARK-8597
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-8167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14597956#comment-14597956
]
Matt Cheah commented on SPARK-8167:
---
I'm starting to work on this now, sorry for the del
[
https://issues.apache.org/jira/browse/SPARK-8167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14577727#comment-14577727
]
Matt Cheah commented on SPARK-8167:
---
To be clear this is independent of SPARK-7451. SPAR
[
https://issues.apache.org/jira/browse/SPARK-7611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553448#comment-14553448
]
Matt Cheah edited comment on SPARK-7611 at 5/21/15 12:54 AM:
-
[
https://issues.apache.org/jira/browse/SPARK-7611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553448#comment-14553448
]
Matt Cheah commented on SPARK-7611:
---
For context, was there a specific decision to avoid
[
https://issues.apache.org/jira/browse/SPARK-7108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552756#comment-14552756
]
Matt Cheah commented on SPARK-7108:
---
Just wanted to add my two cents here. I've had seve
[
https://issues.apache.org/jira/browse/SPARK-6950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14498555#comment-14498555
]
Matt Cheah edited comment on SPARK-6950 at 4/16/15 7:32 PM:
Th
[
https://issues.apache.org/jira/browse/SPARK-6950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14498555#comment-14498555
]
Matt Cheah edited comment on SPARK-6950 at 4/16/15 7:31 PM:
Th
[
https://issues.apache.org/jira/browse/SPARK-6950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt Cheah resolved SPARK-6950.
---
Resolution: Cannot Reproduce
Fix Version/s: 1.3.1
> Spark master UI believes some applications
[
https://issues.apache.org/jira/browse/SPARK-6950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14498555#comment-14498555
]
Matt Cheah commented on SPARK-6950:
---
This is no longer an issue on the tip of branch-1.3
[
https://issues.apache.org/jira/browse/SPARK-6950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14498390#comment-14498390
]
Matt Cheah edited comment on SPARK-6950 at 4/16/15 5:57 PM:
Th
[
https://issues.apache.org/jira/browse/SPARK-6950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14498390#comment-14498390
]
Matt Cheah commented on SPARK-6950:
---
There's one way I could reproduce this locally, but
[
https://issues.apache.org/jira/browse/SPARK-6950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14497273#comment-14497273
]
Matt Cheah edited comment on SPARK-6950 at 4/15/15 11:18 PM:
-
[
https://issues.apache.org/jira/browse/SPARK-6950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14497273#comment-14497273
]
Matt Cheah edited comment on SPARK-6950 at 4/15/15 11:16 PM:
-
[
https://issues.apache.org/jira/browse/SPARK-6950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14497273#comment-14497273
]
Matt Cheah commented on SPARK-6950:
---
It is not the same as SPARK-6107. I'm not sure if S
Matt Cheah created SPARK-6951:
-
Summary: History server slow startup if the event log directory is
large
Key: SPARK-6951
URL: https://issues.apache.org/jira/browse/SPARK-6951
Project: Spark
Issu
Matt Cheah created SPARK-6950:
-
Summary: Spark master UI believes some applications are in
progress when they are actually completed
Key: SPARK-6950
URL: https://issues.apache.org/jira/browse/SPARK-6950
P
[
https://issues.apache.org/jira/browse/SPARK-6950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt Cheah updated SPARK-6950:
--
Description:
In Spark 1.2.x, I was able to set my spark event log directory to be a
different location
[
https://issues.apache.org/jira/browse/SPARK-5697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt Cheah resolved SPARK-5697.
---
Resolution: Won't Fix
> Allow Spark driver to wait longer before giving up connecting to the master
>
Matt Cheah created SPARK-6405:
-
Summary: Spark Kryo buffer should be forced to be max. 2GB
Key: SPARK-6405
URL: https://issues.apache.org/jira/browse/SPARK-6405
Project: Spark
Issue Type: Improve
[
https://issues.apache.org/jira/browse/SPARK-6269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14363632#comment-14363632
]
Matt Cheah commented on SPARK-6269:
---
I updated the code since then so the updated micro
[
https://issues.apache.org/jira/browse/SPARK-6269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14363632#comment-14363632
]
Matt Cheah edited comment on SPARK-6269 at 3/16/15 6:12 PM:
I
Matt Cheah created SPARK-6269:
-
Summary: Using a different implementation of java array reflection
for size estimation
Key: SPARK-6269
URL: https://issues.apache.org/jira/browse/SPARK-6269
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349188#comment-14349188
]
Matt Cheah commented on SPARK-4879:
---
Can you perhaps jstack or profile the driver and th
Matt Cheah created SPARK-6044:
-
Summary: RDD.aggregate() should not use the closure serializer on
the zero value
Key: SPARK-6044
URL: https://issues.apache.org/jira/browse/SPARK-6044
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-5843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt Cheah updated SPARK-5843:
--
Description: It would be nice if users of the Java API could specify the
map-side-combine and serializer
[
https://issues.apache.org/jira/browse/SPARK-5843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14323259#comment-14323259
]
Matt Cheah commented on SPARK-5843:
---
Code's on my screen right now and will ship shortly
Matt Cheah created SPARK-5843:
-
Summary: Expose Map-Side-Combine Setting in
JavaPairRDD.combineByKey()
Key: SPARK-5843
URL: https://issues.apache.org/jira/browse/SPARK-5843
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14315918#comment-14315918
]
Matt Cheah edited comment on SPARK-4879 at 2/11/15 10:00 AM:
-
[
https://issues.apache.org/jira/browse/SPARK-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14315918#comment-14315918
]
Matt Cheah commented on SPARK-4879:
---
Thanks for picking this up @JoshRosen, I see that t
[
https://issues.apache.org/jira/browse/SPARK-4906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14313947#comment-14313947
]
Matt Cheah edited comment on SPARK-4906 at 2/10/15 10:15 AM:
-
[
https://issues.apache.org/jira/browse/SPARK-4906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14313947#comment-14313947
]
Matt Cheah commented on SPARK-4906:
---
Spark has logic for failing a stage if there are to
[
https://issues.apache.org/jira/browse/SPARK-5697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14312938#comment-14312938
]
Matt Cheah commented on SPARK-5697:
---
I'll submit a pull request for this shortly.
> All
Matt Cheah created SPARK-5697:
-
Summary: Allow Spark driver to wait longer before giving up
connecting to the master
Key: SPARK-5697
URL: https://issues.apache.org/jira/browse/SPARK-5697
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-5691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14312632#comment-14312632
]
Matt Cheah commented on SPARK-5691:
---
I've determined that this is a pretty simple bug in
Matt Cheah created SPARK-5691:
-
Summary: Preventing duplicate registering of an application has
incorrect logic
Key: SPARK-5691
URL: https://issues.apache.org/jira/browse/SPARK-5691
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298017#comment-14298017
]
Matt Cheah edited comment on SPARK-4349 at 1/30/15 1:12 AM:
Wh
[
https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt Cheah closed SPARK-4349.
-
Resolution: Fixed
> Spark driver hangs on sc.parallelize() if exception is thrown during
> serialization
[
https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298017#comment-14298017
]
Matt Cheah commented on SPARK-4349:
---
Whoops, this was fixed by SPARK-4737. Someone want
[
https://issues.apache.org/jira/browse/SPARK-4737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt Cheah updated SPARK-4737:
--
Comment: was deleted
(was: I will be out of the office with limited access to e-mail from January 05
to
[
https://issues.apache.org/jira/browse/SPARK-4737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14265267#comment-14265267
]
Matt Cheah commented on SPARK-4737:
---
I will be out of the office with limited access to
[
https://issues.apache.org/jira/browse/SPARK-4737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14236494#comment-14236494
]
Matt Cheah commented on SPARK-4737:
---
Forgot to mention that I'm actively working on this
[
https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14230384#comment-14230384
]
Matt Cheah commented on SPARK-4349:
---
Priorities shifted since I wrote that comment and I
[
https://issues.apache.org/jira/browse/SPARK-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt Cheah updated SPARK-4679:
--
Description:
We started seeing some strange behavior when we were querying the Spark UI JSON
endpoint f
[
https://issues.apache.org/jira/browse/SPARK-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt Cheah updated SPARK-4679:
--
Description:
We started seeing some strange behavior when we were querying the Spark UI JSON
endpoint f
Matt Cheah created SPARK-4679:
-
Summary: Race condition in querying the Spark UI JSON endpoint
when Jetty context handlers are added and removed
Key: SPARK-4679
URL: https://issues.apache.org/jira/browse/SPARK-4679
[
https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14230277#comment-14230277
]
Matt Cheah commented on SPARK-4349:
---
I agree that's a better solution. I fell back to th
[
https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14207392#comment-14207392
]
Matt Cheah commented on SPARK-4349:
---
Investigation showed that the DAGScheduler may not
[
https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14207185#comment-14207185
]
Matt Cheah commented on SPARK-4349:
---
I'm investigating this now. Someone can assign to m
Matt Cheah created SPARK-4349:
-
Summary: Spark driver hangs on sc.parallelize() if exception is
thrown during serialization
Key: SPARK-4349
URL: https://issues.apache.org/jira/browse/SPARK-4349
Project: S
[
https://issues.apache.org/jira/browse/SPARK-3819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt Cheah closed SPARK-3819.
-
Resolution: Won't Fix
Not much activity for awhile. Doesn't seem that important anyways, the cases
when w
[
https://issues.apache.org/jira/browse/SPARK-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14177826#comment-14177826
]
Matt Cheah commented on SPARK-3466:
---
I got caught up in some other things, so I haven't
[
https://issues.apache.org/jira/browse/SPARK-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14174148#comment-14174148
]
Matt Cheah commented on SPARK-3466:
---
I'll look into this. Someone please assign to me!
[
https://issues.apache.org/jira/browse/SPARK-3736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14171331#comment-14171331
]
Matt Cheah commented on SPARK-3736:
---
I was curious if anyone had any feedback on my abov
[
https://issues.apache.org/jira/browse/SPARK-3835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14165785#comment-14165785
]
Matt Cheah commented on SPARK-3835:
---
This is the opposite problem, actually - a Spark co
[
https://issues.apache.org/jira/browse/SPARK-3835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14165722#comment-14165722
]
Matt Cheah edited comment on SPARK-3835 at 10/9/14 8:48 PM:
An
[
https://issues.apache.org/jira/browse/SPARK-3835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14165722#comment-14165722
]
Matt Cheah commented on SPARK-3835:
---
Any updates on this?
> Spark applications that are
[
https://issues.apache.org/jira/browse/SPARK-3736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14165513#comment-14165513
]
Matt Cheah edited comment on SPARK-3736 at 10/9/14 6:42 PM:
Ar
[
https://issues.apache.org/jira/browse/SPARK-3736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14165513#comment-14165513
]
Matt Cheah commented on SPARK-3736:
---
Are the two linked cases above different though?
(
Matt Cheah created SPARK-3835:
-
Summary: Spark applications that are killed should show up as
"KILLED" or "CANCELLED" in the Spark UI
Key: SPARK-3835
URL: https://issues.apache.org/jira/browse/SPARK-3835
[
https://issues.apache.org/jira/browse/SPARK-3819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14162195#comment-14162195
]
Matt Cheah commented on SPARK-3819:
---
Can you elaborate as to why it is not feasible to b
Matt Cheah created SPARK-3819:
-
Summary: Jenkins should compile Spark against multiple versions of
Hadoop
Key: SPARK-3819
URL: https://issues.apache.org/jira/browse/SPARK-3819
Project: Spark
Iss
[
https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14155097#comment-14155097
]
Matt Cheah commented on SPARK-1860:
---
This might be a silly question, but are we guarante
[
https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14153893#comment-14153893
]
Matt Cheah commented on SPARK-1860:
---
Would like confirmation from [~adav] for the propos
[
https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14153514#comment-14153514
]
Matt Cheah commented on SPARK-1860:
---
The change I am going to make is that when the clea
[
https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14153382#comment-14153382
]
Matt Cheah edited comment on SPARK-1860 at 9/30/14 4:48 PM:
Co
[
https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14153382#comment-14153382
]
Matt Cheah commented on SPARK-1860:
---
Cool, I see where you're coming from now. I'll whip
[
https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152758#comment-14152758
]
Matt Cheah edited comment on SPARK-1860 at 9/30/14 3:21 AM:
I
[
https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152758#comment-14152758
]
Matt Cheah edited comment on SPARK-1860 at 9/30/14 3:14 AM:
I
[
https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152758#comment-14152758
]
Matt Cheah commented on SPARK-1860:
---
I agree we should focus the scope on cleaning up th
[
https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152621#comment-14152621
]
Matt Cheah commented on SPARK-1860:
---
ExecutorRunner seems to have various cases correspo
[
https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152421#comment-14152421
]
Matt Cheah commented on SPARK-1860:
---
Apologies for any naivety - this will be the first
201 - 274 of 274 matches
Mail list logo