[
https://issues.apache.org/jira/browse/SPARK-6664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14394147#comment-14394147
]
Florian Verhein commented on SPARK-6664:
I guess the other thing is - we can union
[
https://issues.apache.org/jira/browse/SPARK-6664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14394141#comment-14394141
]
Florian Verhein commented on SPARK-6664:
Thanks [~sowen]. I disagree :-)
...If
[
https://issues.apache.org/jira/browse/SPARK-6665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14394291#comment-14394291
]
Florian Verhein commented on SPARK-6665:
Fair enough. I'll have to implement it
[
https://issues.apache.org/jira/browse/SPARK-6665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14394089#comment-14394089
]
Florian Verhein commented on SPARK-6665:
Thanks for the quick response [~sowen].
[
https://issues.apache.org/jira/browse/SPARK-6664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Florian Verhein updated SPARK-6664:
---
Description:
I can't find this functionality (if I missed something, apologies!), but it
Florian Verhein created SPARK-6665:
--
Summary: Randomly Shuffle an RDD
Key: SPARK-6665
URL: https://issues.apache.org/jira/browse/SPARK-6665
Project: Spark
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/SPARK-6664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14391950#comment-14391950
]
Florian Verhein commented on SPARK-6664:
The closest approach I've found that
Florian Verhein created SPARK-6664:
--
Summary: Split Ordered RDD into multiple RDDs by keys (boundaries
or intervals)
Key: SPARK-6664
URL: https://issues.apache.org/jira/browse/SPARK-6664
Project:
[
https://issues.apache.org/jira/browse/SPARK-6601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Florian Verhein updated SPARK-6601:
---
Description:
Add module hdfs-nfs-gateway, which sets up the gateway for (say,
Florian Verhein created SPARK-6601:
--
Summary: Add HDFS NFS gateway module to spark-ec2
Key: SPARK-6601
URL: https://issues.apache.org/jira/browse/SPARK-6601
Project: Spark
Issue Type: New
[
https://issues.apache.org/jira/browse/SPARK-6600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Florian Verhein updated SPARK-6600:
---
Description:
Use case: User has set up the hadoop hdfs nfs gateway service on their
[
https://issues.apache.org/jira/browse/SPARK-6600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Florian Verhein updated SPARK-6600:
---
Summary: Open ports in ec2/spark_ec2.py to allow HDFS NFS gateway(was:
Open ports in
[
https://issues.apache.org/jira/browse/SPARK-6600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Florian Verhein updated SPARK-6600:
---
Description:
Use case: User has set up the hadoop hdfs nfs gateway service on their
[
https://issues.apache.org/jira/browse/SPARK-6601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Florian Verhein updated SPARK-6601:
---
Description:
Add module hdfs-nfs-gateway, which sets up the gateway for (say,
[
https://issues.apache.org/jira/browse/SPARK-5879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328612#comment-14328612
]
Florian Verhein commented on SPARK-5879:
cc [~shivaram], any opinions on how to
Florian Verhein created SPARK-5879:
--
Summary: spary_ec2.py should expose/return master and slave lists
(e.g. write to file)
Key: SPARK-5879
URL: https://issues.apache.org/jira/browse/SPARK-5879
[
https://issues.apache.org/jira/browse/SPARK-5851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14324986#comment-14324986
]
Florian Verhein commented on SPARK-5851:
That makes sense.
Yeah, I ran into it
Florian Verhein created SPARK-5851:
--
Summary: spark_ec2.py ssh failure retry handling not always
appropriate
Key: SPARK-5851
URL: https://issues.apache.org/jira/browse/SPARK-5851
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-5813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322611#comment-14322611
]
Florian Verhein commented on SPARK-5813:
I think it's a good idea to stick to
[
https://issues.apache.org/jira/browse/SPARK-5813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Florian Verhein closed SPARK-5813.
--
Resolution: Won't Fix
Spark-ec2: Switch to OracleJDK
--
[
https://issues.apache.org/jira/browse/SPARK-5813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14321764#comment-14321764
]
Florian Verhein commented on SPARK-5813:
INAL but here are my thoughts:
The user
[
https://issues.apache.org/jira/browse/SPARK-5813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322208#comment-14322208
]
Florian Verhein commented on SPARK-5813:
Good point. I think you're right re:
[
https://issues.apache.org/jira/browse/SPARK-5813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14321748#comment-14321748
]
Florian Verhein commented on SPARK-5813:
No specific technical reason esp WRT
[
https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320995#comment-14320995
]
Florian Verhein commented on SPARK-3821:
RE: Java, that reminds me... We should
Florian Verhein created SPARK-5813:
--
Summary: Spark-ec2: Switch to OracleJDK
Key: SPARK-5813
URL: https://issues.apache.org/jira/browse/SPARK-5813
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-5641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Florian Verhein updated SPARK-5641:
---
Description:
*Updated - no longer via deploy.generic, no substitutions*
Essentially, give
[
https://issues.apache.org/jira/browse/SPARK-5641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Florian Verhein updated SPARK-5641:
---
Description:
Useful if binary files need to be uploaded. E.g. I use this for rpm transfer to
[
https://issues.apache.org/jira/browse/SPARK-5676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14313102#comment-14313102
]
Florian Verhein edited comment on SPARK-5676 at 2/9/15 11:06 PM:
[
https://issues.apache.org/jira/browse/SPARK-5676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14313102#comment-14313102
]
Florian Verhein commented on SPARK-5676:
[~srowen] Yep, that's the one.
True.
[
https://issues.apache.org/jira/browse/SPARK-5676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14313135#comment-14313135
]
Florian Verhein commented on SPARK-5676:
Makes sense. Thanks.
License missing
Florian Verhein created SPARK-5676:
--
Summary: License missing from spark-ec2 repo
Key: SPARK-5676
URL: https://issues.apache.org/jira/browse/SPARK-5676
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308644#comment-14308644
]
Florian Verhein commented on SPARK-3185:
[~dvohra] Sure, but the exception is
Florian Verhein created SPARK-5641:
--
Summary: Allow spark_ec2.py to copy arbitrary files to cluster via
deploy.generic
Key: SPARK-5641
URL: https://issues.apache.org/jira/browse/SPARK-5641
Project:
[
https://issues.apache.org/jira/browse/SPARK-5552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304412#comment-14304412
]
Florian Verhein commented on SPARK-5552:
Thanks [~sowen].
So it wouldn't fit in
Florian Verhein created SPARK-5552:
--
Summary: Automated data science AMIs creation and cluster
deployment on EC2
Key: SPARK-5552
URL: https://issues.apache.org/jira/browse/SPARK-5552
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-5552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Florian Verhein updated SPARK-5552:
---
Summary: Automated data science AMI creation and data science cluster
deployment on EC2
[
https://issues.apache.org/jira/browse/SPARK-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290923#comment-14290923
]
Florian Verhein commented on SPARK-3185:
Sure [~grzegorz-dubicki]. You need to
[
https://issues.apache.org/jira/browse/SPARK-5331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Florian Verhein updated SPARK-5331:
---
Component/s: EC2
Description:
ps -ef | grep Tachyon
shows Tachyon running on the master
[
https://issues.apache.org/jira/browse/SPARK-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283493#comment-14283493
]
Florian Verhein commented on SPARK-3185:
I built tachyon with the correct hadoop
Florian Verhein created SPARK-5331:
--
Summary: Tachyon workers seem to ignore tachyon.master.hostname
and use localhost instead
Key: SPARK-5331
URL: https://issues.apache.org/jira/browse/SPARK-5331
[
https://issues.apache.org/jira/browse/SPARK-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276436#comment-14276436
]
Florian Verhein commented on SPARK-3185:
I'm also getting this, though with Server
[
https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276572#comment-14276572
]
Florian Verhein commented on SPARK-3821:
Thanks [~nchammas], that makes sense.
Florian Verhein created SPARK-5241:
--
Summary: spark-ec2 spark init scripts do not handle all hadoop (or
tachyon?) dependencies correctly
Key: SPARK-5241
URL: https://issues.apache.org/jira/browse/SPARK-5241
[
https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276263#comment-14276263
]
Florian Verhein commented on SPARK-3821:
This is great stuff! It'll also help
44 matches
Mail list logo