[jira] [Commented] (SPARK-5390) Encourage users to post on Stack Overflow in Community Docs

2015-01-23 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14290187#comment-14290187 ] Nicholas Chammas commented on SPARK-5390: - cc [~pwendell] > Encourage u

[jira] [Commented] (SPARK-5390) Encourage users to post on Stack Overflow in Community Docs

2015-01-23 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14290185#comment-14290185 ] Nicholas Chammas commented on SPARK-5390: - Updated accordingly. > En

[jira] [Updated] (SPARK-5390) Encourage users to post on Stack Overflow in Community Docs

2015-01-23 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-5390: Description: As [discussed extensively on the user list|http://apache-spark-user-list

[jira] [Updated] (SPARK-5390) Encourage users to post on Stack Overflow in Community Docs

2015-01-23 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-5390: Description: As [discussed extensively on the user list|http://apache-spark-user-list

[jira] [Updated] (SPARK-5390) Encourage users to post on Stack Overflow in Community Docs

2015-01-23 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-5390: Description: As [discussed extensively on the user list|http://apache-spark-user-list

[jira] [Created] (SPARK-5390) Encourage users to post on Stack Overflow in Community Docs

2015-01-23 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-5390: --- Summary: Encourage users to post on Stack Overflow in Community Docs Key: SPARK-5390 URL: https://issues.apache.org/jira/browse/SPARK-5390 Project: Spark

Re: Discourse: A proposed alternative to the Spark User list

2015-01-23 Thread Nicholas Chammas
> allowing the use of third party lists or communication fora - provided > that they allow exporting the conversation if those sites were to > change course. However, the state of the art stands as such. > > - Patrick > > On Wed, Jan 21, 2015 at 8:43 AM, Nicholas Chammas > wro

[jira] [Commented] (SPARK-5366) check for mode of private key file

2015-01-22 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14288480#comment-14288480 ] Nicholas Chammas commented on SPARK-5366: - If the problem is no warnings an

Re: Discourse: A proposed alternative to the Spark User list

2015-01-22 Thread Nicholas Chammas
I agree with Sean that a Spark-specific Stack Exchange likely won't help and almost certainly won't make it out of Area 51. The idea certainly sounds nice from our perspective as Spark users, but it doesn't mesh with the structure of Stack Exchange or the criteria for creating new sites. On Thu Ja

Re: Discourse: A proposed alternative to the Spark User list

2015-01-22 Thread Nicholas Chammas
pecific lists? That might also help tune in/out the subset of >> conversations of interest. >> On Jan 22, 2015 10:30 AM, "Petar Zecevic" >> wrote: >> >>> >>> Ok, thanks for the clarifications. I didn't know this list has to remain >>&

Re: Discourse: A proposed alternative to the Spark User list

2015-01-21 Thread Nicholas Chammas
I think a few things need to be laid out clearly: 1. This mailing list is the “official” user discussion platform. That is, it is sponsored and managed by the ASF. 2. Users are free to organize independent discussion platforms focusing on Spark, and there is already one such platform i

Re: Discourse: A proposed alternative to the Spark User list

2015-01-21 Thread Nicholas Chammas
Josh / Patrick, What do y’all think of the idea of promoting Stack Overflow as a place to ask questions over this list, as long as the questions fit SO’s guidelines ( how-to-ask , dont-ask )? The apache-spark

Re: pyspark sc.textFile uses only 4 out of 32 threads per node

2015-01-20 Thread Nicholas Chammas
t know if this issue is noticed by other people. Can anyone > reproduce it with v1.1? > > > On Wed, Dec 17, 2014 at 2:14 AM, Nicholas Chammas > wrote: > > Rui is correct. > > > > Check how many partitions your RDD has after loading the gzipped files. > e.g.

Re: Standardized Spark dev environment

2015-01-20 Thread Nicholas Chammas
t; > > best, > wb > > > - Original Message - > > From: "Nicholas Chammas" > > To: "Spark dev list" > > Sent: Tuesday, January 20, 2015 6:13:31 PM > > Subject: Standardized Spark dev environment > > > > What do y'all

Standardized Spark dev environment

2015-01-20 Thread Nicholas Chammas
What do y'all think of creating a standardized Spark development environment, perhaps encoded as a Vagrantfile, and publishing it under `dev/`? The goal would be to make it easier for new developers to get started with all the right configs and tools pre-installed. If we use something like Vagran

[jira] [Commented] (SPARK-5246) spark/spark-ec2.py cannot start Spark master in VPC if local DNS name does not resolve

2015-01-18 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14282107#comment-14282107 ] Nicholas Chammas commented on SPARK-5246: - [~shivaram] - Should this issu

[jira] [Commented] (SPARK-3244) Add fate sharing across related files in Jenkins

2015-01-18 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14282105#comment-14282105 ] Nicholas Chammas commented on SPARK-3244: - [~andrewor14] - I updated

[jira] [Updated] (SPARK-3244) Add fate sharing across related files in Jenkins

2015-01-18 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-3244: Component/s: (was: Spark Core) Project Infra > Add fate sharing acr

[jira] [Resolved] (SPARK-2396) Spark EC2 scripts fail when trying to log in to EC2 instances

2015-01-18 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-2396. - Resolution: Cannot Reproduce Resolving this issue as "Cannot Reproduce". Fe

[jira] [Resolved] (SPARK-1532) provide option for more restrictive firewall rule in ec2/spark_ec2.py

2015-01-18 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-1532. - Resolution: Fixed I'm resolving this as Fixed since, as far as I can tell, the requ

[jira] [Comment Edited] (SPARK-1532) provide option for more restrictive firewall rule in ec2/spark_ec2.py

2015-01-18 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14282103#comment-14282103 ] Nicholas Chammas edited comment on SPARK-1532 at 1/19/15 3:1

[jira] [Created] (SPARK-5313) Create simple framework for highlighting changes introduced in a PR

2015-01-18 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-5313: --- Summary: Create simple framework for highlighting changes introduced in a PR Key: SPARK-5313 URL: https://issues.apache.org/jira/browse/SPARK-5313 Project

[jira] [Created] (SPARK-5312) Use sbt to detect new or changed public classes in PRs

2015-01-18 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-5312: --- Summary: Use sbt to detect new or changed public classes in PRs Key: SPARK-5312 URL: https://issues.apache.org/jira/browse/SPARK-5312 Project: Spark

Re: Cluster hangs in 'ssh-ready' state using Spark 1.2 EC2 launch script

2015-01-18 Thread Nicholas Chammas
Nathan, I posted a bunch of questions for you as a comment on your question on Stack Overflow. If you answer them (don't forget to @ping me) I may be able to help you. Nick On Sat Jan 17 2015 at 3:49:54 PM gen tang wrote: > Hi, > > This is because "

[jira] [Commented] (SPARK-5298) Spark not starting on EC2 using spark-ec2

2015-01-17 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14281575#comment-14281575 ] Nicholas Chammas commented on SPARK-5298: - Yes, {{mesos/spark-ec2}} is

[jira] [Resolved] (SPARK-5298) Spark not starting on EC2 using spark-ec2

2015-01-17 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-5298. - Resolution: Invalid I'm resolving this as invalid. If you believe this is inco

[jira] [Commented] (SPARK-5298) Spark not starting on EC2 using spark-ec2

2015-01-17 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14281540#comment-14281540 ] Nicholas Chammas commented on SPARK-5298: - Ah, I found the issue. You hav

[jira] [Commented] (SPARK-5298) Spark not starting on EC2 using spark-ec2

2015-01-17 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14281514#comment-14281514 ] Nicholas Chammas commented on SPARK-5298: - A few questions for you: 1.

[jira] [Commented] (SPARK-5299) Is http://www.apache.org/dist/spark/KEYS out of date?

2015-01-17 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14281511#comment-14281511 ] Nicholas Chammas commented on SPARK-5299: - cc [~pwendell] >

Re: Discourse: A proposed alternative to the Spark User list

2015-01-17 Thread Nicholas Chammas
The Stack Exchange community will not support creating a whole new site just for Spark (otherwise you’d see dedicated sites for much larger topics like “Python”). Their tagging system works well enough to separate questions about different topics, and the apache-spark

Re: dockerized spark executor on mesos?

2015-01-15 Thread Nicholas Chammas
The AMPLab maintains a bunch of Docker files for Spark here: https://github.com/amplab/docker-scripts Hasn't been updated since 1.0.0, but might be a good starting point. On Wed Jan 14 2015 at 12:14:13 PM Josh J wrote: > We have dockerized Spark Master and worker(s) separately and are using it

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-01-14 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277104#comment-14277104 ] Nicholas Chammas commented on SPARK-3821: - Hmm, I doubt that was intenti

[jira] [Updated] (SPARK-1805) Error launching cluster when master and slave machines are of different virtualization types

2015-01-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-1805: Description: In the current EC2 script, the AMI image object is loaded only once. This

[jira] [Updated] (SPARK-1805) Error launching cluster when master and slave machines are of different virtualization types

2015-01-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-1805: Affects Version/s: 1.1.1 1.2.0 > Error launching cluster when mas

[jira] [Updated] (SPARK-1805) Error launching cluster when master and slaves machines are of different visualization types

2015-01-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-1805: Issue Type: Bug (was: Improvement) > Error launching cluster when master and sla

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-01-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14276471#comment-14276471 ] Nicholas Chammas commented on SPARK-3821: - [~shivaram] Are we ready to open

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-01-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14276411#comment-14276411 ] Nicholas Chammas commented on SPARK-3821: - Hi [~florianverhein] and thanks

[jira] [Updated] (SPARK-3185) SPARK launch on Hadoop 2 in EC2 throws Tachyon exception when Formatting JOURNAL_FOLDER

2015-01-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-3185: Description: {code} org.apache.hadoop.ipc.RemoteException: Server IPC version 7 cannot

[jira] [Commented] (SPARK-5008) Persistent HDFS does not recognize EBS Volumes

2015-01-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14275656#comment-14275656 ] Nicholas Chammas commented on SPARK-5008: - [~brdwrd] - Thank you for documen

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-01-12 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14274535#comment-14274535 ] Nicholas Chammas commented on SPARK-3821: - That's correct. All those

[jira] [Commented] (SPARK-1422) Add scripts for launching Spark on Google Compute Engine

2015-01-11 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14273197#comment-14273197 ] Nicholas Chammas commented on SPARK-1422: - [~pwendell] - I would consider d

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-01-11 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14273187#comment-14273187 ] Nicholas Chammas commented on SPARK-3821: - Updated launch stats: * Launc

[jira] [Commented] (SPARK-5008) Persistent HDFS does not recognize EBS Volumes

2015-01-11 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14273007#comment-14273007 ] Nicholas Chammas commented on SPARK-5008: - Use [{{copy-dir}}|https://github

[jira] [Commented] (SPARK-5008) Persistent HDFS does not recognize EBS Volumes

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272846#comment-14272846 ] Nicholas Chammas commented on SPARK-5008: - Though I'm not too familiar

[jira] [Resolved] (SPARK-1204) EC2 scripts upload private key

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-1204. - Resolution: Fixed Fix Version/s: 0.9.0 Resolving this issue per Shivaram's co

[jira] [Commented] (SPARK-5189) Reorganize EC2 scripts so that nodes can be provisioned independent of Spark master

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272660#comment-14272660 ] Nicholas Chammas commented on SPARK-5189: - cc [~joshrosen] and [~shivaram] -

[jira] [Created] (SPARK-5189) Reorganize EC2 scripts so that nodes can be provisioned independent of Spark master

2015-01-10 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-5189: --- Summary: Reorganize EC2 scripts so that nodes can be provisioned independent of Spark master Key: SPARK-5189 URL: https://issues.apache.org/jira/browse/SPARK-5189

[jira] [Commented] (SPARK-1555) enable ec2/spark_ec2.py to stop/delete cluster non-interactively

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272615#comment-14272615 ] Nicholas Chammas commented on SPARK-1555: - The other thing to consider, i

[jira] [Resolved] (SPARK-4778) PySpark Json and groupByKey broken

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-4778. - Resolution: Cannot Reproduce > PySpark Json and groupByKey bro

[jira] [Commented] (SPARK-1302) httpd doesn't start in spark-ec2 (cc2.8xlarge)

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272605#comment-14272605 ] Nicholas Chammas commented on SPARK-1302: - There is an open PR by [~fred

[jira] [Resolved] (SPARK-2649) EC2: Ganglia-httpd broken on hvm based machines like r3.4xlarge

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-2649. - Resolution: Duplicate This issue appears to be a duplicate of [SPARK-1302]. Feel free to

[jira] [Resolved] (SPARK-2528) Default spark-ec2 security group permissions are too open

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-2528. - Resolution: Duplicate > Default spark-ec2 security group permissions are too o

[jira] [Commented] (SPARK-1532) provide option for more restrictive firewall rule in ec2/spark_ec2.py

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272601#comment-14272601 ] Nicholas Chammas commented on SPARK-1532: - I believe this capability is

[jira] [Commented] (SPARK-1204) EC2 scripts upload private key

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272600#comment-14272600 ] Nicholas Chammas commented on SPARK-1204: - cc [~shivaram] Do you know off the

[jira] [Resolved] (SPARK-5086) Specifying the master instance type

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-5086. - Resolution: Duplicate Thanks for reporting this. I believe this is a duplicate of [SPARK

[jira] [Commented] (SPARK-1555) enable ec2/spark_ec2.py to stop/delete cluster non-interactively

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272597#comment-14272597 ] Nicholas Chammas commented on SPARK-1555: - [~joshrosen]'s workaround is

[jira] [Commented] (SPARK-2396) Spark EC2 scripts fail when trying to log in to EC2 instances

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272596#comment-14272596 ] Nicholas Chammas commented on SPARK-2396: - [~enraged_ginger] - Can you con

[jira] [Commented] (SPARK-1422) Add scripts for launching Spark on Google Compute Engine

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272590#comment-14272590 ] Nicholas Chammas commented on SPARK-1422: - We have [a package for launching S

[jira] [Commented] (SPARK-4399) Support multiple cloud providers

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272589#comment-14272589 ] Nicholas Chammas commented on SPARK-4399: - Now that [Spark Packages|http://s

[jira] [Updated] (SPARK-4778) PySpark Json and groupByKey broken

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-4778: Component/s: (was: EC2) > PySpark Json and groupByKey bro

[jira] [Commented] (SPARK-4778) PySpark Json and groupByKey broken

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272587#comment-14272587 ] Nicholas Chammas commented on SPARK-4778: - Also, though the environment is on

[jira] [Commented] (SPARK-5008) Persistent HDFS does not recognize EBS Volumes

2015-01-10 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272585#comment-14272585 ] Nicholas Chammas commented on SPARK-5008: - cc [~shivaram] [~brdwrd] - What

[jira] [Commented] (SPARK-2004) QA Automation

2015-01-09 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272373#comment-14272373 ] Nicholas Chammas commented on SPARK-2004: - I recently had a [related discus

[jira] [Commented] (SPARK-4924) Factor out code to launch Spark applications into a separate library

2015-01-09 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272338#comment-14272338 ] Nicholas Chammas commented on SPARK-4924: - Okie doke. I ask because I

[jira] [Commented] (SPARK-4924) Factor out code to launch Spark applications into a separate library

2015-01-09 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14272315#comment-14272315 ] Nicholas Chammas commented on SPARK-4924: - [~vanzin] - How does this prop

Re: Accidental kill in UI

2015-01-09 Thread Nicholas Chammas
As Sean said, this definitely sounds like something worth a JIRA issue (and PR). On Fri Jan 09 2015 at 8:17:34 AM Sean Owen wrote: > (FWIW yes I think this should certainly be a POST. The link can become > a miniature form to achieve this and then the endpoint just needs to > accept POST only. Y

[jira] [Commented] (SPARK-5178) Integrate Python unit tests into Jenkins

2015-01-09 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14271858#comment-14271858 ] Nicholas Chammas commented on SPARK-5178: - Relevant links: *

Re: Results of tests

2015-01-09 Thread Nicholas Chammas
Just created: "Integrate Python unit tests into Jenkins" https://issues.apache.org/jira/browse/SPARK-5178 Nick On Fri Jan 09 2015 at 2:48:48 PM Josh Rosen wrote: > The "Test Result" pages for Jenkins builds shows some nice statistics for > the test run, including individual test times: > > ht

[jira] [Created] (SPARK-5178) Integrate Python unit tests into Jenkins

2015-01-09 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-5178: --- Summary: Integrate Python unit tests into Jenkins Key: SPARK-5178 URL: https://issues.apache.org/jira/browse/SPARK-5178 Project: Spark Issue Type

[jira] [Commented] (SPARK-3431) Parallelize Scala/Java test execution

2015-01-08 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14270324#comment-14270324 ] Nicholas Chammas commented on SPARK-3431: - Generic update: * For those

[jira] [Created] (SPARK-5161) Parallelize Python test execution

2015-01-08 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-5161: --- Summary: Parallelize Python test execution Key: SPARK-5161 URL: https://issues.apache.org/jira/browse/SPARK-5161 Project: Spark Issue Type

[jira] [Updated] (SPARK-3431) Parallelize Scala/Java test execution

2015-01-08 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-3431: Summary: Parallelize Scala/Java test execution (was: Parallelize execution of tests

Re: Spark development with IntelliJ

2015-01-08 Thread Nicholas Chammas
Side question: Should this section in the wiki link to Useful Developer Tools ? On Thu Jan 08 2015 at 6:19:55 PM Sean Owe

[jira] [Commented] (SPARK-4983) Tag EC2 instances in the same call that launches them

2015-01-08 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14270147#comment-14270147 ] Nicholas Chammas commented on SPARK-4983: - Yeah, I took a quick look at the

[jira] [Commented] (SPARK-2541) Standalone mode can't access secure HDFS anymore

2015-01-07 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14268475#comment-14268475 ] Nicholas Chammas commented on SPARK-2541: - By the way, should this issu

[jira] [Updated] (SPARK-4983) Tag EC2 instances in the same call that launches them

2015-01-07 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-4983: Labels: starter (was: ) > Tag EC2 instances in the same call that launches t

[jira] [Resolved] (SPARK-5125) Time major blocks of code in spark-ec2/setup.sh

2015-01-07 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-5125. - Resolution: Fixed Fix Version/s: 1.3.0 Target Version/s: 1.3.0 Resolved

[jira] [Commented] (SPARK-5125) Time major blocks of code in spark-ec2/setup.sh

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14267333#comment-14267333 ] Nicholas Chammas commented on SPARK-5125: - [~shivaram] Please assign this i

[jira] [Created] (SPARK-5125) Time major blocks of code in spark-ec2/setup.sh

2015-01-06 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-5125: --- Summary: Time major blocks of code in spark-ec2/setup.sh Key: SPARK-5125 URL: https://issues.apache.org/jira/browse/SPARK-5125 Project: Spark Issue

[jira] [Commented] (SPARK-4948) Use pssh instead of bash-isms and remove unnecessary operations

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14267331#comment-14267331 ] Nicholas Chammas commented on SPARK-4948: - [~shivaram] Could you assign

[jira] [Updated] (SPARK-4948) Use pssh instead of bash-isms and remove unnecessary operations

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-4948: Target Version/s: 1.3.0 > Use pssh instead of bash-isms and remove unnecessary operati

[jira] [Resolved] (SPARK-4948) Use pssh instead of bash-isms and remove unnecessary operations

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-4948. - Resolution: Fixed Resolved by: https://github.com/mesos/spark-ec2/pull/86 > Use p

[jira] [Commented] (SPARK-926) spark_ec2 script when ssh/scp-ing should pipe UserknowHostFile to /dev/null

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14267327#comment-14267327 ] Nicholas Chammas commented on SPARK-926: [~shay] / [~shayping] (pinging both n

[jira] [Updated] (SPARK-5122) Remove Shark from spark-ec2

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-5122: Description: Since Shark has been replaced by Spark SQL, we don't need it in {{spar

[jira] [Updated] (SPARK-5122) Remove Shark from spark-ec2

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-5122: Description: Since Shark has been replaced by Spark SQL, we don't need it in {{spar

[jira] [Commented] (SPARK-5122) Remove Shark from spark-ec2

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14267281#comment-14267281 ] Nicholas Chammas commented on SPARK-5122: - cc [~shivaram] - Is it appropriat

[jira] [Updated] (SPARK-5122) Remove Shark from spark-ec2

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-5122: Summary: Remove Shark from spark-ec2 (was: Remove Shark from spark-ec2 modules) > Rem

[jira] [Created] (SPARK-5122) Remove Shark from spark-ec2 modules

2015-01-06 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-5122: --- Summary: Remove Shark from spark-ec2 modules Key: SPARK-5122 URL: https://issues.apache.org/jira/browse/SPARK-5122 Project: Spark Issue Type

[jira] [Commented] (SPARK-4898) Replace cloudpickle with Dill

2015-01-05 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14264801#comment-14264801 ] Nicholas Chammas commented on SPARK-4898: - cc [~davies] > Replace clou

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-01-02 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14263206#comment-14263206 ] Nicholas Chammas commented on SPARK-3821: - I have Packer configured to

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-01-02 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14263187#comment-14263187 ] Nicholas Chammas commented on SPARK-3821: - I need to brush up on my statis

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-01-01 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14262720#comment-14262720 ] Nicholas Chammas commented on SPARK-3821: - For lulz, I've benchmarked

Re: Sample Spark Program Error

2014-12-30 Thread Nicholas Chammas
You sent this to the dev list. Please send it instead to the user list. We use the dev list to discuss development on Spark itself, new features, fixes to known bugs, and so forth. The user list is to discuss issues using Spark, which I believe is what you are looking for. Nick On Tue Dec 30 2

[jira] [Resolved] (SPARK-2394) Make it easier to read LZO-compressed files from EC2 clusters

2014-12-30 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-2394. - Resolution: Won't Fix Resolving as "Won't Fix" since this seems lik

[jira] [Commented] (SPARK-2394) Make it easier to read LZO-compressed files from EC2 clusters

2014-12-30 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14261663#comment-14261663 ] Nicholas Chammas commented on SPARK-2394: - [~joshrosen] Put together a scrip

[jira] [Commented] (SPARK-1010) Update all unit tests to use SparkConf instead of system properties

2014-12-30 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14261394#comment-14261394 ] Nicholas Chammas commented on SPARK-1010: - [~joshrosen] Haven't you b

[jira] [Closed] (SPARK-4997) Check if Spark's conf needs to be put ahead of Hadoop's (for log4j purposes)

2014-12-30 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas closed SPARK-4997. --- Resolution: Duplicate > Check if Spark's conf needs to be put ahead of Hadoop'

[jira] [Commented] (SPARK-4997) Check if Spark's conf needs to be put ahead of Hadoop's (for log4j purposes)

2014-12-30 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14261308#comment-14261308 ] Nicholas Chammas commented on SPARK-4997: - OK, I'll close this i

[jira] [Commented] (SPARK-4997) Check if Spark's conf needs to be put ahead of Hadoop's (for log4j purposes)

2014-12-30 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14261234#comment-14261234 ] Nicholas Chammas commented on SPARK-4997: - Could be, though this issu

[jira] [Created] (SPARK-4997) Check if Spark's conf needs to be put ahead of Hadoop's (for log4j purposes)

2014-12-29 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-4997: --- Summary: Check if Spark's conf needs to be put ahead of Hadoop's (for log4j purposes) Key: SPARK-4997 URL: https://issues.apache.org/jira/browse/

<    10   11   12   13   14   15   16   17   18   19   >