Exception closing file

2012-05-10 Thread Aleksandr Elbakyan
Hello, I am running the job, when in two similar size clusters, in one it is running fine on other gives fallowing exceptions. I checked the safe mode is off. Not sure what do I need fix to avoid this? 2012-05-10 00:38:02,637 ERROR org.apache.hadoop.hdfs.DFSClient: Exception closing file /us

Re: Pig question

2012-05-03 Thread Aleksandr Elbakyan
Thanks for help - Original Message - From: Mathias Herberts To: common-user@hadoop.apache.org; Aleksandr Elbakyan Cc: Sent: Thursday, May 3, 2012 12:04 PM Subject: Re: Pig question B = GROUP A BY x; C = FOREACH B GENERATE group,SIZE(B),B; D = FILTER C BY $1 == N; On Thu, May 3

Pig question

2012-05-03 Thread Aleksandr Elbakyan
Hello All, I was wandering if it is possible to filter all groups in pig which have size N. This sounds like something common but can not find the way to do it Please help :) Thanks,

Re: Kill Task Programmatically

2011-08-03 Thread Aleksandr Elbakyan
Hello, You can just throw run time exception. In that case it will fail :) Regards, Aleksandr --- On Wed, 8/3/11, Adam Shook wrote: From: Adam Shook Subject: Kill Task Programmatically To: "common-user@hadoop.apache.org" Date: Wednesday, August 3, 2011, 3:33 PM Is there any way I can progr

Re: Processing xml files

2011-05-24 Thread Aleksandr Elbakyan
some more questions :) On Tue, May 24, 2011 at 4:54 PM, Aleksandr Elbakyan wrote: > Can you please give more info? >>> We currently have off hadoop process which uses java xml parser to convert >>> it to flat file. We have files from couple kb to 10of GB. Do you convert it

Re: EC2 cloudera cc1.4xlarge

2011-05-24 Thread Aleksandr Elbakyan
--- On Tue, 5/24/11, Aleksandr Elbakyan wrote: From: Aleksandr Elbakyan Subject: EC2 cloudera cc1.4xlarge To: common-user@hadoop.apache.org Date: Tuesday, May 24, 2011, 4:23 PM Hello, I am want to use cc1.4xlarge cluster for some data processing, to spin clusters I am using cloudera scripts

Re: Processing xml files

2011-05-24 Thread Aleksandr Elbakyan
er, did you look into word count example from hadoop? Regards, Aleksandr --- On Tue, 5/24/11, Mohit Anchlia wrote: From: Mohit Anchlia Subject: Re: Processing xml files To: common-user@hadoop.apache.org Date: Tuesday, May 24, 2011, 4:41 PM On Tue, May 24, 2011 at 4:25 PM, Aleksandr Elbaky

Re: Processing xml files

2011-05-24 Thread Aleksandr Elbakyan
Hello,  We have the same type of data, we currently convert it to tab delimited file and use it as input for streaming Regards, Aleksandr --- On Tue, 5/24/11, Mohit Anchlia wrote: From: Mohit Anchlia Subject: Processing xml files To: common-user@hadoop.apache.org Date: Tuesday, May 24, 2011,

EC2 cloudera cc1.4xlarge

2011-05-24 Thread Aleksandr Elbakyan
Hello, I am want to use cc1.4xlarge cluster for some data processing, to spin clusters I am using cloudera scripts. hadoop-ec2-init-remote.sh has default configuration until c1.xlarge but not configuration for cc1.4xlarge, can someone give formula how does this values calculated based on hardwa