Re: Issue with Hadoop Streaming

2012-08-03 Thread Subir S
In streaming contents of the file will be streamed to mapper through STDIN, not the file names. Fix the perl script accordingly. Thanks, Subir On 8/3/12, Devi Kumarappan wrote: > > > After specifying NLineInputFormat option, streaming job fails with > > Error from attempt_201205171448_0092_m_00

Re: Basic question on how reducer works

2012-07-16 Thread Subir S
Just for reference of others who might see this thread. Jira corresponding to parameter on reduce input limit is MAPREDUCE-2324 On 7/14/12, Harsh J wrote: > Subir, > > On Sat, Jul 14, 2012 at 5:30 PM, Subir S wrote: >> Harsh, Thanks I think this is what I was looking for. I

Re: Basic question on how reducer works

2012-07-14 Thread Subir S
ists. > > On Tue, Jul 10, 2012 at 8:59 PM, Subir S wrote: >> Is there any property to convey the maximum amount of data each >> reducer/partition may take for processing. Like the bytes_per_reducer >> of pig, so that the count of reducers can be controlled based on size >

Re: Basic question on how reducer works

2012-07-13 Thread Subir S
Probably a wrong question in a wrong thread and wrong mailing list :) On 7/10/12, Subir S wrote: > Is there any property to convey the maximum amount of data each > reducer/partition may take for processing. Like the bytes_per_reducer > of pig, so that the count of reducers can be c

Re: Basic question on how reducer works

2012-07-10 Thread Subir S
Is there any property to convey the maximum amount of data each reducer/partition may take for processing. Like the bytes_per_reducer of pig, so that the count of reducers can be controlled based on size of intermediate map output data size? On 7/10/12, Karthik Kambatla wrote: > The partitioner i

Re: Error in pig execution when loading data from HBase

2012-06-05 Thread Subir S
This is a pig mailing list question. +pig list for better response. Which version of Pig you are using. You miss the Hbase jars in PIG_CLASSPATH as per logs below. Try: export HBASE_CONF_DIR=$HBASE_HOME/conf export HBASE_CLASSPATH="`/usr/bin/hbase classpath`:$HBASE_HOME/conf/hbase-site.xml:$HBA

Re: I cannot post message

2012-03-30 Thread Subir S
You won't receive ur post, others will. You are able to post; Proof is my reply. On 3/31/12, Fang Xin wrote: > all > > sorry to bother, as a new user, it seems that I cannot post anything. > I've tried twice yesterday, but I didn't receive my own post... > > can anyone enlighten me? thanks >

Integration of Hadoop Streaming with Ruby and HBase

2012-03-16 Thread Subir S
Hi, Is there any way i can dump the output of my Ruby Map Reduce jobs into HBase directly? In other words does Hadoop Streaming with Ruby integrate with HBase? Like Pig has HBaseStorage etc. Thanks in advance! Regards Subir

Comparison of Apache Pig Vs. Hadoop Streaming M/R

2012-03-01 Thread Subir S
Hello Folks, Are there any pointers to such comparisons between Apache Pig and Hadoop Streaming Map Reduce jobs? Also there was a claim in our company that Pig performs better than Map Reduce jobs? Is this true? Are there any such benchmarks available Thanks, Subir