I'm trying to figure out how to achieve the following from a Java client,
1. My app (which is a web server) starts up
2. As part of startup my jar file, which includes my map reduce classes are
distributed to hadoop nodes
3. My web app uses map reduce to extract data without the performance overh
ROOM CHANGE TO 211 (one floor up from usual)
Hello Fellow Hadoopists,
We are meeting at 7:15 pm on September 16th at the
University Heights Community Center
5031 University Way NE
Seattle WA 98105
Room #211 note room change
The meetings are informal and highly conversational. If you have
questi
Hi
I am using Pig jobs to run on Hadoop but always it runs 4 mappers
simultaneously.
How can I increase the number of simultaneous mappers to run ?
What config do I have to change ?
Thanks,
Rahul
On Tue, Sep 14, 2010 at 10:10 AM, Alex Baranau wrote:
>Is the only way is to enhance TableInputFormat for me?
>
Currently, yes, you must enhance TIF or use an alternate TIF.
St.Ack
Hello,
As far as I know, the number of map tasks for "scan-based" mapreduce job is
equal (not more than) number of underlying regions (for scan). Of course, if
the max map task capacity is big enough.
I have a situation, when map-side processing is very heavy but uses quite
small amount of records
On Tuesday 14 September 2010 02:00 AM, Harold Lim wrote:
Is there an instruction on how to use rumen?
Yes, the documentation was recently added to trunk via MAPREDUCE-1918:
https://issues.apache.org/jira/browse/MAPREDUCE-1918
I tried the following command:
java -cp
hadoop-mapred-0.21.0.