Room 209!
Hello Fellow Hadoopists,
We are meeting at 7:15 PM April 21st at the
University Heights Community Center
5031 University Way NE
Seattle WA 98105
Room #209
Seattle Hadoop Distributing Computing User Meeting
==
Bringing Hadoopists Tog
Hi All,
I am trying to run a map reduce job and it is running perfectly from cmd
using following command
hadoop jar Processor.jar arg1 arg2
but when i schedule the same job in oozie, it is giving me following
exception.
Apr 20, 2011 12:49:34 PM NightProcessor.ProcessorDriver main
SEVERE: null
Hi,
I tried to find the class you said and it is only available in 0.21.0 version
which is not suitable for me because it doesn't work with HBase.
However I manager to make it work creating MyMultipleOutputs.class which
doesn't need a reporter. The deprecated classes still work so it works now.
Gah, that sucks. I'm using 0.20.1-169 from Cloudera CDH2 and assumed it would
be there in 0.20.2 as well.
Sorry, I have no idea what happened to MultipleOutputs in 0.20.2.
Regards,
Christoph
-Ursprüngliche Nachricht-
Von: Panayotis Antonopoulos [mailto:antonopoulos...@hotmail.com]
Ges
Hi,
You are right I am using the MultipleOutputs from the
org.apache.hadoop.mapred.lib package. But there is no MultipleOutputs in the
package you said.
Can you please tell me which version of hadoop you are using? I use the stable
version 0.20.2.
Regards,
Panagiotis
> From: christoph.schm..
Hi,
you are probably using the wrong MultipleOutputs from the
org.apache.hadoop.mapred.lib package. There is another one in
org.apache.hadoop.mapreduce.lib.output, which fits into the new 0.20 API.
Regards,
Christoph
-Ursprüngliche Nachricht-
Von: Panayotis Antonopoulos [mailto:antonop
I am trying to use the MultipleOutputs class using hadoop 0.20.2 and I have the
following problems:
1) The JobConf and org.apache.hadoop.mapred.TextOutputFormat classes which are
needed to call "MultipleOutputs.addNamedOutput( (JobConf) configuration,
"text", TextOutputFormat.class, LongWritabl
Hello again,
On Wed, Apr 20, 2011 at 3:00 PM, Christoph Schmitz
wrote:
> PS. As far as I understand, MultipleOutputs would be used in the reducer,
> right? (Which I wanted to avoid for the bulk of my data.)
It isn't restricted to reducer phases (and can be used in both as well
-- 'Tasks' are ver
Hello Harsh,
thanks for your help!
I've found that this works for me in my Mapper's setup() method:
---
FileSystem fs = FileSystem.get(context.getConfiguration());
// get the attempt directory
Path outputDir = FileOutputFormat.getWorkOutputPath(context);
Path outputPath = new Path(o
Hello Christoph,
On Wed, Apr 20, 2011 at 2:12 PM, Christoph Schmitz
wrote:
> My question is: is there any mechanism to assist me in writing to some
> designated place in the HDFS from the mapper, in a way that is recognized by
> the framework (i.e. dealing with aborted tasks, speculative execut
Hi,
I need to process data in a Java MR job (using 0.20.1) in a way such that the
largest part of the data is manipulated in the mapper only (i.e. some simple
per-record transformation without the need for sort + shuffle), and some small
pieces have to be passed on to the reducer. The mapper-on
11 matches
Mail list logo