Re: Text Iterable modifying other variable

2012-03-14 Thread Brock Noland
Writable's are reused. You need to deep copy the "value" into "lastValue". On Wed, Mar 14, 2012 at 6:49 PM, Steven Willis wrote: > I seem to have made a few typos in the code. This is how it should read: > > /*/ > public class MyReducer extends Reducer { >    @Override >    publi

RE: Text Iterable modifying other variable

2012-03-14 Thread Steven Willis
I seem to have made a few typos in the code. This is how it should read: /*/ public class MyReducer extends Reducer { @Override public void reduce(Text key, Iterable values, Reducer.Context context) throws IOException, InterruptedException { Text value = new Text();

Query regarding Hadoop version 0.20.203

2012-03-14 Thread Piyush Kansal
Hi, Since MultipleOutputs is not supported in version 0.20.203, so while using Partitioner class, key-value pairs belonging to partition 1 may end up in file part-r-0 or part-r-2. So, to handle this, I am currently *prefixing all the records* in a file with a "*partition number*". So, lets

Text Iterable modifying other variable

2012-03-14 Thread Steven Willis
I'm trying to write a Reducer which will eliminate duplicates from the list of values before writing them out. I have the following code for my Reducer: /*/ public class ClickStreamIndexerReducer extends Reducer { @Override public void reduce(Text dirName, Iterable values,

A doubt about integration with "tools" like Pig, Hive or H-Base

2012-03-14 Thread Luiz Antonio Falaguasta Barbosa
Hi people, Please, I would like ask something a bit more high level than programing for Hadoop. I will have some students working with Hive, Pig or H-Base (I don't which of them yet) and I would like to know if somebody here has already use Hadoop from Amazon EC2 integrated to one ot these other