So after I created the RawComparator for the key it worked as expected.
Thanks.

-Trevor

On Wed, Jun 29, 2011 at 2:47 PM, Aaron Baff <aaron.b...@telescope.tv> wrote:

> I dunno, I just know that when I use a separate comparator for my custom
> key (does something similar to yours, although 2 or 3 additional secondary
> fields to group on) it works as it should.
>
> --Aaron
>
>
> -----------------------------------------------------------------------------
> From: Trevor Adams [mailto:trevorad...@gmail.com]
> Sent: Wednesday, June 29, 2011 11:34 AM
> To: mapreduce-user@hadoop.apache.org
> Subject: Re: Reduce method called same key twice
>
> So, that kind of makes sense but why would it not group the other values
> then? There are a bunch of the exact same key (only 1 primary record, so
> only 1 that is different per set) and it is my understanding that they would
> be grouped together (without the primary key) if I didn't do anything
> different.
>
> -Trevor
> On Wed, Jun 29, 2011 at 2:07 PM, Aaron Baff <aaron.b...@telescope.tv>
> wrote:
> You probably need to implement a custom comparator that you use as the
> grouping comparator that compares the primary key, and then if they are the
> same compares the int part of the key.
>
> --Aaron
>
>
> -----------------------------------------------------------------------------
> From: Trevor Adams [mailto:trevorad...@gmail.com]
> Sent: Wednesday, June 29, 2011 10:00 AM
> To: mapreduce-user@hadoop.apache.org
> Subject: Reduce method called same key twice
>
> So I have a custom Key which is used for a join. It contains two fields, a
> boolean (is primary key) and an int (key). Hashcode only looks at the key
> field, so that it gets sent to the same reducer. Compare places the pkey at
> the top of the list (if sorted using compare). This works nicely, except
> that the reduce method is called with Key: 1 -> a single value, Key: 1 ->
> another value etc. One for each value, so instead of bucketing the values to
> a key (and some of the keys are identical, in every way) it sends 1 key and
> 1 value to the reducer at a time. How do I get it to bucket or why isn't it
> bucketing?
>
> -Trevor
>
>

Reply via email to