Re: Re: Filtering by value in Reducer

2015-05-12 Thread Drake민영근
Hi, Peter The missing records, they are just gone without no logs? How about your reduce tasks logs? Thanks Drake 민영근 Ph.D kt NexR On Tue, May 12, 2015 at 5:18 AM, Peter Ruch rutschifen...@gmail.com wrote: Hello, sum and threshold are both Integers. for the threshold variable I first add

Re: Re: Re: Filtering by value in Reducer

2015-05-12 Thread Peter Ruch
Hi, I already skimmed through the logs but I could not find anything special. I am just really confused why I am having this problem. If the Iterable... for a specific key contains all of the observed values - and it seems to do so otherwise the program wouldn't work correctly in the standard

Re: Re: Re: Filtering by value in Reducer

2015-05-12 Thread Shahab Yunus
Have you tried explicitly printing or logging in you reducer around the code that compares and then outputs the values? Maybe that will give you a clue that what is happening? Debug the threshold value that you get in the reducer and whether that is what you have set or not (in case of when you

Re: Re: Re: Re: Filtering by value in Reducer

2015-05-12 Thread Peter Ruch
Hi, No, I did not create any custom logs, I was only looking through the standard logs. I just started out with Hadoop and did not think of explicitly logging that part of the code, as I thought that I am simply missing a small detail that someone of you might spot. But I will definitely

Re: Filtering by value in Reducer

2015-05-11 Thread Shahab Yunus
What is the type of the threshold variable? sum I believe is a Java int. Regards, Shahab On Mon, May 11, 2015 at 1:08 PM, Peter Ruch rutschifen...@gmail.com wrote: Hi, I am currently playing around with Hadoop and have some problems when trying to filter in the Reducer. I extended the

Re: Re: Filtering by value in Reducer

2015-05-11 Thread Peter Ruch
Hello, sum and threshold are both Integers. for the threshold variable I first add a new resource to the configuration - conf.addResource( ... ); later I get the threshold value from the configuration. Code # private int threshold; public void setup(

Filtering by value in Reducer

2015-05-11 Thread Peter Ruch
Hi, I am currently playing around with Hadoop and have some problems when trying to filter in the Reducer. I extended the WordCount v1.0 example from the 2.7 MapReduce Tutorial with some additional functionality and added the possibility to filter by the specific value of each key - e.g. only