Re: Comparing two logs, finding missing records

2011-06-27 Thread Rajesh Balamohan
From: Mark Kerzner markkerz...@gmail.com To: common-user@hadoop.apache.org; Bharath Mundlapudi bharathw...@yahoo.com Sent: Sunday, June 26, 2011 5:50 PM Subject: Re: Comparing two logs, finding missing records Bharath, how would a Pig query look like

Re: Comparing two logs, finding missing records

2011-06-26 Thread Bharath Mundlapudi
Subject: Comparing two logs, finding missing records Hi, I have two logs which should have all the records for the same record_id, in other words, if this record_id is found in the first log, it should also be found in the second one. However, I suspect that the second log is filtered out, and I

Re: Comparing two logs, finding missing records

2011-06-26 Thread Mark Kerzner
From: Mark Kerzner markkerz...@gmail.com To: Hadoop Discussion Group core-u...@hadoop.apache.org Sent: Saturday, June 25, 2011 9:39 PM Subject: Comparing two logs, finding missing records Hi, I have two logs which should have all the records for the same record_id, in other words

Re: Comparing two logs, finding missing records

2011-06-26 Thread Mark Kerzner
From: Mark Kerzner markkerz...@gmail.com To: Hadoop Discussion Group core-u...@hadoop.apache.org Sent: Saturday, June 25, 2011 9:39 PM Subject: Comparing two logs, finding missing records Hi, I have two logs which should have all the records for the same record_id, in other words

Re: Comparing two logs, finding missing records

2011-06-26 Thread Bharath Mundlapudi
...@gmail.com To: common-user@hadoop.apache.org; Bharath Mundlapudi bharathw...@yahoo.com Sent: Sunday, June 26, 2011 5:50 PM Subject: Re: Comparing two logs, finding missing records Bharath, how would a Pig query look like? Thank you, Mark On Sun, Jun 26, 2011 at 5:12 PM, Bharath Mundlapudi bharathw

Comparing two logs, finding missing records

2011-06-25 Thread Mark Kerzner
Hi, I have two logs which should have all the records for the same record_id, in other words, if this record_id is found in the first log, it should also be found in the second one. However, I suspect that the second log is filtered out, and I need to find the missing records. Anything is

Re: Comparing two logs, finding missing records

2011-06-25 Thread Mark Kerzner
Kumar, thank you, that is the exact solution to my problem as I have formulated it. That's valid and it stands, but I should have added that the two logs each have time stamps and that we are looking for missing records with time stamps in reasonable proximity. I have come up with a solution