From: Mark Kerzner markkerz...@gmail.com
To: common-user@hadoop.apache.org; Bharath Mundlapudi
bharathw...@yahoo.com
Sent: Sunday, June 26, 2011 5:50 PM
Subject: Re: Comparing two logs, finding missing records
Bharath,
how would a Pig query look like
Subject: Comparing two logs, finding missing records
Hi,
I have two logs which should have all the records for the same record_id, in
other words, if this record_id is found in the first log, it should also be
found in the second one. However, I suspect that the second log is filtered
out, and I
From: Mark Kerzner markkerz...@gmail.com
To: Hadoop Discussion Group core-u...@hadoop.apache.org
Sent: Saturday, June 25, 2011 9:39 PM
Subject: Comparing two logs, finding missing records
Hi,
I have two logs which should have all the records for the same record_id,
in
other words
From: Mark Kerzner markkerz...@gmail.com
To: Hadoop Discussion Group core-u...@hadoop.apache.org
Sent: Saturday, June 25, 2011 9:39 PM
Subject: Comparing two logs, finding missing records
Hi,
I have two logs which should have all the records for the same record_id,
in
other words
...@gmail.com
To: common-user@hadoop.apache.org; Bharath Mundlapudi bharathw...@yahoo.com
Sent: Sunday, June 26, 2011 5:50 PM
Subject: Re: Comparing two logs, finding missing records
Bharath,
how would a Pig query look like?
Thank you,
Mark
On Sun, Jun 26, 2011 at 5:12 PM, Bharath Mundlapudi bharathw
Hi,
I have two logs which should have all the records for the same record_id, in
other words, if this record_id is found in the first log, it should also be
found in the second one. However, I suspect that the second log is filtered
out, and I need to find the missing records. Anything is
Kumar,
thank you, that is the exact solution to my problem as I have formulated it.
That's valid and it stands, but I should have added that the two logs each
have time stamps and that we are looking for missing records with time
stamps in reasonable proximity.
I have come up with a solution