Hi,
Have you checked Hive? Seems to fit your needs perfectly.
Thanks and Regards,
Sonal
www.meghsoft.com
http://in.linkedin.com/in/sonalgoyal
On Sun, Aug 1, 2010 at 1:40 AM, Bright D L wrote:
> Hi all,
>I am doing a simple project to analyze http proxy server logs by
> hadoop mapreduc
Hi all,
I am doing a simple project to analyze http proxy server logs by hadoop
mapreduce approach (in Java). The log file contains logs for a week or some
times more than that.
I have following requirements:
1) Find the top 50 bandwidth consumers (IPs) for each