Hi,
you might find some code in katta.sourceforge.net very helpful.
Stefan
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Hadoop training and consulting
http://www.scaleunlimited.com
http://www.101tec.com
On Jun 10, 2009, at 5:49 AM, kartik saxena wrote:
Hi,
I have a huge LDIF file in order of GBs spanning some million user
records.
I am running the example "Grep" job on that file. The search results
have
not really been
upto expectations because of it being a basic per line , brute force.
I was thinking of building some indexes inside HDFS for that file ,
so that
the search results could improve. What could I possibly try to
achieve this?
Secura