Yes it need lots of re-factoring actually I am looking to contribute by writing stuff from scratch because this code is a year old when i studied Information Retrieval back in my undergraduate and it could not meet standards at Apache. Now i have spent more then a year in industry and have good hands on different stuff.
Thanks
