Hello,

I need to develop a "french" parser. Google index french documents parsing "�" (HTML : e´) and "�" characters to "e". I think there's is already french parser for Lucene, so this is not really a problem.

Problem is : can it be created as a nutch plugin ? where should I put it ? Is there any started project about it ?

Thanks

Christophe.


------------------------------------------------------- This SF.net email is sponsored by Microsoft Mobile & Embedded DevCon 2005 Attend MEDC 2005 May 9-12 in Vegas. Learn more about the latest Windows Embedded(r) & Windows Mobile(tm) platforms, applications & content. Register by 3/29 & save $300 http://ads.osdn.com/?ad_id=6883&alloc_id=15149&op=click _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to