Stefan Groschupf wrote:
I think it would be better to have a indexed meta data "mime type" if i understand John right he had done this with the patch(?)Also allowing people to search the specific dbs/segments.
+1 on storing more metadata in the index. Especially the mime-type, and character encoding. I was bitten by this when displaying documents from cache, because most of the code assumes either UTF-8 or the default platform encoding - it would be nice not to discard this information during indexing...
-- Best regards, Andrzej Bialecki
------------------------------------------------- Software Architect, System Integration Specialist CEN/ISSS EC Workshop, ECIMF project chair EU FP6 E-Commerce Expert/Evaluator ------------------------------------------------- FreeBSD developer (http://www.freebsd.org)
------------------------------------------------------- This SF.Net email is sponsored by: SourceForge.net Broadband Sign-up now for SourceForge Broadband and get the fastest 6.0/768 connection for only $19.95/mo for the first 3 months! http://ads.osdn.com/?ad_id=2562&alloc_id=6184&op=click _______________________________________________ Nutch-developers mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-developers
