--- Andrzej Bialecki <[EMAIL PROTECTED]> wrote: > Stefan Groschupf wrote: > > > > >> Also allowing people to search the specific > >> dbs/segments. > >> > > I think it would be better to have a indexed meta > data "mime type" if i > > understand John right he had done this with the > patch(?) > > +1 on storing more metadata in the index. Especially > the mime-type, and > character encoding. I was bitten by this when > displaying documents from > cache, because most of the code assumes either UTF-8 > or the default > platform encoding - it would be nice not to discard > this information > during indexing...
I agree, that would be nice as you could do better regional searches based on the character set detected as well - help limit queries to only show language relevent results as well. -byron ------------------------------------------------------- This SF.Net email is sponsored by: SourceForge.net Broadband Sign-up now for SourceForge Broadband and get the fastest 6.0/768 connection for only $19.95/mo for the first 3 months! http://ads.osdn.com/?ad_id=2562&alloc_id=6184&op=click _______________________________________________ Nutch-developers mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-developers
