--- Andrzej Bialecki <[EMAIL PROTECTED]> wrote:
> Stefan Groschupf wrote:
> 
> > 
> >> Also allowing people to search the specific
> >> dbs/segments.
> >>
> > I think it would be better to have a indexed meta
> data "mime type" if i 
> > understand John right he had done this with the
> patch(?)
> 
> +1 on storing more metadata in the index. Especially
> the mime-type, and 
> character encoding. I was bitten by this when
> displaying documents from 
> cache, because most of the code assumes either UTF-8
> or the default 
> platform encoding - it would be nice not to discard
> this information 
> during indexing...


I agree, that would be nice as you could do better
regional searches based on the character set detected
as well - help limit queries to only show language
relevent results as well.

-byron


-------------------------------------------------------
This SF.Net email is sponsored by: SourceForge.net Broadband
Sign-up now for SourceForge Broadband and get the fastest
6.0/768 connection for only $19.95/mo for the first 3 months!
http://ads.osdn.com/?ad_id=2562&alloc_id=6184&op=click
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to