Hi,

 

I'm having issues with the language identifier plugin, in the specific
scenario where no language attribute is set on the html tag and no
language metadata is available. I understood there are two more steps
then, being http header extraction and then statistical analysis. I'd
like to skip the http header check, because I suspect my http server
sends back a default value for content-language, being the the system
language, and this is not correct. I'd like to directly proceed to
statistical analysis.

 

Is it possible to do this?

 

Dylan Honorez
R & D Consultant
4C Technologies / kZen
+32 (0)485 / 69.28.12
[EMAIL PROTECTED]

 

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to