[Nutch-general] indexing only special documents

Martin Kammerlander Wed, 06 Jun 2007 11:30:13 -0700

hi!


I have a question. If I have for example the seed urls and do a crawl based o
that seeds. If I want to index then only pages that contain for example pdf
documents, how can I do that?

cheers
martin



-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

[Nutch-general] indexing only special documents

Reply via email to