I'm working on a search engine for my university and they want me to do that to create a repository of scientific articles on the web :D
I red something about xpath for extracting exact parts from a document, once done this building the query is very easy but my doubts are about how to insert all of this in the nutch crawler... Thank you -- View this message in context: http://www.nabble.com/Extract+infos+from+documents+and+query+external+sites-t1675003.html#a4624272 Sent from the Nutch - Dev forum at Nabble.com. ------------------------------------------------------- All the advantages of Linux Managed Hosting--Without the Cost and Risk! Fully trained technicians. The highest number of Red Hat certifications in the hosting industry. Fanatical Support. Click to learn more http://sel.as-us.falkag.net/sel?cmd=lnk&kid=107521&bid=248729&dat=121642 _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
