good evening, I am new in this mailing list and I am very pleased to join you I am a computer engineer, and I currently work in a project to the Tunisian government. in this project, I am asked to integrate a search engine, and I opted for nutch, So please, I have a few questions, and hope to have your help: 1 - how to make working nutch with ARC files? 2 - Is it necessary to use Hadoop or NutchWax to use the ARC files? 3- Currently I'm working on the development of few nutch-dork, (like google dork) and I'm asking if we can make the search in more than one site? I used the command: *site:first_site.com site:second_site.com searched_word* but apparently it doesn't work
PS: I'm using Heritrix to crawl the sites. thank you for your help -- ______________________________________________ Mohamed Ben Bouzid, - Élève Ingénieur en Informatique à la Faculté des Sciences de Tunis. - Membre de la DFSA (Digital Free Software Association). - Membre du CLLFST( Club du Logiciel Libre de Tunis). - Membre de Ubuntu-tn. -=MBB=-
