Hi.

I need to configure nutch for crawl image only, but i have not good results 
with configuration of filter of url specially with suffix and regex.

Is posible to read a good configuration for crawl image(png,gif,jpg, and other) 
only with nutch ?.

the problem with filters is that nutch must parse html and all document for 
discover new links but not index them in solr and if i restrict html with this 
filters nutch say nor url to fetch.
please any help will be appreciated.

Im using nutch 1.5.1 and solr 3.6.
________________________________________________________________________________________________
III Escuela Internacional de Invierno en la UCI del 17 al 28 de febrero del 
2014. Ver www.uci.cu

Reply via email to