HI 

I am new to using Nutch. I'm not good with English, so the help of a 
translator. 

My question focuses on the need to know how nutch can collect and process for 
future indexing on solr server , all meta tags of a html document. I am also 
interested in knowing how to collect the ALT attribute of the img tag in html. 
Well and if there is a filter where I can set up labels I collect, the better. 
I'll be very grateful for your help. 


MANP 

10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS 
INFORMATICAS...
CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION

http://www.uci.cu
http://www.facebook.com/universidad.uci
http://www.flickr.com/photos/universidad_uci

Reply via email to