Re: [MASSMAIL]Crawling focused only over seed file

2015-11-27 Thread Julien Nioche
gt; > > > - Mensaje original - > > > De: "Paul Escobar" > > > Para: user@nutch.apache.org > > > Enviados: Miércoles, 18 de Noviembre 2015 22:33:50 > > > Asunto: Re: [MASSMAIL]Crawling focused only over seed file > > > > > > Hi Roa

Re: [MASSMAIL]Crawling focused only over seed file

2015-11-20 Thread Paul Escobar
wse/NUTCH-1331 for more > information. > > Regards. > > - Mensaje original - > > De: "Paul Escobar" > > Para: user@nutch.apache.org > > Enviados: Miércoles, 18 de Noviembre 2015 22:33:50 > > Asunto: Re: [MASSMAIL]Crawling focused only over s

Re: [MASSMAIL]Crawling focused only over seed file

2015-11-19 Thread Roannel Fernández Hernández
> Para: user@nutch.apache.org > Enviados: Miércoles, 18 de Noviembre 2015 22:33:50 > Asunto: Re: [MASSMAIL]Crawling focused only over seed file > > Hi Roannel, the new URLs aren't from other domains, they are in the same > domain, we want updatedb command avoid the update crawl

Re: [MASSMAIL]Crawling focused only over seed file

2015-11-18 Thread Paul Escobar
gt; > > Change in your nutch-site.xml the property db.ignore.external.links to > > true. > > > > Regards > > > > - Mensaje original - > > > De: "Andrés Rincón Pacheco" > > > Para: user@nutch.apache.org > > > Enviados: Sábado, 14

Re: [MASSMAIL]Crawling focused only over seed file

2015-11-18 Thread Andrés Rincón Pacheco
t; Change in your nutch-site.xml the property db.ignore.external.links to > true. > > Regards > > - Mensaje original - > > De: "Andrés Rincón Pacheco" > > Para: user@nutch.apache.org > > Enviados: Sábado, 14 de Noviembre 2015 19:51:54 > &

Re: [MASSMAIL]Crawling focused only over seed file

2015-11-18 Thread Roannel Fernández Hernández
Hi Andrés, Change in your nutch-site.xml the property db.ignore.external.links to true. Regards - Mensaje original - > De: "Andrés Rincón Pacheco" > Para: user@nutch.apache.org > Enviados: Sábado, 14 de Noviembre 2015 19:51:54 > Asunto: [MASSMAIL]Crawling focus