Hello Friends,
I am experiencing this error message fetcher:no agents listed in
'http.agent.name' property when I am trying to crawl with Nutch 1.3
I referred other mails regarding the same error message and tried to change
the nutch-default.xml and nutch-site.xml file details with
property
-default.xml_and_nutch-site.xml.2C_why.3F
[2] http://wiki.apache.org/nutch/NutchConfigurationFiles
On Thu, Jul 7, 2011 at 4:45 PM, serenity serenitykenings...@gmail.comwrote:
Hello Friends,
I am experiencing this error message fetcher:no agents listed in
'http.agent.name' property when I am trying
:
Hello Friends,
I am experiencing this error message fetcher:no agents listed in
'http.agent.name' property when I am trying to crawl with Nutch 1.3
I referred other mails regarding the same error message and tried to change
the nutch-default.xml and nutch-site.xml file details with
property
I fixed it with the help of *Way cool*. I never tried to update the
nutch-site.xml file in the NUTCH_HOME/runtime/local/conf directory, I was
updating the nutch-site.xml file from NUTCH_HOME/conf directory.
--
View this message in context:
Thanks Julien, I just did it and it's crawling fine without any issue
--
View this message in context:
http://lucene.472066.n3.nabble.com/Re-no-agents-listed-in-http-agent-name-property-tp3148743p3148940.html
Sent from the Nutch - User mailing list archive at Nabble.com.
crawl/segments/20110613103305 -threads 8
I get an error message: No agents listed in 'http.agent.name' property
I noticed there are now 2 conf files, one at trunk/conf and the other at
trunk/runtime/local/conf, and hae updated both of them with my
nutch-site.xml file, both have a properly
/20110613103305 -threads 8
I get an error message: No agents listed in 'http.agent.name' property
I noticed there are now 2 conf files, one at trunk/conf and the other at
trunk/runtime/local/conf, and hae updated both of them with my
nutch-site.xml file, both have a properly configured
listed in 'http.agent.name' property
I noticed there are now 2 conf files, one at trunk/conf and the other
at
trunk/runtime/local/conf, and hae updated both of them with my
nutch-site.xml file, both have a properly configured http.agent.name.
Do I need to explicitly declare the conf
and
generated a sement, but when I run:
runtime/deploy/bin/nutch fetch crawl/segments/20110613103305 -threads
8
I get an error message: No agents listed in 'http.agent.name'
property
I noticed there are now 2 conf files, one at trunk/conf and the other
at
trunk/runtime
Try prefixing your script /crawling command with:
$ xmlstarlet edit -L -u /configuration/property[name='']/value -v 'test'
conf/nutch-default.xml
$ xmlstarlet sel -t -c /configuration/property[name='http.agent.name']/value
conf/nutch-default.xml
After the second you should see printed:
hi,
I am facing the same problem as well.I tried with copying property tag
containing http.agent.name into conf/nutch-site.xml. But i get the same
problem. Can u help me out?
--
View this message in context:
!
From: fayazvf [fai...@gmail.com]
Sent: 03 May 2011 06:54
To: nutch-u...@lucene.apache.org
Subject: Re: Re: Error: No agents listed in 'http.agent.name' property.
hi,
I am facing the same problem as well.I tried with copying property tag
containing
Running the following command:
./bin/nutch crawl urls -dir crawl.test -depth 3 crawl.log
Generates the following text in crawl.log
Fetcher: No agents listed in 'http.agent.name' property.
Exception in thread main java.lang.IllegalArgumentException: Fetcher: No
agents listed
13 matches
Mail list logo