CLASSIFICATION: UNCLASSIFIED

>From https://wiki.apache.org/nutch/NutchTutorial
I downloaded apache-nutch-1.12.src.zip
Unzipped it and put it in a directory called apache-nutch-1.12
And used option 2 in the tutorial to set up nutch from src.
I ran ant from the cmd line.
The next step says a folder should have been created 
/apache-nutch-1.12/runtime/local/
It was not created....


Thanks,
Kris

~~~~~~~~~~~~~~~~~~~~~~~~~~
Kris T. Musshorn
FileMaker Developer - Contractor - Catapult Technology Inc.      
US Army Research Lab 
Aberdeen Proving Ground 
Application Management & Development Branch 
410-278-7251
kris.t.musshorn....@mail.mil
~~~~~~~~~~~~~~~~~~~~~~~~~~

-----Original Message-----
From: Jamal, Sarfaraz [mailto:sarfaraz.ja...@verizonwireless.com.INVALID] 
Sent: Tuesday, July 19, 2016 11:03 AM
To: user@nutch.apache.org
Subject: [Non-DoD Source] RE: tutorial help (UNCLASSIFIED)

All active links contained in this email were disabled.  Please verify the 
identity of the sender, and confirm the authenticity of all links contained 
within the message prior to copying and pasting the address to a Web browser.  




----

What does the hadoop.log file state? It might provide insight..

Sas

-----Original Message-----
From: Musshorn, Kris T CTR USARMY RDECOM ARL (US) 
[Caution-mailto:kris.t.musshorn....@mail.mil] 
Sent: Tuesday, July 19, 2016 9:37 AM
To: user@nutch.apache.org
Subject: tutorial help (UNCLASSIFIED)

CLASSIFICATION: UNCLASSIFIED

I am working through the tutorial found here:
Caution-http://wiki.apache.org/nutch/NutchTutorial#A3._Crawl_your_first_website

at Step-by-Step: Fetching I get...

Generator: starting at 2016-07-19 09:34:29
Generator: Selecting best-scoring urls due for fetch.
Generator: filtering: true
Generator: normalizing: true
Generator: running in local mode, generating exactly one partition.
Generator: java.io.IOException: Job failed!
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:836)
        at org.apache.nutch.crawl.Generator.generate(Generator.java:589)
        at org.apache.nutch.crawl.Generator.run(Generator.java:764)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
        at org.apache.nutch.crawl.Generator.main(Generator.java:717)


What to do to correct this?

Thanks,
Kris

~~~~~~~~~~~~~~~~~~~~~~~~~~
Kris T. Musshorn
FileMaker Developer - Contractor - Catapult Technology Inc.      
US Army Research Lab 
Aberdeen Proving Ground 
Application Management & Development Branch 
410-278-7251
kris.t.musshorn....@mail.mil
~~~~~~~~~~~~~~~~~~~~~~~~~~



CLASSIFICATION: UNCLASSIFIED


CLASSIFICATION: UNCLASSIFIED

Reply via email to