Hi Lewis,
Just a quick question - I'm having a slight problem with the NUTCH-828v3 patch.
I check out nutch trunk, make sure it runs ok, then apply the patch. After that
I get
Fetcher: finished at 2013-11-04 10:37:15, elapsed: 00:00:04
Parsing : 20131104103709
ParseSegment: starting at 2013-11-04 10:37:16
ParseSegment: segment: DevCrawl/segments/20131104103709
2013-11-04 10:37:16.495 java[53035:1203] Unable to load realm info from
SCDynamicStore
Exception in thread "main" java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357)
at org.apache.nutch.parse.ParseSegment.parse(ParseSegment.java:213)
at org.apache.nutch.parse.ParseSegment.run(ParseSegment.java:247)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.parse.ParseSegment.main(ParseSegment.java:220)
Is this something obvious and simple or… ? Much appreciate your thoughts.
Best,
Olle
On Nov 3, 2013, at 2:29 PM, Lewis John Mcgibbney <[email protected]>
wrote:
> Hi Olle,
>
> On Sun, Nov 3, 2013 at 9:56 AM, <[email protected]> wrote:
>
>>
>> user Digest 3 Nov 2013 09:56:44 -0000 Issue 2096
>>
>> Re: user Digest 30 Oct 2013 00:57:14 -0000 Issue 2094
>> 28926 by: Lewis John Mcgibbney
>> 28929 by: Olle Romo
>>
>> Thanks for the reply :)
>> I just tried applying the patch to 1.8 trunk and it didn't go thru without
>> problems. I just thought I'd ask if anyone had done this before.
>>
>
> Please try the patch I just posted on the issue. This will work with Nutch
> 1.8-SNAPSHOT (trunk).
> If you could use and provide your comments on the Jira issue ten it would
> be most helpful.
>
>
>> I'm curious about the NUTCH-828 Fetch Filter. It seems it's been on the
>> cards for some time. Do you or anyone know if it's imminent or… ?
>>
>
> No it's certainly not imminent. We can change that of course if it will
> prove useful to folks. Please comment on the issue if you are able and we
> will take it from there. :)
> Thank you
> Lewis