Need Tutorial on Nutch

2018-03-06 Thread Eric Valencia
I'm a beginner in Nutch and need the best tutorials to get started. Can you guys let me know how you would advise yourselves if starting today (like me)? Eric

Re: Need Tutorial on Nutch

2018-03-06 Thread Eric Valencia
ing helps. > > On 7 Mar 2018 00:01, "Eric Valencia" wrote: > > I'm a beginner in Nutch and need the best tutorials to get started. Can > you guys let me know how you would advise yourselves if starting today > (like me)? > > Eric >

Re: Need Tutorial on Nutch

2018-03-06 Thread Eric Valencia
g to get the full basic understanding of the > system. It gets even worse if you don't know Hadoop. If you dont I do > recomend to read "Hadoop. The definitive guide", because, well, Nutch is > Hadoop. > > Here we are, no pain, no gain. > > > > Sent: Tu

Re: Need Tutorial on Nutch

2018-03-06 Thread Eric Valencia
erience with java will help you to fulfil your personal > requirements. > > On 7 Mar 2018 01:42, "Eric Valencia" wrote: > > > Does this require knowing Java proficiently? > > > > On Tue, Mar 6, 2018 at 10:51 AM Semyon Semyonov < > semyon.semyo...@mail.com

Re: Need Tutorial on Nutch

2018-03-07 Thread Eric Valencia
two hours, 24/7/365. Java is needed? > > > > On Tue, Mar 6, 2018 at 12:15 PM, Yash Thenuan Thenuan < > > rit2014...@iiita.ac.in> wrote: > > > > > If you want simple crawlung then Not at all. > > > But having experience with java will help you to fulfil

Re: Need Tutorial on Nutch

2018-03-07 Thread Eric Valencia
March 2018 21:17 > > > > To: user@nutch.apache.org > > > > Subject: Re: Need Tutorial on Nutch > > > > > > > > Yash, well, I want to monitor the price for every item in the top 500 > > > > retail websites every two hours, 24/7/365. Jav

BinaryContent or Base64 Options

2018-03-24 Thread Eric Valencia
Hello guys, I was able to get nutch 1.4 in the most basic of basic setups - local and default options for the most part. While I am getting some results in Solr, it's not getting all the prices and variations from the pages. Previously, I learned nutch could get all this information and the expor

Nutch 1.19 Getting Error: 'boolean org.apache.hadoop.io.nativeio.NativeIO$Windows.access0(java.lang.String, int)'

2023-05-14 Thread Eric Valencia
Hello everyone, So, I set up Nutch 1.19, Solr 8.11.2, and hadoop 3.3.5, to the best of my knowledge. After, I went into the nutch directory and ran this command: *bin/nutch generate crawl/crawldb crawl/segments* Then, I got an error: *Exception in thread "main" java.lang.UnsatisfiedLinkError: 'b