I never tried ES with Nutch 2.3 but it should be similar to setup as for 1.x:

- enable the plugin "indexer-elastic" in plugin.includes
  (upgrade and rename to "indexer-elastic2" in 2.4)

- expects ES 1.4.1

- available/required options are found in the log file (hadoop.log):
   ElasticIndexWriter
        elastic.cluster : elastic prefix cluster
        elastic.host : hostname
        elastic.port : port  (default 9300)
        elastic.index : elastic index command
        elastic.max.bulk.docs : elastic bulk index doc counts. (default 250)
        elastic.max.bulk.size : elastic bulk index length. (default 2500500 
~2.5MB)

Sebastian

On 02/28/2018 01:26 PM, Yash Thenuan Thenuan wrote:
> Yeah
> I was also thinking that
> Can somebody help me with nutch 2.3?
> 
> On 28 Feb 2018 17:53, "Yossi Tamari" <yossi.tam...@pipl.com> wrote:
> 
>> Sorry, I just realized that you're using Nutch 2.x and I'm answering for
>> Nutch 1.x. I'm afraid I can't help you.
>>
>>> -----Original Message-----
>>> From: Yash Thenuan Thenuan [mailto:rit2014...@iiita.ac.in]
>>> Sent: 28 February 2018 14:20
>>> To: user@nutch.apache.org
>>> Subject: RE: Regarding Indexing to elasticsearch
>>>
>>> IndexingJob (<batchId> | -all |-reindex) [-crawlId <id>] This is the
>> output of
>>> nutch index i have already configured the nutch-site.xml.
>>>
>>> On 28 Feb 2018 17:41, "Yossi Tamari" <yossi.tam...@pipl.com> wrote:
>>>
>>>> I suggest you run "nutch index", take a look at the returned help
>>>> message, and continue from there.
>>>> Broadly, first of all you need to configure your elasticsearch
>>>> environment in nutch-site.xml, and then you need to run nutch index
>>>> with the location of your CrawlDB and either the segment you want to
>>>> index or the directory that contains all the segments you want to
>> index.
>>>>
>>>>> -----Original Message-----
>>>>> From: Yash Thenuan Thenuan [mailto:rit2014...@iiita.ac.in]
>>>>> Sent: 28 February 2018 14:06
>>>>> To: user@nutch.apache.org
>>>>> Subject: RE: Regarding Indexing to elasticsearch
>>>>>
>>>>> All I want  is to index my parsed data to elasticsearch.
>>>>>
>>>>>
>>>>> On 28 Feb 2018 17:34, "Yossi Tamari" <yossi.tam...@pipl.com> wrote:
>>>>>
>>>>> Hi Yash,
>>>>>
>>>>> The nutch index command does not have a -all flag, so I'm not sure
>>>>> what
>>>> you're
>>>>> trying to achieve here.
>>>>>
>>>>>         Yossi.
>>>>>
>>>>>> -----Original Message-----
>>>>>> From: Yash Thenuan Thenuan [mailto:rit2014...@iiita.ac.in]
>>>>>> Sent: 28 February 2018 13:55
>>>>>> To: user@nutch.apache.org
>>>>>> Subject: Regarding Indexing to elasticsearch
>>>>>>
>>>>>> Can somebody please tell me what happens when we hit the bin/nutc
>>>>>> index
>>>>> -all
>>>>>> command.
>>>>>> Because I can't figure out why the write function inside the
>>>>> elastic-indexer is not
>>>>>> getting executed.
>>>>
>>>>
>>
>>
> 

Reply via email to