[jira] [Updated] (NUTCH-1987) Make bin/crawl indexer agnostic

Michael Joyce (JIRA) Thu, 16 Apr 2015 12:04:22 -0700

     [ 
https://issues.apache.org/jira/browse/NUTCH-1987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Michael Joyce updated NUTCH-1987:
---------------------------------
    Labels: memex  (was: )

> Make bin/crawl indexer agnostic
> -------------------------------
>
>                 Key: NUTCH-1987
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1987
>             Project: Nutch
>          Issue Type: Improvement
>    Affects Versions: 1.9
>            Reporter: Michael Joyce
>              Labels: memex
>             Fix For: 1.10
>
>
> The crawl script makes it a bit challenging to use an indexer that isn't 
> Solr. For instance, when I want to use the indexer-elastic plugin I still 
> need to call the crawler script with a fake Solr URL otherwise it will skip 
> the indexing step all together.
> {code}
> bin/crawl urls/ crawl/ "http://fakeurl.com:9200"; 1
> {code}
> It would be nice to keep configuration for the Solr indexer in the conf files 
> (to mirror the elastic search indexer conf and others) and to make the 
> indexing parameter simply toggle whether indexing does or doesn't occur 
> instead of also trying to configure the indexer at the same time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (NUTCH-1987) Make bin/crawl indexer agnostic

Reply via email to