Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "IndexWriters" page has been changed by RoannelFernandez:
https://wiki.apache.org/nutch/IndexWriters?action=diff&rev1=8&rev2=9

Comment:
Elasticsearch rest indexer properties

  
  ||'''Parameter Name''' ||'''Description''' ||'''Default value''' ||
  || host || The hostname or a list of comma separated hostnames to send 
documents to using Elasticsearch Jest. Both host and port must be defined. ||  
||
- || port || The port to connect to using Elasticsearch Jest. || 9200 Check 
this number||
+ || port || The port to connect to using Elasticsearch Jest. || 9200 ||
  || index || Default index to send documents to. || nutch ||
  || max.bulk.docs || Maximum size of the bulk in number of documents. || 250 ||
- || max.bulk.size || Maximum size of the bulk in bytes. || 2500500 Check this 
number||
+ || max.bulk.size || Maximum size of the bulk in bytes. || 2500500 ||
  || user || Username for auth credentials (only used when https is enabled) || 
user ||
  || password || Password for auth credentials (only used when https is 
enabled) || password ||
  || type || Default type to send documents to. || doc ||
- || https || "true" to enable https, "false" to disable https If you've 
disabled http access (by forcing https), be sure to set this to true, otherwise 
you might get "connection reset by peer". || false ||
+ || https || '''true''' to enable https, '''false''' to disable https If 
you've disabled http access (by forcing https), be sure to set this to true, 
otherwise you might get "connection reset by peer". || false ||
- || trustallhostnames || "true" to trust elasticsearch server's certificate 
even if its listed domain name does not match the domain they are hosted on 
"false" to check if the elasticsearch server's certificate's listed domain is 
the same domain that it is hosted on, and if it doesn't, then fail to index 
(only used when https is enabled) || false ||
+ || trustallhostnames || '''true''' to trust elasticsearch server's 
certificate even if its listed domain name does not match the domain they are 
hosted on '''false''' to check if the elasticsearch server's certificate's 
listed domain is the same domain that it is hosted on, and if it doesn't, then 
fail to index (only used when https is enabled) || false ||
- || languages || A list of strings denoting the supported languages (e.g. 
`en,de,fr,it`). If this value is empty all documents will be sent to index 
${elastic.rest.index}. If not empty the Rest client will distribute documents 
in different indices based on their `lang` property. Indices are named with the 
following schema: ${elastic.rest.index}${elastic.rest.index.separator}${lang} 
(e.g. `nutch_de`). Entries with an unsupported `lang` value will be added to 
index 
${elastic.rest.index}${elastic.rest.index.separator}${elastic.rest.index.sink} 
(e.g. `nutch_others`). ||  ||
- || separator || Default value is `_`. Is used only if 
`elastic.rest.index.languages` is defined to build the index name (i.e. 
${elastic.rest.index}${elastic.rest.index.separator}${lang}).  || _ ||
- || sink || Default value is `others`. Is used only if 
`elastic.rest.index.languages` is defined to build the index name where to 
store documents with unsupported languages (i.e. 
${elastic.rest.index}${elastic.rest.index.separator}${elastic.rest.index.sink}).
 || others ||
+ || languages || A list of strings denoting the supported languages (e.g. `en, 
de, fr, it`). If this value is empty all documents will be sent to `index` 
property. If not empty the Rest client will distribute documents in different 
indices based on their `languages` property. Indices are named with the 
following schema: `index` `separator` `language` (e.g. `nutch_de`). Entries 
with an unsupported `languages` value will be added to index `index` 
`separator` `sink` (e.g. `nutch_others`). ||  ||
+ || separator || Is used only if `languages` property is defined to build the 
index name (i.e. `index` `separator` `lang`).  || _ ||
+ || sink || Is used only if `languages` property is defined to build the index 
name where to store documents with unsupported languages (i.e. `index` 
`separator` `sink`). || others ||
  

Reply via email to