Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "IndexStructure" page has been changed by LewisJohnMcgibbney:
http://wiki.apache.org/nutch/IndexStructure?action=diff&rev1=8&rev2=9

  ||    host    ||      NO      ||      Tokenized       ||      index-basic     
|| Adds basic searchable '''hostname field''' to a document. ||
  ||    url     ||      YES     ||      Tokenized       ||      index-basic || 
Adds basic searchable '''URL field''' to a document. ||
  ||    content         ||      NO      ||      Tokenized       ||      
index-basic     || Adds basic searchable '''content field''' to a document. ||
- ||    lastModified    ||      YES     ||      NotIndexed      ||      
index-more || ||
+ ||    lastModified    ||      NO      ||      Indexed, Un-Tokenized   ||      
index-more || Adds some time related meta info in the form of 
'''last-modified''' if present.  Index date as last-modified, or, if that's not 
present, uses fetch time. ||
  ||    date    ||      NO      ||      Un-Tokenized    ||      index-more || ||
  ||    contentLength   ||      YES     ||      Not Indexed     ||      
index-more || ||
- ||    type    ||      NO      ||      Un-Tokenized    ||      index-more      
||      contentType,primaryType,subType (all mime-types) ||
+ ||    type    ||      NO      ||      Indexed, Un-Tokenized   ||      
index-more      || Adds contentType, primaryType, subType (all mime-types) ||
  ||    primaryType     ||      YES     ||      Un-Tokenized    ||      
index-more      ||      primaryType (mime-type) ||
  ||    subType         ||      YES     ||      Un-Tokenized    ||      
index-more      ||      subType (mime-type) ||
  ||      tld             ||     YES      || Un-Tokenized / NotStored(based on 
conf) || tld || see http://issues.apache.org/jira/browse/NUTCH-439 ||

Reply via email to