Sebastian Nagel created NUTCH-1820: -------------------------------------- Summary: remove field "orig" which duplicates "id" Key: NUTCH-1820 URL: https://issues.apache.org/jira/browse/NUTCH-1820 Project: Nutch Issue Type: Bug Components: indexer Affects Versions: 2.2.1 Reporter: Sebastian Nagel Priority: Trivial Fix For: 2.3
The indexing filter plugin index-basic (2.x only) adds a field "orig" which contains the "real" URL (not the reprUrl) and duplicates the field "id" (also regarding the field params: stored="true" indexed="true"). The field "orig" should be removed from index-basic and schema.xml. -- This message was sent by Atlassian JIRA (v6.2#6252)