Kranthi created LUCENE-8558: ------------------------------- Summary: Adding NumericDocValuesFields is slowing down the indexing process significantly Key: LUCENE-8558 URL: https://issues.apache.org/jira/browse/LUCENE-8558 Project: Lucene - Core Issue Type: Improvement Components: core/index Affects Versions: 7.5, 7.4 Reporter: Kranthi Fix For: 7.5, 7.4
The indexing time for my ~2M documents has gone up significantly when I started adding fields of type NumericDocValuesField. Upon debugging found the bottleneck to be in the PerFieldMergeState#FilterFieldInfos constructor. The contains check in the below code snippet was the culprit. {code:java} this.filteredNames = new HashSet<>(filterFields); this.filtered = new ArrayList<>(filterFields.size()); for (FieldInfo fi : src) { if (filterFields.contains(fi.name)) { {code} A simple change to the following seems to have fixed my issue {code:java} this.filteredNames = new HashSet<>(filterFields); this.filtered = new ArrayList<>(filterFields.size()); for (FieldInfo fi : src) { if (this.filteredNames.contains(fi.name)) { {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org