jtibshirani opened a new pull request #601:
URL: https://github.com/apache/lucene/pull/601


   When merging segments together, the `KnnVectorsWriter` creates a 
`VectorValues`
   instance with a merged view of all the segments' vectors. This merged 
instance
   is used when constructing the new HNSW graph. Graph building needs random
   access, and the merged VectorValues support this by mapping from merged
   ordinals to segments and segment ordinals. This mapping can add significant
   overhead when building the graph.
   
   This change updates the HNSW merging logic to first write the combined 
segment
   vectors to a file, then use that the file to build the graph. This helps 
speed
   up segment merging, and also lets us simplify `VectorValuesMerger`, which
   provides the merged view of vector values.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to