rmuir commented on PR #11905: URL: https://github.com/apache/lucene/pull/11905#issuecomment-1307289709
> Yeah, we can probably trigger this overflow by using 16268815 byte vectors of few dimensions. Something as small as 2 dimensions could work. > One issue with HNSW is that completely random vectors can make it run dog-slow on index. Maybe having few dimensions could alleviate this. We may need to modify the draft test then to trigger the bug. I used only one dimension and simple `docid % 256` to assign vector value. I also only used `16268814` documents so it may need another one. I also am unsure if CheckIndex at the end will trigger the issue you describe, maybe it only calls `next` and not `advance` or something like that. In such a case, we may need to fix CheckIndex to do some "advancing", too. It does a similar thing already for the postings, see `// Test skipping` section of the `checkFields()` method. So its definitely all a WIP -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
