vigyasharma commented on PR #13525:
URL: https://github.com/apache/lucene/pull/13525#issuecomment-2237194980

   > The pattern doesn't work well with ColBERT esque models.
   
   +1.. Good question, @navneet1v. I had the same doubts before starting this 
effort. There is some discussion in 
[12313](https://github.com/apache/lucene/issues/12313#issuecomment-2044076948). 
   
   Basically, Multi-vectors allow for ColBERT style late interaction models, 
where queries and documents can both be represented by multiple vector values, 
and you do a similarity measure using all those values (e.g. 
sum(max(similarity))). They gather contextual interaction across different 
parts of a query and a document, without incurring the query-time computational 
overheads of cross-encoder models.
   
   Nested block joins on the other hand compute distance with a single vector 
value and then aggregate. That doesn't work well here, esp. with both query and 
doc multi-vectors.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to