Re: Any recommended issues to work on for a newcomer?

2024-05-10 Thread Chang Hank
Thank you for these useful resources, please allow me to spend some time look into it. I’ll let you know asap!! Thanks Hank > On May 10, 2024, at 12:34 PM, Michael Wechner > wrote: > > also we might want to consider how this relates to > > https://lucene.apache.org/core/9_10_0/core/org/apa

Re: Maximum score estimation

2024-05-10 Thread Mikhail Khludnev
Hello Alessandro. Glad to hear! There's not much update from the previously published link: just a tiny test. Guessing max tf doesn't seem really reliable. However, I've got another idea: Can't Impacts give us an exact max score like https://lucene.apache.org/core/9_9_1/core/org/apache/lucene/searc

Re: Any recommended issues to work on for a newcomer?

2024-05-10 Thread Michael Wechner
also we might want to consider how this relates to https://lucene.apache.org/core/9_10_0/core/org/apache/lucene/search/Rescorer.html In vector search reranking has become quite popular, e.g. https://docs.cohere.com/docs/reranking IIUC LangChain (python) for example adds the reranker as an argu

Re: Any recommended issues to work on for a newcomer?

2024-05-10 Thread Michael Wechner
great, yes, let's get started :-) What about the following pseudo code, assuming that there might be alternative ranking algorithms to RRF StoredFieldsKeyword storedFieldsKeyword = indexReaderKeyword.storedFields(); StoredFieldsVector storedFieldsVector = indexReaderKeyword.storedFields(); To

Re: Any recommended issues to work on for a newcomer?

2024-05-10 Thread Chang Hank
Hi Michael, Sounds good to me. Let’s do it!! Cheers, Hank > On May 10, 2024, at 10:50 AM, Michael Wechner > wrote: > > Hi Hank > > Very cool! > > Adrien Grand suggested to implement it as a utility method on the TopDocs > class, and since Adrien worked for a decade on Lucene > https://www

Re: Any recommended issues to work on for a newcomer?

2024-05-10 Thread Michael Wechner
Hi Hank Very cool! Adrien Grand suggested to implement it as a utility method on the TopDocs class, and since Adrien worked for a decade on Lucene https://www.elastic.co/de/blog/author/adrien-grand I guess it makes sense to follow his advice :-) We could create a PR and work together on it,

Re: Any recommended issues to work on for a newcomer?

2024-05-10 Thread Chang Hank
Hi Michael, Thank you for the reply. This is really a cool issue to work on, I’m happy to work on this with you. I’ll try to do research on RRF first. Also, are we going to implement this on the TopDocs class? Best, Hank > On May 9, 2024, at 11:08 PM, Michael Wechner > wrote: > > Hi Hank