[ https://issues.apache.org/jira/browse/LUCENE-7498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15976654#comment-15976654 ]
Alessandro Benedetti commented on LUCENE-7498: ---------------------------------------------- Hi [~rcmuir], thank you for your feedback, I agree on your points, let's identify the next steps : 1) I will split up the patch - MLT internal broken up in more maintainable classes, as much as back compatible as possible ( scoring refactor allow to introduce new scorers in the future with ease) - introduction of BM25 term scorer ( possibly involving BM25 to become public or partial duplicated code) -> this will remain in this JIRA - Changes of Solr cloud classes, classification module, etc (maybe because of api changes) 2) generate related JIRAs 3) proceed with review in order to reduce complexity ( first review will be big, but much smaller and cohesive than this one I will keep you updated, > More Like This to Use BM25 > -------------------------- > > Key: LUCENE-7498 > URL: https://issues.apache.org/jira/browse/LUCENE-7498 > Project: Lucene - Core > Issue Type: Improvement > Components: modules/other > Reporter: Alessandro Benedetti > Assignee: Tommaso Teofili > > BM25 is now the default similarity, but the more like this is still using the > old TF/IDF . > > This issue is to move to BM25 and refactor the MLT to be more organised, > extensible and maintainable. > Few extensions will follow later, but the focus of this issue will be : > - BM25 > - code refactor + tests -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org