Hello Tom.
It's not clear which kind of MLT you are referring to: handler, queryparser
or component .
Generally there are two options for deduplication:
- query time: filed grouping or field collapsing
- index time:
  - mlt query might be limited to parents with titles and children might
carry editions with dates and so one
  - or mlt query can be filtered to the recent edition only for every
title, thus recent-flag should be set during indexing and then used by
filter.

On Wed, Apr 12, 2023 at 1:22 PM Tom Tailor <[email protected]> wrote:

> Hi all
>
>
>
> I want to build a recommender using Solr MoreLikeThis. I work on
> bibliographic data I.e. books. I have multiple records of different
> editions of the same book.  For a given book MLT returns all different
> editions of the book this is not new content from the users point of view.
> I can not deduplicate the records because the different editions are
> relevant for other applications.
>
>
>
> Is it possible to circumvent this? I could use the books title which is the
> same across all editions to filter duplicates from the MLT results
>
>
>
> Thanks for your help
>


-- 
Sincerely yours
Mikhail Khludnev
https://t.me/MUST_SEARCH
A caveat: Cyrillic!

Reply via email to