Hello Tom. It's not clear which kind of MLT you are referring to: handler, queryparser or component . Generally there are two options for deduplication: - query time: filed grouping or field collapsing - index time: - mlt query might be limited to parents with titles and children might carry editions with dates and so one - or mlt query can be filtered to the recent edition only for every title, thus recent-flag should be set during indexing and then used by filter.
On Wed, Apr 12, 2023 at 1:22 PM Tom Tailor <[email protected]> wrote: > Hi all > > > > I want to build a recommender using Solr MoreLikeThis. I work on > bibliographic data I.e. books. I have multiple records of different > editions of the same book. For a given book MLT returns all different > editions of the book this is not new content from the users point of view. > I can not deduplicate the records because the different editions are > relevant for other applications. > > > > Is it possible to circumvent this? I could use the books title which is the > same across all editions to filter duplicates from the MLT results > > > > Thanks for your help > -- Sincerely yours Mikhail Khludnev https://t.me/MUST_SEARCH A caveat: Cyrillic!
