[ https://issues.apache.org/jira/browse/SOLR-16857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joel Bernstein updated SOLR-16857: ---------------------------------- Description: E-commerce catalogs often use collapse to collapse product records within a group. For example a particular couch might come in different colors or fabrics. So a single couch might have a large number of slightly different records within the group. When reranking a collapsed query with a vector query the vector query will select the top K matches based on the vector. The top K could include multiple records from within the same product group although only one group head was selected from the group. This will pollute the top K results with lots of duplicate records of no value. The solution is to devise a filter that limits the vector query to searching only the selected group heads from the collapse. was: E-commerce catalogs often use collapse to collapse product records within a group. For example a particular couch might come in different colors or fabrics. So a single couch might have a large number of slightly different records within the group. When reranking a collapsed query with a vector query the vector query will select the top K matches based on the vector. The top K could include multiple records from within the same product group although only one group head was selected from the group. This will pollute the top K results with lots duplicate records of no value. The solution is to devise a filter that limits the vector query to searching on the selected group heads from the collapse. > Efficiently rerank collapsed queries with vector queries > -------------------------------------------------------- > > Key: SOLR-16857 > URL: https://issues.apache.org/jira/browse/SOLR-16857 > Project: Solr > Issue Type: New Feature > Security Level: Public(Default Security Level. Issues are Public) > Reporter: Joel Bernstein > Priority: Major > > E-commerce catalogs often use collapse to collapse product records within a > group. For example a particular couch might come in different colors or > fabrics. So a single couch might have a large number of slightly different > records within the group. > When reranking a collapsed query with a vector query the vector query will > select the top K matches based on the vector. The top K could include > multiple records from within the same product group although only one group > head was selected from the group. This will pollute the top K results with > lots of duplicate records of no value. > The solution is to devise a filter that limits the vector query to searching > only the selected group heads from the collapse. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org For additional commands, e-mail: issues-h...@solr.apache.org