[ 
https://issues.apache.org/jira/browse/LUCENE-4503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13483322#comment-13483322
 ] 

Ying Andrews commented on LUCENE-4503:
--------------------------------------

 * Added support for multiple index readers so More Like This can generate a 
similary query based on multiple indexes.
 * This extends the MoreLikeThis feature to work with lucene MultSsearcher.
 * 
 * For example: 
 * Due to large size we may want to divide all sales index into: sales_1, 
sale_2, sales_3, ..., sales_n.
 * In this case we would best use parallel multi-searcher to do the search. Old 
MoreLikeThis.java doesn't support
 * this scenario.  If the current document of interest comes from index 
sales_1, then the query returned from
 * like(int) and like(Reader, String) will only be based on index sales_1, 
which apparently does not reflect the
 * entirety of the whole document population.
 * 
 * Modified:
 * constructors   - MoreLikeThis(IndexReader), 
 *                                  MoreLikeThis(IndexReader, Similarity)
 * private method - createQueue(Map<String, Int>)
 * 
 * Added:
 * constructors   - MoreLikeThis(IndexReader, IndexReader[]), 
 *                                      MoreLikeThis(IndexReader, 
IndexReader[], Similarity)
 *                              
 * Notes: 
 * When invoking method like(int) of this class, you have to pass in the 
NORMALIZED document number.
 * You can use the same algorithm used in lucene MultiSearcher class, 
specifically seen in 
 * subSearcher(int) and subDoc(int) methods.

                
> MoreLikeThis supports multiple index readers.
> ---------------------------------------------
>
>                 Key: LUCENE-4503
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4503
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Ying Andrews
>            Priority: Minor
>              Labels: patch
>         Attachments: MoreLikeThis.java.patch
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to