[ 
https://issues.apache.org/jira/browse/LUCENE-1910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12790889#action_12790889
 ] 

Otis Gospodnetic commented on LUCENE-1910:
------------------------------------------

* I'll second Mark's suggestion to extract the Information Gain piece of the 
patch into separate class(es), so we can reuse it in other places.  It looks 
like it's currently an integral part of MoreLikeThisUsingTags class.  Would 
that be possible?

* I noticed the code needs ASL (the Apache Software License) added.

* Also, could you please use the Lucene code format? (Eclipse/IntelliJ 
templates are at the bottom of 
http://wiki.apache.org/lucene-java/HowToContribute )


> Extension to MoreLikeThis to use tag information
> ------------------------------------------------
>
>                 Key: LUCENE-1910
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1910
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: contrib/*
>            Reporter: Thomas D'Silva
>            Priority: Minor
>         Attachments: LUCENE-1910.patch
>
>
> I would like to contribute a class based on the MoreLikeThis class in
> contrib/queries that generates a query based on the tags associated
> with a document. The class assumes that documents are tagged with a
> set of tags (which are stored in the index in a seperate Field). The
> class determines the top document terms associated with a given tag
> using the information gain metric.
> While generating a MoreLikeThis query for a document the tags
> associated with document are used to determine the terms in the query.
> This class is useful for finding similar documents to a document that
> does not have many relevant terms but was tagged.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to