Re: Information on classifier based key word suggestion
Hi Shamik, for classification you can take a look to the Lucene module and the Solr integration ( through UpdateRequestProcessor [1] ) . Unfortunately I didn't have the time to work on the request handler version [2], anyway you are free to contribute ! Related the extraction of interesting terms from text or a set of documents is still work in progress. But you can potentially play a bit with faceting to achieve something similar. Cheers [1] http://www.slideshare.net/AlessandroBenedetti/lucene-and-solr-document-classification , https://issues.apache.org/jira/browse/SOLR-7739, https://issues.apache.org/jira/browse/SOLR-8871 [2] https://issues.apache.org/jira/browse/SOLR-7738 -- View this message in context: http://lucene.472066.n3.nabble.com/Information-on-classifier-based-key-word-suggestion-tp4314942p4315510.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Information on classifier based key word suggestion
Anyone ? -- View this message in context: http://lucene.472066.n3.nabble.com/Information-on-classifier-based-key-word-suggestion-tp4314942p4315492.html Sent from the Solr - User mailing list archive at Nabble.com.
Information on classifier based key word suggestion
Hi, I'm exploring a way to suggest keywords/tags based on a text snippet. I have a fairly small set of the taxonomy of product, release, category, type, etc. stored in an in-memory database. What I'm looking at is a tool which will analyze a given text, suggest not only the fields associated with taxonomy but keywords which it might feel relevant to the text. The keywords can be leveraged as a mechanism for findability of the document. As a newbie in this area, I'm a tad overwhelmed at different options and struggling to find the right approach.To start with I tried GATE, but it seems to be limited only providing taxonomy data which needs to be provided as a flat text. Few people suggested using classifiers like Naive Bayes classifier or other machine learning tools. I'll appreciate if anyone can provide some direction in this regard. Thanks, Shamik