Re: Information on classifier based key word suggestion

2017-01-24 Thread alessandro.benedetti
Hi Shamik,
for classification you can take a look to the Lucene module and the Solr
integration ( through UpdateRequestProcessor [1] ) .

Unfortunately I didn't have the time to work on the request handler version
[2], anyway you are free to contribute !

Related the extraction of interesting terms from text or a set of documents
is still work in progress.
But you can potentially play a bit with faceting to achieve something
similar.

Cheers

[1]
http://www.slideshare.net/AlessandroBenedetti/lucene-and-solr-document-classification
, 
https://issues.apache.org/jira/browse/SOLR-7739, 
https://issues.apache.org/jira/browse/SOLR-8871

[2] https://issues.apache.org/jira/browse/SOLR-7738



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Information-on-classifier-based-key-word-suggestion-tp4314942p4315510.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Information on classifier based key word suggestion

2017-01-23 Thread shamik
Anyone ?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Information-on-classifier-based-key-word-suggestion-tp4314942p4315492.html
Sent from the Solr - User mailing list archive at Nabble.com.


Information on classifier based key word suggestion

2017-01-19 Thread Shamik Bandopadhyay
Hi,

  I'm exploring a way to suggest keywords/tags based on a text snippet. I
have a fairly small set of the taxonomy of product, release, category,
type, etc. stored in an in-memory database. What I'm looking at is a tool
which will analyze a given text, suggest not only the fields associated
with taxonomy but keywords which it might feel relevant to the text. The
keywords can be leveraged as a mechanism for findability of the document.
As a newbie in this area, I'm a tad overwhelmed at different options and
struggling to find the right approach.To start with I tried GATE, but it
seems to be limited only providing taxonomy data which needs to be provided
as a flat text. Few people suggested using classifiers like Naive Bayes
classifier or other machine learning tools.

I'll appreciate if anyone can provide some direction in this regard.

Thanks,
Shamik