[
https://issues.apache.org/jira/browse/STANBOL-1151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rupert Westenthaler resolved STANBOL-1151.
------------------------------------------
Resolution: Fixed
Implemented with http://svn.apache.org/r1518948
Now the SentimentClassifier uses LexicalCategories (Noun, Verb, Adverb,
Adjective ...) when classifying words with sentiments.
This gives implementations the change to distinguish between a Noun and Verb
that so have the same lexical form.
In addition the existing classifier implementation for English and German where
adapted to support the new interface. They do also support the use of lexical
categories as luckily the sentiment dictionaries included the necessary
information.
The new interface is no longer compatible with the old version. However it is
possible to implement the new interface without support for lexical categories.
Note also the addition of the WordSentimentDictionary class. This utility class
allows to manage sentiment dictionaries in a thread save manner and will save
WordClassifier implementation a lot of code.
> SentimentClassifier needs the PosTag when classifiering a word
> --------------------------------------------------------------
>
> Key: STANBOL-1151
> URL: https://issues.apache.org/jira/browse/STANBOL-1151
> Project: Stanbol
> Issue Type: Bug
> Reporter: Rupert Westenthaler
> Assignee: Rupert Westenthaler
>
> The SentimentClassifier interface needs to be changed in a way that the
> PosTag for the current word is also parsed to the
> public double classifyWord(String word);
> method. The reason for that is that the same word might have a different
> meaning for different POS tags.
> An example is the German word
> * gefahren (past for to drive)
> * Gefahren (plural for Danger)
> While the verb does not have an sentiment assigned the Noun has the following
> entry
> Gefahr|NN -1.0 Gefahren
> because of that the SentimentClassifier currently incorrectly assigns a
> sentiment value of '-1' even if the word is used as verb in the text.
> This will also require to adapt the implementation to support using the POS
> tag when looking up the words in the vocabulary
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira