[jira] [Resolved] (STANBOL-1151) SentimentClassifier needs the PosTag when classifiering a word

Rupert Westenthaler (JIRA) Fri, 30 Aug 2013 05:19:07 -0700

     [ 
https://issues.apache.org/jira/browse/STANBOL-1151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Rupert Westenthaler resolved STANBOL-1151.
------------------------------------------

    Resolution: Fixed

Implemented with http://svn.apache.org/r1518948

Now the SentimentClassifier uses LexicalCategories (Noun, Verb, Adverb, 
Adjective ...) when classifying words with sentiments.

This gives implementations the change to distinguish between a Noun and Verb 
that so have the same lexical form.

In addition the existing classifier implementation for English and German where 
adapted to support the new interface. They do also support the use of lexical 
categories as luckily the sentiment dictionaries included the necessary 
information.

The new interface is no longer compatible with the old version. However it is 
possible to implement the new interface without support for lexical categories.

Note also the addition of the WordSentimentDictionary class. This utility class 
allows to manage sentiment dictionaries in a thread save manner and will save 
WordClassifier implementation a lot of code.
                
> SentimentClassifier needs the PosTag when classifiering a word
> --------------------------------------------------------------
>
>                 Key: STANBOL-1151
>                 URL: https://issues.apache.org/jira/browse/STANBOL-1151
>             Project: Stanbol
>          Issue Type: Bug
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>
> The SentimentClassifier interface needs to be changed in a way that the 
> PosTag for the current word is also parsed to the 
>     public double classifyWord(String word);
> method. The reason for that is that the same word might have a different 
> meaning for different POS tags.
> An example is the German word 
> * gefahren (past for to drive)
> * Gefahren (plural for Danger)
> While the verb does not have an sentiment assigned the Noun has the following 
> entry
>     Gefahr|NN -1.0    Gefahren
> because of that the SentimentClassifier currently incorrectly assigns a 
> sentiment value of '-1' even if the word is used as verb in the text.
> This will also require to adapt the implementation to support using the POS 
> tag when looking up the words in the vocabulary

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (STANBOL-1151) SentimentClassifier needs the PosTag when classifiering a word

Reply via email to