Rupert Westenthaler created STANBOL-1151:
--------------------------------------------

             Summary: SentimentClassifier needs the PosTag when classifiering a 
word
                 Key: STANBOL-1151
                 URL: https://issues.apache.org/jira/browse/STANBOL-1151
             Project: Stanbol
          Issue Type: Bug
            Reporter: Rupert Westenthaler
            Assignee: Rupert Westenthaler


The SentimentClassifier interface needs to be changed in a way that the PosTag 
for the current word is also parsed to the 

    public double classifyWord(String word);

method. The reason for that is that the same word might have a different 
meaning for different POS tags.

An example is the German word 

* gefahren (past for to drive)
* Gefahren (plural for Danger)

While the verb does not have an sentiment assigned the Noun has the following 
entry

    Gefahr|NN   -1.0    Gefahren

because of that the SentimentClassifier currently incorrectly assigns a 
sentiment value of '-1' even if the word is used as verb in the text.

This will also require to adapt the implementation to support using the POS tag 
when looking up the words in the vocabulary



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to