[ 
https://issues.apache.org/jira/browse/UIMA-1764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thilo Goetz closed UIMA-1764.
-----------------------------

    Resolution: Not A Problem

This does in fact work as designed.  The crucial piece of information that you 
didn't mention in your quote from the docs is that we must also have annot < b. 
 For your example, our b, the token "UIMA-framework" precedes the named entity 
"UIMA" by virtue of the fact that it starts at the same position, and is 
longer.  This is true independently of the type priorities.  The type 
priorities only come into play when two annotations start and end at the same 
position.

When you change the named entity to " UIMA", it now precedes "UIMA-framework", 
annot < b is true, and we see the expected result.

Please let us know if you still have questions, or if you have any suggestions 
on how to improve the documentation.


> Wrong behavior of the annotation index subiterator
> --------------------------------------------------
>
>                 Key: UIMA-1764
>                 URL: https://issues.apache.org/jira/browse/UIMA-1764
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.2.2, 2.3
>            Reporter: Hannes Korte
>            Assignee: Thilo Goetz
>         Attachments: files.zip
>
>
> I noticed a strange behavior of the annotation index subiterator in
> uimaj 2.2.2 and 2.3.0.
> Consider the sentence: 'Testing the UIMA-Framework'
> with tokens: 'Testing' 'the' 'UIMA-Framework'
> and the named entity: 'UIMA'
> The type priorities list NamedEntity on top of the Token type.
> If I call the Token subiterator for the NamedEntity 'UIMA' with
> strict=false, I get an empty result. According to the docs, the
> definition of Tokens contained in the NamendEntity is in the
> strict=false setting defined as:
>   annot.getBegin() <= b.getBegin() <= annot.getEnd()
> for NamedEntity annot and Token b. This is true for 'UIMA' and
> 'UIMA-Framework', but the subiterator is empty.
> If I change the NamedEntity to ' UIMA' (including the preceeding space),
> then it works correctly, and the Token 'UIMA-Framework' is contained in
> the subiterator.
> I appended a simple java class with all needed files to demonstrate the
> problem. Any ideas?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to