[ 
https://issues.apache.org/jira/browse/UIMA-6194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17066167#comment-17066167
 ] 

Peter Klügl commented on UIMA-6194:
-----------------------------------

What do you mean exactly by that you tried and that they pass the Matcher? With 
a script or with a direct call to the matcher?
The basic annotations are RutaBasic which are automatically created and managed 
to represent a complete disjunct partitioning. So you can modify the matching 
behavior of the literal string matches or also the dictionary lookup by adding 
your own annotations, e.g., decompounding.

If you have prepended other analysis engines or if you used some simple regex 
rules or if you modified offsets manually, there could be RutaBasics smaller 
than TokenSeeds.

(This may sound strange but I think it's a cool feature)


> Ruta: RutaLiteralMatcher throws exception for special choice of string
> ----------------------------------------------------------------------
>
>                 Key: UIMA-6194
>                 URL: https://issues.apache.org/jira/browse/UIMA-6194
>             Project: UIMA
>          Issue Type: Bug
>          Components: Ruta
>    Affects Versions: 2.8.0ruta
>            Reporter: Michael Stenger
>            Assignee: Peter Klügl
>            Priority: Minor
>             Fix For: 2.8.1ruta, 3.0.1ruta
>
>
> For certain combinations of document text and RuleElementLiteral in the 
> script, method getAnnotation of class RutaLiteralMatcher throws a 
> NullPointerException.  This seems to be the case whenever the used string is 
> a postfix or infix of a word in the document, but itself doesn't occur.
> h4. Example
> Script
>  
> {code:java}
> DECLARE testType;
> "est" {-> testType};
> "est te"{-> testType};
> {code}
> Document
>  
> {code:java}
> test test{code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to