[
https://issues.apache.org/jira/browse/UIMA-3775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Peter Klügl reassigned UIMA-3775:
---------------------------------
Assignee: Peter Klügl
> Ruta: Fast multi token dictionary matching on feature values
> ------------------------------------------------------------
>
> Key: UIMA-3775
> URL: https://issues.apache.org/jira/browse/UIMA-3775
> Project: UIMA
> Issue Type: New Feature
> Components: ruta
> Affects Versions: 2.2.0ruta
> Reporter: Peter Klügl
> Assignee: Peter Klügl
> Fix For: 2.2.1ruta
>
>
> Ruta: Fast multi token dictionary matching on feature values.
> Something like checking not the covered text of the RutaBasics, but the
> string values of some features when a dictionary is applied with MARKFAST.
> The current use case is to apply a Lemmatizer and find all token sequences
> whose lemmas are listed as a sequence in the dictionary. Should also be
> useful for other tasks. For single tokens it is already possible with the
> INLIST condition:
> {noformat}
> (pos.N{INLIST(FirstNameList, Lemma.value)}){-> MARK(FirstName)};
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.2#6252)