Contribution of Lucene CAS Indexer
----------------------------------

                 Key: UIMA-1299
                 URL: https://issues.apache.org/jira/browse/UIMA-1299
             Project: UIMA
          Issue Type: New Feature
          Components: Sandbox
            Reporter: Rico Landefeld


Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene 
index. It is based on a XML-based "mapping configuration  file" in which the 
user can determine which UIMA annotations should be put into which Lucene 
field, and how this field is set up (e.g. indexed and/or stored). In addition, 
some basic functionality for (ontolgical) hypernym indexing is provided.

Additionally, Lucas is able to perform offset-based token stream alignment and 
merging of UIMA annotations (via token position increment) in the same Lucene 
field (e.g. "documenttext" or "title")


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to