Contribution of Lucene CAS Indexer ---------------------------------- Key: UIMA-1299 URL: https://issues.apache.org/jira/browse/UIMA-1299 Project: UIMA Issue Type: New Feature Components: Sandbox Reporter: Rico Landefeld
Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided. Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title") -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.