The Apache UIMA team is pleased to announce the release of Apache UIMA Ruta (Rule-based Text Annotation), version 2.8.0.
The Unstructured Information Management Architecture (UIMA) is a component framework supporting development, discovery, composition, and deployment of multi-modal analytics tasked with the analysis of unstructured information. Apache UIMA is an Apache licensed open source implementation of the UIMA specification which is being developed by a technical committee within OASIS, a standards organization. The implementation comprises an SDK and tooling for composing and running analytic components written in Java and C++, with some support for Perl, Python and TCL. Apache UIMA Ruta is a rule-based script language supported by Eclipse-based tooling. The language is designed to enable rapid development of text processing applications within UIMA. A special focus lies on the intuitive and flexible domain specific language for defining patterns of annotations. The Eclipse-based tooling, called the Apache UIMA Ruta Workbench, supports the user and facilitates every step when writing rules. Both the rule language and the workbench integrate smoothly with Apache UIMA. Major Changes in this Release This release contains changes in the analysis engine descriptor and the type system descriptors and therefore requires an update of Ruta projects. This can be accomplished, for example, by right-clicking on the Ruta project, selecting "UIMA Ruta" and then "Update Project". UIMA Ruta Language and Analysis Engine: - The analysis engine is able to generate debug information about inlined rules which includes also an extension of the ruta type system. - Type expressions in dot notation for annotation expressions a new supported: a1:ANY a2:ANY{a1.type==a2.type -> Type}; - Matching on string literals is no more restricted to single RutaBasic annotations, e.g., it is now possible to write: "This is a test"{-> Test}; - Dictionary matching is now more robust concerning white spaces in the word list. The parameter dictRemoveWS is now also set to true by default. - Fixed anchors at composed rule elements. - Labels at macro actions are supported now. - Fixed several bugs. UIMA Ruta Workbench: - New view for visualizing the explanation of inlined rules. - Fixed problem with blocked build processes in Ruta projects with many scripts. - Fixed bugs. For a full list of the changes, please refer to Jira: http://uima.apache.org/d/ruta-2.8.0/issuesFixed/jira-report.html More information about UIMA Ruta can be found here: http://uima.apache.org/ruta.html - Peter Klügl, for the Apache UIMA development team
signature.asc
Description: OpenPGP digital signature