The Apache UIMA team is pleased to announce the release of the Apache UIMA-DUCC version 2.2.2.

The Unstructured Information Management Architecture (UIMA) is a component framework supporting development, discovery, composition, and deployment of multi-modal analytics tasked with the analysis of unstructured information.

Apache UIMA is an Apache licensed open source implementation of the UIMA specification which is being developed by a technical committee within OASIS, a standards organization. The implementation comprises an SDK and tooling for composing and running analytic components written in Java and
C++, with some support for Perl, Python and TCL.

DUCC stands for Distributed UIMA Cluster Computing. DUCC is a cluster management system providing tooling,management, and scheduling facilities to automate the scale-out of applications written to the UIMA framework. Core UIMA provides a generalized framework for applications that process unstructured information such as human language, but does not provide a scale-out mechanism. UIMA-AS provides a scale-out mechanism to distribute UIMA pipelines over a cluster of computing resources, but does not provide job or cluster management of the resources.

DUCC defines a formal job model that closely maps to a standard UIMA pipeline. Around this job model DUCC provides cluster management services to automate the scale-out of UIMA pipelines over computing clusters. It is primarily intended to be used for research and development activities where multiple users need to efficiently share cluster resources for a wide variety of computational activities. Because DUCC is built for UIMA-based analytics from the ground up it automatically makes available such details as what annotators are currently initializing as well as the timing breakdown for each primitive annotator in a pipeline.

More information about DUCC is here: https://uima.apache.org/d/uima-ducc-2.2.2/duccbook.html

This release contains a number of improvements and bug fixes. Notable updates in this release include:

- Upgraded UIMA (2.10.2), UIMA-AS (2.10.2), ActiveMQ (5.15.2), Spring Framework (4.3.9), Camel (2.19.3), XStream (1.4.10) - Requires java 8 for builds and runtime. User jobs and services must use java 8.
- Fixed rogue process detector to prevent truncation of user names
- Reduced size of publications between the PM and Agents
- Improved Agent's detection of defunct processes
- Improved logging of system events ducc_watcher improvements
- Job & managed reservation specs now saved in DB
- DUCC Web Server (WS) queries DB too often
- DUCC Web Server (WS) should use Resource Manager (RM) calculated value for machine's memory available

For a complete list of bugs and improvements included in this release
please see https://issues.apache.org/jira/browse/UIMA-5746?jql=project%20%3D%20UIMA%20AND%20fixVersion%20%3D%20%222.2.2-Ducc%22%20

To download DUCC please visit: https://uima.apache.org/downloads.cgi

-- Jerry Cwiklik, for the Apache UIMA development team

Reply via email to