dkpro-core's BreakIteratorSegmenter (rather: its base class) takes the same 
approach. It allows you to specify that segmentation should occur within 
"zones", defined by some other annotation type.

https://code.google.com/p/dkpro-core-asl/source/browse/de.tudarmstadt.ukp.dkpro.core-asl/trunk/de.tudarmstadt.ukp.dkpro.core.api.segmentation-asl/src/main/java/de/tudarmstadt/ukp/dkpro/core/api/segmentation/SegmenterBase.java


Usage:

        pipeline.add(createEngineDescription(BreakIteratorSegmenter.class,
                BreakIteratorSegmenter.PARAM_ZONE_TYPES, new String[] { 
MyZoneAnnotation.class.getName() }));

Cheers, Oli

-----Original Message-----
From: Thomas Ginter [mailto:thomas.gin...@utah.edu] 
Sent: Tuesday, June 17, 2014 2:20 PM
To: user@uima.apache.org
Subject: Re: Restricting a aggregate engine to a substring or mention

We do this by having a parameter for some of our standard annotators, like our 
RegexAnnotator, that allows the user to specify an annotation type.  If a type 
is specified then the operations of the annotator are restricted to the covered 
text of the annotation type instances specified.  If no annotation type is 
provided then the entire document is assumed.  In that way we can have 
annotators that perform some logic to find the regions of interest and then the 
subsequent annotators only operate on those regions.

Thanks,

Thomas Ginter
801-448-7676
thomas.gin...@utah.edu




On Jun 12, 2014, at 4:00 AM, Dr. Armin Wegner <arminweg...@googlemail.com> 
wrote:

> Hello!
> 
> Is there an UIMA component which restricts an aggregated analysis 
> engine to a substring of the document text or to mentions of a given 
> annotation type? That is, is there a UIMA aquivalent to GATE's Segment 
> Processing PR?
> 
> Thanks,
> Armin

Reply via email to