Hi,

I'm working with the cTAKES pipeline to annotate some clinical text. The cTAKES 
syntactic chunker generates overlapping and nested annotations with the same 
syntactic type. For example:

[NP ascending [NP colon polyps]]

What I would like to do is to use TextMarker rules to eliminate nested 
annotations, so at the end of the day just have the following:

[NP colon ascending polyps]

I've tried to use UNMARK, but the following two rules appear to remove all NPs 
starting at the first match, even the containing annotation:

NP{PARTOF(NP)->UNMARK(NP)};
NP{CONTAINS(NP)->UNMARK(NP)};

Is there way to accomplish this that I'm missing? Using loops perhaps?

Thanks,

Will

Reply via email to