Hi Sekhar,
You can add the RegexSectionizer in ctakes-core to your pipeline, then code
such as:
final Collection<Segment> sections = JCasUtil.select( jCas, Segment.class
);
for ( Segment section : sections ) {
System.out.println( section.getId() );
System.out.println( section.getCoveredText() );
}
The section names (id) come from a file listing names and regular expressions
in ctakes-core-res org.apache.ctakes.core.sections.DefaultSectionRegex.bsv
It is an incomplete list, please add to it if you can.
Sean
-----Original Message-----
From: Liam Bui [mailto:[email protected]]
Sent: Monday, August 21, 2017 12:18 PM
To: [email protected]
Cc: [email protected]
Subject: Re: Clinical documents section heading recognizer [EXTERNAL]
Hi Sekhar,
What you described seems to be related to Clinical Document Architecture
(CDA):
https://urldefense.proofpoint.com/v2/url?u=https-3A__apache.googlesource.com_ctakes_-2B_trunk_ctakes-2Dclinical-2Dpipeline_SystemArchitectureOverview.txt&d=DwIBaQ&c=qS4goWBT7poplM69zy_3xhKwEW14JZMSdioCoppxeFU&r=fs67GvlGZstTpyIisCYNYmQCP6r0bcpKGd4f7d4gTao&m=zrvmFugHM6SD_d67Z5AlmrKS-XJnKoFH3sPcFUjgu8g&s=2tf5LOZU9ITY3oyo-A7FetDbdBy8YmM57ys78r1rto8&e=
I never manage to get CDA working in cTAKES though.
On Sun, Aug 20, 2017 at 8:13 PM, Hari, Sekhar <[email protected]> wrote:
> Hello there -
>
> With the latest version of cTAKES, is it possible to recognize and
> extract the clinical section headings in clinical documents? For
> example, my use case is like this:
>
> 1. 'Extract the BP readings from the 'Vital Signs' or 'Physical
> Examination' section. If BP reading(s) are mentioned in other places
> of the document too, ignore those readings and consider only the one
> in 'Vital Signs' or 'Physical Examination' section.
>
> 2. Ignore everything mentioned under 'Family History' section.
>
> I would be most grateful if you can share your thoughts / code snippet
> examples in cTAKES.
>
> Thanks,
> Sekhar H.
>