[
https://issues.apache.org/jira/browse/CTAKES-314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14169308#comment-14169308
]
Pei Chen commented on CTAKES-314:
---------------------------------
UMLS Annotator requires: SentenceDetector, POSTagger, Chunker (The reason is
that there is a configuration where only Noun Phrases are used for the lookup.)
Is there an easy way I can try out this setup as well? Let me know if they
were removed because it caused other errors...
def getDefaultPipeline():AnalysisEngineDescription = {
def builder = new AggregateBuilder
builder.add(SimpleSegmentAnnotator.createAnnotatorDescription());
builder.add(SentenceDetector.createAnnotatorDescription());
builder.add(TokenizerAnnotatorPTB.createAnnotatorDescription());
builder.add(ContextDependentTokenizerAnnotator.createAnnotatorDescription());
builder.add(POSTagger.createAnnotatorDescription());
builder.add(Chunker.createAnnotatorDescription());
builder.add(AnalysisEngineFactory.createPrimitiveDescription(classOf[CopyNPChunksToLookupWindowAnnotations]));
builder.add(AnalysisEngineFactory.createPrimitiveDescription(classOf[RemoveEnclosedLookupWindows]));
builder.add(UmlsDictionaryLookupAnnotator.createAnnotatorDescription());
builder.add(PolarityCleartkAnalysisEngine.createAnnotatorDescription());
return builder.createAggregateDescription();
}
> BigTop/Hadoop cTAKES integration
> --------------------------------
>
> Key: CTAKES-314
> URL: https://issues.apache.org/jira/browse/CTAKES-314
> Project: cTAKES
> Issue Type: New Feature
> Affects Versions: 3.2.0
> Reporter: Pei Chen
> Fix For: 3.2.3
>
> Attachments: Napkin_cTAKES_Hadoop.JPG
>
>
> Placeholder to-
> Create a simple application that can take in different datasources (public
> forums, twitter, etc.), scale up cTAKES using BigTop/Hadoop ecosystem.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)