[ 
https://issues.apache.org/jira/browse/CTAKES-314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14169308#comment-14169308
 ] 

Pei Chen commented on CTAKES-314:
---------------------------------

UMLS Annotator requires: SentenceDetector, POSTagger, Chunker (The reason is 
that there is a configuration where only Noun Phrases are used for the lookup.) 
 Is there an easy way I can try out this setup as well?  Let me know if they 
were removed because it caused other errors...

 def getDefaultPipeline():AnalysisEngineDescription = {
    def builder = new AggregateBuilder
    builder.add(SimpleSegmentAnnotator.createAnnotatorDescription());
    builder.add(SentenceDetector.createAnnotatorDescription());
    builder.add(TokenizerAnnotatorPTB.createAnnotatorDescription());
    
builder.add(ContextDependentTokenizerAnnotator.createAnnotatorDescription());
    builder.add(POSTagger.createAnnotatorDescription());
    builder.add(Chunker.createAnnotatorDescription());
 
builder.add(AnalysisEngineFactory.createPrimitiveDescription(classOf[CopyNPChunksToLookupWindowAnnotations]));
 
builder.add(AnalysisEngineFactory.createPrimitiveDescription(classOf[RemoveEnclosedLookupWindows]));
    builder.add(UmlsDictionaryLookupAnnotator.createAnnotatorDescription());
    builder.add(PolarityCleartkAnalysisEngine.createAnnotatorDescription());
    return builder.createAggregateDescription();
  }

> BigTop/Hadoop cTAKES integration
> --------------------------------
>
>                 Key: CTAKES-314
>                 URL: https://issues.apache.org/jira/browse/CTAKES-314
>             Project: cTAKES
>          Issue Type: New Feature
>    Affects Versions: 3.2.0
>            Reporter: Pei Chen
>             Fix For: 3.2.3
>
>         Attachments: Napkin_cTAKES_Hadoop.JPG
>
>
> Placeholder to-
> Create a simple application that can take in different datasources (public 
> forums, twitter, etc.), scale up cTAKES using BigTop/Hadoop ecosystem.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to