Re: Apache cTAKES Example Application?

2014-04-17 Thread John Green
+1! Im mainly using ctakes as middleware, which is totally inline with this. What is NCBO?  JG — Sent from Mailbox for iPhone On Wed, Apr 16, 2014 at 6:53 PM, andy mcmurry mcmurry.a...@gmail.com wrote: Lowering the barrier to entry = worth the effort Notice the NCBO users mailing

lvg entries

2014-04-17 Thread Miller, Timothy
The LVG annotator creates an enormous number of lemmas for every WordToken in the CAS, and I'm wondering what the original purpose was? I think this is probably a minor bottleneck for speed but mostly a pretty big space hog (at least 50% of the space of xmi files in my tests). As of right now I'm

Re: lvg entries

2014-04-17 Thread Dligach, Dmitriy
Tim, this is a very interesting observation. Could you please send a few examples of what LVG generates? Both sensical and non :) Dima On Apr 17, 2014, at 11:28, Miller, Timothy timothy.mil...@childrens.harvard.edu wrote: The LVG annotator creates an enormous number of lemmas for every

Re: lvg entries

2014-04-17 Thread Miller, Timothy
Sure, just as an example, I gave it a note with about 1000 words. It generates 11500 NonEmptyFSList elements (each is basically one lexical variant). For the word symptomatic, these are the first 10 of 20 lexical variants: Symptomaticer/JJ Symptomaticer/RB Symptomaticed/VB Symptomaticcing/VB

Re: lvg entries

2014-04-17 Thread Dligach, Dmitriy
I don’t know of any applications within cTAKES that make use of this… The reverse (mapping from these “variants” to the normal form) may be useful though. Dima On Apr 17, 2014, at 11:50, Miller, Timothy timothy.mil...@childrens.harvard.edu wrote: Sure, just as an example, I gave it a note

RE: lvg entries

2014-04-17 Thread Finan, Sean
Those variants are not used by the dictionary lookup. I did look at them to see if it was worthwhile for the new dictionary, but they are all over the place so I passed. From: Miller, Timothy [timothy.mil...@childrens.harvard.edu] Sent: Thursday, April

RE: lvg entries

2014-04-17 Thread Masanz, James J.
Offhand I recall at least one of the dependency parsers used the Lemma annotations at one point. Not sure if still does. There is an option for turning off the posting of the lemmas to the cas. Hope that helps -Original Message- From: Miller, Timothy

Re: lvg entries

2014-04-17 Thread Miller, Timothy
Thanks James. Does it ring a bell to you that the original intention was something like query expansion for a dictionary lookup? Tim On 04/17/2014 01:57 PM, Masanz, James J. wrote: Offhand I recall at least one of the dependency parsers used the Lemma annotations at one point. Not sure if

RE: lvg entries

2014-04-17 Thread Masanz, James J.
Before the switch to OpenNLP (which was done before the first opensource release of cTAKES), I believe the Lemma annotations were used by the POS tagger and/or phrasal parser. As far as I know, that was the original intention of the Lemmas. I believe they were turned off by default for some

Re: lvg entries

2014-04-17 Thread andy mcmurry
There is a lot of config handling, maybe PostLemmas is being set to true or configInit() is not setting up the NLM wrapper incorrectly. ctakes-lvg *README* Note: as distributed, PostLemmas is set to false. This is done to reduce the size of the CAS. Set PostLemmas to true to have