Hi All
Having created a new dictionary from the 2020AA UMLS and added Genes and
Receptors to the dictionary-creator's default selections, I have a curious
problem where cTakes now assigns the most bizarre acronyms to ordinary
words used in POS contexts where it shouldn't find <XXX>Mentions.
Here are two examples:
1. soft (in "soft tissue...")
becomes "SHORT STATURE, ONYCHODYSPLASIA, FACIAL DYSMORPHISM, AND
HYPOTRICHOSIS SYNDROME",
2. bed in ("The wound bed was...")
becomes "BORNHOLM EYE DISEASE"
I have not changed the TermConsumer type in the descriptor XML.
Are the DictionaryCreator's defaults, the equivalent to the default sno_rx
that's delivered with the app?
Attached is the vocab subsets list I used
Peter