Tomasz Oliwa created CTAKES-389:
-----------------------------------
Summary: cTAKES dictionary lookup missed word starting string bug
Key: CTAKES-389
URL: https://issues.apache.org/jira/browse/CTAKES-389
Project: cTAKES
Issue Type: Bug
Components: ctakes-dictionary-lookup-fast
Affects Versions: 3.2.2, 3.2.3
Environment: All environments
Reporter: Tomasz Oliwa
cTAKES has a bug in its fast dictionary lookup.
"baby to" , "baby too" gets looked up as C1305907 of "baby tooth", however
"baby token" does not match it.
"electrolyte le", "electrolyte lev" gets found as C0428284 "electrolyte level",
but "electrolyte dev" does not match.
It seems if the "missed" word contains the same characters that the word found
in the fast dictionary starts with, a match is made.
This is a bug.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)