britt fitch created CTAKES-341:
----------------------------------
Summary: FractionFSM annotates incorrect span
Key: CTAKES-341
URL: https://issues.apache.org/jira/browse/CTAKES-341
Project: cTAKES
Issue Type: Bug
Components: ctakes-context-tokenizer
Affects Versions: 3.2.0
Reporter: britt fitch
Assignee: britt fitch
Fix For: 3.2.2
It appears that when a decimal is followed by a range that the FractionFSM
incorrectly annotates the FractionToken
given:
{code}
FOO 4.5 3.5-4.7
{code}
produces the following FractionTokens:
* "4.5"
* "4.5 3.5"
* "4.7"
after fsm.reset we need to also add the following in order to move the start
position and allow sequential END states to be handled correctly:
{code} tokenStartMap.put(fsm, tokenStartIndex); {code}
i will create a test case and verify this solution before committing it back to
trunk.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)