britt fitch created CTAKES-371:
----------------------------------

             Summary: update PTB tokenization logic in fast dictionary module
                 Key: CTAKES-371
                 URL: https://issues.apache.org/jira/browse/CTAKES-371
             Project: cTAKES
          Issue Type: Bug
          Components: ctakes-dictionary-lookup
    Affects Versions: 3.2.2
            Reporter: britt fitch
            Assignee: Sean Finan
             Fix For: 3.2.3


PTB tokenization logic is used in places like the tokenizer & dictionary 
building code. 

For example, given “22q11.2 deletion syndrome”:

PTB tokenizer: [22q11, .2, deletion, syndrome]

Dictionary module:  [22q11, ., 2, deletion, syndrome]   
(RareWordTermMapCreator.getTokens)

Dictionary module should be updated to match PTB tokenization logic used 
elsewhere in ctakes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to