seanfinan commented on issue #19:
URL: https://github.com/apache/ctakes/issues/19#issuecomment-2142787585

   There is/was documentation somewhere, but it is spread out and would take me 
some time to find the latest and most accurate.  However, you have have a look 
at the example here 
https://github.com/apache/ctakes/blob/main/ctakes-dictionary-lookup-fast/src/user/resources/org/apache/ctakes/dictionary/lookup/fast/bsv/tinyDict.bsv
   Column 1 has the cui, 2 has the tui, 3 the synonym.  If there is a 4th entry 
then it is used as the preferred text for the concept.  The lookupXml for the 
example is here: 
https://github.com/apache/ctakes/blob/main/ctakes-dictionary-lookup-fast/src/user/resources/org/apache/ctakes/dictionary/lookup/fast/bsv/tinyDictSpec.xml
   You just want to point to that lookupXml file.  on the command line "-l 
bsv/tinyDictSpec.xml".  You may need to use the full path.  This all uses the 
code here: 
https://github.com/apache/ctakes/blob/main/ctakes-dictionary-lookup-fast/src/main/java/org/apache/ctakes/dictionary/lookup2/dictionary/BsvRareWordDictionary.java
   and here: 
https://github.com/apache/ctakes/blob/main/ctakes-dictionary-lookup-fast/src/main/java/org/apache/ctakes/dictionary/lookup2/concept/BsvConceptFactory.java
   If you aren't certain about the tui you can use T000 for Unknown


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@ctakes.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to