seanfinan commented on issue #19: URL: https://github.com/apache/ctakes/issues/19#issuecomment-2142787585
There is/was documentation somewhere, but it is spread out and would take me some time to find the latest and most accurate. However, you have have a look at the example here https://github.com/apache/ctakes/blob/main/ctakes-dictionary-lookup-fast/src/user/resources/org/apache/ctakes/dictionary/lookup/fast/bsv/tinyDict.bsv Column 1 has the cui, 2 has the tui, 3 the synonym. If there is a 4th entry then it is used as the preferred text for the concept. The lookupXml for the example is here: https://github.com/apache/ctakes/blob/main/ctakes-dictionary-lookup-fast/src/user/resources/org/apache/ctakes/dictionary/lookup/fast/bsv/tinyDictSpec.xml You just want to point to that lookupXml file. on the command line "-l bsv/tinyDictSpec.xml". You may need to use the full path. This all uses the code here: https://github.com/apache/ctakes/blob/main/ctakes-dictionary-lookup-fast/src/main/java/org/apache/ctakes/dictionary/lookup2/dictionary/BsvRareWordDictionary.java and here: https://github.com/apache/ctakes/blob/main/ctakes-dictionary-lookup-fast/src/main/java/org/apache/ctakes/dictionary/lookup2/concept/BsvConceptFactory.java If you aren't certain about the tui you can use T000 for Unknown -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
