Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "cTAKESParser" page has been changed by ChrisMattmann: https://wiki.apache.org/tika/cTAKESParser?action=diff&rev1=1&rev2=2 Tika now has the ability to leverage [[http://ctakes.apache.org|Apache cTAKES]] for use in parsing biomedical information from text. Read on to figure out how to install it. + = Installing cTAKES = + + The first step to getting the parser up and running is installing Apache cTAKES. + Read on the following should work well on *nix systems. Windows directions are TODO. + It's very important to install at least cTAKES version 3.2.2 or later. + + 1. `mkdir -p $HOME/src && cd $HOME/src` + 2. `curl -O http://mirrors.ibiblio.org/apache//ctakes/ctakes-3.2.2/apache-ctakes-3.2.2-bin.tar.gz` + 3. `tar xvzf *.tar.gz` + 4. `export CTAKES_HOME=$HOME/src/apache-ctakes-3.2.2` + + Now you have to download a separate resources package for cTAKES: + + 1. `cd $HOME/src` + 2. `curl -o ctakes-resources-3.2.1.1-bin.zip "http://downloads.sourceforge.net/project/ctakesresources/ctakes-resources-3.2.1.1-bin.zip?r=http%3A%2F%2Fsourceforge.net%2Fprojects%2Fctakesresources%2F%3Fsource%3Dtyp_redirect&ts=1433609725&use_mirror=softlayer-dal"` + 3. `mv *.zip apache-ctakes-3.2.2` + 4. `cd apache-ctakes-3.2.2` + 5. `unzip ctakes-resources-3.2.1.1-bin.zip` + + After the above is done, cTAKES is installed. + + = Signing up for a UMLS account = + + To use cTAKES and the cTAKES Tika Parser you need a Unified Medical Language System (UMLS) account. + You can sign up for one [[https://uts.nlm.nih.gov/home.html|here]]. +
