Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Tika Wiki" for change 
notification.

The "cTAKESParser" page has been changed by ChrisMattmann:
https://wiki.apache.org/tika/cTAKESParser?action=diff&rev1=1&rev2=2

  Tika now has the ability to leverage [[http://ctakes.apache.org|Apache 
cTAKES]] for use in parsing biomedical information from text. Read on to figure 
out how to install it.
  
+ = Installing cTAKES =
+ 
+ The first step to getting the parser up and running is installing Apache 
cTAKES.
+ Read on the following should work well on *nix systems. Windows directions 
are TODO.
+ It's very important to install at least cTAKES version 3.2.2 or later.
+ 
+  1. `mkdir -p $HOME/src && cd $HOME/src` 
+  2. `curl -O 
http://mirrors.ibiblio.org/apache//ctakes/ctakes-3.2.2/apache-ctakes-3.2.2-bin.tar.gz`
+  3. `tar xvzf *.tar.gz`
+  4. `export CTAKES_HOME=$HOME/src/apache-ctakes-3.2.2`
+ 
+ Now you have to download a separate resources package for cTAKES:
+ 
+  1. `cd $HOME/src`
+  2. `curl -o ctakes-resources-3.2.1.1-bin.zip 
"http://downloads.sourceforge.net/project/ctakesresources/ctakes-resources-3.2.1.1-bin.zip?r=http%3A%2F%2Fsourceforge.net%2Fprojects%2Fctakesresources%2F%3Fsource%3Dtyp_redirect&ts=1433609725&use_mirror=softlayer-dal"`
+  3. `mv *.zip apache-ctakes-3.2.2`
+  4. `cd apache-ctakes-3.2.2`
+  5. `unzip ctakes-resources-3.2.1.1-bin.zip`
+ 
+ After the above is done, cTAKES is installed.
+ 
+ = Signing up for a UMLS account =
+ 
+ To use cTAKES and the cTAKES Tika Parser you need a Unified Medical Language 
System (UMLS) account.
+ You can sign up for one [[https://uts.nlm.nih.gov/home.html|here]].
+ 

Reply via email to