Re: Hello cTAKES Mailing List

2015-02-23 Thread Pei Chen
Raymond,
Probably a combination of UMLS *Consumer Health Vocabulary + Custom
Dictionary (as Sean described) *may work for the use case*:*
OAC CHV connects informal, common words and phrases about health to
technical terms used by health care professionals. It includes jargon,
slang, ambiguous, and misspelled words as used by consumers and health care
professionals. Due to its nature, OAC CHV includes concepts that are not
represented by other source vocabularies within the Metathesaurus.

[1] http://www.nlm.nih.gov/research/umls/sourcereleasedocs/current/CHV/

On Sun, Feb 22, 2015 at 10:37 AM, Finan, Sean 
sean.fi...@childrens.harvard.edu wrote:

 Hi Raymond,

 If you use the dictionary-fast module there exists an entry feeling bad
 with cui 557911 and cui 231218.  There is also feel bad and feeling bad
 emotionally

 You will find horrible present pain but no other entry with horrible.
  You will not find any terms with awful and probably many other desired
 words.  If you are really interested in slang crappy, lousy, etc. then
 they are definitely not present.

 What you can do is create a second dictionary.  There are example custom
 dictionaries in
 -dictionary-lookup-fast-res/src/main/resources/org/apache/ctakes/dictionary/lookup/fast/example/bsv/
 You should look at custom_cui_bsv.bsv if you want to specify term unique
 id codes and term text alone.  If you want to add tui/group codes then look
 at custom_cui_tui_bsv.bsv  - you will probably want to model your
 dictionary after this so that you can tag your terms with tuis for
 symptoms.

 You will want to imitate sections from the corresponding .xml file in that
 directory.   Make a copy of cTakesHsql.xml (two dirs up) and add lines:
   dictionary
  nameCustomCuiRareWord/name

  
 implementationNameorg.apache.ctakes.dictionary.lookup2.BsvRareWordDictionary/implementationName
  properties
 property key=bsvPath
 value=org/apache/ctakes/dictionary/fast/example/custom_cui_tui_bsv.bsv/
  /properties
   /dictionary

 And

   conceptFactory
  nameCustomCuiConcept/name

  
 implementationNameorg.apache.ctakes.dictionary.lookup2.concept.BsvConceptFactory/implementationName
  properties
 property key=bsvPath
 value=org/apache/ctakes/dictionary/fast/example/custom_cui_tui_bsv.bsv/
  /properties
   /conceptFactory

 And

   dictionaryConceptPair
  nameCustomPair/name
  dictionaryNameCustomCuiRareWord/dictionaryName
  conceptFactoryNameCustomCuiConcept/conceptFactoryName
   /dictionaryConceptPair

 Then make sure that you point to your custom cTakesHsql.xml in
 dictionary-fast/desc/analysis_engine/UmlsLookupAnnotator.xml (or Overlap
 depending upon your use):

 nameDictionaryDescriptorFile/name
 description/
 fileResourceSpecifier

  
 fileUrlfile:org/apache/ctakes/dictionary/lookup/fast/cTakesHsqlYourCopy.xml/fileUrl
 /fileResourceSpecifier

 You can also skip the UMLS dictionary altogether and just use your custom
 dictionary.

 If you do give this a try then let me know  how it goes.  If you need
 additional assistance let me know and I will help the best I can.

 Sean


 -Original Message-
 From: Raymond Li [mailto:ray...@bu.edu]
 Sent: Saturday, February 21, 2015 1:26 PM
 To: dev@ctakes.apache.org
 Subject: Hello cTAKES Mailing List

 Hello, my name is is Raymond Li and I am currently working on a team
 project involving cTAKES. The goal of our project would be to use cTAKES to
 analyze posts on social media (such as tweets, forum posts, public
 available data) in order to catch in real-time any adverse effects of
 prescribed drugs and do a public service of protecting people from harmful
 drugs.

 Aside from this introduction, I do have only one question to ask to
 proceed with this project: Is cTAKES capable of understanding slang words
 as symptoms. An example is if I were to say I took Crestor and feeling bad
 is there a way for cTAKES to recognize that Crestor had a negative effect?
 My team has not been able to isolate 'bad' as a negative effect as it is
 not a defined medical symptom, but it would be nice to figure out if such a
 solution exists, or if we would need to develop our own solution and how we
 could go around doing it.

 My team and I would appreciate any comments or assistance regarding our
 project and this current issue. Thank you and have a nice day!

 --
 Sincerely,

 Raymond Li



RE: Hello cTAKES Mailing List

2015-02-23 Thread Finan, Sean
The CHV is a good resource for some things, but before going through the 
motions of porting it to a ctakes format, take a look inside.  

 
-Original Message-
From: Pei Chen [mailto:chen...@apache.org] 
Sent: Monday, February 23, 2015 1:52 PM
To: dev@ctakes.apache.org
Subject: Re: Hello cTAKES Mailing List

Raymond,
Probably a combination of UMLS *Consumer Health Vocabulary + Custom Dictionary 
(as Sean described) *may work for the use case*:* OAC CHV connects informal, 
common words and phrases about health to technical terms used by health care 
professionals. It includes jargon, slang, ambiguous, and misspelled words as 
used by consumers and health care professionals. Due to its nature, OAC CHV 
includes concepts that are not represented by other source vocabularies within 
the Metathesaurus.

[1] 
https://urldefense.proofpoint.com/v2/url?u=http-3A__www.nlm.nih.gov_research_umls_sourcereleasedocs_current_CHV_d=BQIBaQc=qS4goWBT7poplM69zy_3xhKwEW14JZMSdioCoppxeFUr=fs67GvlGZstTpyIisCYNYmQCP6r0bcpKGd4f7d4gTaom=1Bkpeno1tqLjX78o0wYm5DmJHCHlK7hrxpeEgPnGtRMs=-rEmTgTCe0mkSXT34XK56zkiuy_VxIfFvngGJzUwem8e=
 

On Sun, Feb 22, 2015 at 10:37 AM, Finan, Sean  
sean.fi...@childrens.harvard.edu wrote:

 Hi Raymond,

 If you use the dictionary-fast module there exists an entry feeling bad
 with cui 557911 and cui 231218.  There is also feel bad and feeling 
 bad emotionally

 You will find horrible present pain but no other entry with horrible.
  You will not find any terms with awful and probably many other 
 desired words.  If you are really interested in slang crappy, 
 lousy, etc. then they are definitely not present.

 What you can do is create a second dictionary.  There are example 
 custom dictionaries in 
 -dictionary-lookup-fast-res/src/main/resources/org/apache/ctakes/dicti
 onary/lookup/fast/example/bsv/ You should look at custom_cui_bsv.bsv 
 if you want to specify term unique id codes and term text alone.  If 
 you want to add tui/group codes then look at custom_cui_tui_bsv.bsv  - 
 you will probably want to model your dictionary after this so that you 
 can tag your terms with tuis for symptoms.

 You will want to imitate sections from the corresponding .xml file in that
 directory.   Make a copy of cTakesHsql.xml (two dirs up) and add lines:
   dictionary
  nameCustomCuiRareWord/name

  
 implementationNameorg.apache.ctakes.dictionary.lookup2.BsvRareWordDictionary/implementationName
  properties
 property key=bsvPath
 value=org/apache/ctakes/dictionary/fast/example/custom_cui_tui_bsv.bsv/
  /properties
   /dictionary

 And

   conceptFactory
  nameCustomCuiConcept/name

  
 implementationNameorg.apache.ctakes.dictionary.lookup2.concept.BsvConceptFactory/implementationName
  properties
 property key=bsvPath
 value=org/apache/ctakes/dictionary/fast/example/custom_cui_tui_bsv.bsv/
  /properties
   /conceptFactory

 And

   dictionaryConceptPair
  nameCustomPair/name
  dictionaryNameCustomCuiRareWord/dictionaryName
  conceptFactoryNameCustomCuiConcept/conceptFactoryName
   /dictionaryConceptPair

 Then make sure that you point to your custom cTakesHsql.xml in 
 dictionary-fast/desc/analysis_engine/UmlsLookupAnnotator.xml (or 
 Overlap depending upon your use):

 nameDictionaryDescriptorFile/name
 description/
 fileResourceSpecifier

  
 fileUrlfile:org/apache/ctakes/dictionary/lookup/fast/cTakesHsqlYourCopy.xml/fileUrl
 /fileResourceSpecifier

 You can also skip the UMLS dictionary altogether and just use your 
 custom dictionary.

 If you do give this a try then let me know  how it goes.  If you need 
 additional assistance let me know and I will help the best I can.

 Sean


 -Original Message-
 From: Raymond Li [mailto:ray...@bu.edu]
 Sent: Saturday, February 21, 2015 1:26 PM
 To: dev@ctakes.apache.org
 Subject: Hello cTAKES Mailing List

 Hello, my name is is Raymond Li and I am currently working on a team 
 project involving cTAKES. The goal of our project would be to use 
 cTAKES to analyze posts on social media (such as tweets, forum posts, 
 public available data) in order to catch in real-time any adverse 
 effects of prescribed drugs and do a public service of protecting 
 people from harmful drugs.

 Aside from this introduction, I do have only one question to ask to 
 proceed with this project: Is cTAKES capable of understanding slang 
 words as symptoms. An example is if I were to say I took Crestor and feeling 
 bad
 is there a way for cTAKES to recognize that Crestor had a negative effect?
 My team has not been able to isolate 'bad' as a negative effect as it 
 is not a defined medical symptom, but it would be nice to figure out 
 if such a solution exists, or if we would need to develop our own 
 solution and how we could go around doing it.

 My team and I would appreciate any comments or assistance regarding

RE: Hello cTAKES Mailing List

2015-02-22 Thread Finan, Sean
Hi Raymond,

If you use the dictionary-fast module there exists an entry feeling bad with 
cui 557911 and cui 231218.  There is also feel bad and feeling bad 
emotionally

You will find horrible present pain but no other entry with horrible.   You 
will not find any terms with awful and probably many other desired words.  If 
you are really interested in slang crappy, lousy, etc. then they are 
definitely not present.

What you can do is create a second dictionary.  There are example custom 
dictionaries in 
-dictionary-lookup-fast-res/src/main/resources/org/apache/ctakes/dictionary/lookup/fast/example/bsv/
You should look at custom_cui_bsv.bsv if you want to specify term unique id 
codes and term text alone.  If you want to add tui/group codes then look at 
custom_cui_tui_bsv.bsv  - you will probably want to model your dictionary after 
this so that you can tag your terms with tuis for symptoms.

You will want to imitate sections from the corresponding .xml file in that 
directory.   Make a copy of cTakesHsql.xml (two dirs up) and add lines: 
  dictionary
 nameCustomCuiRareWord/name
 
implementationNameorg.apache.ctakes.dictionary.lookup2.BsvRareWordDictionary/implementationName
 properties
property key=bsvPath 
value=org/apache/ctakes/dictionary/fast/example/custom_cui_tui_bsv.bsv/
 /properties
  /dictionary

And

  conceptFactory
 nameCustomCuiConcept/name
 
implementationNameorg.apache.ctakes.dictionary.lookup2.concept.BsvConceptFactory/implementationName
 properties
property key=bsvPath 
value=org/apache/ctakes/dictionary/fast/example/custom_cui_tui_bsv.bsv/
 /properties
  /conceptFactory

And

  dictionaryConceptPair
 nameCustomPair/name
 dictionaryNameCustomCuiRareWord/dictionaryName
 conceptFactoryNameCustomCuiConcept/conceptFactoryName
  /dictionaryConceptPair

Then make sure that you point to your custom cTakesHsql.xml in 
dictionary-fast/desc/analysis_engine/UmlsLookupAnnotator.xml (or Overlap 
depending upon your use):

nameDictionaryDescriptorFile/name
description/
fileResourceSpecifier
   
fileUrlfile:org/apache/ctakes/dictionary/lookup/fast/cTakesHsqlYourCopy.xml/fileUrl
/fileResourceSpecifier

You can also skip the UMLS dictionary altogether and just use your custom 
dictionary.

If you do give this a try then let me know  how it goes.  If you need 
additional assistance let me know and I will help the best I can.

Sean


-Original Message-
From: Raymond Li [mailto:ray...@bu.edu] 
Sent: Saturday, February 21, 2015 1:26 PM
To: dev@ctakes.apache.org
Subject: Hello cTAKES Mailing List

Hello, my name is is Raymond Li and I am currently working on a team project 
involving cTAKES. The goal of our project would be to use cTAKES to analyze 
posts on social media (such as tweets, forum posts, public available data) in 
order to catch in real-time any adverse effects of prescribed drugs and do a 
public service of protecting people from harmful drugs.

Aside from this introduction, I do have only one question to ask to proceed 
with this project: Is cTAKES capable of understanding slang words as symptoms. 
An example is if I were to say I took Crestor and feeling bad
is there a way for cTAKES to recognize that Crestor had a negative effect?
My team has not been able to isolate 'bad' as a negative effect as it is not a 
defined medical symptom, but it would be nice to figure out if such a solution 
exists, or if we would need to develop our own solution and how we could go 
around doing it.

My team and I would appreciate any comments or assistance regarding our project 
and this current issue. Thank you and have a nice day!

--
Sincerely,

Raymond Li