You may find the patient de-identifcation features of this project interesting:
http://www.mii.ucla.edu/index.php/MainSite:NLPHome
-Original Message-
From: [EMAIL PROTECTED]
To: openhealth@yahoogroups.com
Sent: Sat, 3 Mar 2007 11:31 AM
Subject: Re: [openhealth] Suppressing Sensi
Nandalal,
I believe you are exactly right. In the case of this interesting
problem the key issue is to identify the appropriate middleware
services to safely expose legacy patient data. This is a different
problem from the opportunity to create structured and coded data as
new systems
If one uses a structured report along the lines of the ASTM CCR, then
I think it would be "relatively" easy to remove the sensitive information,
since all of the data would be tagged.
Dave
Nandalal Gunaratne wrote:
> Will,
>
> It is not a good idea to have sensitive information in
> free text. I
Will Ross wrote:
>
> I'm looking for a tool to suppress sensitive information (e.g., HIV
> status, etc.) from free text clinical notes prior to allowing the
> notes to be published from a protected, physician-only area into
> general circulation patient records for the clinic. What existing
> FOSS
Thanks Ross!
Due to your question i have come to know the present
state of text mining and NLP. These will give you your
solution I guess.
http://portal.acm.org/citation.cfm?id=1089824&dl=acm&coll=&CFID=15151515&CFTOKEN=6184618
nandalal
--- Will Ross <[EMAIL PROTECTED]> wrote:
> Dear 80n,
>
>
Will,
It is not a good idea to have sensitive information in
free text. If you do, it should not go to "general
circulation", right?
How can one extract such info from free text? One way
is to remove such words from free text files using a
macro of some sort. "FInd and replace" can be used to
re