Re: [openhealth] Suppressing Sensitive Info From Free Text

2007-03-03 Thread madhusasidhar
You may find the patient de-identifcation features of this project interesting: http://www.mii.ucla.edu/index.php/MainSite:NLPHome -Original Message- From: [EMAIL PROTECTED] To: openhealth@yahoogroups.com Sent: Sat, 3 Mar 2007 11:31 AM Subject: Re: [openhealth] Suppressing Sensi

Re: [openhealth] Suppressing Sensitive Info From Free Text

2007-03-03 Thread Will Ross
Nandalal, I believe you are exactly right. In the case of this interesting problem the key issue is to identify the appropriate middleware services to safely expose legacy patient data. This is a different problem from the opportunity to create structured and coded data as new systems

Re: [openhealth] Suppressing Sensitive Info From Free Text

2007-03-03 Thread David Forslund
If one uses a structured report along the lines of the ASTM CCR, then I think it would be "relatively" easy to remove the sensitive information, since all of the data would be tagged. Dave Nandalal Gunaratne wrote: > Will, > > It is not a good idea to have sensitive information in > free text. I

Re: [openhealth] Suppressing Sensitive Info From Free Text

2007-03-03 Thread Adrian Midgley
Will Ross wrote: > > I'm looking for a tool to suppress sensitive information (e.g., HIV > status, etc.) from free text clinical notes prior to allowing the > notes to be published from a protected, physician-only area into > general circulation patient records for the clinic. What existing > FOSS

Re: [openhealth] Suppressing Sensitive Info From Free Text

2007-03-03 Thread Nandalal Gunaratne
Thanks Ross! Due to your question i have come to know the present state of text mining and NLP. These will give you your solution I guess. http://portal.acm.org/citation.cfm?id=1089824&dl=acm&coll=&CFID=15151515&CFTOKEN=6184618 nandalal --- Will Ross <[EMAIL PROTECTED]> wrote: > Dear 80n, > >

Re: [openhealth] Suppressing Sensitive Info From Free Text

2007-03-03 Thread Nandalal Gunaratne
Will, It is not a good idea to have sensitive information in free text. If you do, it should not go to "general circulation", right? How can one extract such info from free text? One way is to remove such words from free text files using a macro of some sort. "FInd and replace" can be used to re