How / when can we go about getting started on this?
From: "Chen, Pei" <[email protected]>
To: "[email protected]" <[email protected]>; Ted Strall
<[email protected]>
Sent: Friday, July 24, 2015 12:52 PM
Subject: RE: Combining Knowledge- and Data-driven Methods for
De-identification of Clinical Narratives
Ted- Welcome to the community!
I think this would be a great enhancement.
Jay- I think the BigTop folks did a lot with the smoke and integration tests...
Do you how they did it? Something we can reuse?
--Pei
-----Original Message-----
From: Ted Strall [mailto:[email protected]]
Sent: Friday, July 24, 2015 12:31 PM
To: [email protected]
Subject: Re: Combining Knowledge- and Data-driven Methods for De-identification
of Clinical Narratives
I would be interested in helping to develop / maintain a regression testing
framework for that.
I'm new to ctakes (and just recently started stalking the dev mailing list) but
I've been a software engineer for 20 years and have done a lot of framework
automation stuff that will probably be required. As I write this, I am working
on an automated integration test that will run on Jenkins that fires up and
load an h2 database, a solr instance, an in-house indexing pipeline and an
in-house search service, indexes 10k documents and executes and evaluates some
canned queries before shutting itself down.
I'm also working on a MS in Predictive Analytics and I am interested in
applying machine learning and NLP to medical informatics, so I would welcome
the chance to get dirty with that side of stuff, also.
From: Jay Vyas <[email protected]>
To: "[email protected]" <[email protected]>
Sent: Friday, July 24, 2015 10:44 AM
Subject: Re: Combining Knowledge- and Data-driven Methods for
De-identification of Clinical Narratives
Yes this is very interesting work.
- If we have access to a large corpus of de identified records we can
recession test the ctakes platform.
- I can help collaborate on a regression testing framework if someone else
wants to help Maintain it.
> On Jul 24, 2015, at 11:12 AM, Pei Chen <[email protected]> wrote:
>
> Hi,
> Re:
> https://urldefense.proofpoint.com/v2/url?u=http-3A__www.sciencedirect.
> com_science_article_pii_S1532046415001392&d=BQIFaQ&c=qS4goWBT7poplM69z
> y_3xhKwEW14JZMSdioCoppxeFU&r=huK2MFkj300qccT8OSuuoYhy_xEYujfPwiAxhPVz5
> WY&m=IdFJ0ChLqz9-dg435_5Rea2_0EUPNDw0uCUKnNp_N7k&s=DOgavsLa7IIU0rgq8lx
> DXTb33J8-4zgCWuKzL83CZyw&e= This is very interesting work and I think
> it would be very valuable for the general community. Is this
> something that you may be in interested in contributing/sharing the
> code with the Apache cTAKES community?
> Thanks,
> Pei