Opennlp is a standard lib used by many apache NLP projects. The clinical text engine (ctakes.apache.org) is one such use of open NLP. There is a medical data privacy engine (de-identification) that does medical concept recognition and privacy features described in the paper. We used it to conduct some medical studies.
Dev list committers: I'm speaking up because this potential student is looking for a project, and hasn't yet found one. We could certainly use the help if rohit is interested. On Mar 15, 2015 10:13 PM, "Rohit Shinde" <rohit.shinde12...@gmail.com> wrote: > Could you please elaborate a bit more on this? I didn't really get this. > What exactly is de-identification? > > And what do you mean by apache sandbox? > > Thank you. > > On Mon, Mar 16, 2015 at 10:21 AM, andy mcmurry <mcmurry.a...@gmail.com> > wrote: > > > How about a project based on open NLP that is still in apache sandbox? > > > > http://www.biomedcentral.com/1472-6947/13/112 > > Hello everyone, > > > > I still haven't got a reply to my previous email and I would really > > appreciate a reply to that. > > > > I would like to contribute as soon as possible. > > > > Thank you. > > >