Re: Training parser & head rules

David Sanderson Thu, 14 Apr 2016 10:10:01 -0700

The theory behind the head_rules file is described in this dissertation:


*Michael Collins, Head-driven Statistical Models for Natural Language
Parsing, Ph.D. Dissertation Computer and Information Science, University of
Pennsylvania, 1999.*

>From the research I have done, I believe that the OpenNLP parser
implementation is based on Collins' work.

This dissertation can be found on the Internet.


On Thu, Apr 14, 2016 at 11:39 AM, Joern Kottmann <[email protected]> wrote:

> I also never really created it myself, but it was once explained to me that
> the format is very and frequently used in research papers. I think it must
> be possible to track that down a bit and maybe write some documentation.
>
> I would start looking in papers from Tom Morton or Adwait Ratnaparkhi, I am
> sure they have written about it or referenced it somewhere.
>
> HTH,
> Jörn
>
> On Thu, Apr 14, 2016 at 3:55 PM, Yohanes Gultom <[email protected]>
> wrote:
>
> > Thanks for the fast response.
> > Yes, I have. Unfortunately it doesn't explain how to generate the head
> > rules nor explaining the format of given example
> > http://opennlp.sourceforge.net/models/english/parser/head_rules
> >
> >
> > Have a great day,
> >
> > Yohanes Gultom
> > Enterprise Software Developer
> > LinkedIn <http://www.linkedin.com/profile/view?id=41941925> | Blog
> > <http://www.yohanesgultom.co.cc> | Facebook
> > <http://www.facebook.com/yohanes.gultom> | Twitter
> > <http://twitter.com/yohanesgultom>
> >
> > On Thu, Apr 14, 2016 at 8:51 PM, Russ, Daniel (NIH/CIT) [E] <
> > [email protected]> wrote:
> >
> > > I have never done this, but have you seen:
> > > http://stackoverflow.com/questions/6359050/opennlp-headrules
> > >
> > > Daniel Russ, Ph.D.
> > > Staff Scientist, Office of Intramural Research
> > > Center for Information Technology
> > > National Institutes of Health
> > > U.S. Department of Health and Human Services
> > > 12 South Drive
> > > Bethesda,  MD 20892-5624
> > >
> > > On Apr 14, 2016, at 9:30 AM, Yohanes Gultom <[email protected]
> > > <mailto:[email protected]>> wrote:
> > >
> > > Hi all,
> > >
> > > I'm trying to do a constituency parsing on Non-English sentence
> > > (Indonesian)
> > >
> > >  
> > > using OpenNLP. I
> > >  
> > > already have a training data (attached) but I don't know how to create
> a
> > > head rules. As far as I can understand, head rules is a set of CFG
> > > production rules but I can't understand the format. Could anyone help
> to
> > > explain how to create a head rules from a training data?
> > >
> > >
> > >
> > > I also notice that in documentation (
> > >
> >
> https://opennlp.apache.org/documentation/1.6.0/manual/opennlp.html#tools.parser.training
> > ),
> > > there is no example yet on using API to train a parser. Could anyone
> > > provide a brief example on this?
> > >
> > >
> > > Many t
> > > hanks.
> > >
> > >
> > > Yohanes Gultom
> > >
> > >
> > >
> > >
> > >
> >
>



-- 
David Sanderson
Natural Language Processing Developer
CrowdCare Corporation
wysdom.com

Re: Training parser & head rules

Reply via email to