The theory behind the head_rules file is described in this dissertation:
*Michael Collins, Head-driven Statistical Models for Natural Language Parsing, Ph.D. Dissertation Computer and Information Science, University of Pennsylvania, 1999.* >From the research I have done, I believe that the OpenNLP parser implementation is based on Collins' work. This dissertation can be found on the Internet. On Thu, Apr 14, 2016 at 11:39 AM, Joern Kottmann <[email protected]> wrote: > I also never really created it myself, but it was once explained to me that > the format is very and frequently used in research papers. I think it must > be possible to track that down a bit and maybe write some documentation. > > I would start looking in papers from Tom Morton or Adwait Ratnaparkhi, I am > sure they have written about it or referenced it somewhere. > > HTH, > Jörn > > On Thu, Apr 14, 2016 at 3:55 PM, Yohanes Gultom <[email protected]> > wrote: > > > Thanks for the fast response. > > Yes, I have. Unfortunately it doesn't explain how to generate the head > > rules nor explaining the format of given example > > http://opennlp.sourceforge.net/models/english/parser/head_rules > > > > > > Have a great day, > > > > Yohanes Gultom > > Enterprise Software Developer > > LinkedIn <http://www.linkedin.com/profile/view?id=41941925> | Blog > > <http://www.yohanesgultom.co.cc> | Facebook > > <http://www.facebook.com/yohanes.gultom> | Twitter > > <http://twitter.com/yohanesgultom> > > > > On Thu, Apr 14, 2016 at 8:51 PM, Russ, Daniel (NIH/CIT) [E] < > > [email protected]> wrote: > > > > > I have never done this, but have you seen: > > > http://stackoverflow.com/questions/6359050/opennlp-headrules > > > > > > Daniel Russ, Ph.D. > > > Staff Scientist, Office of Intramural Research > > > Center for Information Technology > > > National Institutes of Health > > > U.S. Department of Health and Human Services > > > 12 South Drive > > > Bethesda, MD 20892-5624 > > > > > > On Apr 14, 2016, at 9:30 AM, Yohanes Gultom <[email protected] > > > <mailto:[email protected]>> wrote: > > > > > > Hi all, > > > > > > I'm trying to do a constituency parsing on Non-English sentence > > > (Indonesian) > > > > > > > > > using OpenNLP. I > > > > > > already have a training data (attached) but I don't know how to create > a > > > head rules. As far as I can understand, head rules is a set of CFG > > > production rules but I can't understand the format. Could anyone help > to > > > explain how to create a head rules from a training data? > > > > > > > > > > > > I also notice that in documentation ( > > > > > > https://opennlp.apache.org/documentation/1.6.0/manual/opennlp.html#tools.parser.training > > ), > > > there is no example yet on using API to train a parser. Could anyone > > > provide a brief example on this? > > > > > > > > > Many t > > > hanks. > > > > > > > > > Yohanes Gultom > > > > > > > > > > > > > > > > > > -- David Sanderson Natural Language Processing Developer CrowdCare Corporation wysdom.com
