Here is a URL for the Collins dissertation: http://www.dfki.de/~neumann/dop-seminar/References/collins-thesis.pdf
Sincerely, Rich Cooper, Rich Cooper, Chief Technology Officer, MetaSemantics Corporation MetaSemantics AT EnglishLogicKernel DOT com ( 9 4 9 ) 5 2 5-5 7 1 2 http://www.EnglishLogicKernel.com -----Original Message----- From: David Sanderson [mailto:[email protected]] Sent: Thursday, April 14, 2016 10:09 AM To: [email protected] Subject: Re: Training parser & head rules The theory behind the head_rules file is described in this dissertation: *Michael Collins, Head-driven Statistical Models for Natural Language Parsing, Ph.D. Dissertation Computer and Information Science, University of Pennsylvania, 1999.* >From the research I have done, I believe that the OpenNLP parser implementation is based on Collins' work. This dissertation can be found on the Internet. On Thu, Apr 14, 2016 at 11:39 AM, Joern Kottmann <[email protected]> wrote: > I also never really created it myself, but it was once explained > to me > that the format is very and frequently used in research papers. I > think it must be possible to track that down a bit and maybe > write some documentation. > > I would start looking in papers from Tom Morton or Adwait > Ratnaparkhi, > I am sure they have written about it or referenced it somewhere. > > HTH, > Jörn > > On Thu, Apr 14, 2016 at 3:55 PM, Yohanes Gultom > <[email protected]> > wrote: > > > Thanks for the fast response. > > Yes, I have. Unfortunately it doesn't explain how to generate > > the > > head rules nor explaining the format of given example > > http://opennlp.sourceforge.net/models/english/parser/head_rules > > > > > > Have a great day, > > > > Yohanes Gultom > > Enterprise Software Developer > > LinkedIn <http://www.linkedin.com/profile/view?id=41941925> | > > Blog > > <http://www.yohanesgultom.co.cc> | Facebook > > <http://www.facebook.com/yohanes.gultom> | Twitter > > <http://twitter.com/yohanesgultom> > > > > On Thu, Apr 14, 2016 at 8:51 PM, Russ, Daniel (NIH/CIT) [E] < > > [email protected]> wrote: > > > > > I have never done this, but have you seen: > > > http://stackoverflow.com/questions/6359050/opennlp-headrules > > > > > > Daniel Russ, Ph.D. > > > Staff Scientist, Office of Intramural Research Center for > > > Information Technology National Institutes of Health U.S. > > > Department of Health and Human Services > > > 12 South Drive > > > Bethesda, MD 20892-5624 > > > > > > On Apr 14, 2016, at 9:30 AM, Yohanes Gultom > > > <[email protected] <mailto:[email protected]>> > > > wrote: > > > > > > Hi all, > > > > > > I'm trying to do a constituency parsing on Non-English > > > sentence > > > (Indonesian) > > > > > > > > > using OpenNLP?. I > > > > > > already have a training data (attached) but I don't know how > > > to > > > create > a > > > head rules. As far as I can understand, head rules is a set > > > of CFG > > > production rules but I can't understand the format. Could > > > anyone > > > help > to > > > explain how to create a head rules from a training data? > > > > > > > > > > > > I also notice that in documentation ( > > > > > > https://opennlp.apache.org/documentation/1.6.0/manual/opennlp.html#too > ls.parser.training > > ), > > > there is no example yet on using API to train a parser. Could > > > anyone provide a brief example on this? > > > > > > > > > ?Many t > > > hanks. > > > > > > > > > ?Yohanes Gultom > > > > > > > > > > > > > > > > > > -- David Sanderson Natural Language Processing Developer CrowdCare Corporation wysdom.com
