Thanks very much, Jörn. Btw, I'm open to any suggestions or advice for determining the subject of the sentence if anyone has some to share. Today I'm using the DocumentCategorizer for categorization of request types, but I have to restrain the input such that the subject of the sentence is put in quotes. I'm interested in using removing that constraint.
Thanks, Chris Sent from my mobile device. Please excuse any typos. On Aug 6, 2012, at 4:43 AM, Jörn Kottmann <[email protected]> wrote: > On 08/05/2012 04:52 PM, Christopher Dellario wrote: >> I was lucky enough to find a post that had a link to a head_rules file: >> http://opennlp.sourceforge.net/models/english/parser/head_rules >> >> So I can now train the Parser, but I'd like to understand what the >> head_rules files does and its syntax. Anybody know? > > The current one is located in opennlp-tools/lang/en/parser. > > I was told the format is the one used by the collins parser. > > Each line starts with a number, thats the count of item to follow. > That is some C left over where they wanted to know the size of > the array in advance. > > Anyway I do not know more about it, because I never really worked > on the parser. > Would be nice to investigate this a bit further and then update our > documentation about it. > > Jörn
