Re: better explanation of some features?

Jim foo.bar Mon, 26 Nov 2012 04:16:13 -0800

Can anyone help me with this? I'm struggling to find extra documentationregarding feature generation...

Jim



On 20/11/12 12:08, Jim foo.bar wrote:

Hi all,
I am trying to properly understand all the built-in features ofopenNLP but I'm having some trouble with some of them...
The maxent introduction page [1] mentions:
So, say you want to implement a program which uses maxent to findnames in a text., such as:
    /He succeeds Terrence D. Daniels, formerly a W.R. Grace vice
chairman, who resigned./If you are currently looking at the word /Terrence/ and are trying todecide if it is a name or not, examples of the kinds of features youmight use are "previous=succeeds", "current=Terrence", "next=D.", and"currentWordIsCapitalized". You might even add a feature that saysthat "Terrence" was seen as a name before.
I am particularly interested in the last sentence: *" You might evenadd a feature that says that "Terrence" was seen as a name before. "*
Does this refer to the "PreviousMapFeatureGenerator" ?
also, a while back I had asked about the*OutcomePriorFeatureGenerator* and Jorn replied with this:
_it is there to measure the distribution of the outcome_

E.g. for the name it could be:
start 5%
cont 10%
other 85%
In a name-finding context, what does the above example mean? Theoutcome is either TRUE or FALSE yes? So the name-finder eitherrecognizes a name or it doesn't. If Jorn had not shown this example Iwould understand that this feature-generator calculates distributionsfor these 2 boolean values...However, Jorn's example shows somethingdifferent which I don't understand...what is 'start', 'cont' & 'end'?How are these outcomes and how does that help the name-finder?
thanks in advance...

Jim

[1] http://maxent.sourceforge.net/howto.html

Re: better explanation of some features?

Reply via email to