Hi Jorn,
thanks a lot for replying...I've found the 12 'classes' you're
describing in the 'StringPattern.java' file located in the 'featuregen'
package. This helps me a lot... :-)
How about the other 2 feature generators that I asked about
(PreviousMapFeatureGenerator & OutcomePriorFeatureGenerator )? Could you
elaborate a bit further? I'm really sorry but I've not understood your
example about the *OutcomePriorFeatureGenerator* ...
Documentation says:
*" You might even add a feature that says that "Terrence" was seen as a
name before. "
*Does that refer to the**PreviousMapFeatureGenerator?
thanks a million...
Jim
On 26/11/12 13:02, Jörn Kottmann wrote:
On 11/20/2012 01:33 PM, Jim foo.bar wrote:
also, the only information that I could find about the
*TokenClassFeatureGenerator* is this oddly phrased sentence:
_"Generates features for different for the class of the token."_
How does this generator work?
What 'class' does this refer to in a name-finding context? semantic
class? If we're looking for genes and drugs, would the classes be
"gene", "drug" & presumably "none"?
It assigns a category to a token based on the characters used in it,
for example:
- token is initial capital
- token is all upper case
- token is numeric
- token is alpha numeric
...
Have a look at the code to see all the classes and on which conditions
they are assigned.
Jörn