On 8/22/11 11:41 PM, Jörn Kottmann wrote:
No, these models are statistical. That means they can learn with training
data what is an entity and what is not.


Ups, something is missing here.

During the training, each token is "transformed" into a set of features. This set of features is combined with an outcome which describes how a token should be labeled. These features are generated by all kinds of rules, e.g. the token, capitalization of the token, the token before, the token after, etc. These features cannot be adjusted to work with Romanian by hand.

Jörn
  • NE Eugen Ignat
    • Re: NE Jörn Kottmann
      • Re: NE Jörn Kottmann

Reply via email to