You will want to look into part-of-speech (POS) tagging. There are many
natural language processing (NLP) programs out there that will do this
type of functionality. It is not something that is built into nutch but
it can be attached by writing a custom map reduce job which processes
nutch content. I suggest looking at lingpipe and at the stanford POS
tagger:
http://nlp.stanford.edu/software/tagger.shtml
Dennis
dealmaker wrote:
Hi,
I am trying to implement a feature that can parse sentences and determine
whether a word is verb, noun or adjective. Does Nutch already have similar
functionality? Can it even be done or just science fiction?
Thanks.