You will want to look into part-of-speech (POS) tagging. There are many natural language processing (NLP) programs out there that will do this type of functionality. It is not something that is built into nutch but it can be attached by writing a custom map reduce job which processes nutch content. I suggest looking at lingpipe and at the stanford POS tagger:

http://nlp.stanford.edu/software/tagger.shtml

Dennis

dealmaker wrote:
Hi,
  I am trying to implement a feature that can parse sentences and determine
whether a word is verb, noun or adjective.  Does Nutch already have similar
functionality?  Can it even be done or just science fiction?

Thanks.

Reply via email to