Maybe of interest for Mahout people interested in NLP: http://code.google.com/p/cleartk/
> ClearTK is a toolkit for developing statistical natural language > processing components in Java and is based on the Apache UIMA > framework for text analysis. (Toolkit itself is available under the "New BSD license" but wraps libraries with various licenses that are not BSD.) Isabel
