Hi Lance,
> About removing non-nouns: the OpenNLP patch includes two simple
> TokenFilters for manipulating terms with payloads. The
> FilterPayloadFilter lets you keep or remove terms with given payloads.
yes, I used this already in the schema.xml
> payloadList="NN,NNS,NNP,NNPS,FM" keepPayloa
Thanks, Kai!
About removing non-nouns: the OpenNLP patch includes two simple
TokenFilters for manipulating terms with payloads. The
FilterPayloadFilter lets you keep or remove terms with given payloads.
In the demo schema.xml, there is an example type that keeps only
nouns&verbs.
There is a
UIMA:
I just found this issue https://issues.apache.org/jira/browse/SOLR-3013
Now I am able to use this analyzer for english texts and filter (un)wanted
token types :-)
Open issue -> How to set the ModelFile for the Tagger to
"german/TuebaModel.dat" ???
OpenNLP:
And a mod