On 11/9/11 9:23 PM, Boris Galitsky wrote:
Furthermore it would be nice if you can do the change you did for the>pos tagger
also for the chunker, where you extract the pos tags from the Parse
objects instead
of running the POS Tagger. The Parse object also includes the chunk
information,
so there should be no need to run the chunker.
Hi
I am doing further chunks processing which might be useful for other apps, not 
just this 'similarity' project
I need to get all phrases grouped by type (noun, verb, adj, pp, ...) from 
chunking results, and it is not clear how can I get phrases other than noun 
from 'Parse' object.
Once I get all phrases, I do matching inside my component for each phrase type 
separately.
So far I have to process chunking results [1..3 4..5 6...8 6..10] + POS + lemmas 
->  lists of phrases for each group.
I suspect there's a better way!
RegardsBoris                                    

Sorry for the late reply. As far as I know it should be possible.

We have code in opennlp.tools.parser.ChunkSampleStream which does it to train
a chunker based on Parse trees.

Can you try this out and see if it work for you? I guess more people will need this
anyway, maybe we should create a method somewhere to do this.

Jörn

Reply via email to