Yes, I will try opennlp.tools.parser.ChunkSampleStream and meanwhile the question is: what is wrong with using opennlp.tools.chunker ? RegardsBoris
> Date: Thu, 17 Nov 2011 11:19:02 +0100 > From: [email protected] > To: [email protected] > Subject: Re: any hints on how to get chunking info from Parse? > > On 11/9/11 9:23 PM, Boris Galitsky wrote: > >> Furthermore it would be nice if you can do the change you did for the>pos > >> tagger > >> also for the chunker, where you extract the pos tags from the Parse > >> objects instead > >> of running the POS Tagger. The Parse object also includes the chunk > >> information, > >> so there should be no need to run the chunker. > > Hi > > I am doing further chunks processing which might be useful for other apps, > > not just this 'similarity' project > > I need to get all phrases grouped by type (noun, verb, adj, pp, ...) from > > chunking results, and it is not clear how can I get phrases other than noun > > from 'Parse' object. > > Once I get all phrases, I do matching inside my component for each phrase > > type separately. > > So far I have to process chunking results [1..3 4..5 6...8 6..10] + POS + > > lemmas -> lists of phrases for each group. > > I suspect there's a better way! > > RegardsBoris > > Sorry for the late reply. As far as I know it should be possible. > > We have code in opennlp.tools.parser.ChunkSampleStream which does it to > train > a chunker based on Parse trees. > > Can you try this out and see if it work for you? I guess more people > will need this > anyway, maybe we should create a method somewhere to do this. > > Jörn
