Re: Merging the output of multiple name finders

Jim - FooBar(); Tue, 17 Apr 2012 07:01:32 -0700

On 17/04/12 13:52, Jörn Kottmann wrote:

If you don't want to handle these cases, you can simply copy all namestogether
into a list, and then do evaluation on this list.
This approach works with our evaluation, but will usually be an issuefor applications which expect output
where the ambiguities mentioned earlier are resolved.

That is exactly what my current AggregateNameFinder does...It just getsrids of duplicates...

I propose that we make a simple baseline implementations
which takes all output spans, orders them and then resolves
the ambiguities based on the order. This will prefer longer
names over shorter names, but ignores the type.

There are more sophisticated ways of handling this,
e.g taking probabilities from the statistical name finders into

account, but these might be a bit more restrictive as well.

I agree on the baseline implementation but i don't see why the spansneed to be ordered and why ambiguities need resolving...the only trueambiguity that can occur is having the exact same span with a differenttype in which case we need to make a decision. Taking the probabilitiesfrom maxent is also a bit naive because you will not know which model totrust (maybe the weakest model gives you highest probs)...

Jim

p.s: i have to clarify that using a real dictionary and real-word corpusthe AggregateNameFinder which simply merges the distinct predictionsachieved a 29% improvement on recall and no change in precision whichwas already very high.

Re: Merging the output of multiple name finders

Reply via email to