On 02/24/2012 11:38 AM, Jim - FooBar(); wrote:
Firstly and more importantly I cannot find multi-word entities even
though they do exist in the dictionary and the test data.
In this sample
"... the drug Denileukin diftitox is ...."
it only matches Denileukin?
The DictionaryNameFinder should actually match "Denileukin diftitox".
If it doesn't, it sounds like a bug (assuming there is no problem with
the tokenization).
Does your dictionary contains "Denileukin" (single token) as en entry?
Secondly, even though i'm setting case_sensitive="false" in both the
xml file and the constructor of the DictionaryNameFinder, the actual
results that i 'm getting are always case-sensitive!!!
That is a bug. Please open a jira for it. it would be nice if you can
reproduce the problem in
junit test (just attach the .java file or make a patch).
Jörn