Re: Case Insensitive Name Finder - any ideas? - sorry missed the update - another ? though

Jörn Kottmann Thu, 19 Jan 2012 05:17:19 -0800

On 1/19/12 2:05 PM, Riccardo Tasso wrote:

I'm working on NameFinder too. How can I determine the rightparameters (iterations, cutoff and feature generation) for my usecase? Is there any guideline?


No we don't have any guides yet (any contributions are welcome).

When I am doing training I always take our defaults as a base line andthen modify the parametersto see how it changes the performance. When you are working with atraining set which grows overtime I suggest to once in a while start again from the default andverify if the modifications are still

giving an improvement.

A few hints:

- Using more iterations on the maxent model helps especially when yourdata set is small,

   e.g. try 300 to 500 instead of 100.

- Depending on domain and language feature generation should be adapted,try to useour xml feature generation (for this use trunk version, there was asevere bug in 1.5.2).

- Try the perceptron, usually has a higher recall, train it with acutoff of 0.

- Use our build-in evaluation to test how a model performs, it canoutput performance numbers

   and print out misclassified samples.

- Look carefully at misclassified samples maybe there is are patternswhich do not really work

   with your model.

- Add training data which contains cases which should work but do not.

Hope this helps,
Jörn

Re: Case Insensitive Name Finder - any ideas? - sorry missed the update - another ? though

Reply via email to