On 08/15/2013 12:46 AM, Ryan Josal wrote:
I want to train a model to detect addresses in English text, because I think I may get better results than a RegexNameFinder if there are many variations, though I will compare the results. Is there somewhere I might be able to get a corpus of annotated text for this, or else just a corpus of text and something that can automatically annotate addresses?
The learn-able Name Finder is quite good at detecting addresses. You will probably get the best result when you take your own data and annotate a few hundred of your documents.
HTH, Jörn
