*not* speaking on behalf of the team but I wrote my own little tool to do the annotation whenever there is a dictionary or dictionaries available. It is open-source, it exposes a Java and a clojure API or you can even run it from the command line. You can find it here:

https://github.com/jimpil/annotator-clj

proper annotation in a specialised domain however is usually done by domain experts. Don't expect to get gold-data using annotator-clj or in fact any dictionary based annotator...the good with annotator-clj is that it can produce annotations compatible with openNLP, stanfordNLP or NLTK which is great sometimes. It is also very fast as it leverages the fork-join framework available in java7.

hope that helps :)

Jim

ps: now that you've made me think about it again, instead of only allowing raw dictionaries I could add support for the user to provide his own specialised regex Pattern objects...hmmm...I'll investigate over the weekend.


On 06/08/13 22:23, Girivaraprasad Nambari wrote:
Hi Team,

I ran pre-trained English NER models on text I have and results are not as
I expected. So, I thought to train NER on the "trainining" text to generate
new models, but I am stuck on *Annotation tools*.

Could someone help me on which tools are being used by Open-NLP team for
text annotation? As well as any references around text annotation topics
would be really appreciated.

Thanks for your time and help.

Thank you,
Giri


On Tue, Aug 6, 2013 at 5:12 PM, Girivaraprasad Nambari <
[email protected]> wrote:

Hi Team,

I ran pre-trained English NER models on text I have and results are not as
I expected. So, I thought to train NER on the "trainining" text to generate
new models, but I am stuck on *Annotation tools*.

Could someone help me on which tools are being used by Open-NLP team for
text annotation? As well as any references around text annotation topics
would be really appreciated.

Thanks for your time and help.

Thank you,
Giri


Reply via email to