We should have a look at phrase detectives as a sample:
http://anawiki.essex.ac.uk/phrasedetectives/
They have a short tutorial/test a user needs to do, they have a leader
board,
progress made by the user is tracked.
Jörn
On 6/24/11 1:47 PM, Jörn Kottmann wrote:
On 6/24/11 1:38 PM, Hannes Korte wrote:
On 06/24/2011 11:54 AM, Olivier Grisel wrote:
2011/6/24 Hannes Korte<[email protected]>:
On 24.06.2011 10:03, Jörn Kottmann wrote:
Hannes and Olivier, do you want to take over the part about the
web based
annotation tooling? I called it for now Corpus Refiner, but we can of
course change
the name to something else.
Yes, I'll try to find some time in the next days to have a look at
what
Olivier already committed and to work on the javascript part of the
webGUI.
My stuff is not following the new design: in particular it treats
sentences as individual sentences. Maybe you should go on from the
existing Walter design that treats CAS as individual, multi-sentences
documents instead and try to align it with the tooling available in
opennlp-uima: but we need to agree on a CAS type system first. I don't
know the opennlp-uima myself and won't have time to invest more effort
on this project before mid-july unfortunately.
Ok, till then I'll work with some dummy documents. By the way, who wants
to build the active learning component, which decides which sentences
come next?
Would it be possible for you to contribute the Walter code to OpenNLP?
In a previous project I used some kind of filtering to find CASes
which should be
annotated. Maybe that is an approach which could work well for us here
too.
The corpus server will index all CASes with annotations, and then the
corpus refiner or walter
server can query the index to find CASes it should hand out to
annotators. Sure this logic
would be task dependent.
This could also be done in a more controlled way, where we insert
annotations into the CAS which
say that this area should be labeled manually.
Lets update the OpenNLP Annotations proposal a little to describe
these things.
Jörn