On 28 February 2013 09:01, Per Tunedal <per.tune...@operamail.com> wrote:
> Hi,
> it might be helpful with some information on how the tagger (le Tageur
> redoutable?) actually works. How can I help the tagger when I add words
> and paradigms to the dictionaries? I suppose the structure of the
> dictionaries, and specifically the paradigms, has a great impact on the
> work of the tagger.

Not directly. The tagger is entirely independent of the dictionaries.

The fine tags (the tags coming from the dictionary) need to have
corresponding coarse tags (the tags used by the tagger) that are
sufficient to disambiguate the text. Coarse tags group together
equivalent fine tags, which helps to alleviate the data sparseness
problem: not all words occur in all contexts, so we group them
together so that what we know about classes of words applies to all
words in that class. The coarse tags should be as broad as possible,
but not too broad - if two word forms match the same coarse tag, then
that tag needs to be split, for example. See my next mail.

-- 
<Sefam> Are any of the mentors around?
<jimregan> yes, they're the ones trolling you

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_feb
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to