Hi all. Thank you very much for your suggestion. 
 I have learned HMMs in the class of NLP. Sorry for that I don't know that how 
HMMs are used in Apertium to perform part-of-speech disambiguation. I will take 
time to learning soon.
I think I  can use the phoneme based approaches. the problem I need to deal 
with is: how to deal with the European languages besides Asian languages.
(a) Apertium (by installing it, studying its PoS tagger, etc.); 
(b) the documentation available about Apertium part-of-speech tagging; 
(c) the problem of  "hidden" unknown words, and that makes a proposal to detect 
these with an indication 

Someone who has suggestion to me please feel free to tell me !
Thank you very much !

I think this topic is quite interesting

Thank you very much
Chen Xiajian


2011-04-03 



xmujay 



发件人: Jimmy O'Regan 
发送时间: 2011-04-03  09:23:27 
收件人: ftyers; apertium-stuff 
抄送: 
主题: Re: [Apertium-stuff] Detect 'hidden' unknown words inApertium_chenxiajian 
 
On 3 April 2011 02:01, Francis Tyers <fty...@prompsit.com> wrote:
> El ds 02 de 04 de 2011 a les 21:21 +0100, en/na Jimmy O'Regan va
> escriure:
>> On Saturday, 2 April 2011, Mikel Forcada <m...@dlsi.ua.es> wrote:
>> >
>> >     Dear Xiajian,
>> >
>> >     I think you haven't taken your time to study how HMMs are used in
>> >     Apertium to perform part-of-speech disambiguation. Your proposal is
>> >     completely wrong as it confuses HMMs as used in speech recognition
>>
>> Sorry to contradict, but the phoneme based approaches are valid. The
>> problem is, these are based on the considerations required for Asian
>> languages and not for European languages.
>>
>> >  with HMMs as used for part-of-speech tagging. Make sure you write a
>> >     proposal that shows that you have studied (a) Apertium (by
>> >     installing it, studying its PoS tagger, etc.); (b) the documentation
>> >     available about Apertium part-of-speech tagging; (c) the problem of
>> >     "hidden" unknown words, and that makes a proposal to detect these
>> >     with an indication on how you plan to evaluate its performance (for
>> >     instance, by modifying existing dictionaries for a language pair to
>> >     induce unknowns).
>>
>> Modifying dictionaries would be the least efficient of the approaches
>> I can imagine.
>
> Surely modifying the dictionaries based on suggestions from the program
> is the only way to go, unless you were thinking of getting the taggser
> to just add analyses based on what it thinks the words might be ??
No, you can evaluate by subtracting existing analyses, not just by
adding new ones.
-- 
<Leftmost> jimregan, that's because deep inside you, you are evil.
<Leftmost> Also not-so-deep inside you.
------------------------------------------------------------------------------
Create and publish websites with WebMatrix
Use the most popular FREE web apps or write code yourself; 
WebMatrix provides all the features you need to develop and 
publish your website. http://p.sf.net/sfu/ms-webmatrix-sf
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff
------------------------------------------------------------------------------
Create and publish websites with WebMatrix
Use the most popular FREE web apps or write code yourself; 
WebMatrix provides all the features you need to develop and 
publish your website. http://p.sf.net/sfu/ms-webmatrix-sf
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to