Spellcheck algorithms should detect the languages by themself.
Detecting the language can done inside the spellchecker itself or
external editor applications can pass the detected language as the
input argument to the spellchecker.
A text with English and Hindi(or other indian language) is a trivial
usecase. If text editors are not supporting the language detection
based spellchecking, we need to fix it.

KDE4 comes with a spellchecker named Sonnet[1]. It can work with
hunspell and aspell. It does this language detection. In the initial
days of KDE4 it broke the Indic spellcheck completely[2].  But later
fixed. But I missed to notice the language detection in latest
version(I need to try again).

Gedit or any other text editors should do this language detection and
should switch the dictionaries dynamically. I had written about this
in 2008.[3]

By using the silpa project's apis[4], I have  written an online
spellchecker which can do spellcheck even if the input text is a mix
of English and Indian languages. You can try it here
http://thottingal.in/projects/spellchecker/


[1] http://www.linux.com/archive/articles/59963
[2] 
http://thottingal.in/blog/2008/11/30/kde-spellchecker-not-working-for-indian-languages/
[3] http://thottingal.in/blog/2008/11/13/language-detection-and-spellcheckers/
[4] http://silpa.org.in



Thanks
Santhosh

------------------------------------------------------------------------------
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
_______________________________________________
IndLinux-group mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/indlinux-group

Reply via email to