On 03/11/2011 01:19 PM, Ravishankar Shrivastava wrote:
> On 11-03-2011 12:58, Pravin Satpute wrote:
>> I am working on a project https://fedorahosted.org/indic-typing-booster
>>
>> "
> Great work Pravin.
> If you need a handsome wordlist (one word per line) for Hindi, I can
> send you one. It is about 99% accurate and contain ~30K words.
>
Me and parag prepared word list from hindi wikipedia, one can find word 
list for hindi at 
http://git.fedorahosted.org/git?p=indic-typing-booster.git;a=blob_plain;f=hindi-typing-booster/tables/hindi-phonetic.txt;hb=HEAD

We required contributions to check this list and remove invalid words
Example:

अंग्रेज़ी, अन्ग्रेज़ी and अन्ग्रेजी
AFAIK first one is correct

Presently we have 0.2 millions word in this. I agree correcting complete 
list at a time will be difficult but even if we try to target say 
correcting 10000 or something in each release that will be good 
achievement, will appreciate help in this regards.

Regards,
Pravin S

------------------------------------------------------------------------------
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
_______________________________________________
IndLinux-group mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/indlinux-group

Reply via email to