Re: New Language Turkish

2013-07-20 Thread Daniel Naber
Am 20.07.2013 10:21, schrieb Sinan Yolal: > Hunspell may be good for other languages, but it is poor when it > comes to Turkish. It has around 100,000 words while the one I > suggested before has 1.4 mil words. Yes, but hunspell will accept more words than there are in its list because it knows

Re: New Language Turkish

2013-07-20 Thread Sinan Yolal
Hunspell may be good for other languages, but it is poor when it comes to Turkish. It has around 100,000 words while the one I suggested before has 1.4 mil words. I suggest we benefit from both. But as you said, we can first write some grammer rules and I will try writing a few and share them wit

Re: New Language Turkish

2013-07-20 Thread Ruud Baars
What about abbreviations? Hunspell accepts abbreviations without the . even when these are actually incorrect. LT knows about abbreviations, using srx for sentence endings at least, but what about in-sentence abbreviations? Ruud On 20-07-13 08:51, Daniel Naber wrote: > Am 20.07.2013 00:19, sc

Re: New Language Turkish

2013-07-20 Thread Ruud Baars
As long as there is a Hunspell one, and it is of good quality. On 20-07-13 08:51, Daniel Naber wrote: > Am 20.07.2013 00:19, schrieb Daniel Naber: > >>> http://code.google.com/p/zemberek/ [4] >> Yes, if the MPL license of the project also applies to the word list >> we >> could use that. > Actuall

Re: New Language Turkish

2013-07-19 Thread Daniel Naber
Am 20.07.2013 00:19, schrieb Daniel Naber: >> http://code.google.com/p/zemberek/ [4] > > Yes, if the MPL license of the project also applies to the word list > we > could use that. Actually, the easier approach is to just take an existing hunspell dictionary and convert that. Morfologik suppor

Re: New Language Turkish

2013-07-19 Thread Ruud Baars
For spell checking, you might want to pay some attention to the capitals. Proper names and adjectives are very much the same in Turkish, I was told. But for postagging they might need different postags. Not all spell checking lists are good enough for that. I might be able to even contribute s

Re: New Language Turkish

2013-07-19 Thread Ruud Baars
You could start by getting the old postag file, and check out the postags in that. From there, it might be possible to apply these postags to the new words. On 19-07-13 23:26, Sinan Yolal wrote: Thank you for your collaboration. I think we can extract 1.4 million Turkish words from another op

Re: New Language Turkish

2013-07-19 Thread Daniel Naber
Am 19.07.2013 23:26, schrieb Sinan Yolal: > I think we can extract 1.4 million Turkish words from another open > source project if it is okay:  > > http://code.google.com/p/zemberek/ [4] Yes, if the MPL license of the project also applies to the word list we could use that. There's a descripti

Re: New Language Turkish

2013-07-19 Thread Sinan Yolal
Thank you for your collaboration. I think we can extract 1.4 million Turkish words from another open source project if it is okay: http://code.google.com/p/zemberek/ We can start with creating a spellcheck database and then move onto creating grammar rules. What do you think? Ruud, I would real

Re: New Language Turkish

2013-07-19 Thread R.J. Baars
You are right. It is right there for download. Ruud > I think there was a complete dictionary and a tagger in the abandoned > project we mentioned earlier... But I may be wrong. > > Best, > Marcin > > W dniu 2013-07-19 08:09, R.J. Baars pisze: >> When you are going to need postags, I would be happ

Re: New Language Turkish

2013-07-19 Thread Marcin Miłkowski
I think there was a complete dictionary and a tagger in the abandoned project we mentioned earlier... But I may be wrong. Best, Marcin W dniu 2013-07-19 08:09, R.J. Baars pisze: > When you are going to need postags, I would be happy to co-operate on > Turkish; I can provide a Turkish words list

Re: New Language Turkish

2013-07-18 Thread R.J. Baars
When you are going to need postags, I would be happy to co-operate on Turkish; I can provide a Turkish words list and word frequencies. Ruud (Dutch) > Hi everyone, > > My name is Sinan Yolal. I do not know so much about Java and I am learning > XML. If it is okay, I would like to add Turkish to L

Re: New Language Turkish

2013-07-18 Thread Daniel Naber
Hi Sinan, > My name is Sinan Yolal. I do not know so much about Java and I am > learning XML. If it is okay, I would like to add Turkish to > LanguageTool program using only the XML rules. welcome to LanguageTool! We're looking forward to contributions for Turkish. There once was a version of LT

New Language Turkish

2013-07-18 Thread Sinan Yolal
Hi everyone, My name is Sinan Yolal. I do not know so much about Java and I am learning XML. If it is okay, I would like to add Turkish to LanguageTool program using only the XML rules. Waiting for your comments and replies... Sinan Yolal -