Am 20.07.2013 10:21, schrieb Sinan Yolal:
> Hunspell may be good for other languages, but it is poor when it
> comes to Turkish. It has around 100,000 words while the one I
> suggested before has 1.4 mil words.
Yes, but hunspell will accept more words than there are in its list
because it knows
Hunspell may be good for other languages, but it is poor when it comes to
Turkish. It has around 100,000 words while the one I suggested before has
1.4 mil words.
I suggest we benefit from both.
But as you said, we can first write some grammer rules and I will try
writing a few and share them wit
What about abbreviations?
Hunspell accepts abbreviations without the . even when these are
actually incorrect.
LT knows about abbreviations, using srx for sentence endings at least,
but what about in-sentence abbreviations?
Ruud
On 20-07-13 08:51, Daniel Naber wrote:
> Am 20.07.2013 00:19, sc
As long as there is a Hunspell one, and it is of good quality.
On 20-07-13 08:51, Daniel Naber wrote:
> Am 20.07.2013 00:19, schrieb Daniel Naber:
>
>>> http://code.google.com/p/zemberek/ [4]
>> Yes, if the MPL license of the project also applies to the word list
>> we
>> could use that.
> Actuall
Am 20.07.2013 00:19, schrieb Daniel Naber:
>> http://code.google.com/p/zemberek/ [4]
>
> Yes, if the MPL license of the project also applies to the word list
> we
> could use that.
Actually, the easier approach is to just take an existing hunspell
dictionary and convert that. Morfologik suppor
For spell checking, you might want to pay some attention to the
capitals. Proper names and adjectives are very much the same in Turkish,
I was told. But for postagging they might need different postags.
Not all spell checking lists are good enough for that.
I might be able to even contribute s
You could start by getting the old postag file, and check out the
postags in that.
From there, it might be possible to apply these postags to the new words.
On 19-07-13 23:26, Sinan Yolal wrote:
Thank you for your collaboration.
I think we can extract 1.4 million Turkish words from another op
Am 19.07.2013 23:26, schrieb Sinan Yolal:
> I think we can extract 1.4 million Turkish words from another open
> source project if it is okay:
>
> http://code.google.com/p/zemberek/ [4]
Yes, if the MPL license of the project also applies to the word list we
could use that.
There's a descripti
Thank you for your collaboration.
I think we can extract 1.4 million Turkish words from another open source
project if it is okay:
http://code.google.com/p/zemberek/
We can start with creating a spellcheck database and then move onto
creating grammar rules. What do you think?
Ruud, I would real
You are right.
It is right there for download.
Ruud
> I think there was a complete dictionary and a tagger in the abandoned
> project we mentioned earlier... But I may be wrong.
>
> Best,
> Marcin
>
> W dniu 2013-07-19 08:09, R.J. Baars pisze:
>> When you are going to need postags, I would be happ
I think there was a complete dictionary and a tagger in the abandoned
project we mentioned earlier... But I may be wrong.
Best,
Marcin
W dniu 2013-07-19 08:09, R.J. Baars pisze:
> When you are going to need postags, I would be happy to co-operate on
> Turkish; I can provide a Turkish words list
When you are going to need postags, I would be happy to co-operate on
Turkish; I can provide a Turkish words list and word frequencies.
Ruud (Dutch)
> Hi everyone,
>
> My name is Sinan Yolal. I do not know so much about Java and I am learning
> XML. If it is okay, I would like to add Turkish to L
Hi Sinan,
> My name is Sinan Yolal. I do not know so much about Java and I am
> learning XML. If it is okay, I would like to add Turkish to
> LanguageTool program using only the XML rules.
welcome to LanguageTool! We're looking forward to contributions for
Turkish. There once was a version of LT
Hi everyone,
My name is Sinan Yolal. I do not know so much about Java and I am learning
XML. If it is okay, I would like to add Turkish to LanguageTool program
using only the XML rules.
Waiting for your comments and replies...
Sinan Yolal
-
14 matches
Mail list logo