Re: Questions about new date checking rule

2014-08-31 Thread Marcin Miłkowski
W dniu 2014-08-30 23:35, Dominique Pellé pisze: > Daniel Naber > wrote: > > On 2014-08-29 21:50, Dominique Pellé wrote: > > > Message: The date 31 September 2014 is not a Monday, but a Wednesday. > > Monday, 31 September 2014 > > I've now mad

Re: Questions about new date checking rule

2014-08-31 Thread Daniel Naber
On 2014-08-30 23:35, Dominique Pellé wrote: > By the way, it would be good to list somewhere those XML > grammar rules that can be useful in all or most languages. An external list is difficult to keep up-to-date. What about adding an attribute i18n="yes" to those rules? It would be used only f

Re: Questions about new date checking rule

2014-08-30 Thread Dominique Pellé
Daniel Naber wrote: On 2014-08-29 21:50, Dominique Pellé wrote: > > > Message: The date 31 September 2014 is not a Monday, but a Wednesday. > > Monday, 31 September 2014 > > I've now made date parsing more strict, but the rule won't complain > about these dates and just ignore them. So to catch t

Re: Questions about new date checking rule

2014-08-30 Thread Daniel Naber
On 2014-08-29 21:50, Dominique Pellé wrote: > Message: The date 31 September 2014 is not a Monday, but a Wednesday. > Monday, 31 September 2014 I've now made date parsing more strict, but the rule won't complain about these dates and just ignore them. So to catch them, you need other rules. See

Re: Questions about new date checking rule

2014-08-29 Thread Dominique Pellé
Daniel Naber wrote: > On 2014-08-29 07:47, Dominique Pellé wrote: > >> Would there be a way to say something like instead: >> >> The date October 7, 2014 is not a Monday, but a Tuesday. > > This is now implemented, you can use "\realDay" in your message and it > will be replaced. Note that Java o

Re: Questions about new date checking rule

2014-08-29 Thread Daniel Naber
On 2014-08-29 07:47, Dominique Pellé wrote: > However, I'm not able to write a rule to detect > error in *Vendredi 28-08-2014*. I've added a rule for that now: (...) I don't claim this is super elegant, as it requires a new Java class, but it's the best I could think of. Regards Daniel -

Re: Questions about new date checking rule

2014-08-29 Thread Daniel Naber
On 2014-08-29 07:47, Dominique Pellé wrote: > Would there be a way to say something like instead: > > The date October 7, 2014 is not a Monday, but a Tuesday. This is now implemented, you can use "\realDay" in your message and it will be replaced. Note that Java only knows the day names for a f

Re: Questions about new date checking rule

2014-08-29 Thread Daniel Naber
On 2014-08-29 07:47, Dominique Pellé wrote: > However, I'm not able to write a rule to detect > error in *Vendredi 28-08-2014*. You're right, I'll try to extend the rule to support that. > Would there be a way to say something like instead: > > The date October 7, 2014 is not a Monday, but a Tu

Re: Questions about new date checking rule

2014-08-29 Thread R.J. Baars
In Dutch texts, all of the stupid forms are present. Nevertheless, it might help having rules forcing them into the official Dutch standard. But then, most CMS's have to be tweaked to show a correct Dutch date. Had to dive deep in my Joomla to get that fixed... I will think about these 'forcing'

Re: Questions about new date checking rule

2014-08-29 Thread Dominique Pellé
R.J. Baars wrote: > A different question: what about dates like '08-07-2014'or '2014/08/07' > One cannot tell which is month and which is day, isn't it? Are both > options considered then? And what of notations '04/05/06'; it is > completely unclear which is month, year and day. > > (All cases ac

Re: Questions about new date checking rule

2014-08-28 Thread R.J. Baars
A different question: what about dates like '08-07-2014'or '2014/08/07' One cannot tell which is month and which is day, isn't it? Are both options considered then? And what of notations '04/05/06'; it is completely unclear which is month, year and day. (All cases actually are in source texts ...

Questions about new date checking rule

2014-08-28 Thread Dominique Pellé
Hi I saw that date checking was added to LT. Thanks for that. I've added support for date checkin in French (as was done already in "en", "de", "pl", "ca"). I have 2 remarks: 1) LT detects date inconsistency in French as in: * Vendredi 28/08/2014 (it should be a Thursday, not a Friday) * Vendr

Re: "Ignore All" issue/bug in LibO (was: Questions on Using LT in LibO)

2014-08-14 Thread Elanjelian Venugopal
Hi Daniel, On 13 August 2014 19:43, Daniel Naber wrote: > > This should be fixed now, at least for documents that are mostly Tamil > (we don't use the cursor position, as I assumed, but we look at the > beginning of the text and assume Tamil if at least one of the first 100 > characters is in th

Re: "Ignore All" issue/bug in LibO (was: Questions on Using LT in LibO)

2014-08-13 Thread Daniel Naber
On 2014-08-13 22:05, Elanjelian Venugopal wrote: > Also, btw, the UI in LibO (concerning LT) is still in English. I think > I've already translated these strings. Please see languagetool-office-extension/src/main/resources/Addons.xcu and description.xml, these are not part of our standard i18n p

Re: "Ignore All" issue/bug in LibO (was: Questions on Using LT in LibO)

2014-08-13 Thread Elanjelian Venugopal
OK, great. Will test and report. Also, btw, the UI in LibO (concerning LT) is still in English. I think I've already translated these strings. On 13 August 2014 19:43, Daniel Naber wrote: > On 2014-08-13 11:55, Elanjelian Venugopal wrote: > > > not for Tamil. As I've explained in my other e-ma

Re: "Ignore All" issue/bug in LibO (was: Questions on Using LT in LibO)

2014-08-13 Thread Daniel Naber
On 2014-08-13 11:55, Elanjelian Venugopal wrote: > not for Tamil. As I've explained in my other e-mail, Tamil uses common > characters like numbers, symbols and punctuation from other areas, but > the Tamil block itself starts at U+0B82 and ends at U+0BFA. This should be fixed now, at least for d

Re: Questions on Using LT in LibO

2014-08-13 Thread Daniel Naber
On 2014-08-13 11:42, Elanjelian Venugopal wrote: > The current arrangement works for most word processing needs; so, I'm > not sure if this could be considered a LibO bug? Interestingly there > are no issues in LT standalone. That's because we consider the whole text to be Tamil in the stand-alon

Re: "Ignore All" issue/bug in LibO (was: Questions on Using LT in LibO)

2014-08-13 Thread Elanjelian Venugopal
On 13 August 2014 17:29, Daniel Naber wrote: > > This would need to be implemented in LO. LT doesn't know which document > is being checked, we only see the text. > Hmm... if there is consensus among the members here a bug report could be filed, I suppose... > It's supposed to use the language

Re: Questions on Using LT in LibO

2014-08-13 Thread Elanjelian Venugopal
2014-08-13 15:56 GMT+08:00 Daniel Naber : > > When I paste that into LO 4.2, the quote characters always end up being > set to German, even when I mark all the text and set it to Tamil. This > will probably lead to LT getting called several times with sentence > fragments instead of the whole sent

Re: "Ignore All" issue/bug in LibO (was: Questions on Using LT in LibO)

2014-08-13 Thread Daniel Naber
On 2014-08-13 10:54, Elanjelian Venugopal wrote: > This method doesn't work for me, I fear. I prefer the way MS Word > handles this matter -- i.e. the "Ignore All" only ignores the rule for > the particular document. This would need to be implemented in LO. LT doesn't know which document is bein

Re: "Ignore All" issue/bug in LibO (was: Questions on Using LT in LibO)

2014-08-13 Thread Elanjelian Venugopal
Hi Daniel, On 13 August 2014 15:32, Daniel Naber wrote: > > That's how it's supposed to work, more or less. Although it should > probably be called "Turn off Rule" or something like that, but I'm not > sure if that makes sense for other checkers. > This method doesn't work for me, I fear. I pre

Re: Questions on Using LT in LibO

2014-08-13 Thread Daniel Naber
On 2014-08-11 11:45, Elanjelian Venugopal wrote: > 1) Take this sample sentence (in Tamil): > > இந்த பெட்டியில் உங்கள் > உரையை பிடிக்க தடுமாறலாம் > ஒட்டி சரிவர சோதிக்கிறதா > என பாருங்கள் பல. பிழைகளை > பிடிக்க தடுமாறலாம். > > "லேங்குவேஜ் டூல்" சில > இலக்கண பிழைகளை சரியாக > கண்டுபிடிக்கும். When

Re: Questions on Using LT in LibO

2014-08-13 Thread Daniel Naber
On 2014-08-11 14:27, Elanjelian Venugopal wrote: > So, it works fine for: > > "He is also know as the first Chechen writer." - name="also know (known)"> > > But not for: > > "All though the day: a song by Anita O'Day." - id="ALLTHOUGH" name="all though (although)"> > > Is this is a bug? Wel

Re: "Ignore All" issue/bug in LibO (was: Questions on Using LT in LibO)

2014-08-13 Thread Daniel Naber
On 2014-08-12 11:33, Elanjelian Venugopal wrote: > 2) "Ignore All" however is problematic. If used, it ignores not merely > all the instances in the particular document, but all future instances > in all future documents, as well. That's how it's supposed to work, more or less. Although it should

Re: "Ignore All" issue/bug in LibO (was: Questions on Using LT in LibO)

2014-08-12 Thread Elanjelian Venugopal
nstances within a document, and not ignore in a another, separate document. I think MS Word does this well. Should a bug report be filed at LO? Please advise. Best regards, Elanjelian On 11 August 2014 20:27, Elanjelian Venugopal wrote: > Hi, > > This is a follow-up to my earlier

Re: Questions on Using LT in LibO

2014-08-11 Thread Elanjelian Venugopal
Hi, This is a follow-up to my earlier questions. I'd like to refer to Q3 that relate to "Ignore Rule" command in LibO. It appears the problem exists also for English when the rule is part of a rulegroup. It works fine if the rule is defined individually. So, it works fine for: &

Questions on Using LT in LibO

2014-08-11 Thread Elanjelian Venugopal
Hi, While using the LT extension in LibO I encountered a number of issues. Not sure if these are specific to Tamil (given its limited state of development), and hence my writing about it here. 1) Take this sample sentence (in Tamil): இந்த பெட்டியில் உங்கள் உரையை பிடிக்க தடுமாறலாம் ஒட்டி சரிவர சோ

Re: Questions about creating a synthesizer dictionary

2014-07-13 Thread Dominique Pellé
Daniel Naber wrote: > On 2014-07-11 22:43, Dominique Pellé wrote: > > > 1/ Why does the above command create files in /tmp rather than > > providing command line options to specify the outputs? > > There's no specific reason that I can remember. Feel free to change the > command. > > > 2/ Langua

Re: Questions about creating a synthesizer dictionary

2014-07-12 Thread Marcin Miłkowski
W dniu 2014-07-11 23:01, Daniel Naber pisze: > On 2014-07-11 22:43, Dominique Pellé wrote: > >> 1/ Why does the above command create files in /tmp rather than >> providing command line options to specify the outputs? > > There's no specific reason that I can remember. Feel free to change the > comm

Re: Questions about creating a synthesizer dictionary

2014-07-11 Thread Daniel Naber
On 2014-07-11 22:43, Dominique Pellé wrote: > 1/ Why does the above command create files in /tmp rather than > providing command line options to specify the outputs? There's no specific reason that I can remember. Feel free to change the command. > 2/ LanguageTool source tree contains *.sh and

Questions about creating a synthesizer dictionary

2014-07-11 Thread Dominique Pellé
files: /tmp/SynthDictionaryBuilder2431379053960263718.txt_tags.txt /tmp/DictionaryBuilder2731894273062211374.dict I'm not sure that the above steps are correct as I have not yet made the Java changes to use the synthesizer dictionary. But I have 2 questions: 1/ Why does the above command cr

Re: Questions

2013-09-07 Thread Daniel Naber
On 2013-09-07 06:35, Kumara Bhikkhu wrote: > Does the LT db link English adjectives with their adverbs? Unfortunately not. Most other forms are linked, e.g. one can get the gerund, but not the adverb. Regards Daniel -- http://www.danielnaber.de -

Re: Questions

2013-09-06 Thread Kumara Bhikkhu
Does the LT db link English adjectives with their adverbs? If so, I'm thinking of making the below come with a quick Correct. Kumara Bhikkhu wrote thus at 12:31 PM 07-09-13: Never mind. I've decided that JJ is better way. Here's what I've come up with: in a regexp="yes">pr

Re: Questions

2013-09-06 Thread Kumara Bhikkhu
Never mind. I've decided that JJ is better way. Here's what I've come up with: in a regexp="yes">preparatory|good|modern|christian manner Consider replacing "in a X manner" with adverb for X; eg, "in a hasty manner" with "hastily". Possibly wordy He c

Re: Questions

2013-09-06 Thread gulp21
>>> Is .*? the best way to specify "any-word" as a token? I wonder >>> if it might be overkill. >> >> You can just use for that. > > What if I need to specify exceptions? Use …. Regards Markus -- Learn the latest--Vis

Re: Questions

2013-09-06 Thread Kumara Bhikkhu
Daniel Naber wrote thus at 04:26 PM 06-09-13: > > Is .*? the best way to specify "any-word" as a token? I wonder if it > > might be overkill. > >You can just use for that. What if I need to specify exceptions? kb Can we possible teach a computer English? -

Re: Questions

2013-09-06 Thread Marcin Miłkowski
W dniu 2013-09-06 05:07, Kumara Bhikkhu pisze: > Does LT identify parts of speech? If so, how do I indicate adjective? In which language? Parts of speech tags are defined per language. For example, the pos tag for English is "JJ". > Is .*? the best way to specify "any-word" as a token? I wonder

Re: Questions

2013-09-06 Thread Daniel Naber
On 2013-09-06 05:07, Kumara Bhikkhu wrote: > Does LT identify parts of speech? If so, how do I indicate adjective? Yes, this is documented here, including a link to all part-of-speech tags: http://languagetool.org/development/#xmlrules > Is .*? the best way to specify "any-word" as a token? I w

Questions

2013-09-05 Thread Kumara Bhikkhu
Does LT identify parts of speech? If so, how do I indicate adjective? Is .*? the best way to specify "any-word" as a token? I wonder if it might be overkill. -- Learn the latest--Visual Studio 2012, SharePoint 2013, SQL

Re: some questions on Java rules

2013-04-05 Thread Daniel Naber
On 04.04.2013, 23:54:37 Paolo Bianchini wrote: > I would probably need to override that method. What's the correct way of > doing that? I just do not want to copy and paste the whole rule to make > an ItalianLongSentence Probably like this: public class ItalianLongSentenceRule extends LongSenten

some questions on Java rules

2013-04-05 Thread Paolo Bianchini
Hey Guys, I wanted to start looking into Java rules for Italian. I have some questions. I have been studying th ecode to understand how English javae rules work. First of all I see that some of the generic rules that are implemented into org.languagetool.rules are active for Italian because

Re: [Languagetool] A couple of questions on syntax and language differences

2012-07-29 Thread Mike Unwalla
I agree with Henrik. In English, ellipsis does not always end a sentence. In English, the rules for ellipsis are more complex than Henrik suggests. Four dots is an option. For American English, the Chicago Manual of Style, 15th Edition explains the 'three-dot method', the 'three-or-four-dot meth

Re: [Languagetool] A couple of questions on syntax and language differences

2012-07-25 Thread Henrik Bendt
2012/7/24 Susana Sotelo > Henrik Bendt escribiu: > > > I am having trouble with catching everything but a dot. > > The following code is testing for a string with "[something] A > [something]" > > which should be "[something] A. [something]" > > > > > > > > A > > . > >

Re: [Languagetool] A couple of questions on syntax and language differences

2012-07-24 Thread Daniel Naber
On Dienstag, 24. Juli 2012, Henrik Bendt wrote: > Regarding ellipsis (the three dots [...]), the Danish gramma dictates > that it is not the end of a sentence unless the letter after is in > uppercase. Is this not the case for English or other languages? I'm not sure if there are strict rules abo

Re: [Languagetool] A couple of questions on syntax and language differences

2012-07-24 Thread Susana Sotelo
Henrik Bendt escribiu: > I am having trouble with catching everything but a dot. > The following code is testing for a string with "[something] A [something]" > which should be "[something] A. [something]" > > > > A > . > > Should be A. > Possible missin

[Languagetool] A couple of questions on syntax and language differences

2012-07-24 Thread Henrik Bendt
Hello everybody I am not sure wether to ask programming-concerned questions here or on the forum, but here goes: I am having trouble with catching everything but a dot. The following code is testing for a string with "[something] A [something]" which should be "[something