Hi Alan,
You also need to remove Hebrew script from $ALetter, otherwise another
general rule may take charge on Hebrew word.
$Hebrew = [:Script = Hebrew:];
$DoubleQuote = \u0022;
$Hebrew+ $DoubleQuote $Hebrew+;
$ALetter = [\u0002 [:Alphabetic:] [:name= COMMERCIAL AT:] [:name=
HEBREW PUNCTUATION GERESH:]
- $Ideographic
- $Katakana
- $Hangul
- $Hebrew
- [:Script = Thai:]
- [:Script = Lao:]
- [:Script = Hiragana:]];
Regards,
Karl.
On 2008年11月06日 11:44, Alan Yaniger wrote:
Hi Karl,
I've added mt rule to dict_word_he.txt, and rebuilt, but still no
change. Is the syntax of my rule OK?
Alan
Karl Hong wrote:
Hi Alan,
There is a dict_word_he.txt in the directory, it will be used for
Hebrew text, you need to add the rule in this file.
Thanks,
Karl.
On 2008年11月06日 11:15, Alan Yaniger wrote:
Hi Karl,
Thanks for your help, but I'm still having difficulty.
I've added the following at the end of data/dict_word.txt:
$HebrewLetter = [\u05d0-\u05ea];
$DoubleQuote = \u0022;
$HebrewLetter+ $DoubleQuote $HebrewLetter+;
but it doesn't have any effect. If I have a word like L"HZ, the
spell checker still marks HZ as a separate word. Is there something
wrong with my syntax?
Alan
Karl Hong wrote:
Hi Alan,
I would suggest you write a rule in data/dict_word.txt, something like
<hebrew_letter>+<quotation_mark><hebrew_letter>+;
it means a Hebrew word is one or more Hebrew letters, following by
a quotation mark, and following by one or more Hebrew letters. for
rule syntax, check ICU user guide
http://icu-project.org/userguide/boundaryAnalysis.html
Regards,
Karl.
On 2008年11月05日 11:11, Alan Yaniger wrote:
Hi Karl,
I'm trying to address issue 51772. Single or double-quotes are
used in Hebrew within a word to specify the sound "j" or acronyms,
respectively. At present, they are considered as word breaks
during spellchecking, because their script type is not COMPLEX,
but LATIN. endOfScript() treats this script change as a word
break, but in Hebrew it's not. I'd like to avoid having double
quotes within a word () as a word break, while treating them as a
word break if they are at the beginning or end of a word (preceded
or followed by whitespace, beginning or end of a paragraph).
Alan
Karl Hong wrote:
Hi Alan,
ScriptType breakiterator is not controlled by language, but
Unicode script type definition. It does not like
character/word/sentence/line breakiterators, which can be
customized by language, only one script type breakiterator for
all languages.
What would you like to do with endOfScript for Hebrew exactly?
Regards,
Karl.
On 2008?11?05? 06:41, Mathias Bauer wrote:
Hi Alan,
Alan Yaniger wrote:
Hi list-members,
For Hebrew text, I would like to override the
BreakIteratorImpl::endOfScript() function.
I tried:
- writing a Breakiterator_he class (with hxx and cxx files) ,
- I added it to the SLOFILES section of makefile.mk,
- I added it to the instances array in registerservices.cxx
- I rebuilt OOo
But it's still not getting called from the Writer code I'm
testing it with.
What should I do differently?
In case you don't get an answer here, I think you should try to
repeat
your question on the sw dev-list.
Regards,
Mathias
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]