Hi Alan,
There is a dict_word_he.txt in the directory, it will be used for Hebrew
text, you need to add the rule in this file.
Thanks,
Karl.
On 2008年11月06日 11:15, Alan Yaniger wrote:
Hi Karl,
Thanks for your help, but I'm still having difficulty.
I've added the following at the end of data/dict_word.txt:
$HebrewLetter = [\u05d0-\u05ea];
$DoubleQuote = \u0022;
$HebrewLetter+ $DoubleQuote $HebrewLetter+;
but it doesn't have any effect. If I have a word like L"HZ, the spell
checker still marks HZ as a separate word. Is there something wrong
with my syntax?
Alan
Karl Hong wrote:
Hi Alan,
I would suggest you write a rule in data/dict_word.txt, something like
<hebrew_letter>+<quotation_mark><hebrew_letter>+;
it means a Hebrew word is one or more Hebrew letters, following by a
quotation mark, and following by one or more Hebrew letters. for rule
syntax, check ICU user guide
http://icu-project.org/userguide/boundaryAnalysis.html
Regards,
Karl.
On 2008年11月05日 11:11, Alan Yaniger wrote:
Hi Karl,
I'm trying to address issue 51772. Single or double-quotes are used
in Hebrew within a word to specify the sound "j" or acronyms,
respectively. At present, they are considered as word breaks during
spellchecking, because their script type is not COMPLEX, but LATIN.
endOfScript() treats this script change as a word break, but in
Hebrew it's not. I'd like to avoid having double quotes within a
word () as a word break, while treating them as a word break if they
are at the beginning or end of a word (preceded or followed by
whitespace, beginning or end of a paragraph).
Alan
Karl Hong wrote:
Hi Alan,
ScriptType breakiterator is not controlled by language, but Unicode
script type definition. It does not like
character/word/sentence/line breakiterators, which can be
customized by language, only one script type breakiterator for all
languages.
What would you like to do with endOfScript for Hebrew exactly?
Regards,
Karl.
On 2008?11?05? 06:41, Mathias Bauer wrote:
Hi Alan,
Alan Yaniger wrote:
Hi list-members,
For Hebrew text, I would like to override the
BreakIteratorImpl::endOfScript() function.
I tried:
- writing a Breakiterator_he class (with hxx and cxx files) ,
- I added it to the SLOFILES section of makefile.mk,
- I added it to the instances array in registerservices.cxx
- I rebuilt OOo
But it's still not getting called from the Writer code I'm
testing it with.
What should I do differently?
In case you don't get an answer here, I think you should try to
repeat
your question on the sw dev-list.
Regards,
Mathias
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]