Hi Alan,

You also need to remove Hebrew script from $ALetter, otherwise another general rule may take charge on Hebrew word.

$Hebrew = [:Script = Hebrew:];
$DoubleQuote = \u0022;
$Hebrew+ $DoubleQuote $Hebrew+;

$ALetter = [\u0002 [:Alphabetic:] [:name= COMMERCIAL AT:] [:name= HEBREW PUNCTUATION GERESH:]
                          - $Ideographic
                          - $Katakana
                          - $Hangul
                          - $Hebrew
                          - [:Script = Thai:]
                          - [:Script = Lao:]
                          - [:Script = Hiragana:]];

Regards,
Karl.

On 2008年11月06日 11:44, Alan Yaniger wrote:
Hi Karl,

I've added mt rule to dict_word_he.txt, and rebuilt, but still no change. Is the syntax of my rule OK?

Alan

Karl Hong wrote:
Hi Alan,

There is a dict_word_he.txt in the directory, it will be used for Hebrew text, you need to add the rule in this file.

Thanks,
Karl.

On 2008年11月06日 11:15, Alan Yaniger wrote:
Hi Karl,

Thanks for your help, but I'm still having difficulty.

I've added the following at the end of data/dict_word.txt:

$HebrewLetter   = [\u05d0-\u05ea];
$DoubleQuote         = \u0022;
$HebrewLetter+  $DoubleQuote  $HebrewLetter+;

but it doesn't have any effect. If I have a word like L"HZ, the spell checker still marks HZ as a separate word. Is there something wrong with my syntax?

Alan

Karl Hong wrote:
Hi Alan,

I would suggest you write a rule in data/dict_word.txt, something like

<hebrew_letter>+<quotation_mark><hebrew_letter>+;

it means a Hebrew word is one or more Hebrew letters, following by a quotation mark, and following by one or more Hebrew letters. for rule syntax, check ICU user guide

http://icu-project.org/userguide/boundaryAnalysis.html

Regards,
Karl.

On 2008年11月05日 11:11, Alan Yaniger wrote:
Hi Karl,

I'm trying to address issue 51772. Single or double-quotes are used in Hebrew within a word to specify the sound "j" or acronyms, respectively. At present, they are considered as word breaks during spellchecking, because their script type is not COMPLEX, but LATIN. endOfScript() treats this script change as a word break, but in Hebrew it's not. I'd like to avoid having double quotes within a word () as a word break, while treating them as a word break if they are at the beginning or end of a word (preceded or followed by whitespace, beginning or end of a paragraph).

Alan

Karl Hong wrote:
Hi Alan,

ScriptType breakiterator is not controlled by language, but Unicode script type definition. It does not like character/word/sentence/line breakiterators, which can be customized by language, only one script type breakiterator for all languages.

What would you like to do with endOfScript for Hebrew exactly?

Regards,
Karl.

On 2008?11?05? 06:41, Mathias Bauer wrote:
Hi Alan,

Alan Yaniger wrote:


Hi list-members,

For Hebrew text, I would like to override the BreakIteratorImpl::endOfScript() function.

I tried:

- writing a Breakiterator_he class (with hxx and cxx files) ,
- I added it to the SLOFILES section of makefile.mk,
- I added it to the instances array in registerservices.cxx
- I rebuilt OOo

But it's still not getting called from the Writer code I'm testing it with.
What should I do differently?

In case you don't get an answer here, I think you should try to repeat
your question on the sw dev-list.

Regards,
Mathias




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to