Re: Hebrew *.doc and e-mail experiment

2003-02-04 Thread Arie Folger
On Tuesday 04 February 2003 11:29, shlomo solomon wrote:
> I received an attachment containig a Hebrew
> Word .doc file. Since I had to read it, I tried an experiment and it
> worked. Here's what I did. I first opened the file with OpenOffice. This is
> **not** the Hebrew version being developed, but straight out-of-the box

> I know this is not an ideal solution for Hebrew, but in an emergency, it
> works. FWIW.

This solution is a better, though still cumbersome:
open the file in OO, then save as html, and voila, including formatting.

Arie
-- 
It is absurd to seek to give an account of the matter to a man 
who cannot himself give an account of anything; for insofar as
he is already like this, such a man is no better than a vegetable.
   -- Book IV of Aristotle's Metaphysics



=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]




Re: Hebrew *.doc and e-mail experiment

2003-02-04 Thread Tzafrir Cohen
On Tue, 4 Feb 2003, shlomo solomon wrote:

> This may be of some interest. I received an attachment containig a Hebrew Word
> .doc file. Since I had to read it, I tried an experiment and it worked.
> Here's what I did. I first opened the file with OpenOffice. This is **not**
> the Hebrew version being developed, but straight out-of-the box Mandrake 9.0.
> The Hebrew fonts in the document were OK, but since there's no RTL support,
> everything was backward. I then cut-and-pasted into KMail's composer (again
> out-of-the box Mandrake 9.0). And the document was correctly detected as
> Hebrew - including the correct RTL direction.

Grab antiword. gets the text, the whole text, and only the text. Usually
this is what you need to read a document :-) . bidi support can be added
by piping the text to fribidi or to bidiv .

Version 0.33 supports OfficeXP as well.

There is also WVware (It is used in abiword as well), which can translate
word documents to HTML documents.

-- 
Tzafrir Cohen
mailto:[EMAIL PROTECTED]
http://www.technion.ac.il/~tzafrir


=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]




Re: Hebrew *.doc and e-mail experiment

2003-02-04 Thread Nadav Har'El
On Tue, Feb 04, 2003, shlomo solomon wrote about "Hebrew *.doc and e-mail experiment":
> everything was backward. I then cut-and-pasted into KMail's composer (again 
> out-of-the box Mandrake 9.0). And the document was correctly detected as 
> Hebrew - including the correct RTL direction.

But you lost all the formatting of the text.
If text is enough for you, you can also use antiword, it's much easier and
more automatic. I have the following script to which I pipe Hebrew MS-Word
attachments I get:

#!/bin/zsh
# hebrew ms-word viewer
antiword -m 8859-8.txt =(cat) | bidiv

Which works quite nicely. It even keeps some of the text formating (with the
limitations of ascii, of course).

BTW, for the Debian users out there, thanks to Baruch Even bidiv is available
as a package (see http://packages.debian.org/unstable/text/bidiv.html).

-- 
Nadav Har'El|   Tuesday, Feb 4 2003, 3 Adar I 5763
[EMAIL PROTECTED] |-
Phone: +972-53-245868, ICQ 13349191 |Boat: A hole in the water surrounded by
http://nadav.harel.org.il   |wood into which one pours money.

=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]




Hebrew *.doc and e-mail experiment

2003-02-04 Thread shlomo solomon
This may be of some interest. I received an attachment containig a Hebrew Word 
.doc file. Since I had to read it, I tried an experiment and it worked. 
Here's what I did. I first opened the file with OpenOffice. This is **not** 
the Hebrew version being developed, but straight out-of-the box Mandrake 9.0. 
The Hebrew fonts in the document were OK, but since there's no RTL support, 
everything was backward. I then cut-and-pasted into KMail's composer (again 
out-of-the box Mandrake 9.0). And the document was correctly detected as 
Hebrew - including the correct RTL direction.

BTW - I forwarded the new mail message to someone using Outlook on WindowsXP 
and the e-mail arrived with the Hebrew displayed correctly.

I know this is not an ideal solution for Hebrew, but in an emergency, it 
works. FWIW.

-- 
Shlomo Solomon
http://come.to/shlomo.solomon
Sent by KMail (KDE 3.0.5a) on LINUX Mandrake 9.0



To unsubscribe, send 
mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]