antiword

2001-04-16 Thread Christoph Bugel

My local unix guru just showed me antiword.
It is a really cool (and small and simple) utility to convert m$word files into
nicely formatted ascii text. (also postscript). It works great for hebrew too.
(I pipe the output through 'rev'. the english is reversed as well, but at least
I can read all those hebrew attachements that I used to ignore until now, and
also for english it is so much faster and better that starting a slow and
annoying gui session with openoffice or koffice)
I downloaded && installed it and it is really cool.

The website is http://www.winfield.demon.nl/
What I did was to pipe the output through |rev|more or |rev|less. for less you
should make sure that the LESSCHARSET variable is ok. (I used latin1). Also I
wrote a script to start an xterm with -fn heb8x13, so that I can easily invoke
it from my mail client (mutt).
I was impressed by this useful little utility and I hope this posting will be
useful for somebody.



=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]




Re: antiword

2001-04-16 Thread Dan Kenigsberg

I would like to thank both you and your unix guru.
I downloaded and compiled antiword on a Solaris box without any problem. It
indeed seems cool.

I feel obliged to tell here that Nadav's bidiv is much better than rev for
viewing mixed English/Logical-Hebrew text files.
Check ftp://ftp.ivrix.org.il/pub/ivrix/src/cmdline/bidiv-1.2.tgz .

Thanks (really),

Dan.

=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]




Re: antiword

2001-04-16 Thread Tzafrir Cohen

On Mon, 16 Apr 2001, Christoph Bugel wrote:

> My local unix guru just showed me antiword.
> It is a really cool (and small and simple) utility to convert m$word files into
> nicely formatted ascii text. (also postscript). It works great for hebrew too.

But not the postscript, right?

> (I pipe the output through 'rev'. the english is reversed as well, but at least
> I can read all those hebrew attachements that I used to ignore until now, and
> also for english it is so much faster and better that starting a slow and
> annoying gui session with openoffice or koffice)
> I downloaded && installed it and it is really cool.
> 
> The website is http://www.winfield.demon.nl/
> What I did was to pipe the output through |rev|more or |rev|less. for less you
> should make sure that the LESSCHARSET variable is ok. (I used latin1).

if your language settings are correct, then your charset is probably
iso-8859-8 by default , and then you don't have to set LESSCHARSET
explicitly.

See locale(7) and less(1)
 
>Also I
> wrote a scriptto start an xterm with -fn heb8x13, so that I can easily invoke
> it from my mail client (mutt).

In my .Xdefaults I had:

Xterm*Font:heb8x13

The downside for such a setting is that the bold font is not supplied.

> I was impressed by this useful little utility and I hope this posting will be
> useful for somebody.

My current 'wordv' script does:

antiword -m 8859-8.txt "$1" | bidiv


BTW: is there a simple way to make such a script work for standard-input?
antiword does not accept input from the standard input.

-- 
Tzafrir Cohen
mailto:[EMAIL PROTECTED]
http://www.technion.ac.il/~tzafrir


=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]




Re: antiword

2001-04-16 Thread Christoph Bugel

On Mon 2001-04-16, Tzafrir Cohen wrote:

[snip]

> > wrote a scriptto start an xterm with -fn heb8x13, so that I can easily invoke
> > it from my mail client (mutt).
> 
> In my .Xdefaults I had:
> 
> Xterm*Font:heb8x13
> 
> The downside for such a setting is that the bold font is not supplied.

And also that the hebrew font is too small :-( I need a hebrew font only
rarely when I need to read something, so I prefer to stay with a normal
font. I wonder why the hebrew font is still not available in larger sizes
and bold. (or is it?)

[snip]

> My current 'wordv' script does:
> 
> antiword -m 8859-8.txt "$1" | bidiv
> 
> 
> BTW: is there a simple way to make such a script work for standard-input?
> antiword does not accept input from the standard input.

I think antiword will eventually have this feature, since it is only at
version 0.3.*. (Maybe I should help them..), but currently antiword expects
a filename, so no matter how clever the shell script, there still has to be
an actual file on the disk, so using tempfiles is unavoidable.

My script is realy ugly, I think, but it works for me :). this is what I do:
I use 'cat' to accept stdin and write it into a tmp file, which is
deleted later (officially I should use 'trap' to delete the tmp files)

#
#!/bin/sh
# note, LESSCHARSET should be set correctly. (otherwise only 'more' works)
# yes, or I should fix my locale..

script=/tmp/antiword-script$$
data=/tmp/antiword-data$$

cat > $data
echo /home/chris/bin/antiword -m 8859-8.txt $data '|' rev '|' less > $script
chmod +x $script
xterm -fn heb8x13 -e $script
rm -f $data $script
#

And thanks (also to Dan) for the bidiv tip. I will try it.



=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]




Re: antiword

2001-04-17 Thread Ely Levy

yep ots really nice
antiword -m /usr/local/share/antiword/8859-8.txt ~/tmp/file.doc | fribidi
-charset 8859-8 | hpstext 
does the work nicly
(thanks to our anonymous local hebrew guru;)


Ely Levy
System group
Hebrew University 
Jerusalem Israel



On Mon, 16 Apr 2001, Christoph Bugel wrote:

|  My local unix guru just showed me antiword.
|  It is a really cool (and small and simple) utility to convert m$word files into
|  nicely formatted ascii text. (also postscript). It works great for hebrew too.
|  (I pipe the output through 'rev'. the english isreversed as well, but at least
|  I can read all those hebrew attachements that I used to ignore until now, and
|  also for english it is so much faster and better that starting a slow and
|  annoying gui session with openoffice or koffice)
|  I downloaded && installed it and it is really cool.
|  
|  The website is http://www.winfield.demon.nl/
|  What I did was to pipe the output through |rev|more or |rev|less. for less you
|  should make sure that the LESSCHARSET variable is ok. (I used latin1). Also I
|  wrote a script to start an xterm with -fn heb8x13, so that I can easily invoke
|  it from my mail client (mutt).
|  I was impressed by this useful little utility and I hope this posting will be
|  useful for somebody.
|  
|  
|  
|  =
|  To unsubscribe, send mail to [EMAIL PROTECTED] with
|  the word "unsubscribe" in the message body, e.g., run the command
|  echo unsubscribe | mail [EMAIL PROTECTED]
|  
|  


=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]




Re: antiword

2001-04-17 Thread Michael Rozhavsky

Hi,

Where can I find hpstext?

Thanks.

--

  Rozhavsky Michael

On Tue, 17 Apr 2001, Ely Levy wrote:

> yep ots really nice
> antiword -m /usr/local/share/antiword/8859-8.txt ~/tmp/file.doc | fribidi
> -charset 8859-8 | hpstext 
> does the work nicly
> (thanks to our anonymous local hebrew guru;)
> 
> 
> Ely Levy
> System group
> Hebrew University 
> Jerusalem Israel
> 
[snip]


=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]




Re: antiword

2001-04-17 Thread Michael Rozhavsky

Hi,

where can I find hpstext?

Thank you.

On Tue, Apr 17, 2001 at 12:39:53PM +0300, Ely Levy wrote:
> yep ots really nice
> antiword -m /usr/local/share/antiword/8859-8.txt ~/tmp/file.doc | fribidi
> -charset 8859-8 | hpstext 
> does the work nicly
> (thanks to our anonymous local hebrew guru;)
> 
> 
> Ely Levy
> System group
> Hebrew University 
> Jerusalem Israel
> 
> 
[snip]

-- 
Regards,

   Michael Rozhavsky
   
   Optical Access
   Senior Software Engineer
   Tel:+972-4-9936248
   Fax:+972-4-9890564
   E-mail: [EMAIL PROTECTED]
   URL:www.opticalaccess.com

=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]




A Hebrew fix for antiword

2000-09-15 Thread Sam (Uli) Freed

I sent this to [EMAIL PROTECTED] a while back & it bounced. Dear
IGLU, whoever you are, PLEASE have an Email address.

Uli

Hi,

This is a fix to chartrans.c in antiword to make hebrew chars display
correctly:

241a242,247
> #define HEBREW
> #ifdef HEBREW
>   if (iChar > 1487 && iChar < 1515 ) { /* Hebrew */
>   return iChar-1488+224;
>   }
> #endif /* HEBREW */

a neat script is :
$ cat ~sam/bin/hantiword
#!/bin/csh -f
antiword $1 | fribidi -charset 8859-8

Use "more" to view, since "less" considers hebrew non-displayable. Also,
use xterm  -fn heb6x13 to display.

I know this is primitive and that const-const can be optimised. It
works. It's late. This is my contribution to fighting Microsoft for the
night. "Veletif'eret Medinat Yisrael".

Sam
--
Sam (Uli) Freed


=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]




Re: A Hebrew fix for antiword

2000-09-15 Thread Matan Ziv-Av


On Fri, 15 Sep 2000, Sam (Uli) Freed wrote:

> Use "more" to view, since "less" considers hebrew non-displayable. 

use less -r to display 8bit chars.




-- 
Matan Ziv-Av. [EMAIL PROTECTED]


=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]