Re: Converting to UTF-8 encoding

2007-02-18 Thread Oded Arbel
On Sun, 2007-02-18 at 09:15 +0200, Uri Even-Chen wrote:
 On 2/18/07, Shlomi Fish [EMAIL PROTECTED] wrote:
  You can use the following command to convert a single file:
 
  
  iconv -f WINDOWS-1255 -t UTF-8 oldfile  newfile
  
 
 Thanks!  It works!
 
 Anyone knows about good UTF-8 text editors?

All of the editors below support UTF-8 text, and also allow you to open
a file in any character set and save it in another - which will let you
convert files as you edit them. OTOH, none of them have particularly
good BiDi support, so YMMV as to the convinience of actually editing
HTML with Hebrew or any mixed Hebrew english text:

gedit (the default GNOME simple text editor. IMHO it has the best BiDi
support of the editors listed, but it still sucks)
leafpad (the XFCE default editor)
kwrite (the default KDE editor. Also kate which offers many enhancements
and advanced features over kwrite, but uses the same editor engine).

I've also had mixed experiences with vim using a utf-8 enabled console.

--
Oded
::..
Your program is sick!  Shoot it and put it out of its memory.



=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word unsubscribe in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]



Re: Converting to UTF-8 encoding

2007-02-18 Thread Shachar Shemesh
Uri Even-Chen wrote:
 Thanks!  It works!

 Anyone knows about good UTF-8 text editors?
On Linux - any of them. Just set LC_CTYPE to something which is UTF-8.

On Windows, any browser can display UTF-8, but notepad is the only
editor that can actually edit it, as far as I know. I'm not sure you can
call it a good text editor.
 Uri.
Shachar

-- 
Shachar Shemesh
Lingnu Open Source Consulting ltd.
Have you backed up today's work? http://www.lingnu.com/backup.html


=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word unsubscribe in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]



Re: Converting to UTF-8 encoding

2007-02-18 Thread Ira Abramov
Quoting Uri Even-Chen, from the post of Sun, 18 Feb:
 
 Thanks!  It works!
 
 Anyone knows about good UTF-8 text editors?

I use VIM...

-- 
Playboy photographer
Ira Abramov
http://ira.abramov.org/email/

=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word unsubscribe in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]



Re: Converting to UTF-8 encoding

2007-02-18 Thread Uri Even-Chen

I wrote:

 Anyone knows about good UTF-8 text editors?


Let me explain: I have to use it on Windows, and I need a search 
replace feature on many files simultaneously.  Currently I use Crimson
Editor, but it doesn't support Hebrew, doesn't display UTF-8 text
properly, and doesn't have an undo feature for search  replace.

I know notepad, but I agree it's not a good text editor, because it
doesn't support search  replace (and you can open only one file each
time).  But on the other hand, it does support Hebrew and UTF-8
encoding (but only if the first character is something specific).

Uri.

=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word unsubscribe in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]



Re: Converting to UTF-8 encoding

2007-02-18 Thread Peter



On Sun, 18 Feb 2007, Uri Even-Chen wrote:


I wrote:

 Anyone knows about good UTF-8 text editors?


Let me explain: I have to use it on Windows, and I need a search 
replace feature on many files simultaneously.


Emacs ?

Peter

=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word unsubscribe in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]



Re: Converting to UTF-8 encoding

2007-02-18 Thread Uri Even-Chen

On 2/18/07, Peter [EMAIL PROTECTED] wrote:

Emacs ?


Does it support UTF-8?  Does it support search  replace on many files
simultaneously?  Which version of Emacs (for Windows XP) do you
recommend?  And where do I download it?

Uri.

=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word unsubscribe in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]



Re: Converting to UTF-8 encoding

2007-02-18 Thread Moshe Gorohovsky

On Sun, 18 Feb 2007, Uri Even-Chen wrote:



On 2/18/07, Peter [EMAIL PROTECTED] wrote:

Emacs ?


Does it support UTF-8?  Does it support search  replace on many files
simultaneously?  Which version of Emacs (for Windows XP) do you
recommend?  And where do I download it?




Hi Uri,

Try VIM for WinXP with easy WinXP settings, it supports UTF-8
out of the box on WinXP with _all_ fonts installed.
http://mesh.dl.sourceforge.net/sourceforge/cream/cream-0-38-gvim-7-0-152.exe
Easy-vim settings have keyboards shortcuts like in notepad.

Or try regular VIM:
http://mesh.dl.sourceforge.net/sourceforge/cream/gvim-7-0-191.exe

- Moshe Gorohovsky

 A6 CC A7 E1 C2 BD 8C 1B  30 8E A4 C3 4C 09 88 47   Tk Open Systems Ltd.
 ---
  - tel: +972.2.679.5364, http://www.tkos.co.il -

=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word unsubscribe in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]



Re: Converting to UTF-8 encoding

2007-02-18 Thread Peter



On Sun, 18 Feb 2007, Uri Even-Chen wrote:


On 2/18/07, Peter [EMAIL PROTECTED] wrote:

Emacs ?


Does it support UTF-8?  Does it support search  replace on many files
simultaneously?  Which version of Emacs (for Windows XP) do you
recommend?  And where do I download it?

Uri.



http://math.claremontmckenna.edu/ALee/emacs/emacs.html



=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word unsubscribe in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]



Re: Converting to UTF-8 encoding

2007-02-18 Thread Ira Abramov
Quoting Uri Even-Chen, from the post of Sun, 18 Feb:
 
 I know notepad, but I agree it's not a good text editor, because it
 doesn't support search  replace (and you can open only one file each
 time).  But on the other hand, it does support Hebrew and UTF-8
 encoding (but only if the first character is something specific).

it's a HORRIBLE editor. there are quite a few drop-in replacements for
it, just search sourceforge. Notepad++ is one I got stuck with but
maybe there are better ones these days. Hebrew is far from perfect
there. gvim for windows may do a better job.

-- 
Minute man
Ira Abramov
http://ira.abramov.org/email/

=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word unsubscribe in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]



Re: Converting to UTF-8 encoding

2007-02-18 Thread Uri Even-Chen

Thanks for all your suggestions.  I tried VIM, XEmacs, EditPlus and
Notepad++.  I didn't like the Linux GUI of VIM and XEmacs and gave
them up.  EditPlus and Notepad++ are OK, but they have a few flaws in
them:

1. EditPlus has a feature of replace in all open files, but it
doesn't have a feature of find in all open files.  It supports
Hebrew, but when I type in Hebrew I see all the words in א (such as
 אאא).  Only when I refresh the window I see what I typed.

2. Notepad++ has both replace in all open files and find in all
open files features.  But it doesn't recognize Hebrew (Windows)
encoding.  It displays the Hebrew in such files as gibrish.

3. Both editors don't have the Notepad feature of aligning the text to
the right, which is very important when typing in Hebrew.  I guess
that I will have to keep using Notepad for Hebrew align-to-the-right
typing.

4. Both editors don't have a global undo feature for replace in all
open files.  You have to undo in each file manually.

So in general, both EditPlus and Notepad++ are OK for me, but far from
perfect.  I will start with Notepad++ since it's GPL  free, and maybe
I'll switch to EditPlus later.  I wouldn't mind paying if it didn't
have these flaws.

Uri.

On 2/18/07, Oded Arbel [EMAIL PROTECTED] wrote:


On Sun, 2007-02-18 at 10:38 +0200, Uri Even-Chen wrote:
 I wrote:
  Anyone knows about good UTF-8 text editors?

 Let me explain: I have to use it on Windows, and I need a search 
 replace feature on many files simultaneously.



For windows development, I use Editplus, which is a developer editor with lots and 
lots of tools (including search  replace in files). Its a shareware program 
which costs $30 for lifetime updates and I have to say that even though I rarely 
used it in the last few years (where I don't get many MS-Windows hours, maybe 2-3 
hours/month), I still would have bought it.


--
Oded
::..
Sex on TV is bad for you - you might fall down.




Converting to UTF-8 encoding

2007-02-17 Thread Uri Even-Chen

Dear Linux people,

In addition to my previous message, I decided it's about time to
convert my Hebrew websites from Windows-1255 encoding to Unicode
(UTF-8).  (By the way, is it a smart decision?)

Anyway, since I didn't find a better way to do it - that's how I did
it: I opened each file that contains Hebrew text (English files didn't
need any conversion) with Windows Notepad, saved it as UTF-8 encoding,
then FTP'd it to my Linux server, converted it to unix using dos2unix,
edited it with pico and removed the first 3 characters, which are
created by Notepad but don't work well with PHP (at least my version,
PHP 4.4.2), then I FTP'd it back and replaced the original file.  The
problem is - it takes too much time for each file, and I have hundreds
of files.  And also, Notepad doesn't recognize these files as UTF-8
encoded files.  Is there a way to do it simultaneously to hundreds of
files?

Now, when I view these files with SSH, I get all these strange symbols
['���×��× ��×�¨�×, ���×��× ×��× ��×�¢�(c)��,
�×�¨�×��× ¨��×, ���×��× � ��× ��(c),
�(c)�¢�¨���¨�¦��¤��', and that's a quote].  I also can't edit them
with the editors I currently use.  Do you know about good editors who
can edit UTF-8 encoded files, and also search and replace text within
many files simultaneously?  I also need the editor to run under
Windows, because that's the operating system I currently use, both at
home and at work (sorry).  And it has to be compatible with Linux and
PHP.

If you're interested - here's a website I converted:
http://www.pazgal.co.il/

And here's a website I didn't convert yet:
http://www.speedy.net/

By the way, one of the reasons I decided to convert the website to
UTF-8, is because Google displays the page descriptions backwards:
http://www.google.co.il/search?hl=iwq=site%3Apazgal.co.ilmeta=

Best Regards,
Uri Even-Chen


Re: Converting to UTF-8 encoding

2007-02-17 Thread Shlomi Fish
Hi Uri! (and all)

On Sunday 18 February 2007, Uri Even-Chen wrote:
 Dear Linux people,

 In addition to my previous message, I decided it's about time to
 convert my Hebrew websites from Windows-1255 encoding to Unicode
 (UTF-8).  (By the way, is it a smart decision?)

 Anyway, since I didn't find a better way to do it - that's how I did
 it: I opened each file that contains Hebrew text (English files didn't
 need any conversion) with Windows Notepad, saved it as UTF-8 encoding,
 then FTP'd it to my Linux server, converted it to unix using dos2unix,
 edited it with pico and removed the first 3 characters, which are
 created by Notepad but don't work well with PHP (at least my version,
 PHP 4.4.2), then I FTP'd it back and replaced the original file.  The
 problem is - it takes too much time for each file, and I have hundreds
 of files.  And also, Notepad doesn't recognize these files as UTF-8
 encoded files.  Is there a way to do it simultaneously to hundreds of
 files?


Maybe I'm missing something, but perhaps you should look at iconv:

http://www.gnu.org/software/libiconv/

You can use the following command to convert a single file:


iconv -f WINDOWS-1255 -t UTF-8 oldfile  newfile


And you can convert a group of files using a shell script or a find command. 
If you want to do it the Perl way then read perldoc Encode:

http://perldoc.perl.org/Encode.html

Regards,

Shlomi Fish

-
Shlomi Fish  [EMAIL PROTECTED]
Homepage:http://www.shlomifish.org/

Chuck Norris wrote a complete Perl 6 implementation in a day but then
destroyed all evidence with his bare hands, so no one will know his secrets.

=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word unsubscribe in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]



Re: Converting to UTF-8 encoding

2007-02-17 Thread Uri Even-Chen

On 2/18/07, Shlomi Fish [EMAIL PROTECTED] wrote:

You can use the following command to convert a single file:


iconv -f WINDOWS-1255 -t UTF-8 oldfile  newfile



Thanks!  It works!

Anyone knows about good UTF-8 text editors?

Uri.

=
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word unsubscribe in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]