Re: Cygwin programs doesn't support non-ASCII filenames
On 2009-5-9 23:44, Corinna Vinschen wrote: On May 9 23:12, Lenik wrote: The same result, it shows that `cat' from binutils can support locale well, while `d' isn't. Ok, but that's not Cygwin's problem, just the d tool would need an update at one point, perhaps. OTOH, what you're doing is a bit borderline. When you start this stuff from cmd, you will have to enter the filename in the notation valid for the locale in which the application works. For d, which only works in the C locale, you would have to give the pathname using the SO/UTF-8 sequences. Right now I have no idea if there's a workaround for that, but keep in mind that we're at the beginning of real native language support. Unfortunately it's all a bit more complicated than on non-Windows systems, given the UTF-16-ness of the underlying system. I'd like to know if there is any build plan to upgrade tools like d, zip, unzip, jar, etc. to support locale settings, rather than C only. So I can tell customers when our cygwin-based scripts will work for Chinese path names. Or is there documented ways to build with locale support from source code? Currently the two tools `jar' and `unzip' which don't support locale settings prevent my scripts being widely deployed. Thank you Corinna, Lenik -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: Cygwin programs doesn't support non-ASCII filenames
On May 12 15:49, Lenik wrote: I'd like to know if there is any build plan to upgrade tools like d, zip, unzip, jar, etc. to support locale settings, rather than C only. So I can tell customers when our cygwin-based scripts will work for Chinese path names. That depends on the package maintainers and it will certainly not be done within just a couple of days. After all, the Cygwin distro is a volunteer effort. Corinna -- Corinna Vinschen Please, send mails regarding Cygwin to Cygwin Project Co-Leader cygwin AT cygwin DOT com Red Hat -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: Setup 1.7 2.621
On May 11 21:17, Karl M wrote: I was able to reproduce the symptom with another clean install on my XP Pro machine. The /etc/group file was interesting...different than the output of mkgroup. Attached are the /etc/passwd, /etc/group and mkgroup output. The /etc/passwd file is sanitized for user and machine names. The mkpasswd output matched the /etc/passwd contents. Looks normal. The group file contains a root group because it gets added by the postinstall script. mkgroup does not add that automatically. And files group owned by root are actually owned by the administrators group for hopefully obvious reasons: root:S-1-5-32-544:0: [...] Administrators:S-1-5-32-544:544: Corinna -- Corinna Vinschen Please, send mails regarding Cygwin to Cygwin Project Co-Leader cygwin AT cygwin DOT com Red Hat -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
WMIC
Hi, I´m trying to run the WMIC (WMI Command-line interface) via Cygwin to parse the results with grep, awk, etc. But, when I try to run it, the cursor indicates that the command is executing, but nothing is printed on screen and the cursor is not released. I maked a batch script (.cmd) which executes the WMIC and log the output into a log file. Then, I tried to call it with Cygwin, but the same behaviour happens. The command I want to execute is: WMIC CPU LIST FULL Anyone can help me to resolve the above problem? -- Att. Bruno Galindro da Costa bruno.galin...@gmail.com Florianópolis - SC -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
RE: Can't run Bash!
Yarin wrote on Monday, May 11, 2009 8:27 PM: I'm trying to compile Linux binaries on Windows. To do this, I've downloaded and installed Cygwin (along with it's GCC packages). You are a bit ambiguous here. Are you trying to compile linux binaries of POSIX code? For that you need a cross-compiler (about which I know nothing). Or are you trying to compile Windows binaries of POSIX code that is often found on Linux systems? In which case, you want to do something like this. $ tar -xvf package-ver.tar.gz $ cd package-ver $ ./configure $ make $ make install It doesn't recognize the command su, See the FAQ: Why doesn't su work? http://cygwin.com/faq/faq-nochunks.html#faq.using.su I'm confronted with this: bash-3.2$ _ and it doesn't begin with x...@x You are getting bash's default prompt. See $ man bash for how and where to customize your prompt. (Though Gary Johnson's advice about using a login shell may address the issue.) I hope this helps. - Barry -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: Cygwin programs doesn't support non-ASCII filenames
On May 9 23:12, Lenik wrote: (This mail is encoded in utf-8) [...] The two chinese characters encoding in: GB2312: d7 c0 c3 e6 UTF-8: e6 a1 8c e9 9d a2 Unicode: \u684c \u9762 [...] This is a new test don't use cygpath: C:\Profiles\Shecti set LANG= bash -c cat ?? cat: ??: No such file or directory I'm just looking into this issue and I do not quite understand how you came up with the filename in this example. Above you mention that the mail is in UTF-8. However, when I look into this email using `od -t x1', the multibyte sequence in your example is e4 bd a0 e5 a5 bd, rather than the aforementioned UTF-8 sequence e6 a1 8c e9 9d a2. Nor does it match the aforementioned GB2312 sequence d7 c0 c3 e6. Can you please explain how the multibyte sequence in the example is related to the above GB2312 and UTF-8 sequences? Corinna -- Corinna Vinschen Please, send mails regarding Cygwin to Cygwin Project Co-Leader cygwin AT cygwin DOT com Red Hat -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: WMIC
Bruno Galindro da Costa wrote: I´m trying to run the WMIC (WMI Command-line interface) via Cygwin to parse the results with grep, awk, etc. But, when I try to run it, the cursor indicates that the command is executing, but nothing is printed on screen and the cursor is not released. I maked a batch script (.cmd) which executes the WMIC and log the output into a log file. Then, I tried to call it with Cygwin, but the same behaviour happens. The command I want to execute is: WMIC CPU LIST FULL Anyone can help me to resolve the above problem? The following works for me in Cygwin console (notty, tty, and mintty) on XP: $ echo '' | wmic cpu list full or $ wmic cpu list full /dev/null -- Christian Franke -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: Cygwin programs doesn't support non-ASCII filenames
On 2009-5-12 16:30, Corinna Vinschen wrote: On May 12 15:49, Lenik wrote: I'd like to know if there is any build plan to upgrade tools like d, zip, unzip, jar, etc. to support locale settings, rather than C only. So I can tell customers when our cygwin-based scripts will work for Chinese path names. That depends on the package maintainers and it will certainly not be done within just a couple of days. After all, the Cygwin distro is a volunteer effort. Corinna Is there any hint on how to add locale support to existing packages at source code level? I guess that a specific package maintainer maybe work on a central version, so any change to `grep' or `unzip' for example, will be applied on that central version, so various distros like RHEL, Ubuntu, etc. can share the same changement? If so that is, my changement on `grep' may be required to be test/return-test on the various distros, and only if the package maintainer considers the changement won't break the consistency between the various distros, he will then accept the changes and release it to the cygwin? If I changed a specific package in cygwin distro, I shall send to that specific package maintainer, right? And I still think it may be better to add locale support in cygwin layer(maybe it's not enabled by default, though), if you write a program operates with path names, (most programs access files) you won't do anything about locale settings in common sense, and I didn't see there is necessary to setlocale or somewhat before fopen/fstat() operations in that famous APUX book. C:\Profiles\Shecti set LANG=zh_CN.GBK cat 你好 123 C:\Profiles\Shecti set LANG=C cat 你好 123 C:\Profiles\Shecti set LANG= cat 你好 cat: 你好: No such file or directory The default LANG isn't C neither GBK in codepage 936, I guess it is set to GB2312, but I'm not sure. How can I know the default LANG? Lenik -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
d tool doesn't support locale setting
(This mail is encoded in utf-8) bash-3.2$ ls .zip .gz bash-3.2$ echo $LANG bash-3.2$ export LANG=zh_CN.GBK bash-3.2$ ls 好.gz 你好 世界.zip bash-3.2$ d 世界.zip /mnt/c/Profiles/Shecti/lt/世界.zip doesn't exist! FYI 你 = \u4f60 好 = \u597d 世 = \u4e16 界 = \u754c -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
RE: Setup 1.7 2.621
Date: Tue, 12 May 2009 10:33:13 +0200 From: corinna Subject: Re: Setup 1.7 2.621 Looks normal. The group file contains a root group because it gets added by the postinstall script. mkgroup does not add that automatically. And files group owned by root are actually owned by the administrators group for hopefully obvious reasons: root:S-1-5-32-544:0: [...] Administrators:S-1-5-32-544:544: So what is the root group for and why is it added? Thanks, ...Karl _ Windows Live™: Keep your life in sync. http://windowslive.com/explore?ocid=TXT_TAGLM_BR_life_in_synch_052009 -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
zip, unzip doesn't support locale settings
(This mail is encoded in utf-8) bash-3.2$ ls .zip .gz bash-3.2$ echo $LANG bash-3.2$ export LANG=zh_CN.GBK bash-3.2$ zip a * zip warning: name not matched: 好.gz zip warning: name not matched: 你好 zip error: Nothing to do! (a.zip) bash-3.2$ unzip 世界.zip unzip: cannot find or open 世界.zip, 世界.zip.zip or 世界.zip.ZIP. FYI 你 = \u4f60 好 = \u597d 世 = \u4e16 界 = \u754c -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
gzip, gunzip doesn't support locale settings
(This mail is encoded in utf-8) bash-3.2$ ls .zip .gz bash-3.2$ echo $LANG bash-3.2$ export LANG=zh_CN.GBK bash-3.2$ ls 好.gz 你好 世界.zip bash-3.2$ gzip 你好 gzip: 你好: No such file or directory bash-3.2$ gunzip 好.gz gzip: 好.gz: No such file or directory FYI 你 = \u4f60 好 = \u597d 世 = \u4e16 界 = \u754c -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: Cygwin programs doesn't support non-ASCII filenames
On 2009-5-12 21:54, Corinna Vinschen wrote: On May 9 23:12, Lenik wrote: (This mail is encoded in utf-8) [...] The two chinese characters encoding in: GB2312: d7 c0 c3 e6 UTF-8: e6 a1 8c e9 9d a2 Unicode: \u684c \u9762 [...] This is a new test don't use cygpath: C:\Profiles\Shecti set LANG= bash -c cat ?? cat: ??: No such file or directory I'm just looking into this issue and I do not quite understand how you came up with the filename in this example. Above you mention that the mail is in UTF-8. However, when I look into this email using `od -t x1', the multibyte sequence in your example is e4 bd a0 e5 a5 bd, rather than the aforementioned UTF-8 sequence e6 a1 8c e9 9d a2. Nor does it match the aforementioned GB2312 sequence d7 c0 c3 e6. Can you please explain how the multibyte sequence in the example is related to the above GB2312 and UTF-8 sequences? Corinna Sorry, there are two examples, the first using 桌面, and the second using 你好. You may test either. 桌面:e6 a1 8c e9 9d a2, GB2312=d7 c0 c3 e6 你好:e4 bd a0 e5 a5 bd, GB2312=c4 e3 ba c3 Thanks, Lenik -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
[1.7] wcwidth failing configure tests
I noticed this failure in various configure scripts (findutils, coreutils, ...): checking whether wcwidth works reasonably in UTF-8 locales... no I've reduced it to a STC: #include locale.h #include wchar.h int main () { int i = 0; if (setlocale (LC_ALL, fr_FR.UTF-8) != NULL) { if (wcwidth (0x0301) 0) i |= 1; if (wcwidth (0x200B) 0) i |= 2; } return i; } The return value should be 0 but is coming back as 3; 0x0301 is a combining mark which should occupy no space on its own, and 0x200b is a 0-width space, according to Unicode 5.1 (and earlier, to some extent). And that probably means that other places within wcwidth() are broken. -- Eric Blake -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: 2 questions about fonts
Oops, rush to judgment here. Caught in the old assumption trap--- I thought that if it worked for gs then it would follow that it should work for gv! Silly me!! However both will work with a symlink to the appropiate .PFB. So a change to the script: $ cat fonts.pl #!/usr/bin/perl # # fonts.pl -- create list of fonts for fontmap.GS from commandline use strict; use warnings; use PostScript::Font; use File::Basename; our $VERSION = '1.01'; my @files; for (@ARGV) { push(@files,glob($_)); } print \n% -- Additional User Fonts --\n\n; for (@files) { my $info = new PostScript::Font ($_,format = 'pfb'); my $filename = basename($info-FileName()); print /,$info-FontName(),\t\t($filename)\t;\n; } Now I cleaned out the full path version of my fonts from Fontmap.GS and then ran: $ perl ./fonts.pl /cygdrive/c/windows/fonts/*.PFB | cat /usr/share/ghostscript/8.63/lib/Fontmap.GS Next I created the symlinks in /usr/share/ghostscript/fonts with the following: $ ln /cygdrive/c/windows/fonts/*.PFB . At this point I have a working gs and gv. This makes writing raw postscript about as easy as it is going to get. --hsm On Mon, May 11, 2009 at 7:32 PM, Hugh Myers hsmy...@gmail.com wrote: The primary answer is yes. And since my belief is that lpr goes through ghostscript for Postscript evaluation, things all resolve down to getting ghostscript to find wanted fonts. After a great deal of trial and error, it came down to a single file: Fontmap.GS. On my system this is found at /usr/share/ghostscript/8.63/lib-- your mileage may vary. Modifying this file leads to the desired result: for instance in order for the /BriemMono line in gs to work, you need an entry in Fontmap.GS that looks like: /BriemMono (/cygdrive/c/windows/fonts/BRIEM___.PFB) ; Two things to note--- the whitespace you see is comprised of tabs (old tradition in unix, possible makes no difference, but still...) and the location of the file (which may be different for you) is couched in cygwin terms; /cygdrive/c/ etc. There after font name resolution is no longer a problem. Actually, since annotation by hand would in my case be a very large PIA (190 fonts to add), I wrote a small script that creates output suitable for concatenation with Fontmap.GS: #!/usr/bin/perl # # fonts.pl -- create list of fonts for fontmap.GS from commandline use strict; use warnings; use PostScript::Font; my @files; for (@ARGV) { push(@files,glob($_)); } print \n% -- Additional User Fonts --\n\n; for (@files) { my $info = new PostScript::Font ($_,format = 'pfb'); print /,$info-FontName(),\t\t(,$info-FileName(),)\t;\n; } Given the above, the following command gratifies greatly! perl ./fonts.pl /cygdrive/c/windows/fonts/*.PFB | cat /usr/share/ghostscript/8.63/lib/Fontmap.GS Problem solved, back to work;) On Fri, May 8, 2009 at 3:37 AM, Hugh Myers hsmy...@gmail.com wrote: I've a great many postscript fonts installed under Windows XP--- is there a way to acquaint cygwin of these? How are font names resolved for lpr? For instance in a non cygwin situation, I might have a line in a postscript file such as: /BriemMono findfont 8 scalefont setfont (typeset these words) show. This runs without problems, I'd like to be able to do something similar using cygwin (obviously using cygwin installed fonts). I am somewhat clueless here, hence these questions. Thanks for any help... --hsm -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: WMIC
Christian, It Works perfectly for both! Can you explain why that commands works and not only wmic cpu list full? Thank you very much! 2009/5/12 Christian Franke christian.fra...@t-online.de: Bruno Galindro da Costa wrote: I´m trying to run the WMIC (WMI Command-line interface) via Cygwin to parse the results with grep, awk, etc. But, when I try to run it, the cursor indicates that the command is executing, but nothing is printed on screen and the cursor is not released. I maked a batch script (.cmd) which executes the WMIC and log the output into a log file. Then, I tried to call it with Cygwin, but the same behaviour happens. The command I want to execute is: WMIC CPU LIST FULL Anyone can help me to resolve the above problem? The following works for me in Cygwin console (notty, tty, and mintty) on XP: $ echo '' | wmic cpu list full or $ wmic cpu list full /dev/null -- Christian Franke -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/ -- Att. Bruno Galindro da Costa bruno.galin...@gmail.com Florianópolis - SC -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
[Fwd: [1.7] wcwidth failing configure tests]
Forwarded to newlib. - Forwarded message from Eric Blake - Date: Tue, 12 May 2009 16:02:04 + (UTC) From: Eric Blake Subject: [1.7] wcwidth failing configure tests To: cygwin AT cygwin DOT com I noticed this failure in various configure scripts (findutils, coreutils, ...): checking whether wcwidth works reasonably in UTF-8 locales... no I've reduced it to a STC: #include locale.h #include wchar.h int main () { int i = 0; if (setlocale (LC_ALL, fr_FR.UTF-8) != NULL) { if (wcwidth (0x0301) 0) i |= 1; if (wcwidth (0x200B) 0) i |= 2; } return i; } The return value should be 0 but is coming back as 3; 0x0301 is a combining mark which should occupy no space on its own, and 0x200b is a 0-width space, according to Unicode 5.1 (and earlier, to some extent). And that probably means that other places within wcwidth() are broken. - End forwarded message - wcwidth returns 1 if iswprint returns true. I had a quick debug attempt and it turns out that the entire range 0x0300..0x034f is marked as printable in the u3 array in libc/ctype/utf8print.h. The entire range 0x0300..0x034f are combining characters which are printable, but have zero width. 200b..200d are all three zero-width characters but all three are also printable. Scanning the Unicode 5.1 standard, I see a couple of these characters, which are printable but have zero width: 0300..036f 0483..0489 200b..200f 20d0..20ea 3099..309a fe20..fe23 (not sure about them. Each of them is the half of a full combined char which doesn't make sense alone, afaics) feff and a couple of musical symbols in the 0x1d1xx range How can we fix this problem? Should we hardcode a check for the above character values in wcwidth? And here's another question. The utf8*.h files claim they have been generated from the unicode.txt file of the Unicode 3.2 standard. Do we have the script which generated the utf8*.h files? Can we regenerate the files to match the current Unicode 5.1 standard? Corinna -- Corinna Vinschen Please, send mails regarding Cygwin to Cygwin Project Co-Leader cygwin AT cygwin DOT com Red Hat -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: Setup 1.7 2.621
On May 12 07:59, Karl M wrote: Date: Tue, 12 May 2009 10:33:13 +0200 From: corinna Subject: Re: Setup 1.7 2.621 Looks normal. The group file contains a root group because it gets added by the postinstall script. mkgroup does not add that automatically. And files group owned by root are actually owned by the administrators group for hopefully obvious reasons: root:S-1-5-32-544:0: [...] Administrators:S-1-5-32-544:544: So what is the root group for and why is it added? I don't recall the orignal reason to add it, but it's a useful fix point in terms of name and gid. The Administrators group can have different names in different languages. Corinna -- Corinna Vinschen Please, send mails regarding Cygwin to Cygwin Project Co-Leader cygwin AT cygwin DOT com Red Hat -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: [Fwd: [1.7] wcwidth failing configure tests]
And here's another question. The utf8*.h files claim they have been generated from the unicode.txt file of the Unicode 3.2 standard. Do we have the script which generated the utf8*.h files? Can we regenerate the files to match the current Unicode 5.1 standard? There's Markus Kuhn's wcwidth implementation, which says it's based on Unicode 5.0: http://www.cl.cam.ac.uk/~mgk25/ucs/wcwidth.c Trouble is, there's the thorny issue of the CJK Ambiguous Width category of characters, which consists of things like Greek and Cyrillic letters as well as line drawing symbols. Those have a width of 1 in Western use, yet with CJK fonts they have a width of 2. That's why Markus Kuhn's code includes the mk_wcswidth_cjk() variant. Andy -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: Cygwin cat command failure for large files
CHATURVEDI PRABUDDHA wrote: Hi All, While accessing a file through rsh on my system and using cat command on that file. if file is less than 64 kb ... command works else i get a error cat: write error: No space left on device Is this known problem in cygwin. Kindly help I am using CYGWIN_NT-5.1 p9 1.5.25(0.156/4/2) 2007-12-14 19:21 i686 Cygwin on winXP See the problem reporting guidelines here: Problem reports: http://cygwin.com/problems.html How does it work for you without 'rsh' in the picture? It worked fine for me with a file of several hundred MB locally and through 'ssh' (i.e. 'ssh localhist cat /tmp/foo'). I don't use 'rsh'. -- Larry Hall http://www.rfk.com RFK Partners, Inc. (508) 893-9779 - RFK Office 216 Dalton Rd. (508) 893-9889 - FAX Holliston, MA 01746 _ A: Yes. Q: Are you sure? A: Because it reverses the logical flow of conversation. Q: Why is top posting annoying in email? -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
[1.7] Proposal: the filename encoding in C locale uses UTF-8 instead of SO/UTF-8
Hi. I propose that the filename encoding in C locale uses UTF-8 instead of SO/UTF-8. There are three reasons: 1. for the interoperability between Cygwin and various UNIX-like systems (Linux, *BSD, Solaris, and so on). UNIX-like systems treat the filename as 8bit byte array, and many applications on the systems send or receive filename information without locale. (mercurial, git, rsync, and so on). 2. UTF-8 is the only encoding that can treat multi languages. 3. Today, the default encoding of modern UNIX-like systems is UTF-8. Please examine it. Thanks. -- IWAMURO Motnori http://vmi.jp/ -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: [Fwd: [1.7] wcwidth failing configure tests]
On May 12 17:56, Andy Koppe wrote: And here's another question. The utf8*.h files claim they have been generated from the unicode.txt file of the Unicode 3.2 standard. Do we have the script which generated the utf8*.h files? Can we regenerate the files to match the current Unicode 5.1 standard? There's Markus Kuhn's wcwidth implementation, which says it's based on Unicode 5.0: http://www.cl.cam.ac.uk/~mgk25/ucs/wcwidth.c This looks nice. Trouble is, there's the thorny issue of the CJK Ambiguous Width category of characters, which consists of things like Greek and Cyrillic letters as well as line drawing symbols. Those have a width of 1 in Western use, yet with CJK fonts they have a width of 2. That's why Markus Kuhn's code includes the mk_wcswidth_cjk() variant. We should use the standard variation alone, imho. And we need some workaround for UTF-16 systems like Cygwin. Unfortunately, surrogate pairs only work well as part of a string, not as standalone chars. So wcwidth would return -1 for each single char, but wcswidth could be tweaked to handle them gracefully. Corinna -- Corinna Vinschen Cygwin Project Co-Leader Red Hat -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: [1.7] Proposal: the filename encoding in C locale uses UTF-8 instead of SO/UTF-8
On May 13 02:29, IWAMURO Motonori wrote: Hi. I propose that the filename encoding in C locale uses UTF-8 instead of SO/UTF-8. There are three reasons: 1. for the interoperability between Cygwin and various UNIX-like systems (Linux, *BSD, Solaris, and so on). UNIX-like systems treat the filename as 8bit byte array, and many applications on the systems send or receive filename information without locale. (mercurial, git, rsync, and so on). 2. UTF-8 is the only encoding that can treat multi languages. 3. Today, the default encoding of modern UNIX-like systems is UTF-8. That's an interesting thought. Do you have a patch and, if so, did you try it? Does it, for instance, help for the issue reported in the thread starting at http://cygwin.com/ml/cygwin/2009-05/msg00245.html? Corinna -- Corinna Vinschen Please, send mails regarding Cygwin to Cygwin Project Co-Leader cygwin AT cygwin DOT com Red Hat -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: [1.7] Proposal: the filename encoding in C locale uses UTF-8 instead of SO/UTF-8
On May 13 02:29, IWAMURO Motonori wrote: Hi. I propose that the filename encoding in C locale uses UTF-8 instead of SO/UTF-8 What the heck is SO/UTF-8? -- Mark J. Reed markjr...@gmail.com -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: [1.7] Proposal: the filename encoding in C locale uses UTF-8 instead of SO/UTF-8
On May 12 15:13, Mark J. Reed wrote: On May 13 02:29, IWAMURO Motonori wrote: Hi. I propose that the filename encoding in C locale uses UTF-8 instead of SO/UTF-8 What the heck is SO/UTF-8? http://cygwin.com/1.7/cygwin-ug-net/using-specialnames.html#pathnames-unusual Corinna -- Corinna Vinschen Please, send mails regarding Cygwin to Cygwin Project Co-Leader cygwin AT cygwin DOT com Red Hat -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: [1.7] Proposal: the filename encoding in C locale uses UTF-8 instead of SO/UTF-8
On Tue, May 12, 2009 at 3:22 PM, Corinna Vinschen http://cygwin.com/1.7/cygwin-ug-net/using-specialnames.html#pathnames-unusual OK, got it. So Mr. Iwamuro's proposal is that Cygwin ignore the locale setting, and just automatically convert the Windows UTF-16 filenames to UTF-8 (and back) no matter what. That seems rife with possible confusion, though. If I have my codepage set to ISO-2022 and paste in a filename, I expect it to be interpreted as ISO-2022, not as UTF-8 (which will probably fail with an invalid encoding sequence). OTOH, the SO/UTF-8 hack would seem to bode ill for the portability of, say, tar archives created under Cygwin. -- Mark J. Reed markjr...@gmail.com -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: [1.7] Proposal: the filename encoding in C locale uses UTF-8 instead of SO/UTF-8
On May 12 15:53, Mark J. Reed wrote: On Tue, May 12, 2009 at 3:22 PM, Corinna Vinschen http://cygwin.com/1.7/cygwin-ug-net/using-specialnames.html#pathnames-unusual OK, got it. So Mr. Iwamuro's proposal is that Cygwin ignore the locale setting, and just automatically convert the Windows UTF-16 filenames to UTF-8 (and back) no matter what. No. Only if LANG=C. That seems rife with possible confusion, though. If I have my codepage set to ISO-2022 and paste in a filename, I expect it to be interpreted Cygwin 1.7 doesn't use the codepage. It uses what $LANG says. See http://cygwin.com/1.7/cygwin-ug-net/setup-locale.html as ISO-2022, not as UTF-8 (which will probably fail with an invalid encoding sequence). OTOH, the SO/UTF-8 hack would seem to bode ill for the portability of, say, tar archives created under Cygwin. The filenames potentially look weird, but they are valid filenames. If anybody has a better idea how to workaround the problem of UTF-16 chars which don't translate into the current singlebyte or multibyte charset, feel free to suggest. Corinna -- Corinna Vinschen Please, send mails regarding Cygwin to Cygwin Project Co-Leader cygwin AT cygwin DOT com Red Hat -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Selling management on Cygwin
Hi, First time post. Believe I have read and carried out all specified do this before posting guidelines. Ok. I work for a 5-person company whose IT infrastructure is exclusively Windows Server-based, and whose mindset is very narrowly Microsoftian. I prefer *nix. Four months ago I quietly created a Windows Server 2003 machine running in a VM on a test box, installed Cygwin, and have been successfully writing running tools (mostly Perl) all this time. Now I want to persuade management to let me install Cygwin directly on the main Server 2003 box. This is not only for better interactive performance (I work remotely and need to go through one extra screen- scraper layer to get to my current Cygwin command line), but also to access some directories on the main box that aren't being shared and, consequently, can't be accessed from my current Cygwin. I expect to be met with plenty of FUD. I honestly don't know what kind of concerns arguments will be raised, but I feel certain they will be garden variety. However, since I'm not a management or IT type, nor a Windows expert, nor a Cygwin expert, I am unprepared to argue the case. If someone could help, perhaps by briefly explaining what it is they're worried about, and why they needn't be, I would greatly appreciate it. Alternately, a link to an article would be nice (I haven't found any so far). In some ways this is more of an issue about open source software in general, but I'm sure there will be questions specifically about Cygwin and the extent to which it touches Windows OS innards. Any guidance would be helpful! Thanks, Chap Harrison -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
RE: Selling management on Cygwin
I expect to be met with plenty of FUD. I honestly don't know what kind of concerns arguments will be raised, but I feel certain they will be garden variety. However, since I'm not a management or IT type, nor a Windows expert, nor a Cygwin expert, I am unprepared to argue the case. Presumably they care about keeping the company profitable, argue cost benefit in terms of whatever your company does. I know someone else had to sell me on cygwin at first. I hadn't had a programming-only job probably since high school and found myself doing java for media as the mainline task. Basically I didn't care one way or the other but I eventually figured out how much more useful it is for things I want to do and how big a problem graphics can be for information processing, even when trying to assess and test audio or video you try to reduce the media to a tractable set of numbers ( distortion for example) and the big thing with Windoze, as you can tell by the name, is the UI and graphics. I've personally been more worried about government or other non-profit information providing sites making their information available in a versatile form ( usually this means text without a bunch of graphics and something that works well with cygwin ) but a similar audience is involved. You can find some of my arguments online if they address topics relevant to your needs. The argument here is compelling public interest which may or may not help you. I've used cygwin to do custom processing of everything from financial and business data to medical literature and DNA sequence analyses. I'm not sure what alternatives there are within Windoze, except maybe the DOS prompt and whatever you can run from that. If someone could help, perhaps by briefly explaining what it is they're worried about, and why they needn't be, I would greatly appreciate it. Alternately, a link to an article would be nice (I haven't found any so far). In some ways this is more of an issue about open source software in general, but I'm sure there will be questions specifically about Cygwin and the extent to which it touches Windows OS innards. Any guidance would be helpful! _ Windows Live™: Keep your life in sync. http://windowslive.com/explore?ocid=TXT_TAGLM_BR_life_in_synch_052009 -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: WMIC
Bruno Galindro da Costa wrote: Christian, It Works perfectly for both! Can you explain why that commands works and not only wmic cpu list full? 2009/5/12 Christian Franke Christian.Franke Bruno Galindro da Costa wrote: I´m trying to run the WMIC (WMI Command-line interface) via Cygwin to parse the results with grep, awk, etc. But, when I try to run it, the cursor indicates that the command is executing, but nothing is printed on screen and the cursor is not released. I maked a batch script (.cmd) which executes the WMIC and log the output into a log file. Then, I tried to call it with Cygwin, but the same behaviour happens. The command I want to execute is: WMIC CPU LIST FULL Anyone can help me to resolve the above problem? The following works for me in Cygwin console (notty, tty, and mintty) on XP: $ echo '' | wmic cpu list full or $ wmic cpu list full /dev/null So, are some of you guys using a DOS console, and some using rxvt/xterm/whatever gui console? cheers, DaveK -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
gs--yes, gv--yes, lpr--no Font problems still...
After getting both gs and gv to find and use the .PFB fonts on my system I am now stumped by lpr. I had thought that lpr would use ghostscripts font mechanism to resolve font references in a postscript file, but apparently not. While both of the other utilities correctly find and display the desired font, lpr defaults to some version of courier (same thing others did before I found a fix.) Could anyone explain this? Don't need much, but do need to be drop-kicked in the right direction. Thanks in advance... --hsm -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: gs--yes, gv--yes, lpr--no Font problems still...
Hugh Myers wrote: After getting both gs and gv to find and use the .PFB fonts on my system I am now stumped by lpr. I had thought that lpr would use ghostscripts font mechanism to resolve font references in a postscript file, but apparently not. While both of the other utilities correctly find and display the desired font, lpr defaults to some version of courier (same thing others did before I found a fix.) Could anyone explain this? Don't need much, but do need to be drop-kicked in the right direction. Thanks in advance... lpr uses whatever is the default printer for Windows, if that printer understands postscript or not, and if the application or driver sends the fonts or not when needed, is part of the printer configuration. What usually works with no problem is letting GS control the printing, that is setting GhostScript as a port (with the printer as parameter). I haven't done this in a long time; very similar to what you do on Unix to use PS with a plain printer. -- René Berber -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: GREP: Memory Exhausted
On Tue, May 12, 2009 at 03:42:15AM +0100, Dave Korn wrote: Christopher Faylor wrote: On Mon, May 11, 2009 at 11:55:26PM +0100, Dave Korn wrote: Some weeks ago... Shailesh Dadure wrote: I am a support engineer from Microsoft trying to help my Customer Maziyar Samadzadeh. We have been notified by Maziyar that when they perform a Query on a bigger database using GREP we get the following error GREP: Memory Exhausted Just to tie up the loose ends of this thread, which kind of petered out without any resolution: I've now notified Shailesh off-list about the apparent bug we uncovered in the WriteConsole API(*), and suggested he show the testcase around internally at Microsoft. I'll keep the list posted if anything interesting transpires. It didn't really peter out. This is a well-known problem even on linux where it is trivially easy to trigger. It doesn't necessarily have anything to do with the console. http://www.cygwin.com/ml/cygwin/2009-04/msg00374.html Well, I couldn't trigger any kind of anomalous memory behaviour from grepping through a few gigs of binary data with Cygwin grep-2.5.3-1; usage stayed rock steady at around 3.5MB. So I took a guess that Shailesh might have had the same problem that was giving me the same symptom. Maybe with a bit of luck we can get the console fixed, you never know. You apparently haven't taken the advice of googling the problem. It really is very easy to trigger on linux. cgf -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: zip, unzip doesn't support locale settings
On Tue, May 12, 2009 at 10:58:00PM +0800, Lenik wrote: (This mail is encoded in utf-8) What is the point of three separate messages when you've already made the point in another thread? cgf -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Re: GREP: Memory Exhausted
Christopher Faylor wrote: You apparently haven't taken the advice of googling the problem. Sure I did. http://img23.imageshack.us/img23/8914/sureidid.png Those links didn't go purple all by themselves. It really is very easy to trigger on linux. Which of the many different bugs that can cause this error message when running grep on Linux are you referring to? It wasn't obvious to me. cheers, DaveK -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
Cygwin 1.7 new install /bin permissions
Hi All... On a new 1.7 install I noticed $ find /bin -perm 644 -print /bin/d2u.exe /bin/dos2unix.exe /bin/gawk-3.1.6.exe /bin/perlthanks5.10.0 /bin/pgawk-3.1.6.exe /bin/pstruct5.10.0 /bin/s2p5.10.0 /bin/u2d.exe /bin/unix2dos.exe /bin/zipinfo.exe Thanks, ...Karl _ Hotmail® goes with you. http://windowslive.com/Tutorial/Hotmail/Mobile?ocid=TXT_TAGLM_WL_HM_Tutorial_Mobile1_052009 -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/