Re: MARC::Charset 1.34

2013-02-11 Thread Galen Charlton
Hi, On Mon, Feb 11, 2013 at 12:12 PM, Leif Andersson wrote: > So - however it happend - the "corrupt" file had been gzipped twice. > > At least a partly explanation to this mystery :) > Wild. Thanks for chasing down further. Regards, Galen -- Galen Charlton gmcha...@gmail.com

Re: MARC::Charset 1.34

2013-02-11 Thread Leif Andersson
Ah, I still had one of those bad archives left. So I could see that the file size differed. The bad .tar.gz was 182.056 bytes The good 191.865 And after unzipping the corrupt file I got a .tar file of size 191.865 ! gzipped twice? Well, I renamed the unzipped corrupt .tar file to .tar.gz And now

Re: MARC::Charset 1.34

2013-02-11 Thread Galen Charlton
Hi, On Mon, Feb 11, 2013 at 11:52 AM, Leif Andersson wrote: > Corrupt tar file RESOLVED. > Thanks for the update. I'm inclined to agree with your guess that a mirror was corrupt. Out of curiosity, are you using ActiveState or Strawberry? Regards, Galen -- Galen Charlton gmcha...@gmail.com

Re: MARC::Charset 1.34

2013-02-11 Thread Leif Andersson
uari 2013 20:01 Till: Leif Andersson Kopia: perl4lib Ämne: Re: MARC::Charset 1.34 Hi, On Mon, Feb 11, 2013 at 10:50 AM, Leif Andersson mailto:leif.anders...@sub.su.se>> wrote: It gunzips fine, but then there seems to be something wrong with the tar file... Could you elaborate? In particul

Re: MARC::Charset 1.34

2013-02-11 Thread Galen Charlton
Hi, On Mon, Feb 11, 2013 at 10:50 AM, Leif Andersson wrote: > It gunzips fine, but then there seems to be something wrong with the tar > file... > Could you elaborate? In particular, what platform are you on and what error message are you getting? I tried installing MARC::Charset 1.34 via a 'c

Re: MARC::Charset 1.34

2013-02-11 Thread Leif Andersson
It gunzips fine, but then there seems to be something wrong with the tar file... /Leif Andersson Stockholm University Library

Re: MARC::Charset 1.33 released

2011-08-05 Thread Ed Summers
Galen, thanks very much for continuing to develop MARC::Charset. You should feel free to update the Makefile.PL and README to list you as the author now, since you have taken an active role in maintaining it. One of the most gratifying parts of my work as a software developer has been seeing these

RE: MARC::Charset 'utf8_to_marc8'

2007-09-18 Thread Doran, Michael D
Hi Laurence, > I'm trying to create MARC records from serials data exported > from SFX, using MARC::Charset version 0.98 to convert UTF-8 > strings to MARC-8. It seems to be failing on extended latin > characters like U+00C5 CAPITAL LETTER A WITH RING ABOVE The encoding, U+00C5 (CAPITAL LETTE

Re: MARC::Charset question

2007-05-20 Thread moconnor59
Thanks for the info (I'm unable to choose font Arial Unicode MS so I still can't display the data properly in my browser, but that's not a problem). Here's a small script that reproduces the problem I'm interested in: use MARC::Charset qw(marc8_to_utf8); binmode STDOUT, ":utf8"; my $marc8

RE: MARC::Charset question

2007-05-18 Thread Doran, Michael D
817-688-1926 mobile # [EMAIL PROTECTED] # http://rocky.uta.edu/doran/ > -Original Message- > From: Doran, Michael D > Sent: Friday, May 18, 2007 1:17 PM > To: perl4lib@perl.org > Subject: RE: MARC::Charset question > > Hi Michael, > > > An example is t

Re: MARC::Charset question

2007-05-18 Thread Ed Summers
Michael, would you be willing to work with me to come up with an automated test case to see if this is a problem w/ MARC::Charset? //Ed

RE: MARC::Charset question

2007-05-18 Thread Doran, Michael D
Hi Michael, > An example is the author (personal name) of the book that can > be found at http://catalog.loc.gov/ by searching for ISBN > 5040039875 (I'm guessing the fact that the website appears to > be displaying a corrupted name may be part of the problem here). The Library of Congress cat

Re: MARC::Charset

2007-03-14 Thread Ashley Sanders
Michael, So, basically, you either need prior knowledge about the actual character encoding used, or you have to test. Testing for UTF-8 is fairly straightforward... How are you testing for UTF-8? There's a handy perl regexp on the W3C web site at: http://www.w3.org/International/questi

RE: MARC::Charset

2007-03-14 Thread Doran, Michael D
ent: Wednesday, March 14, 2007 4:59 AM > Cc: perl4lib > Subject: Re: MARC::Charset > > > Your MARC records appear to be encoded in MARC-8 as evidenced by > > "ergáo" in which the combining accent character comes before the > > character to be modified. I.

RE: MARC::Charset

2007-03-14 Thread Doran, Michael D
du/doran/ > -Original Message- > From: Henri-Damien LAURENT [mailto:[EMAIL PROTECTED] > Sent: Wednesday, March 14, 2007 4:18 AM > To: Doran, Michael D; perl4lib > Subject: Re: MARC::Charset > > Doran, Michael D a écrit : > > Hi Henri, > > > > Althou

Re: MARC::Charset

2007-03-14 Thread Ashley Sanders
Your MARC records appear to be encoded in MARC-8 as evidenced by "ergáo" in which the combining accent character comes before the character to be modified. I.e. the byte string that displays as "ergáo" in your email would display as "ergò" (with a Latin small letter o with grave) in a MARC-8 a

Re: MARC::Charset

2007-03-14 Thread Henri-Damien LAURENT
Doran, Michael D a écrit : > Hi Henri, > > Although in my email client, the character in question appears as a MICRO > SIGN ("µ"), I am assuming that it is actually meant to be a LOWERCASE DIGRAPH > AE ("æ") since that is consistent with the Latin vernacular text in your > record. In MARC-8,

RE: MARC::Charset

2007-03-13 Thread Doran, Michael D
Hi Henri, > MARC::Charset ... fails on each µ character. > ad Scripturµ sensum Although in my email client, the character in question appears as a MICRO SIGN ("µ"), I am assuming that it is actually meant to be a LOWERCASE DIGRAPH AE ("æ") since that is consistent with the Latin vernacular tex

Re: MARC::Charset and transcoding of MARC::Record objects

2006-09-25 Thread bargioni
On 25/set/06, at 12:38, Edward Summers wrote: How would you guys do to transcode a whole MARC record, contained in a MARC::Record object, from MARC8 to UTF8? I'd probably extend marc8_to_utf8() in MARC::Charset so that it not only transcoded strings, but records as well. The function would

Re: MARC::Charset and transcoding of MARC::Record objects

2006-09-25 Thread Edward Summers
On Sep 22, 2006, at 6:06 AM, Leif Andersson wrote: How would you guys do to transcode a whole MARC record, contained in a MARC::Record object, from MARC8 to UTF8? I'd probably extend marc8_to_utf8() in MARC::Charset so that it not only transcoded strings, but records as well. The function wo

Re: MARC::Charset problem

2006-06-27 Thread moconnor59
Edward Summers wrote: > Perhaps when you are writing out your data you aren't preparing the > filehandle for utf8? - you're right (as printing to STDOUT and using binmode() shows), however I got confused because I'm actually trying to save the transcoded data in a Berkeley-db file (via DB_File.pm

Re: MARC::Charset problem

2006-06-22 Thread Edward Summers
On Jun 22, 2006, at 5:34 AM, [EMAIL PROTECTED] wrote: I'm using MARC::Charset::marc8_to_utf8() v0.95 to transcode some Library of Congress data to utf8, however I'm finding a problem with character 'ø' (hex 0xB2 - lowercase scandinavian o / latin small letter o with stroke), this character is tra

Re: MARC::Charset testing

2005-12-28 Thread Edward Summers
On Dec 28, 2005, at 8:25 PM, Bryan Baldus wrote: I was able to successfully compile and test MARC::Charset v. 0.8 from CPAN with the following minor modifications, while using MacPerl 5.8.0a2 on MacOS 9.2.2: Thanks for trying it out and for emailing the list. If anyone else is interested

Re: MARC-Charset-0.5 questions

2003-07-30 Thread Edward Summers
Hi Michael: I've sent this message to [EMAIL PROTECTED] since the list at Rice is being retired. If you need information about the new list visit http://perl4lib.perl.org. On Wed, 30 Jul 2003, Michael D Doran wrote: > 1) In the process of converting MARC-8 to UTF-8, are escape sequences, if > p