On Sat, Jun 24, 2006 at 12:09:34PM -0400, Anthony DeRobertis wrote:
> Package: id3v2
> Version: 0.1.11-3
> Severity: normal
> File: /usr/bin/id3v2
> 
> On a test file:
> 
> $ id3v2 -l J??????\ -\ #nipponsei\ @\ irc.rizon.net\,\ di??il????????.mp3 | 
> grep TIT2 | xxd
> 0000000: 5449 5432 2028 5469 746c 652f 736f 6e67  TIT2 (Title/song
> 0000010: 6e61 6d65 2f63 6f6e 7465 6e74 2064 6573  name/content des
> 0000020: 6372 6970 7469 6f6e 293a 2064 69df 696c  cription): di.il
> 0000030: dfec f3f1 0a                             .....
> 
> The tag in the MP3 file is UTF-8:
> 
> 00000b0: 697a 6f6e 2e6e 6574 5449 5432 0000 000f  izon.netTIT2....
> 00000c0: 0000 0364 69c3 9f69 6cc3 9fc3 acc3 b3c3  ...di..il.......
> 00000d0: b154 4954 3300 0000 0100 0003 544f 5045  .TIT3.......TOPE
> 
> And my system uses UTF-8:
> 
> $ locale | grep LC_CTYPE
> LC_CTYPE="en_US.UTF-8"

I am a character encoding newbie - you likely know more than me about the
subject. With that said:

* The ID3v2 2.3.0 standard is not the latest but it happens to be what
  id3lib (which id3v2 uses) implements. (The latest is 2.4.0).

* The ID3v2 2.3.0 standard only supports ISO-8859-1 and UCS-2. 2.4.0 did add
  support for UTF-16BE and UTF-8.

* I'll have to check whether it's id3v2 or id3lib that lacks support for
  UCS-2, or whether it does support it but chooses the wrong encoding to
  convert to.
-- 
Robert Woodcock - [EMAIL PROTECTED]
"Everything has been thought of before. The trick is to think of it again."
        -- Johann Wolfgang von Goethe


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to