Dan? Robin? Anyone?

(I suppose I should really be asking this on the ActiveState lists,
since I'm presently working with their perl, but I'd like to know if
this is supposed to be working.)

Does an RE like 

    $line =~ /^\s*(<表>)/

work for perl 5.8.0?

I'm opening the file with 

    unless ( open ( CSVTEXT, "<:encoding(shiftjis)", $commaText ) )

and my text editor and I think I'm storing the source code as UTF-8. I'm
telling the browser 

    -encoding => 'utf-8'

and getting good text in the browser window, so I'm pretty sure I'm
straight on this stuff. I can even print 表 out by writing in it UTF-8
Unicode:

    print "\xE8\xA1\xA8<br />\n";

The above RE doesn't work, and neither do these:

    /^\s*(<\xE8A1A8>)/      (no surprise, really)
    /^\s*(<\xE8\xA1\xA8>)/  (Should this work?)
    /^\s*(<\x8868>)/        (one could sort of hope, anyway)

I can grab it with  /^\s*(<.*>)/, but that only gets me so far, of
course.  I can also pick off <caption> and </caption> with

    /^\s*(<\/?caption>)/i

without any problem, but I really wanted to use Japanese in the tags. 
(Japanese for our customers, and 表 and 表題 collapse so nicely.)

Am I expecting too much, too early? 

What are you guys doing for these? Any suggestions?

-- 
Joel Rees, programmer, Systems Group
Altech Corporation (Alpsgiken), Osaka, Japan
http://www.alpsgiken.co.jp

Reply via email to