Re: use Encode; # on Japanese; LONG!

Dan Kogai Wed, 09 Jan 2002 19:49:22 -0800

jhi,

On 2002.01.10, at 11:18, Jarkko Hietaniemi wrote:
> Yes, I've heard of you :-)


   Thanks.

> (I'm CCing Nick Ing-Simmons, the original author of Encode, and
> SADAHIRO Tomoyuki, who has worked on it a little bit, and who might
> also know some Japanese :-)

   Therefore I'm cc'ing this to all recepients.

> First off, I'm really thankful that you took a careful look the
> current state of Encode as per Japanese encodings.

   And my apology for not moving quick enough despite being a mainterner 
of Jcode.  Well, I have an excuse.  My 1st ($[ = 0, of course :) child 
was born 18 days ahead of ETA.  She was supposed to be born on the 7th 
this month but "hello world"ed on the 20th last month.  So my holiday 
season schedule was quite an disarray....

> I won't (can't) comment on much the Encode details, since I'm pretty
> unfamiliar the design or the implementation, all I've done is to add
> some (eight-bit) encodings many moons ago.  I'm hoping Nick and Sadahiro
> will join in and comment.

   The surprising thing however broken it functioned somewhat.  Not bad 
for a character set you have virtually no idea on.  It's as miraculous 
as assembling the Machine out of the blueprint sent over the stars 
(Read/Seen 'Contact' by late Carl Sagan?).

> How about "not at all"? :-)

   How do you say that in Finnish?  In Japanese it would be "Zenzen 
Wakarimasen".

>> don't have to; I don't grok Finnish either :).  It takes more than a
>> simple table lookup to handle Japanese well enough to make native
>> grokkers happy.  It has to automatically detect which of many charsets
>> are used, it has to be robust, and most of all, it must be documented 
>> in
>> Japanese :)  I can do all that.
>
> Excellent.

   Or is it?  As a matter of fact Jcode POD contains no Japanese since 
pod parser groks no Japanese.  It just has a web page in both languages 
and mailing list, however....

>>    If I submit Encode::Japanese, are you going to merge it standard
>> module?
>
> Definitely, yes.  Implementation-wise you'll have to discuss with Nick
> since whatever we use should work with the Tcl/Tk scheme (hence the
> name Encode::Tcl, as you no doubt guessed.)  Sadahiro can comment on
> both Encode and Japanese.

   I'm honored to be a gene donator of the beast!

>> Dan the Man with Too Many Charsets to Deal With
>
> Sounds good :-)
>
> One nit, though: the sooner you can start *and* finish the task,
> the better.  For delivery dates, I would prefer "yesterday"... Why?
> I want to release a 5.7.3 really, REALLY soon now, so that module
> authors and users can test their stuff against it, so that 5.8.0 can
> be released in a few months.  So I hope you haven't got any previous
> commitmentents, like a day job or a family :-)
>
   Okay, I'll move as quickly as possible but if the worse gets the worst 
I can still upload it to CPAN (I just want to make sure the name space 
remains untouched).
   If I just code a bridging module to Jcode that would be just a few 
hours away but I wouldn't want to do that knowing I can implement much 
simpler and more elegantly.
   I also believe the same scheme can be applied to other CJKV 
languages/charset.  But once again I need some help to come that far.  I 
know some chinese (perhaps enough to debug the code.  I can at least 
tell if certain string is a sentence or line noise :) but I know little 
Korean and absolutely no Vietnamese...
   Well, enough mubling done.  Back to coding....

Dan the Man with Too Many Breed of Camels (that is, too many versions of 
Camels to babysit; I still have a customer that sticks with perl4, 
y'know).

Re: use Encode; # on Japanese; LONG!

Reply via email to