Searching, counting characters etc. are easier to convert UTF-8 to Unicode
(wide), doing whatever, then converting back to UTF-8.


On Tue, Feb 25, 2014 at 9:10 AM, Björn Helgason <[email protected]> wrote:

> a. and especially i. a. - looking up chars indexes used to be useful.
>
> It is not as easy anymore.
>
> The national chars are often not in there with a single number.
>
> Sometimes two or three.
>
> Reading files also sometimes with unicode markings.
>
> -
> Björn Helgason
> gsm:6985532
> skype:gosiminn
> On 25.2.2014 14:03, "Don Guinn" <[email protected]> wrote:
>
> > I tried that a while back. I extended the table for ;: to treat the bytes
> > for _128{.a to be treated as letters which made all multi-byte UTF-8
> > treated as alphas. Statements were broken into tokens properly. But then
> I
> > found that the interpreter used the top half of a. internally. I
> mentioned
> > that in the forum a while back when someone noticed that some character
> in
> > there acted weird. Roger said that could be changed if needed. Might be
> > easy for Roger to change that but it didn't look so easy to me.
> >
> > I looked at the tables for Unicode (wide characters) and in the form of
> > UTF-8 and couldn't see any easy to distinguish the category of a
> character.
> > Those that one would consider an alpha were mixed in with graphics and
> > controls. APL characters were not grouped together but scattered all over
> > the place.
> >
> > For trying it out and seeing what happens shouldn't be too difficult to
> see
> > how it would work but there are a lot of questions to answer before
> making
> > it a production tool.
> >
> >
> >
> >
> >
> >
> > On Mon, Feb 24, 2014 at 10:11 PM, bill lam <[email protected]> wrote:
> >
> > > This seems simpler. The first thing to do is build a prototype
> > > implementaton,
> > > and then we can see what are other problems out there.
> > >
> > > Пн, 24 фев 2014, Don Guinn писал(а):
> > > > A middle ground might be to allow for some Unicode (UTF-8​​) to be
> > > > considered letters like a-z,A-Z. Then one could name APL iota to
> > > something
> > > > like i. . In addition, it would allow non-English languages not be
> > > > restricted to ASCII characters for names. Greek letters in
> mathematics
> > > > could be used as names making statements look a little more like
> > > > traditional mathematics. It would be simpler to allow all Unicode
> > > > characters be considered letters, but that might lend to other
> > problems.
> > > >
> ----------------------------------------------------------------------
> > > > For information about J forums see
> http://www.jsoftware.com/forums.htm
> > >
> > > --
> > > regards,
> > > ====================================================
> > > GPG key 1024D/4434BAB3 2008-08-24
> > > gpg --keyserver subkeys.pgp.net --recv-keys 4434BAB3
> > > gpg --keyserver subkeys.pgp.net --armor --export 4434BAB3
> > > ----------------------------------------------------------------------
> > > For information about J forums see http://www.jsoftware.com/forums.htm
> > ----------------------------------------------------------------------
> > For information about J forums see http://www.jsoftware.com/forums.htm
> ----------------------------------------------------------------------
> For information about J forums see http://www.jsoftware.com/forums.htm
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Reply via email to