Tedd Hopp asked:

> Tell me if I'm wrong please, but isn't moving characters (however
> it's disguised) as much of a violation of the stability policy as is
> changing combining classes of the existing vowels?

You're not wrong. It is a violation of the stability policy.

> The Hebrew vowels interact typographically and the combining classes should
> have been assigned accordingly originally. That's what should be fixed now.

But that is water under the bridge at this point. I am looking for
a *feasible* technical solution to the current *technical* problem.

> I recognize the powerful political issues involved, and that these are
> barriers to this happening. But trying to find technical solutions to
> political problems is extremely short-sighted. I would urge that all
> technical efforts be directed away from solving the politicians' problems
> and focus on how to minimize whatever damage may be caused by changing the
> combining classes.

What we have currently are:

  a. a minor technical problem (that certain sequences of vowel
     points in Biblical Hebrew cannot be reliably distinguished
     in normalized Unicode plain text)
     
and

  b. a minor political problem (that certain communities of Biblical
     scholars are badmouthing Unicode because it "can't fix its
     obvious mistakes")
     
Changing the combining classes of Hebrew points will create:

  a. a major technical problem (destabilization of normalization)
  
and

  b. a major political problem (in both IETF and W3C, at least,
     as well as between members in the UTC, with the non-zero
     risk that the rift will result in the definition of competing
     specifications of normalization, which will compound the
     technical problem)
     
> 
> From my company's perspective, all other proposals I've seen would be more
> damaging to us than doing the right thing. It would be beneficial to hear
> from others on this list about what the specific technical (not political)
> impacts would be (both positive and negative) on their work and their
> products that would come from fixing the combining classes of the existing
> vowels.

Speaking for Sybase products, "fixing" the combining classes of the
existing vowels would have *no* positive impacts. It would have
a large number of negative impacts, the ultimate ramifications
of which I cannot even follow to their eventual conclusions. It
would impact the implementation of normalization code, as well
as its testing. It would lead to nasty meetings where I would
try to explain to server developers why normalization on the
servers wasn't quite reliable, since it has this little Hebrew
"hole" over here. It might require figuring out how to specify
versions of normalization -- and I have no idea how the labels
for that would be reliably attached to data. It puts normalized
data into a kind of a Catch-22 situation where you could never
rely on its stability, since if it contained any of the offending
characters, the determination of whether it would change or not
if normalized again would depend on which *version* of the
normalization code it hit. The reaction, in the context of
some protocols or even database implementations, might be to
deny access to the offending characters -- essentially to rule
them offlimits because of their impact on normalization. And
how would *that* benefit Biblical Hebrew scholars?

The whole situation just stinks of the neverending cascade of
problems that result, for example, from the small set of
persistent interoperability mismatches between various
interpretations of Shift-JIS encoding. That's the kind of
problem that can persist for a decade or more and which just
gets passed as the hot potato from one generation of
developers to the next.

I expect you could hear testimonials from other database
developers on the list about the evils of destabilizing the
definition of Unicode normalization.

--Ken


Reply via email to