On 15 Sep 2013, at 22:52, Stephan Stiller wrote:
On 9/15/2013 1:04 PM, Doug Ewell wrote:
André Schappo wrote:
U+2026 is useful for microblogs when one is looking to save characters
Not if the microblog is in UTF-8, as almost all are.
That's an astute observation, but André was talking
Twitter - Until recently, characters outside the BMP resulted in a Counter
decrement of 2 and BMP characters gave a decrement of 1. Not sure when the change
happened but now both BMP non BMP characters result in a decrement of 1
Yes!! How might that have happened? ;-)
And the date line of
① Twitter - [...]
② Sina Weibo - [...]
About a year ago I blogged about it
http://schappo.blogspot.co.uk/2012/10/weibo-character-count.html
And your post on Twitter is this one:
http://schappo.blogspot.co.uk/2012/10/twitter-character-count.html
Stephan
On Thu, Sep 12, 2013 at 1:05 AM, Whistler, Ken ken.whist...@sap.com wrote:
The two currently relevant documents are:
Draft repertoire for FDAM2 of ISO/IEC 10646:2012 (3rd edition) (WG2 N4458):
http://www.unicode.org/L2/L2013/13150-n4458.pdf
and
Draft additional repertoire for ISO/IEC
2013/9/16 Stephan Stiller stephan.stil...@gmail.com
That's exactly what happens when people confuse code point with scalar
value ;-) Hmm, whom might we blame? :-)
Actually you never count scalar values. You are confusing tham with code
units. Twitter was orignally counting UTF-16 code units, but
You haven't been following the thread, have you. When you count code
points you can: either count the original code points, which is the
same as counting scalar values, /because that's what an encoding form
encodes/; or count code points corresponding to code units because,
well, you can match
* Philippe Verdy wrote:
2013/9/16 Stephan Stiller stephan.stil...@gmail.com
That's exactly what happens when people confuse code point with scalar
value ;-) Hmm, whom might we blame? :-)
Actually you never count scalar values. You are confusing tham with code
units. Twitter was orignally
On 9/16/2013 7:48 AM, Stephan Stiller wrote:
or count code points corresponding to code units because, well, you
can match them up
= or count code points corresponding to UTF-16 code units; those
happen to be BMP code points.
Twitter has been claiming since /at least/ April 2012 that they're
Nah!!! STRICTLY NOBODY counts scalar values.
Every one counts either
- (a) code units (most often 8-bit bytes, more rarely 16-bit bytes e.g.
with basic Javascript code), or
- (b) code points (independantly of code units used in the storage or
communication message format).
The application *may*
Is the Unicode Consortium allowed to receive dedicated grants like a public
foundation under US law ?
And if so, how does this conform with the UTC working policies ? I suppose
that the Henry Luce Foundation (HLF) will monitor the progresses (to
provide payments) but will it influence the agenda
On 9/16/2013 1:41 PM, Doug Ewell wrote:
This has nothing to do with UTF-Anything or Normalization Form Anything.
But all with keeping the discussion alive for any reason, however
insignificant :)
A./
Oh, for heaven's sake:
Code Point. (1) Any value in the Unicode codespace; that is, the range
of integers from 0 to 10₁₆. (See definition D10 in Section 3.4,
Characters and Encoding.) Not all code points are assigned to encoded
characters. See code point type. (2) A value, or position, for a
|status changed to become a non-profit charity foundation dedicated to
|wordlwide promotion of education and culture. Thanks.
Oh -- that would be adorable!
--steffen
---BeginMessage---
Is the Unicode Consortium allowed to receive dedicated grants like a public
foundation under US law ?
And if
Asmus Freytag asmusf at ix dot netcom dot com wrote:
On 9/16/2013 1:41 PM, Doug Ewell wrote:
This has nothing to do with UTF-Anything or Normalization Form
Anything.
But all with keeping the discussion alive for any reason, however
insignificant :)
I guess it was too soon to try to come
Sorry to say but a lot of nonsense in your message.
For status of the Unicode Consortium please refer to
http://www.unicode.org/consortium/consort.html
Unicode has always been a member based nonprofit organization and was always
welcoming grants of any sort to help its work. Many experts and
On 9/16/2013 2:18 PM, Doug Ewell wrote:
Asmus Freytag asmusf at ix dot netcom dot com wrote:
On 9/16/2013 1:41 PM, Doug Ewell wrote:
This has nothing to do with UTF-Anything or Normalization Form
Anything.
But all with keeping the discussion alive for any reason, however
insignificant :)
I
Please stop, I've enough replies about the Unicode Consortium status.
But my questions about consequences of **dedicated** grants remain as it
affects how you'll organize works and manage it, within a limited
timeframe. We've not seen this discussed before and nothing is found on the
website
On 9/16/2013 2:05 PM, Steffen Daode Nurpmeso wrote:
But this may be a sign that the Unicode Consortium is about to have
its own status changed to become a non-profit charity foundation
dedicated to wordlwide promotion of education and culture. Thanks. But
this should be clear, and some
Again look at the encoding proposals. For example, many have been sponsored by
SEI. Funds were disbursed to experts doing the ground work. They are all
dedicated by definition. Of course it is easier to get traction for an
encoding proposals when there have been some source of funding. What is
On 9/16/2013 3:01 PM, Philippe Verdy wrote:
Please stop, I've enough replies about the Unicode Consortium status.
But my questions about consequences of **dedicated** grants remain as
it affects how you'll organize works and manage it, within a limited
timeframe. We've not seen this discussed
Hey, Michel, let me correct the history here. The Unicode Consortium was
originally incorporated as a 501 c (6) not-for-profit. In 2012, we
converted to a 501 c (3) not-for-profit. The conversion was approved in
2013, but the IRS counted the date of filing as the conversion date. The
21 matches
Mail list logo