Re: font-encoded hacks

2016-10-07 Thread Neil Harris

On 07/10/16 07:42, Denis Jacquerye wrote:

In may case people resort to these hacks because it is an easier short term
solution. All they have to do is use a specific font. They don't have to
switch or find and install a keyboard layout and they don't have to upgrade
to an OS that supports their script with Unicode properly. Because of these
sort term solutions it's hard for a switch to Unicode to gain proper
momentum. Unfortunately, not everybody sees the long term benefit, or often
they see it but cannot do it practically.

Too often Unicode compliant fonts or keyboard layouts have been lacking or
at least have taken much longer to be implemented.
One could wonder if a technical group for keyboards layouts would help this
process.


What might also help is a reconceptualization of these hacks as being in 
effect non-standard character encodings: the existing software 
infrastructure for handling charsets could then be co-opted to convert 
them to (and possibly from) Unicode if desired.


Neil



Re: Why incomplete subscript/superscript alphabet ?

2016-10-03 Thread Neil Harris

On 03/10/16 18:59, Steve Swales wrote:

On Oct 3, 2016, at 10:14 AM, Doug Ewell  wrote:

a.lukyanov wrote:


I think that the right thing to do would be to create several new
control/formatting characters, like this:

"previous character is superscript"
"previous character is subscript"
"previous character is small caps (for use in phonetic transcription
only)"
"previous character is mathematical blackletter"
etc

Then people will be able to apply this features on any character as
long as their font supports it.

I happen to think this would be exactly the wrong thing to do,
completely contrary to the principles of plain text that Unicode was
founded upon. But you never know what might gain traction, so stay
tuned.

I guess I don’t see how it is fundamentally different from other variant 
selector uses within Unicode, and the ability to write properly formatted 
mathematical and chemical formulas (for example) in a plain text environment 
like text messaging seems like a fairly compelling use case.

-steve





Yes, but since there are existing well-standardized higher-level 
protocols already in existence (HTML, MATHML, TeX, etc. etc.) that do 
exactly that. They should be used instead, as opposed to trying to make 
Unicode something other than a plain-text character encoding, contrary 
to its design principles. Moreover, while you describe seems 
superficially simple, as soon as you try to expand it, you will find you 
end up with systems like this: 
http://unicode.org/notes/tn28/UTN28-PlainTextMath.pdf which are neither 
one nor the other, and in spite of their proposal as a plain-text 
notation, actually ends up being an ad-hoc higher-level protocol anyway.


Neil




Re: APL Under-bar Characters

2015-08-17 Thread Neil Harris

On 17/08/15 17:23, Doug Ewell wrote:


In that case, despite the text in Section 22.7 that Ken quoted, it seems
that U+0331 COMBINING MACRON might be a better choice for APL
"underlined letters" than U+0332 COMBINING LOW LINE. Compare A̱ḆC̱
with A̲B̲C̲, noting that your font and rendering engine mileage may
vary.

"Voting again" to change one of the basic rules of Unicode, on the basis
that "perhaps feelings about the under-bar characters have changed since
then," is not expected to be an option, as David said.



Doug is right. One small correction: U+0331 is COMBINING MACRON BELOW, 
not COMBINING MACRON.


Wikipedia has an excellent article on this topic:

https://en.wikipedia.org/wiki/Macron_below

-- Neil





Re: Plain text (from Re: Avoidance variants)

2015-03-27 Thread Neil Harris

On 26/03/15 23:27, Mark E. Shoulson wrote:

On 03/26/2015 11:18 AM, William_J_G Overington wrote:
> Blocks of boring plain text, no italics or effects any more complex 
than justification, simple notes written all in one font with no 
formatting to speak of etc.



I am wondering if it is considered a good idea to define into Plane 
14 some formatting characters, so that plain text could in the future 
contain italics and so on.


And we could define "plain water" to include sugar and flavorings, and 
have Coke run out of our taps.  But that isn't "plain water" anymore.  
And yes, we DO allow some additives in water and still call it 
"plain", even as we do have some formatting characters in Unicode and 
call it plain text (e.g. tab, formfeed, ZWJ, RLO, PDF, etc)


Alternatively, you could say we already have such things encodable as 
plain text, using character sequences, like U+003C U+0069 U+003E to 
indicate "BEGIN ITALICS", etc...  Just need the right reader...


~mark


Or you could just redefine "&" and "<" as

U+0026 START HTML ENTITY

and

U+003C START HTML TAG

and be done with it, and just incorporate HTML5 into Unicode forever, 
thus eliminating these discussions from this list, and moving them to 
the W3C and WHATWG lists...


-- Neil

___
Unicode mailing list
Unicode@unicode.org
http://unicode.org/mailman/listinfo/unicode


Admuncher javascript on Unicode site

2014-12-25 Thread Neil Harris

I've just noticed that loading the web page

http://www.unicode.org/L2/L2014/14250.htm

loads a script from "interceptedby.admuncher.com"

This seems pretty peculiar to me. Is this intended?

Neil Harris

___
Unicode mailing list
Unicode@unicode.org
http://unicode.org/mailman/listinfo/unicode


Re: Websites in Hindi

2014-03-03 Thread Neil Harris

On 03/03/14 18:14, James Lin wrote:

another problem you may need to consider is the support of the glyph/fonts
on your system.  Not all fonts are supported/install by default when
installing the OS.

Warm Regards,
-James




This is where webfonts should be extremely useful -- I believe recent 
versions of at least Firefox, and probably other modern browsers, should 
support both webfonts and text shaping for Indic scripts by default, 
whether or not the underlying platform has the correct fonts.


Neil

___
Unicode mailing list
Unicode@unicode.org
http://unicode.org/mailman/listinfo/unicode


Re: The Ruble sign has been approved

2013-12-12 Thread Neil Harris

On 12/12/13 15:14, William_J_G Overington wrote:

Michael Everson  wrote:


I’m already on it.

Excellent.

Would it be possible please for encoding to include specific official guidance, 
going back to a source with provenance, as to whether a glyph for the symbol in 
a serif font should or should not have serifs?

William Overington

12 December 2013






I would imagine that this is left to the discretion of the font 
designer, as in all other simlar currency symbols, where the style of 
the currency symbol follows the style of the rest of the font.


-- N.




Re: Why blackletter letters?

2013-09-12 Thread Neil Harris

On 12/09/13 11:26, Johan Winge wrote:
On Wed, 11 Sep 2013 20:29:51 +0200, Hans Aberg  
wrote:


... The symbol for the empty set ∅ is originally a Greek letter phi 
ϕ, ans some use the latter.


According to the autobiography of André Weil, quoted at 
http://jeff560.tripod.com/set.html, the empty set symbol ∅ was 
inspired by the Scandinavian Ø, and would then have nothing to do with 
the Greek phi, except for a superficial resemblance. I'm aware that 
some mathematician indeed do use Φ/φ, supposedly due to this 
misconception and/or lacking coverage in fonts and/or carelessness, 
but I find it terribly annoying. Really, it is no more correct than 
using ß in lieu of β.


-- Johan Winge




Do some mathematicians _really_ use Φ/φ instead of ∅, or does it just 
look like they're doing so?


Careless handwriting of ∅ could indeed look like Φ or even φ, but I 
doubt they're thinking "phi, the symbol for the empty set" as they do 
so.  TeX is universal in the typesetting of mathematics, and the symbol 
is visually quite distinct from the Greek letter, which mathematicians 
will also see on a daily basis: if they've ever been exposed to TeX, 
they must surely have made the distinction, and just reading papers 
should be enough to make the difference clear.


Neil






Re: Suggestion for new dingbats/symbols

2013-05-31 Thread Neil Harris

On 31/05/13 20:37, Asmus Freytag (w) wrote:

I think that research that does precisely this kind of task of correlating 
symbol repertoires against each other is extremely valuable in its own right.

Additional research that documents the usage of these symbols -- in computing 
environments -- would also be useful.

Reliable facts on users and the tasks in which they use particular symbols 
(represented in filed and data) would be a better basis to argue about possible 
encodings than just the existence of symbols or whether they are highly 
recognizable when seen on signage.

Having said that, documenting the details of ongoing efforts at understanding 
symbols by posting each small finding on this list is probably inappropriate. 
That kind of effort belongs in a research project aimed at symbols.

A./



Thanks! I agree that a mailing list is a very poor venue for this -- I 
just wanted to demonstrate that the repertoire of public information 
symbols was quite coherent, and very amenable to unification, instead of 
being a random grab-bag of pictograms with no defined boundaries -- and 
then I got carried away.


ISO has its TC 145 committee to talk about exactly this, and no doubt 
they will have a lot of this sewn up already, but it's not really a 
public forum, and their documents are not freely available.


Is there an alternative forum that could be used to develop something in 
a crowdsourced, collaborative way that could later be refined to 
generate a more formal document such as a Unicode encoding proposal? 
Something as simple as a wiki would work fine in the short term...


Neil





Re: Suggestion for new dingbats/symbols

2013-05-31 Thread Neil Harris


The ECOMO of symbols set is particularly interesting, as it seems to be a compendium of not just 
the ISO 7001 symbols, but also the DOT and AIGA symbols, and almost all the other symbols I could 
think of as meeting my earlier criteria, such as the ubiquitous "running man" emergency 
exit symbol. the stylized "i" information symbol, and the symbols for sports activities.

There is a list of all 125 of them here:

http://www.ecomo.or.jp/barrierfree/pictogram/picto_top.html



After a bit more research, I found this blog post:

http://tavmjong.free.fr/blog/?p=700

which suggests that the AIGA set and the Department of Transportation 
set are one and the same, and that the ECOMO and ISO 7001 pictograph 
sets were derived from this starting set of pictograms.


I've had a quick go at unifying the ECOMO and AGIA sets with one another 
and symbols already encoded in Unicode, which suggests that around 126 
new characters might need to be encoded to complete both the ECOMO/AGIA 
sets.


Details below the fold: I've tried to keep the various names I've used 
close to the names in the actual standards themselves. I'm sure there 
are errors in this, as well as missed opportunities for unification, but 
it's a start on estimating the scope of the task of completing the set 
of commonly used public information symbols.


-- Neil

AIGA_TELEPHONE = ECOMO_TELEPHONE
AIGA_MAIL = ECOMO_POST
AIGA_CASHIER
AIGA_ESCALATOR = ECOMO_ESCALATOR

AIGA_TOILETS = ECOMO_TOILETS = UNICODE_1F6BB_RESTROOM
AIGA_NURSERY = ECOMO_NURSERY = UNICODE_1F6BC_BABY_SYMBOL
AIGA_DRINKING_FOUNTAIN
AIGA_TAXI = ECOMO_TAXI_/_TAXI_STAND = UNICODE_1F696_ONCOMING_TAXI
AIGA_BUS = ECOMO_BUS_/_BUS_STOP = UNICODE_1F68D_ONCOMING_BUS
AIGA_LOST_AND_FOUND = ECOMO_LOST_AND_FOUND = UNICODE_1F6C5_LEFT_LUGGAGE
AIGA_STAIRS = ECOMO_STAIRS
AIGA_ELEVATOR = ECOMO_ELEVATOR
AIGA_WAITING_ROOM = ECOMO_LOUNGE_/_WAITING_ROOM
AIGA_INFORMATION = ECOMO_INFORMATION
AIGA_HOTEL_INFORMATION
AIGA_AIR_TRANSPORTATION = ECOMO_AIRCRAFT_/_AIRPORT
AIGA_HELIPORT = ECOMO_HELICOPTER_/_HELIPORT = UNICODE_1F681_HELICOPTER
AIGA_RAIL_TRANSPORTATION = ECOMO_RAILWAY_/_RAILWAY_STATION
AIGA_WATER_TRANSPORTATION = ECOMO_SHIP_/_FERRY_/_PORT = UNICODE_1F6A2_SHIP
AIGA_ROAD_TRANSPORTATION
AIGA_TOILETS_FEMALE = ECOMO_WOMEN = UNICODE_1F6BA_WOMENS_SYMBOL
AIGA_CAR_RENTAL = ECOMO_RENT_A_CAR
AIGA_RESTUARANT = ECOMO_RESTAURANT
AIGA_COFFEESHOP = ECOMO_COFFEE_SHOP
AIGA_BAR = ECOMO_BAR
AIGA_SHOPS = ECOMO_SHOP
AIGA_BARBER_SHOP_BEAUTY_SALON = ECOMO_BARBER_/_BEAUTY_SALON
AIGA_BARBER_SHOP
AIGA_BEAUTY_SALON
AIGA_TICKET_PURCHASE = ECOMO_TICKETS_/_FARE_ADJUSTMENT
AIGA_BAGGAGE_CHECK_IN = AIGA_BAGGAGE_COLLECTION
AIGA_DEPARTING_FLIGHTS
AIGA_ARRIVING_FLIGHTS = ECOMO_ARRIVALS
AIGA_SMOKING = ECOMO_SMOKING_AREA = UNICODE_1F6AC_SMOKING_SYMBOL
AIGA_NO_SMOKING = ECOMO_NO_SMOKING = UNICODE_1F6AD_NO_SMOKING_SYMBOL
AIGA_PARKING = ECOMO_PARKING
AIGA_NO_PARKING = ECOMO_NO_PARKING
AIGA_NO_DOGS = ECOMO_NO_UNCAGED_ANIMALS
AIGA_NO_ENTRY = ECOMO_NO_ENTRY = UNICODE_26D4_NO_ENTRY
AIGA_EXIT
AIGA_FIRE_EXTINGUISHER = ECOMO_FIRE_EXTINGUISHER
AIGA_LITTER_DISPOSAL
AIGA_TOILETS_MEN = ECOMO_MEN = UNICODE_1F6B9_MENS_SYMBOL
AIGA_STAIRS_UP
AIGA_STARIS_DOWN
AIGA_CURRENCY_EXCHANGE
AIGA_FIRST_AID = ECOMO_FIRST_AID
AIGA_BAGGAGE_LOCKERS
AIGA_ESCALATOR_DOWN
AIGA_ESCALATOR_UP
AIGA_COAT_CHECK = ECOMO_CLOAKROOM

AGIA_SYMBOL_RIGHT_POINTING_ARROW = ECOMO_SYMBOL_RIGHT_POINTING_ARROW = 
UNICODE_27A1_BLACK_RIGHTWARDS_ARROW
AGIA_SYMBOL_FORWARD_AND_RIGHT_POINTING_ARROW = 
ECOMO_SYMBOL_FORWARD_AND_RIGHT_POINTING_ARROW = 
UNICODE_2B08_NORTH_EAST_BLACK_ARROW
AGIA_SYMBOL_FORWARD_POINTING_ARROW = ECOMO_SYMBOL_FORWARD_POINTING_ARROW 
= UNICODE_2B06_UPWARDS_BLACK_ARROW
AGIA_SYMBOL_FORWARD_AND_LEFT_POINTING_ARROW = 
ECOMO_SYMBOL_FORWARD_AND_LEFT_POINTING_ARROW = 
UNICODE_2B09_NORTH_WEST_BLACK_ARROW
AGIA_SYMBOL_LEFT_POINTING_ARROW = ECOMO_SYMBOL_LEFT_POINTING_ARROW = 
UNICODE_2B05_LEFTWARDS_BLACK_ARROW
AGIA_SYMBOL_DOWNWARD_AND_LEFT_POINTING_ARROW = 
ECOMO_SYMBOL_DOWNWARD_AND_LEFT_POINTING_ARROW = 
UNICODE_2B0B_SOUTH_WEST_BLACK_ARROW
AGIA_SYMBOL_DOWNWARD_POINTING_ARROW = 
ECOMO_SYMBOL_DOWNWARD_POINTING_ARROW = UNICODE_2B07_DOWNWARDS_BLACK_ARROW
AGIA_SYMBOL_DOWNWARD_AND_RIGHT_POINTING_ARROW = 
ECOMO_SYMBOL_DOWNWARD_AND_RIGHT_POINTING_ARROW = 
UNICODE_2B0A_SOUTH_EAST_BLACK_ARROW


ECOMO_CUSTOMS_/_BAGGAGE_CHECK = AIGA_CUSTOMS = UNICODE_1F6C3_CUSTOMS
ECOMO_IMMIGRATION_/_QUARANTINE_/_INSPECTION = AIGA_IMMIGRATION = 
UNICODE_1F6C2_PASSPORT_CONTROL


ECOMO_QUESTION_AND_ANSWER
ECOMO_HOSPITAL
ECOMO_POLICE
ECOMO_ACCESSIBLE_FACILITY
ECOMO_ACCESSIBLE_SLOPE
ECOMO_DRINKING_WATER
ECOMO_CHECK-IN_/_RECEPTION
ECOMO_HOTEL_/_ACCOMMODATION
ECOMO_BAGGAGE_STORAGE
ECOMO_COIN_LOCKERS
ECOMO_MEETING_POINT
ECOMO_BANK,_MONEY_EXCHANGE
ECOMO_CASH_SERVICE
ECOMO_FAX
ECOMO_CART

ECOMO_DRESSING_ROOM
ECOMO_DRESSING_ROOM(WOMEN)
ECOMO_SHOWER = UNICODE_1F6BF_SHOWER
ECOMO_BATH = UNICODE_1F6C0_BATH
ECOMO_WATER_FOUNTAIN
ECOMO_TRASH_BOX
ECOMO_COLLECTION_FACILITY_FOR_THE_RECYCLING_PRODUCTS

ECOMO_BICYCLE = UNICODE_1F6B2_BICYC

Re: Suggestion for new dingbats/symbols

2013-05-31 Thread Neil Harris

On 31/05/13 01:13, Dreiheller, Albrecht wrote:

Watching the discussion on symbols, icons, signs, emoticons of the last days,
I'm thinking a little bit philosophically about the question:
Where will we end up?

Is communicating with symbols like a new easy-to-learn universal language?
Is this our new Lingua Franca?
Even if there will be more different symbols than Chinese characters?

Well, it might be easy to understand symbols, if the context is clear and
if they are displayed or printed with good quality.
So the _receptive vocabulary_ might be pretty big for many people.



That's certainly my rationale. I'm not arguing for the encoding of all 
pictograms, and even more so not advocating the use or development of 
novel pictographic languages or any such similar development.


I'm only arguing for the encoding of pictograms that are

* in widespread use internationally
* that are used commonly in contexts where they are _used like, or in, 
text_, in particular their use in Rosetta Stone-like multilingual signs

* that belong to standardized sets
* and that are instantly recognised understood by a _very large_ number 
of people from a wide range of cultures


Over and above this, I think that this can also be combined with a 
"complete the existing set" rationale.


After a lot of Googling, I have discovered that many, but not all, of 
the ISO 7001 symbols already seem to have been encoded in the block 
U+1F680 - 1F6FF, Transport and Map Symbols, which also contains other 
similar symbols which to the best of my knowledge are not part of ISO 7001.


A quick glance at the Unicode code tables shows that several other ISO 
7001 symbols are already encoded elsewhere:


* U+267F WHEELCHAIR SYMBOL
* U+26D4 NO ENTRY
* U+260E BLACK TELEPHONE
* U+2B06 UPWARDS BLACK ARROW, in a block with seven other arrows 
pointing to the other cardinal points

* U+1F481 INFORMATION DESK PERSON has an obvious equivalent in ISO 7001

I also spotted a number of other symbols that I think meet the 
definition I gave above which are already encoded, presumably using a 
similar rationale to the one I gave above:


* U+2623 BIOHAZARD SIGN
* U+2620 SKULL AND CROSSBONES
* U+2622 RADIOACTIVE SIGN

The ECOMO of symbols set is particularly interesting, as it seems to be 
a compendium of not just the ISO 7001 symbols, but also the DOT and AIGA 
symbols, and almost all the other symbols I could think of as meeting my 
earlier criteria, such as the ubiquitous "running man" emergency exit 
symbol. the stylized "i" information symbol, and the symbols for sports 
activities.


There is a list of all 125 of them here:

http://www.ecomo.or.jp/barrierfree/pictogram/picto_top.html

Neil




Re: Suggestion for new dingbats/symbols

2013-05-29 Thread Neil Harris

A quick look at

http://signcollection.com/media/wysiwyg/DOT_ISO_7001_Pictograms.jpg

http://defound.com/2011/10/the-helvetica-of-pictograms/

and

http://www.aiga.org/symbol-signs/

and making up names for the DOT/ISO 7001 symbols, gives the following 
set of equivalences, suggesting a high degree of commonality between the 
symbol sets:


Symbols apparently common to DOT and AIGA:

DOT SYMBOL TELEPHONE = AIGA SYMBOL TELEPHONE
DOT SYMBOL ENVELOPE = AIGA SYMBOL MAIL
DOT SYMBOL DOLLAR IN CIRCLE = AIGA SYMBOL CASHIER
DOT SYMBOL ESCALATOR = AIGA SYMBOL ESCALATOR
DOT SYMBOL TOILETS MALE FEMALE = AIGA SYMBOL TOILETS
DOT SYMBOL BABY = AIGA SYMBOL NURSERY
DOT SYMBOL DRINKING FOUNTAIN = AIGA SYMBOL DRINKING FOUNTAIN
DOT SYMBOL TAXI = AIGA SYMBOL TAXI
DOT SYMBOL BUS = AIGA SYMBOL BUS
DOT SYMBOL LOST PROPERTY = AIGA SYMBOL LOST AND FOUND
DOT SYNBOL STAIRCASE = AIGA SYMBOL STAIRS
DOT SYMBOL ELEVATOR = AIGA SYMBOL ELEVATOR
DOT SYMBOL WAITING AREA = AIGA SYMBOL WAITING ROOM
DOT SYMBOL QUESTION MARK IN CIRCLE = AIGA SYMBOL INFORMATION
DOT SYMBOL BED AND QUESTION MARK IN CIRCLE = AIGA SYMBOL HOTEL INFORMATION
DOT SYMBOL AIRPLANE = AIGA SYMBOL AIR TRANSPORTATION
DOT SYMBOL HELICOPTER = AIGA SYMBOL HELIPORT
DOT SYMBOL TRAIN = AIGA SYMBOL RAIL TRANSPORTATION
DOT SYMBOL RIGHT POINTING ARROW = AIGA equivalent of similar name
DOT SYMBOL FORWARD AND RIGHT POINTING ARROW = AIGA equivalent of similar 
name

DOT SYMBOL FORWARD POINTING ARROW = AIGA equivalent of similar name
DOT SYMBOL FORWARD AND LEFT POINTING ARROW = AIGA equivalent of similar name
DOT SYMBOL LEFT POINTING ARROW = AIGA equivalent of similar name
DOT SYMBOL DOWNWARD AND LEFT POINTING ARROW = AIGA equivalent of similar 
name

DOT SYMBOL DOWNWARD POINTING ARROW = AIGA equivalent of similar name
DOT SYMBOL DOWNWARD AND RIGHT POINTING ARROW = AIGA equivalent of 
similar name

DOT SYMBOL SHIP = AIGA SYMBOL WATER TRANSPORTATION
DOT SYMBOL TAXI AND BUS = AIGA SYMBOL ROAD TRANSPORTATION
DOT SYMBOL WOMAN = AIGA SYMBOL TOILETS FEMALE
DOT SYMBOL CAR AND KEY = AIGA SYMBOL CAR RENTAL
DOT SYMBOL KNIFE AND FORK = AIGA SYMBOL RESTUARANT
DOT SYMBOL CUP AND SAUCER = AIGA SYMBOL COFFEESHOP
DOT SYMBOL COCKTAIL GLASS = AIGA SYMBOL BAR
DOT SYMBOL GIFT SHOP [?] = AIGA SYMBOL SHOPS
DOT SYMBOL HAIRDRESSER = AIGA SYMBOL BARBER SHOP BEAUTY SALON
DOT SYMBOL HAIRDRESSER FOR MEN = AIGA SYMBOL BARBER SHOP
DOT SYMBOL HAIRDRESSER FOR WOMEN = AIGA SYMBOL BEAUTY SALON
DOT SYMBOL TICKET COUNTER = AIGA SYMBOL TICKET PURCHASE
DOT SYMBOL SUITCASE = AIGA SYMBOL BAGGAGE CHECK IN [?] = AIGA SYMBOL 
BAGGAGE COLLECTION [?]

DOT SYMBOL AIRPLANE TAKEOFF [?] = AIGA SYMBOL DEPARTING FLIGHTS
DOT SYMBOL MAN WITH SUITCASE WAVING [?] = AIGA SYMBOL ARRIVING FLIGHTS
DOT SYMBOL LIT CIGARETTE = AIGA SYMBOL SMOKING
DOT SYMBOL NO SMOKING = AIGA SYMBOL NO SMOKING
DOT SYMBOL PARKING P = AIGA SYMBOL PARKING
DOT SYMBOL NO PARKING = AIGA SYMBOL NO PARKING
DOT SYMBOL NO DOGS = AIGA SYMBOL NO DOGS
DOT SYMBOL NO ENTRY = AIGA SYMBOL NO ENTRY
DOT SYMBOL BLACK CIRCLE DIVIDED BY VERTICAL WHITE LINE [?] = AIGA SYMBOL 
EXIT

DOT SYMBOL FIRE EXTINGUISHER = AIGA SYMBOL FIRE EXTINGUISHER
DOT SYMBOL PUTTING LITTER INTO BIN = AIGA SYMBOL LITTER DISPOSAL
DOT SYMBOL MAN = AIGA SYMBOL TOILETS MEN
DOT SYMBOL UP STAIRCASE = AIGA SYMBOL STAIRS UP
DOT SYMBOL DOWN STAIRCASE = AIGA SYMBOL STARIS DOWN
DOT SYMBOL NOTE WITH CURRENCY SYMBOLS = AIGA SYMBOL CURRENCY EXCHANGE
DOT SYMBOL SWISS CROSS = AIGA SYMBOL FIRST AID
DOT SYMBOL SUITCASE IN BOX WITH KEY = AIGA SYMBOL BAGGAGE LOCKERS
DOT SYMBOL DOWN ESCALATOR = AIGA SYMBOL ESCALATOR DOWN
DOT SYMBOL UP ESCALATOR = AIGA SYMBOL ESCALATOR UP
DOT SYMBOL COATHANGER = AIGA SYMBOL COAT CHECK

AIGA-only symbols:

AIGA SYMBOL CUSTOMS
AIGA SYMBOL IMMIGRATION

Mere colour-reversed variants of other symbols:

DOT SYMBOL RIGHT POINTING ARROW WHITE ON BLACK -- see AIGA 
black-on-white equivalent
DOT SYMBOL FORWARD AND RIGHT POINTING ARROW WHITE ON BLACK -- see AIGA 
black-on-white equivalent
DOT SYMBOL FORWARD POINTING ARROW WHITE ON BLACK -- see AIGA 
black-on-white equivalent
DOT SYMBOL FORWARD AND LEFT POINTING ARROW WHITE ON BLACK -- see AIGA 
black-on-white equivalent
DOT SYMBOL LEFT POINTING ARROW WHITE ON BLACK -- see AIGA black-on-white 
equivalent
DOT SYMBOL DOWNWARD AND LEFT POINTING ARROW WHITE ON BLACK -- see AIGA 
black-on-white equivalent
DOT SYMBOL DOWNWARD POINTING ARROW WHITE ON BLACK -- see AIGA 
black-on-white equivalent
DOT SYMBOL DOWNWARD AND RIGHT POINTING ARROW WHITE ON BLACK -- see AIGA 
black-on-white equivalent






Re: Suggestion for new dingbats/symbols

2013-05-29 Thread Neil Harris

On 29/05/13 15:56, Asmus Freytag wrote:

On 5/29/2013 1:39 AM, Andreas Stötzner wrote:


Am 29.05.2013 um 01:06 schrieb David Starner:


And what you'll run into is the fact that people don't agree that that
belongs in Unicode.



What Andreas was suggesting is rigorous study. I think that is a 
commendable suggestion.


The more interesting question is what aspects should such a study 
encompass, what are to be its starting points and what kind of 
conclusions should be possible after it is completed?


With better facts in hand it will be much easier to double-check 
whether currently-held assumptions about their relevance for encoding 
hold up or need revisiting. Without facts, this kind of discussion 
just deals in pre-conceived notions, and therefore adds little value.


A./



ISO have a technical committee, ISO/TC 145, that deals with graphical 
symbols, and a standard, ISO 7001, that defines a set of "public 
information symbols".


http://www.iso.org/iso/graphical-symbols_booklet.pdf

I can find several sets of public information symbols currently in 
public use: ISO 7001, DOT, AIGA, and ECOMO.


There appear to be

* 67 [?] U.S. Department of. Transportation public information pictograms
* 50 AIGA symbols: http://www.aiga.org/symbol-signs/
* 125 ECOMO symbols: http://www.ecomo.or.jp/english/picto_top.html

There seems to be quite a lot of overlap between all these sets.

The 125 ECOMO symbols chosen for unification and standardization were 
based on a study of around 1200 individual pictograms from 63 different 
sources: see 
http://www.accessibletourism.org/?i=enat.en.enat_projects_and_good_practices.307


I'm not sure how many ISO 7001 symbols there are, because the document 
is not freely publicly available: this press release: 
http://www.iso.org/iso/home/news_index/news_archive/news.htm?refid=Ref1097 
suggests that there were 79 of them in 2007.


Unlike universal pictogram schemes like the Noun Project, these public 
information symbols seem to me to be pretty good candidates for encoding 
right now. Not only are they capable of being referenced back to 
authoritative sources for their design and meaning, they are evidently 
in common use in signs. frequently uses in the same context as text, and 
in a text-like way. For example, an airport sign might contain the 
English word for something, the local word for the same thing, and the 
corresponding public information symbol, all side by side, like a 
Rosetta stone for pictograms. Moreover, I believe they have clear 
character-spirit: in many cases, there are different graphical forms of 
the same symbol from different sets that are manifestly the same 
semantic entity, in terms of both intended meaning and graphic design 
intent.


With a bit of unification there are probably less than 200 of these to 
encode, fewer than the emoji. Some of these can be identified with 
already encoded symbols used elsewhere, and there is also a natural way 
to use encoding to avoid the need to encode different semantic variants 
of the same symbol, using combining characters as modifiers to signify 
that the character following should be put in a warning shape, or 
overlaid with a circle and diagonal bar to symbolize prohibition.


Neil






Re: Suggestion for new dingbats/symbols

2013-05-28 Thread Neil Harris

On 26/05/13 23:37, Michael Everson wrote:

On 26 May 2013, at 23:15, David Starner  wrote:


Problems from Unicode generally come from of two places; compatibility with 
non-Unicode data sets, and people with different goals working on it. For 
pictographs, when Google comes forth saying this is the set we need supported, 
that was the set they needed supported for compatibility.

And then experts in symbols from Germany and Ireland said "We'll accept this 
incomplete set but we insist on some additions so that the set makes better sense." 
And that gave us what we have.

For instance, the Japanese telco sets had nearly, but not all, the animals in 
the Asian Zodiacs. The encoded set has all of them however. That's way better 
than it would have been had we just accepted Google's minimal request.


Goals can not be decided in an scientific way, and there are many people who 
have the goals for Unicode to support text, and not to support an arbitrary set 
of pictographs.

In my experience I have learned that splitters win out over lumpers. Often it 
takes a long time due to the stubbornness of the lumpers.

Michael Everson * http://www.evertype.com/






The Noun Project seem determined to create a pictogram for every noun, 
and many short phrases:


See http://blog.thenounproject.com/

I don't have any statistics about how far they've got to date, but I 
would imagine that this project is likely to generate at least some tens 
of thousands of pictograms, just in the near term.


-- N.





Re: Private Use Area

2013-02-18 Thread Neil Harris

On 18/02/13 18:09, William_J_G Overington wrote:

The first sentence of section 16.5 is as follows.

quote

Private-use characters are assigned Unicode code points whose interpretation is 
not specified by this standard and whose use may be determined by private 
agreement among cooperating users.

end quote

Suppose that there is a person, perhaps someone who is trying to produce a 
font, for whom the Private Use Areas is something that he or she has not 
encountered before he or she reads that sentence.

Is there ambiguity in that sentence such that such a person, a not unreasonable 
person, could ask himself or herself the following question?

Where can I find some cooperating users who will make a private agreement among 
themselves in order that I can make my font using code points from a Private 
Use Area?

William Overington

18 February 2013




It makes sense to implement something for yourseld first, before you 
start to recruit people to use it.


Produce a font, and some documentation for it. Publicise it. See who 
uses it.


You might want to try Twitter, Facebook, your own website, etc. etc. to 
do this.


There's your group of cooperating users.

-- Neil




Re: German »ß«

2013-02-17 Thread Neil Harris

On 17/02/13 10:48, Philippe Verdy wrote:

I was not citing empirical results but things that are regulated by legislation.
And your existing empirical results are just nfomal tests ignoring
important parts of the population of drivers, notably:
- those driving by night : the effet of some visual defects like
asygmatism, which is only partially corrected and which can only be
compensated by sufficient contrast (lowercase letters do not contrast
enough, because their strokes are too near of each other)
- the effect of presbytia on vision of aging population : here again
the size of letters does matter (look at those phones sold to ages
people: most of them are completely unable to use modern smartphones
for example, they are unreadable even with the best visual
correction), even if they wear "progressive glasses", they have a
reduced angle of good focusing, and if letters are too small, they
need to stop looking at the road to fix the displays on roads for
longer time. Every people above the age of 40 starts suffering this
visual defficiency where adaptation to vision depth is more difficult
and longer. larger letters that can be read easily even before there's
a full focus helps reducing the adaptation time.
- also by night, the effect of tireness also slows down the visual
adaptation and reduuces the angle of good focusing.
In all these cases, you need less density of strokes, and capital
letters are better constrasting.
Of course there are other factors like the effective constrast of
colors used on those displays, the negative impact of too narrow
fonts, insufficient intercharacter advance gaps, and insufficient
boldness.
Note that a perfect 10/10 vision (or better) is not mandatory to
drive, there are legal minimums where people with only 8/10 can
legally drive (and other visual defects are NOT tested at all, notably
the various forms of color blindness (mot cases being full or partial
deuteranopia, affecting about 1 on 6-8 male human in Europe, depending
on test methods : this is definitely not a small population).


Here are some excellent articles about the evidence-based approach that 
led to the development of current road signage in the United States.


http://www.nytimes.com/2007/08/12/magazine/12fonts-t.html?pagewanted=1&_r=0

and this, on research on the legibility of mixed/lower case vs. ALL CAPS:

http://www.microsoft.com/typography/ctfonts/WordRecognition.aspx

Regarding Clearview and older drivers, this:

http://deldot.gov/information/pubs_forms/manuals/de_mutcd/pdf/20080731061923147.pdf

is particularly interesting: the take-home quote is this:



The greatest improvement in legibility distance afforded by Clearview 
was realized by older drivers when viewed under headlamp illumination 
during nighttime conditions (an increase in legibility distance of 
between 6.0 percent and 6.8 percent)


-- Neil








Re: Mayan numerals

2012-08-16 Thread Neil Harris

On 16/08/12 16:42, Andreas Prilop wrote:

On Wed, 15 Aug 2012, Jameson Quinn wrote:


...
I'd like to see at least 20 glyphs for the (horizontal-barred) numerals.
...
Do others agree that it's needed?

Certainly not. Mayan numerals will disappear after 21 December 2012.




I think we can be confident they'll still be worth having, even in the 
14th b'ak'tun.


-- N.




Re: UTS46 "transitional period"

2011-06-29 Thread Neil Harris

On 29/06/11 23:25, Chris Weber wrote:

On 6/28/2011 11:20 PM, Peter Krefting wrote:

Den 2011-06-29 04:41:22 skrev Chris Weber :

I was trying to understand the implementation differences in some 
browsers and registrars. Using your example from UTS46 
http://xn--fa-hia.de/


Opera - error, doesn't resolve


We have recently implemented support for IDNA 2008 internally, but it 
was not finished in time for the just recently released Opera 11.50. 
Our latest greatest internal test build resolves that URL just fine 
(not that there is much interesting to see).




Thanks for the responses Mark and Peter.   So do you happen to know if 
Chrome, Safari, and Firefox are implementing IDNA2008 already or if 
they punycode example resolves under their implementation of IDNA2003 
because of the "murky" processing rules?


Best regards,
Chris




Firefox currently has an open bug for IDNA 2008 implementation:

https://bugzilla.mozilla.org/show_bug.cgi?id=479520

-- Neil




Re: Why not just change the glyph of 20A8 RUPEE SIGN?

2010-07-28 Thread Neil Harris

On 28/07/10 10:22, Shriramana Sharma wrote:
This is a somewhat late mail and please forgive me if this question 
has already been asked but there are too many hits for 20A8 Rupee when 
searching through the lists. Isn't it possible to just change the 
glyph of 20A8? To my knowledge, few people actually use 20A8 to 
display the existing representative glyph.


The Rupee sign is what it is -- the sign of India's currency. The 
character 20A8 encodes that sign. If the Indian Govt decides to change 
the glyph (in fact it is only now *deciding* on a glyph) then the same 
character should be maintained with only the glyph changed, no?




Unfortunately, that would have undesirable side effects, since there's 
more than one currency called the "rupee", and presumably some 
significant number of documents using that symbol: changing the generic 
rupee sign to look like the Indian rupee sign would make them all 
appear, misleadingly, to refer to the Indian rupee.


There's also the issue of canonical decomposition: the "₨" sign clearly 
looks like LATIN CAPITAL LETTER R, LATIN SMALL LETTER S -- the Indian 
rupee sign does not like anything of the sort.


-- Neil