Re: Unicode "no-op" Character?

2019-06-22 Thread Rebecca T via Unicode
Perhaps a codepoint from a private use area and another processing step to add/ remove them would work for you? On Sat, Jun 22, 2019, 1:39 AM Mark Davis ☕️ via Unicode wrote: > There nothing like what you are describing. Examples: > >1. Display — There are a few of the Default Ignorables tha

Re: Correct way to express in English that a string is encoded ... using UTF-8 ... with UTF-8 ... in UTF-8?

2019-05-15 Thread Rebecca T via Unicode
I think that colloquially “the file contains a UTF-8 string” is best, but perhaps not in more formal communications. On Wed, May 15, 2019, 7:24 AM Costello, Roger L. via Unicode < unicode@unicode.org> wrote: > Hello Unicode experts! > > Which is correct: > > (a) The input file contains a string.

Re: Reminder Ribbon U+1F397

2018-06-23 Thread Rebecca T via Unicode
> > Perhaps a larger set of ribbon characters, with defined colors for > each, is called for? [image: crying pointing gun.jpg] But uh, seriously, the ribbon was encoded in the Great Wingdings and Webdings Migration of 2011 (see L2/12-368 p.21) a

Re: Can NFKC turn valid UAX 31 identifiers into non-identifiers?

2018-06-04 Thread Rebecca T via Unicode
I think that the benefits of inclusion from allowing non-ASCII identifiers far outweigh any corner cases this might cause. (Although ironing out and analyzing those is of course important, I don’t think they should be obstacles for implementing this kind of thing.) Something I’d love to see is tra

Re: Unicode education in UK Schools

2017-07-07 Thread Rebecca T via Unicode
> That might be a good thing. Yeah. Very seriously, it’s very important to introduce Unicode early on in CS education, even in a “hey, it’s not OK to exclude people who don’t speak English or people whose names have diacritics from using the programs you create” sort of way. Ignorance and apathy

Re: unihan-etl: create exports of UNIHAN db to csv, json and yaml

2017-05-30 Thread Rebecca T via Unicode
Oh, thank god. I’ve wanted something like this for ages, but I’ve been too lazy to invest the time to create a serious tool — I’ve used a lot of messy one-time regular expressions. Will definitely be starring your repo!

Re: Petition to ban Google from designing emoji

2017-05-18 Thread Rebecca T via Unicode
Nick, Don’t even joke. Sincerely, — Rebecca On Thu, May 18, 2017 at 2:53 PM, Phake Nick via Unicode wrote: > Is it possible to introduce variation selector for emoji with large design > variation among vendors so that when users send emoji with selectors their > variation among vendors can be

Re: Petition to ban Google from designing emoji

2017-05-18 Thread Rebecca T via Unicode
Well, you’re certainly not alone in your distaste for the new design. @eevee just today said “cool how we improved gender diversity by slowly changing from ‘ambiguous/neutral’ to ‘explicit color-coded binary, default usually

Re: Proposal to add standardized variation sequences for chess notation

2017-04-07 Thread Rebecca T
> while evidently there are users who need to send BROCCOLI to one another, > nobody but nobody needs to send an 8 x 8 chessboard matrix in a tweet. Get > it? I simply must disagree; sending a textual chessboard sounds awesome! A twitter bot that plays chess with you and shows you a graphical repr

Re: PETSCII mapping?

2017-04-06 Thread Rebecca T
Count me in! I’m partial for one large unified proposal, FWIW. On Thu, Apr 6, 2017 at 2:24 PM, Rebecca Bettencourt wrote: > On Thu, Apr 6, 2017 at 10:43 AM, Doug Ewell wrote: > >> Michael Everson wrote: >> >> > Everybody interested, raise your hand… >> >> I'm in. > > > I'm in as well of cours

Re: PETSCII mapping?

2017-04-06 Thread Rebecca T
Here’s a copy of the Teletext character set; it includes box-drawing characters for all combinations of a 2×3 grid of cells. 2⁶ = 64 characters, so we might need a new block. [1]: http://www.galax.xyz/TELETEXT/CHARSET.HTM

Re: PETSCII mapping?

2017-04-05 Thread Rebecca T
The Wikipedia page for PETSCII [1] only marks 20 characters as not having Unicode equivalents; 2px (light) and 3px (heavy) horizontal and vertical bars at various non-center positions, diagonal shading characters, and corner characters. I’ve done some processing to the table on [1] to filter out t

Re: Coloured Punctuation and Annotation

2017-04-05 Thread Rebecca T
> The hieroglyphs don't have the emoji property What does the emoji property mean, semantically? That the codepoint represents a pictograph or that vendors have “permission” to give it a colored, stylized representation? If we go with the first, then hieroglyphs should certainly be emoji. Although

Re: PETSCII mapping?

2017-04-05 Thread Rebecca T
> If there's a credible need to convert files between Unicode-based systems and > those using PETSCII There is! It’s called “sharing textual information” and it’s how our society functions. Can we afford to blithely abandon data from the best selling computer in history [1] because nobody cared to

Re: Proposal to add standardized variation sequences for chess notation

2017-04-01 Thread Rebecca T
> No chess symbols, encoded or proposed, are emoji, nor should they be. Except on Samsung .

Re: New tool unidump

2017-03-19 Thread Rebecca T
I maintain a list of various Unicode tools and resources at unicode.yea.rs and always welcome new additions! On Sat, Mar 18, 2017 at 1:42 AM, Janusz S. Bien wrote: > Quote/Cytat - Manuel Strehl (Fri 17 Mar 2017 > 09:44:15 PM CET): > > Hi, >> >> for my work on codepoints.net and Emojipedia I

Combining solidus above for transcription of poetic meter

2017-03-17 Thread Rebecca T
When transcribing poetic meter (scansion ), it is common to use two symbols above the line (usually a breve [U+306 ̆] for stressed syllables and a solidus / slash [U+2F /] for unstressed syllables) to indicate stress patterns. Ex: ̆/ ̆ / ̆

Re: Emoji Packs

2016-12-22 Thread Rebecca T
at, do I just run process.py? > > I'm so sorry for being dumb, but thanks again! > > Chris > > On Wed, Dec 21, 2016 at 12:07 PM, Rebecca T <637...@gmail.com> wrote: > >> Okay, I threw something together. >> >> github.com/years/emoji has all 18,615 ima

Re: Emoji Packs

2016-12-21 Thread Rebecca T
better way to organize them, please let me know! On Wed, Dec 21, 2016 at 10:28 AM, Chris Monteleone wrote: > "Unfamiliar" would be an understatement. If you feel like putting that > together it would be appreciated, but no pressure. > > Thank you! > > On Tue, Dec 20

Re: a character for an unknown character

2016-12-21 Thread Rebecca T
U+FFFD REPLACEMENT CHARACTER � On Wed, Dec 21, 2016 at 3:05 AM Philippe Verdy wrote: > there's a "replacement" control, whose rendering is undefined. It may > represent any missing part covering more than one character, such as parts > that have been burned, or overstrikken. This Unicode charact

Re: Emoji end goal

2016-10-12 Thread Rebecca T
Agreed. I think a good response to “that’d _double_ the codepoints, so we should just add a ligature” is “if it would be such a burden to implement that you don’t want to use space in the charts for what are, fundamentally, hundreds of *semantically different* ideographs, why are we dumping that bu

Re: Emoji end goal

2016-10-12 Thread Rebecca T
Sure, and kanji have romanisations but that doesn’t make the latin alphabet language neutral. And yes, emoji were supposed to be language neutral but all the implementers made them default to male. I think you have an *argument* with skin-tone neutrality but I think you’d be hard-pressed to find an

Re: Emoji end goal

2016-10-12 Thread Rebecca T
Well, I think it’s definitely important to have representation and expression for people of all skin tones and genders even in things like emoji. I think we’re rapidly reaching a limit for variation sequences, and I’m personally not begging for hair color modifiers (although I would welcome them).