The problem is the word boundary (b). There's no word boundary next to a
mathematical symbol. Try this one instead:
replace_tag N1 (?:1|l|xf0x9dx9fx8f)
replace_tag DIGIT (?:[0-9Ol]|xf0x9dx9f[x8e-x97])
replace_rules OB_PHONE_S
body OB_PHONE_S
/(?<!d)(?:<N1>[^a-zA-Z0-9]*)?<DIGIT>{3}[^a-zA-Z0-9]+<DIGIT>{3}[^a-zA-Z0-9]+<DIGIT>{4}(?!d)/
This should detect US phone numbers in various formats:
1 (800) 555-1212
1-800-555-1212
800.555.1212
Caveats:
1. This will fire on non-obfuscated phone numbers also
2. This will not fire on obfuscated phone numbers if they use any symbols other
than MATHEMATICAL BOLD
See attached for a more complete list of homoglyphs
Thanks
Kent
On Thu, Apr 3, 2025 at 02:43 AM, [email protected] wrote:
------------------------------------------------------------------------------------------------------------------
CAUTION: External email from: giovanni@paclan.it
Use caution before clicking on links or opening attachments.
------------------------------------------------------------------------------------------------------------------
On 4/3/25 8:04 AM, Loren Wilton wrote:
Well, this is very strange and I don't know what is going on. I almost suspect
some sort of bug in the regex processor in SA.
replace_tag N1 (?:1|l|xf0x9dx9fx8f)
replace_tag DIGIT (?:[0-9Ol]|xf0x9dx9f[x8e-x97])
replace_rules OB_PHONE_TEST4 OB_PHONE_TEST5 OB_PHONE_TEST6
body OB_PHONE_TEST4
/b(?:+?s?<N1>s?)?(?<DIGIT>{3})?[-s]{0,3}<DIGIT>{3}[-s]{0,3}b/
body OB_PHONE_TEST5
/b(?:+?s?<N1>s?)?(?<DIGIT>{3})?[s-]{0,3}<DIGIT>{3}[s-]{0,3}<DIGIT>{4}/
body OB_PHONE_TEST6 /(?<DIGIT>{3})?[s-]{0,3}<DIGIT>{3}[s-]{0,3}<DIGIT>{4}/
Rules 4 and 6 match. Rule 5, which is the complete match, does not. I have no
idea why.
I was getting the same results using your rule form before I simplified things
a bit. Partial overlapping matches work, a complete match does not. The
complete match DOES work if the phone number is in ASCII. But not if any digit
is unicode.
Actually OB_PHONE_TEST4 matches on "INV-854113" and OB_PHONE_TEST6 matches on
"andr9202822840@caosusaoviet[.]vn", the regexps doesn't seem to work at all.
Giovanni
U+07C0 \xDF\x80 ß NKO DIGIT ZERO
U+0966 \xE0\xA5\xA6 ० DEVANAGARI DIGIT ZERO
U+09E6 \xE0\xA7\xA6 ০ BENGALI DIGIT ZERO
U+0A66 \xE0\xA9\xA6 ੦ GURMUKHI DIGIT ZERO
U+0AE6 \xE0\xAB\xA6 ૦ GUJARATI DIGIT ZERO
U+0B66 \xE0\xAD\xA6 ঠORIYA DIGIT ZERO
U+0BE6 \xE0\xAF\xA6 ௦ TAMIL DIGIT ZERO
U+0C66 \xE0\xB1\xA6 ౦ TELUGU DIGIT ZERO
U+0CE6 \xE0\xB3\xA6 ೦ KANNADA DIGIT ZERO
U+0D66 \xE0\xB5\xA6 ൦ MALAYALAM DIGIT ZERO
U+0E50 \xE0\xB9\x90 ๠THAI DIGIT ZERO
U+0ED0 \xE0\xBB\x90 à» LAO DIGIT ZERO
U+1040 \xE1\x81\x80 á MYANMAR DIGIT ZERO
U+2070 \xE2\x81\xB0 â° SUPERSCRIPT ZERO
U+2080 \xE2\x82\x80 â SUBSCRIPT ZERO
U+3007 \xE3\x80\x87 ã IDEOGRAPHIC NUMBER ZERO
U+3358 \xE3\x8D\x98 ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR ZERO
U+FF10 \xEF\xBC\x90 ï¼ FULLWIDTH DIGIT ZERO
U+1D7CE \xF0\x9D\x9F\x8E ð MATHEMATICAL BOLD DIGIT ZERO
U+1D7D8 \xF0\x9D\x9F\x98 ð MATHEMATICAL DOUBLE-STRUCK DIGIT ZERO
U+1D7E2 \xF0\x9D\x9F\xA2 ð¢ MATHEMATICAL SANS-SERIF DIGIT ZERO
U+1D7EC \xF0\x9D\x9F\xAC ð¬ MATHEMATICAL SANS-SERIF BOLD DIGIT ZERO
U+1D7F6 \xF0\x9D\x9F\xB6 ð¶ MATHEMATICAL MONOSPACE DIGIT ZERO
U+1FBF0 \xF0\x9F\xAF\xB0 𯰠SEGMENTED DIGIT ZERO
U+00B9 \xC2\xB9 ¹ SUPERSCRIPT ONE
U+2081 \xE2\x82\x81 â SUBSCRIPT ONE
U+32C0 \xE3\x8B\x80 ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR JANUARY
U+3359 \xE3\x8D\x99 ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR ONE
U+33E0 \xE3\x8F\xA0 ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY ONE
U+FF11 \xEF\xBC\x91 ï¼ FULLWIDTH DIGIT ONE
U+1D7CF \xF0\x9D\x9F\x8F ð MATHEMATICAL BOLD DIGIT ONE
U+1D7D9 \xF0\x9D\x9F\x99 ð MATHEMATICAL DOUBLE-STRUCK DIGIT ONE
U+1D7E3 \xF0\x9D\x9F\xA3 ð£ MATHEMATICAL SANS-SERIF DIGIT ONE
U+1D7ED \xF0\x9D\x9F\xAD ð MATHEMATICAL SANS-SERIF BOLD DIGIT ONE
U+1D7F7 \xF0\x9D\x9F\xB7 ð· MATHEMATICAL MONOSPACE DIGIT ONE
U+1FBF1 \xF0\x9F\xAF\xB1 𯱠SEGMENTED DIGIT ONE
U+00B2 \xC2\xB2 ² SUPERSCRIPT TWO
U+01A7 \xC6\xA7 Ƨ LATIN CAPITAL LETTER TONE TWO
U+01A8 \xC6\xA8 ƨ LATIN SMALL LETTER TONE TWO
U+01BB \xC6\xBB Æ» LATIN LETTER TWO WITH STROKE
U+0242 \xC9\x82 É LATIN SMALL LETTER GLOTTAL STOP
U+03E8 \xCF\xA8 Ϩ COPTIC CAPITAL LETTER HORI
U+03E9 \xCF\xA9 Ï© COPTIC SMALL LETTER HORI
U+0536 \xD4\xB6 Ô¶ ARMENIAN CAPITAL LETTER ZA
U+14BF \xE1\x92\xBF á¿ CANADIAN SYLLABICS SAYISI M
U+1616 \xE1\x98\x96 á CANADIAN SYLLABICS CARRIER JO
U+2082 \xE2\x82\x82 â SUBSCRIPT TWO
U+32C1 \xE3\x8B\x81 ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR FEBRUARY
U+335A \xE3\x8D\x9A ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR TWO
U+33E1 \xE3\x8F\xA1 ã¡ IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TWO
U+A644 \xEA\x99\x84 ê CYRILLIC CAPITAL LETTER REVERSED DZE
U+A75A \xEA\x9D\x9A ê LATIN CAPITAL LETTER R ROTUNDA
U+FF12 \xEF\xBC\x92 ï¼ FULLWIDTH DIGIT TWO
U+1D7D0 \xF0\x9D\x9F\x90 ð MATHEMATICAL BOLD DIGIT TWO
U+1D7DA \xF0\x9D\x9F\x9A ð MATHEMATICAL DOUBLE-STRUCK DIGIT TWO
U+1D7E4 \xF0\x9D\x9F\xA4 ð¤ MATHEMATICAL SANS-SERIF DIGIT TWO
U+1D7EE \xF0\x9D\x9F\xAE ð® MATHEMATICAL SANS-SERIF BOLD DIGIT TWO
U+1D7F8 \xF0\x9D\x9F\xB8 ð¸ MATHEMATICAL MONOSPACE DIGIT TWO
U+1FBF2 \xF0\x9F\xAF\xB2 𯲠SEGMENTED DIGIT TWO
U+00B3 \xC2\xB3 ³ SUPERSCRIPT THREE
U+018E \xC6\x8E Æ LATIN CAPITAL LETTER REVERSED E
U+01B7 \xC6\xB7 Æ· LATIN CAPITAL LETTER EZH
U+01EE \xC7\xAE Ç® LATIN CAPITAL LETTER EZH WITH CARON
U+01EF \xC7\xAF ǯ LATIN SMALL LETTER EZH WITH CARON
U+021C \xC8\x9C È LATIN CAPITAL LETTER YOGH
U+021D \xC8\x9D È LATIN SMALL LETTER YOGH
U+025C \xC9\x9C É LATIN SMALL LETTER REVERSED OPEN E
U+025D \xC9\x9D É LATIN SMALL LETTER REVERSED OPEN E WITH HOOK
U+0292 \xCA\x92 Ê LATIN SMALL LETTER EZH
U+0293 \xCA\x93 Ê LATIN SMALL LETTER EZH WITH CURL
U+0417 \xD0\x97 Ð CYRILLIC CAPITAL LETTER ZE
U+0437 \xD0\xB7 з CYRILLIC SMALL LETTER ZE
U+0498 \xD2\x98 Ò CYRILLIC CAPITAL LETTER ZE WITH DESCENDER
U+0499 \xD2\x99 Ò CYRILLIC SMALL LETTER ZE WITH DESCENDER
U+04DE \xD3\x9E Ó CYRILLIC CAPITAL LETTER ZE WITH DIAERESIS
U+04DF \xD3\x9F Ó CYRILLIC SMALL LETTER ZE WITH DIAERESIS
U+04E0 \xD3\xA0 Ó CYRILLIC CAPITAL LETTER ABKHASIAN DZE
U+04E1 \xD3\xA1 Ó¡ CYRILLIC SMALL LETTER ABKHASIAN DZE
U+04EC \xD3\xAC Ó¬ CYRILLIC CAPITAL LETTER E WITH DIAERESIS
U+04ED \xD3\xAD Ó CYRILLIC SMALL LETTER E WITH DIAERESIS
U+0545 \xD5\x85 Õ
ARMENIAN CAPITAL LETTER YI
U+19A1 \xE1\xA6\xA1 ᦡ NEW TAI LUE LETTER HIGH DA
U+19A4 \xE1\xA6\xA4 ᦤ NEW TAI LUE LETTER LOW DA
U+1D4C \xE1\xB5\x8C áµ MODIFIER LETTER SMALL TURNED OPEN E
U+1D9F \xE1\xB6\x9F á¶ MODIFIER LETTER SMALL REVERSED OPEN E
U+1DBE \xE1\xB6\xBE á¶¾ MODIFIER LETTER SMALL EZH
U+2083 \xE2\x82\x83 â SUBSCRIPT THREE
U+2128 \xE2\x84\xA8 ⨠BLACK-LETTER CAPITAL Z
U+2C9C \xE2\xB2\x9C â² COPTIC CAPITAL LETTER KSI
U+2C9D \xE2\xB2\x9D â² COPTIC SMALL LETTER KSI
U+2CC4 \xE2\xB3\x84 â³ COPTIC CAPITAL LETTER OLD COPTIC SHEI
U+2CC5 \xE2\xB3\x85 â³
COPTIC SMALL LETTER OLD COPTIC SHEI
U+2CCC \xE2\xB3\x8C â³ COPTIC CAPITAL LETTER OLD COPTIC HORI
U+2D3A \xE2\xB4\xBA â´º TIFINAGH LETTER YADDH
U+32C2 \xE3\x8B\x82 ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR MARCH
U+335B \xE3\x8D\x9B ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR THREE
U+33E2 \xE3\x8F\xA2 㢠IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY THREE
U+A76A \xEA\x9D\xAA êª LATIN CAPITAL LETTER ET
U+FF13 \xEF\xBC\x93 ï¼ FULLWIDTH DIGIT THREE
U+1D537 \xF0\x9D\x94\xB7 ð· MATHEMATICAL FRAKTUR SMALL Z
U+1D585 \xF0\x9D\x96\x85 ð
MATHEMATICAL BOLD FRAKTUR CAPITAL Z
U+1D59F \xF0\x9D\x96\x9F ð MATHEMATICAL BOLD FRAKTUR SMALL Z
U+1D7D1 \xF0\x9D\x9F\x91 ð MATHEMATICAL BOLD DIGIT THREE
U+1D7DB \xF0\x9D\x9F\x9B ð MATHEMATICAL DOUBLE-STRUCK DIGIT THREE
U+1D7E5 \xF0\x9D\x9F\xA5 ð¥ MATHEMATICAL SANS-SERIF DIGIT THREE
U+1D7EF \xF0\x9D\x9F\xAF ð¯ MATHEMATICAL SANS-SERIF BOLD DIGIT THREE
U+1D7F9 \xF0\x9D\x9F\xB9 ð¹ MATHEMATICAL MONOSPACE DIGIT THREE
U+1FBF3 \xF0\x9F\xAF\xB3 𯳠SEGMENTED DIGIT THREE
U+03E4 \xCF\xA4 Ϥ COPTIC CAPITAL LETTER FEI
U+03E5 \xCF\xA5 ϥ COPTIC SMALL LETTER FEI
U+0447 \xD1\x87 Ñ CYRILLIC SMALL LETTER CHE
U+04B8 \xD2\xB8 Ò¸ CYRILLIC CAPITAL LETTER CHE WITH VERTICAL STROKE
U+04B9 \xD2\xB9 Ò¹ CYRILLIC SMALL LETTER CHE WITH VERTICAL STROKE
U+10AF \xE1\x82\xAF ᯠGEORGIAN CAPITAL LETTER ZHAR
U+13CE \xE1\x8F\x8E á CHEROKEE LETTER SE
U+2074 \xE2\x81\xB4 â´ SUPERSCRIPT FOUR
U+2084 \xE2\x82\x84 â SUBSCRIPT FOUR
U+2D0F \xE2\xB4\x8F â´ GEORGIAN SMALL LETTER ZHAR
U+32C3 \xE3\x8B\x83 ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR APRIL
U+335C \xE3\x8D\x9C ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR FOUR
U+33E3 \xE3\x8F\xA3 㣠IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY FOUR
U+AB9E \xEA\xAE\x9E ê® CHEROKEE SMALL LETTER SE
U+FF14 \xEF\xBC\x94 ï¼ FULLWIDTH DIGIT FOUR
U+10A30 \xF0\x90\xA8\xB0 𨰠KHAROSHTHI LETTER ZA
U+1D7D2 \xF0\x9D\x9F\x92 ð MATHEMATICAL BOLD DIGIT FOUR
U+1D7DC \xF0\x9D\x9F\x9C ð MATHEMATICAL DOUBLE-STRUCK DIGIT FOUR
U+1D7E6 \xF0\x9D\x9F\xA6 ð¦ MATHEMATICAL SANS-SERIF DIGIT FOUR
U+1D7F0 \xF0\x9D\x9F\xB0 ð° MATHEMATICAL SANS-SERIF BOLD DIGIT FOUR
U+1D7FA \xF0\x9D\x9F\xBA ðº MATHEMATICAL MONOSPACE DIGIT FOUR
U+1FBF4 \xF0\x9F\xAF\xB4 𯴠SEGMENTED DIGIT FOUR
U+01BC \xC6\xBC Ƽ LATIN CAPITAL LETTER TONE FIVE
U+2075 \xE2\x81\xB5 âµ SUPERSCRIPT FIVE
U+2085 \xE2\x82\x85 â
SUBSCRIPT FIVE
U+32C4 \xE3\x8B\x84 ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR MAY
U+335D \xE3\x8D\x9D ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR FIVE
U+33E4 \xE3\x8F\xA4 㤠IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY FIVE
U+FF15 \xEF\xBC\x95 ï¼ FULLWIDTH DIGIT FIVE
U+1D7D3 \xF0\x9D\x9F\x93 ð MATHEMATICAL BOLD DIGIT FIVE
U+1D7DD \xF0\x9D\x9F\x9D ð MATHEMATICAL DOUBLE-STRUCK DIGIT FIVE
U+1D7E7 \xF0\x9D\x9F\xA7 ð§ MATHEMATICAL SANS-SERIF DIGIT FIVE
U+1D7F1 \xF0\x9D\x9F\xB1 ð± MATHEMATICAL SANS-SERIF BOLD DIGIT FIVE
U+1D7FB \xF0\x9D\x9F\xBB ð» MATHEMATICAL MONOSPACE DIGIT FIVE
U+1FBF5 \xF0\x9F\xAF\xB5 𯵠SEGMENTED DIGIT FIVE
U+03EC \xCF\xAC Ϭ COPTIC CAPITAL LETTER SHIMA
U+03ED \xCF\xAD Ï COPTIC SMALL LETTER SHIMA
U+0431 \xD0\xB1 б CYRILLIC SMALL LETTER BE
U+0573 \xD5\xB3 Õ³ ARMENIAN SMALL LETTER CHEH
U+13EE \xE1\x8F\xAE á® CHEROKEE LETTER WV
U+19C8 \xE1\xA7\x88 á§ NEW TAI LUE TONE MARK-1
U+2076 \xE2\x81\xB6 â¶ SUPERSCRIPT SIX
U+2086 \xE2\x82\x86 â SUBSCRIPT SIX
U+2CD2 \xE2\xB3\x92 â³ COPTIC CAPITAL LETTER OLD COPTIC HEI
U+32C5 \xE3\x8B\x85 ã
IDEOGRAPHIC TELEGRAPH SYMBOL FOR JUNE
U+335E \xE3\x8D\x9E ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR SIX
U+33E5 \xE3\x8F\xA5 㥠IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY SIX
U+ABBE \xEA\xAE\xBE ꮾ CHEROKEE SMALL LETTER WV
U+FF16 \xEF\xBC\x96 ï¼ FULLWIDTH DIGIT SIX
U+1041E \xF0\x90\x90\x9E ð DESERET CAPITAL LETTER ZEE
U+10446 \xF0\x90\x91\x86 ð DESERET SMALL LETTER ZEE
U+1D7D4 \xF0\x9D\x9F\x94 ð MATHEMATICAL BOLD DIGIT SIX
U+1D7DE \xF0\x9D\x9F\x9E ð MATHEMATICAL DOUBLE-STRUCK DIGIT SIX
U+1D7E8 \xF0\x9D\x9F\xA8 ð¨ MATHEMATICAL SANS-SERIF DIGIT SIX
U+1D7F2 \xF0\x9D\x9F\xB2 ð² MATHEMATICAL SANS-SERIF BOLD DIGIT SIX
U+1D7FC \xF0\x9D\x9F\xBC ð¼ MATHEMATICAL MONOSPACE DIGIT SIX
U+1E031 \xF0\x9E\x80\xB1 ð± MODIFIER LETTER CYRILLIC SMALL BE
U+1E052 \xF0\x9E\x81\x92 ð CYRILLIC SUBSCRIPT SMALL LETTER BE
U+1FBF6 \xF0\x9F\xAF\xB6 𯶠SEGMENTED DIGIT SIX
U+2077 \xE2\x81\xB7 â· SUPERSCRIPT SEVEN
U+2087 \xE2\x82\x87 â SUBSCRIPT SEVEN
U+32C6 \xE3\x8B\x86 ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR JULY
U+335F \xE3\x8D\x9F ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR SEVEN
U+33E6 \xE3\x8F\xA6 㦠IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY SEVEN
U+FF17 \xEF\xBC\x97 ï¼ FULLWIDTH DIGIT SEVEN
U+1D7D5 \xF0\x9D\x9F\x95 ð MATHEMATICAL BOLD DIGIT SEVEN
U+1D7DF \xF0\x9D\x9F\x9F ð MATHEMATICAL DOUBLE-STRUCK DIGIT SEVEN
U+1D7E9 \xF0\x9D\x9F\xA9 ð© MATHEMATICAL SANS-SERIF DIGIT SEVEN
U+1D7F3 \xF0\x9D\x9F\xB3 ð³ MATHEMATICAL SANS-SERIF BOLD DIGIT SEVEN
U+1D7FD \xF0\x9D\x9F\xBD ð½ MATHEMATICAL MONOSPACE DIGIT SEVEN
U+1FBF7 \xF0\x9F\xAF\xB7 𯷠SEGMENTED DIGIT SEVEN
U+0222 \xC8\xA2 Ȣ LATIN CAPITAL LETTER OU
U+0223 \xC8\xA3 ȣ LATIN SMALL LETTER OU
U+09EA \xE0\xA7\xAA ৪ BENGALI DIGIT FOUR
U+0A6A \xE0\xA9\xAA ੪ GURMUKHI DIGIT FOUR
U+0B03 \xE0\xAC\x83 ଠORIYA SIGN VISARGA
U+1D3D \xE1\xB4\xBD á´½ MODIFIER LETTER CAPITAL OU
U+2078 \xE2\x81\xB8 ⸠SUPERSCRIPT EIGHT
U+2088 \xE2\x82\x88 â SUBSCRIPT EIGHT
U+32C7 \xE3\x8B\x87 ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR AUGUST
U+3360 \xE3\x8D\xA0 ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR EIGHT
U+33E7 \xE3\x8F\xA7 ã§ IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY EIGHT
U+FF18 \xEF\xBC\x98 ï¼ FULLWIDTH DIGIT EIGHT
U+1031A \xF0\x90\x8C\x9A ð OLD ITALIC LETTER EF
U+10445 \xF0\x90\x91\x85 ð
DESERET SMALL LETTER ES
U+1D7D6 \xF0\x9D\x9F\x96 ð MATHEMATICAL BOLD DIGIT EIGHT
U+1D7E0 \xF0\x9D\x9F\xA0 ð MATHEMATICAL DOUBLE-STRUCK DIGIT EIGHT
U+1D7EA \xF0\x9D\x9F\xAA ðª MATHEMATICAL SANS-SERIF DIGIT EIGHT
U+1D7F4 \xF0\x9D\x9F\xB4 ð´ MATHEMATICAL SANS-SERIF BOLD DIGIT EIGHT
U+1D7FE \xF0\x9D\x9F\xBE ð¾ MATHEMATICAL MONOSPACE DIGIT EIGHT
U+1FBF8 \xF0\x9F\xAF\xB8 𯸠SEGMENTED DIGIT EIGHT
U+09ED \xE0\xA7\xAD à§ BENGALI DIGIT SEVEN
U+0A67 \xE0\xA9\xA7 à©§ GURMUKHI DIGIT ONE
U+0B68 \xE0\xAD\xA8 ਠORIYA DIGIT TWO
U+2079 \xE2\x81\xB9 â¹ SUPERSCRIPT NINE
U+2089 \xE2\x82\x89 â SUBSCRIPT NINE
U+2CCA \xE2\xB3\x8A â³ COPTIC CAPITAL LETTER DIALECT-P HORI
U+32C8 \xE3\x8B\x88 ã IDEOGRAPHIC TELEGRAPH SYMBOL FOR SEPTEMBER
U+3361 \xE3\x8D\xA1 ã¡ IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR NINE
U+33E8 \xE3\x8F\xA8 㨠IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY NINE
U+A76E \xEA\x9D\xAE ê® LATIN CAPITAL LETTER CON
U+FF19 \xEF\xBC\x99 ï¼ FULLWIDTH DIGIT NINE
U+1D7D7 \xF0\x9D\x9F\x97 ð MATHEMATICAL BOLD DIGIT NINE
U+1D7E1 \xF0\x9D\x9F\xA1 ð¡ MATHEMATICAL DOUBLE-STRUCK DIGIT NINE
U+1D7EB \xF0\x9D\x9F\xAB ð« MATHEMATICAL SANS-SERIF DIGIT NINE
U+1D7F5 \xF0\x9D\x9F\xB5 ðµ MATHEMATICAL SANS-SERIF BOLD DIGIT NINE
U+1D7FF \xF0\x9D\x9F\xBF ð¿ MATHEMATICAL MONOSPACE DIGIT NINE
U+1FBF9 \xF0\x9F\xAF\xB9 𯹠SEGMENTED DIGIT NINE