This source, written in EuroAssembler, declares codepoints of Unicode characters and their properties.
It is included to EuroTool sources which work with Unicode (EuroConv, EuroSort, EuroText).
; Character categories and their assigned relevance in text: Bm = +127 ; Byte order mark (FEFF). Cc = -32 ; Other, control Cf = -16 ; Other, format Co = -24 ; Other, Private Use Fm = +16 ; Format control (LF,CR,TAB,space} L1 = +8 ; Lowcase letter 1 L3 = +24 ; Lowcase letter 3 L4 = +28 ; Lowcase letter 4 L5 = +32 ; Lowcase letter 5 L6 = +36 ; Lowcase letter 6 L7 = +40 ; Lowcase letter 7 Lm = -8 ; Letter, modifier Lo = +4 ; Letter, other Lp = -60 ; Letter, pseudo Ls = +8 ; Letter, sign Mn = -24 ; Mark, nonspacing Nd = +16 ; Number, decimal digit No = +8 ; Number, other Ns = -32 ; Number, superscript Pd = +4 ; Punctuation, dash Pe = +4 ; Punctuation, close Pf = +4 ; Punctuation, final quote Pi = +4 ; Punctuation, initial quote Po = +4 ; Punctuation, other Ps = +4 ; Punctuation, open Pw = -4 ; Punctuation, weird Sc = +4 ; Symbol, currency Sk = -20 ; Symbol, modifier Sm = -16 ; Symbol, math So = -20 ; Symbol, other U3 = +20 ; Upcase letter 3 U4 = +24 ; Upcase letter 4 U5 = +28 ; Upcase letter 5 U6 = +32 ; Upcase letter 6 U7 = +36 ; Upcase letter 7 Zs = +8 ; Separator, space ?? = -128 ; Not a valid character.
All valid Unicode characters including Asian languages and emojis are convertible between Unicode encodings (UTF) but the following table defines only those characters, which are defined in some of the supported 8bit encoding, and also characters which have defined a named HTML entity.
The data will be tossed by a (not-yet-declared) macro UCP into sections of [.rodata] segment at assembly time.
Not all columns of the following table are necessarily used in every EuroTool; the macro UCP selects only those
which are needed at the moment.
An assembler application which wants to use this resource should
unicode.htm
UCP name reservation.0x.& and without the trailing ;.
; ╔═codepoint value
; ║ ╔═character category, relevance in text
; ║ ║ ╔═transliteration to ASCII
; ║ ║ ║ ╔═HTML entity (0..8 ASCII characters, case-sensitive)
; ║ ║ ║ ║ ╔═glyph in browser
; ║ ║ ║ ║ ║ ╔═character name
UCP 0000, Cc, ' ', ; control NUL
UCP 0001, Cc, ' ', ; control SOH
UCP 0002, Cc, ' ', ; control STX
UCP 0003, Cc, ' ', ; control ETX
UCP 0004, Cc, ' ', ; control EOT
UCP 0005, Cc, ' ', ; control ENQ
UCP 0006, Cc, ' ', ; control ACK
UCP 0007, Cc, ' ', ; control BEL
UCP 0008, Cc, ' ', ; control BS
UCP 0009, Fm, ' ', tab ; control HT
UCP 000A, Fm, ' ', newline ; control LF
UCP 000B, Cc, ' ', ; control VT
UCP 000C, Cc, ' ', ; control FF
UCP 000D, Fm, ' ', ; control CR
UCP 000E, Cc, ' ', ; control SO
UCP 000F, Cc, ' ', ; control SI
UCP 0010, Cc, ' ', ; control DLE
UCP 0011, Cc, ' ', ; control DC1
UCP 0012, Cc, ' ', ; control DC2
UCP 0013, Cc, ' ', ; control DC3
UCP 0014, Cc, ' ', ; control DC4
UCP 0015, Cc, ' ', ; control NAK
UCP 0016, Cc, ' ', ; control SYN
UCP 0017, Cc, ' ', ; control ETB
UCP 0018, Cc, ' ', ; control CAN
UCP 0019, Cc, ' ', ; control EM
UCP 001A, Cc, ' ', ; control SUB
UCP 001B, Cc, ' ', ; control ESC
UCP 001C, Cc, ' ', ; control FS
UCP 001D, Cc, ' ', ; control GS
UCP 001E, Cc, ' ', ; control RS
UCP 001F, Cc, ' ', ; control US
UCP 0020, Fm, ' ', ; SPACE
UCP 0021, Po, '!', excl ; ! EXCLAMATION MARK
UCP 0022, Po, '"', quot ; " QUOTATION MARK
UCP 0023, Po, '#', num ; # NUMBER SIGN
UCP 0024, Sc, '$', dollar ; $ DOLLAR SIGN
UCP 0025, Po, '%%', percnt ; % PERCENT SIGN
UCP 0026, Po, '&', amp ; & AMPERSAND
UCP 0027, Po, "'", apos ; ' APOSTROPHE
UCP 0028, Ps, '(', lpar ; ( LEFT PARENTHESIS
UCP 0029, Pe, ')', rpar ; ) RIGHT PARENTHESIS
UCP 002A, Po, '*', ast ; * ASTERISK
UCP 002B, Sm, '+', plus ; + PLUS SIGN
UCP 002C, Po, ',', comma ; , COMMA
UCP 002D, Pd, '-', ; - HYPHEN-MINUS
UCP 002E, Po, '.', period ; . FULL STOP
UCP 002F, Po, '/', sol ; / SOLIDUS
UCP 0030, Nd, '0', ; 0 DIGIT ZERO
UCP 0031, Nd, '1', ; 1 DIGIT ONE
UCP 0032, Nd, '2', ; 2 DIGIT TWO
UCP 0033, Nd, '3', ; 3 DIGIT THREE
UCP 0034, Nd, '4', ; 4 DIGIT FOUR
UCP 0035, Nd, '5', ; 5 DIGIT FIVE
UCP 0036, Nd, '6', ; 6 DIGIT SIX
UCP 0037, Nd, '7', ; 7 DIGIT SEVEN
UCP 0038, Nd, '8', ; 8 DIGIT EIGHT
UCP 0039, Nd, '9', ; 9 DIGIT NINE
UCP 003A, Po, ':', colon ; : COLON
UCP 003B, Po, ';', semi ; ; SEMICOLON
UCP 003C, Sm, '<', lt ; < LESS-THAN SIGN
UCP 003D, Sm, '=', equals ; = EQUALS SIGN
UCP 003E, Sm, '>', gt ; > GREATER-THAN SIGN
UCP 003F, Po, '?', quest ; ? QUESTION MARK
UCP 0040, Po, '@', commat ; @ COMMERCIAL AT
UCP 0041, U5, 'A', ; A LATIN CAPITAL LETTER A
UCP 0042, U5, 'B', ; B LATIN CAPITAL LETTER B
UCP 0043, U5, 'C', ; C LATIN CAPITAL LETTER C
UCP 0044, U5, 'D', ; D LATIN CAPITAL LETTER D
UCP 0045, U5, 'E', ; E LATIN CAPITAL LETTER E
UCP 0046, U5, 'F', ; F LATIN CAPITAL LETTER F
UCP 0047, U5, 'G', ; G LATIN CAPITAL LETTER G
UCP 0048, U5, 'H', ; H LATIN CAPITAL LETTER H
UCP 0049, U5, 'I', ; I LATIN CAPITAL LETTER I
UCP 004A, U5, 'J', ; J LATIN CAPITAL LETTER J
UCP 004B, U5, 'K', ; K LATIN CAPITAL LETTER K
UCP 004C, U5, 'L', ; L LATIN CAPITAL LETTER L
UCP 004D, U5, 'M', ; M LATIN CAPITAL LETTER M
UCP 004E, U5, 'N', ; N LATIN CAPITAL LETTER N
UCP 004F, U5, 'O', ; O LATIN CAPITAL LETTER O
UCP 0050, U5, 'P', ; P LATIN CAPITAL LETTER P
UCP 0051, U5, 'Q', ; Q LATIN CAPITAL LETTER Q
UCP 0052, U5, 'R', ; R LATIN CAPITAL LETTER R
UCP 0053, U5, 'S', ; S LATIN CAPITAL LETTER S
UCP 0054, U5, 'T', ; T LATIN CAPITAL LETTER T
UCP 0055, U5, 'U', ; U LATIN CAPITAL LETTER U
UCP 0056, U5, 'V', ; V LATIN CAPITAL LETTER V
UCP 0057, U5, 'W', ; W LATIN CAPITAL LETTER W
UCP 0058, U5, 'X', ; X LATIN CAPITAL LETTER X
UCP 0059, U5, 'Y', ; Y LATIN CAPITAL LETTER Y
UCP 005A, U5, 'Z', ; Z LATIN CAPITAL LETTER Z
UCP 005B, Ps, '[', lbrack ; [ LEFT SQUARE BRACKET
UCP 005C, Po, '\', bsol ; \ REVERSE SOLIDUS
UCP 005D, Pe, ']', rbrack ; ] RIGHT SQUARE BRACKET
UCP 005E, Sk, '^', hat ; ^ CIRCUMFLEX ACCENT
UCP 005F, Pe, '_', lowbar ; _ LOW LINE
UCP 0060, Sk, '`', grave ; ` GRAVE ACCENT
UCP 0061, L5, 'a', ; a LATIN SMALL LETTER A
UCP 0062, L5, 'b', ; b LATIN SMALL LETTER B
UCP 0063, L5, 'c', ; c LATIN SMALL LETTER C
UCP 0064, L5, 'd', ; d LATIN SMALL LETTER D
UCP 0065, L5, 'e', ; e LATIN SMALL LETTER E
UCP 0066, L5, 'f', ; f LATIN SMALL LETTER F
UCP 0067, L5, 'g', ; g LATIN SMALL LETTER G
UCP 0068, L5, 'h', ; h LATIN SMALL LETTER H
UCP 0069, L5, 'i', ; i LATIN SMALL LETTER I
UCP 006A, L5, 'j', ; j LATIN SMALL LETTER J
UCP 006B, L5, 'k', ; k LATIN SMALL LETTER K
UCP 006C, L5, 'l', ; l LATIN SMALL LETTER L
UCP 006D, L5, 'm', ; m LATIN SMALL LETTER M
UCP 006E, L5, 'n', ; n LATIN SMALL LETTER N
UCP 006F, L5, 'o', ; o LATIN SMALL LETTER O
UCP 0070, L5, 'p', ; p LATIN SMALL LETTER P
UCP 0071, L5, 'q', ; q LATIN SMALL LETTER Q
UCP 0072, L5, 'r', ; r LATIN SMALL LETTER R
UCP 0073, L5, 's', ; s LATIN SMALL LETTER S
UCP 0074, L5, 't', ; t LATIN SMALL LETTER T
UCP 0075, L5, 'u', ; u LATIN SMALL LETTER U
UCP 0076, L5, 'v', ; v LATIN SMALL LETTER V
UCP 0077, L5, 'w', ; w LATIN SMALL LETTER W
UCP 0078, L5, 'x', ; x LATIN SMALL LETTER X
UCP 0079, L5, 'y', ; y LATIN SMALL LETTER Y
UCP 007A, L5, 'z', ; z LATIN SMALL LETTER Z
UCP 007B, Ps, '{', lbrace ; { LEFT CURLY BRACKET
UCP 007C, Sm, '|', verbar ; | VERTICAL LINE
UCP 007D, Pe, '}', rbrace ; } RIGHT CURLY BRACKET
UCP 007E, Sm, '~', tilde ; ~ TILDE
UCP 007F, Cc, ' ', ; control DEL
UCP 00A0, Zs, ' ', nbsp ; NO-BREAK SPACE
UCP 00A1, Pw, '!', iexcl ; ¡ INVERTED EXCLAMATION MARK
UCP 00A2, Sc, 'c', cent ; ¢ CENT SIGN
UCP 00A3, Sc, 'L', pound ; £ POUND SIGN
UCP 00A4, Sc, '$', curren ; ¤ CURRENCY SIGN
UCP 00A5, Sc, 'Y', yen ; ¥ YEN SIGN
UCP 00A6, So, '|', brvbar ; ¦ BROKEN BAR
UCP 00A7, Po, '#', sect ; § SECTION SIGN
UCP 00A8, Sk, '', uml ; ¨ DIAERESIS
UCP 00A9, So, '(c)', copy ; © COPYRIGHT SIGN
UCP 00AA, Lp, 'f', ordf ; ª FEMININE ORDINAL INDICATOR
UCP 00AB, Pi, '<<', laquo ; « LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
UCP 00AC, Sm, '_', not ; ¬ NOT SIGN
UCP 00AD, Cf, '-', shy ; SOFT HYPHEN
UCP 00AE, So, '(R)', reg ; ® REGISTERED SIGN
UCP 00AF, Sk, '', macr ; ¯ MACRON
UCP 00B0, So, '`', deg ; ° DEGREE SIGN
UCP 00B1, Sm, '+', plusmn ; ± PLUS-MINUS SIGN
UCP 00B2, Ns, '2', sup2 ; ² SUPERSCRIPT TWO
UCP 00B3, Ns, '3', sup3 ; ³ SUPERSCRIPT THREE
UCP 00B4, Sk, '', acute ; ´ ACUTE ACCENT
UCP 00B5, L1, 'u', micro ; µ MICRO SIGN
UCP 00B6, Po, 'P', para ; ¶ PILCROW SIGN
UCP 00B7, Po, '.', middot ; · MIDDLE DOT
UCP 00B8, Sk, '', cedil ; ¸ CEDILLA
UCP 00B9, Ns, '1', sup1 ; ¹ SUPERSCRIPT ONE
UCP 00BA, Lp, 'm', ordm ; º MASCULINE ORDINAL INDICATOR
UCP 00BB, Pf, '>>', raquo ; » RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
UCP 00BC, Ns, '1/4', frac14 ; ¼ VULGAR FRACTION ONE QUARTER
UCP 00BD, Ns, '1/2', frac12 ; ½ VULGAR FRACTION ONE HALF
UCP 00BE, Ns, '3/4', frac34 ; ¾ VULGAR FRACTION THREE QUARTERS
UCP 00BF, Po, '?', iquest ; ¿ INVERTED QUESTION MARK
UCP 00C0, U5, 'A', Agrave ; À LATIN CAPITAL LETTER A WITH GRAVE
UCP 00C1, U7, 'A', Aacute ; Á LATIN CAPITAL LETTER A WITH ACUTE
UCP 00C2, U5, 'A', Acirc ; Â LATIN CAPITAL LETTER A WITH CIRCUMFLEX
UCP 00C3, U3 , 'A', Atilde ; Ã LATIN CAPITAL LETTER A WITH TILDE
UCP 00C4, U5, 'A', Auml ; Ä LATIN CAPITAL LETTER A WITH DIAERESIS
UCP 00C5, U3 , 'A', Aring ; Å LATIN CAPITAL LETTER A WITH RING ABOVE
UCP 00C6, U5, 'AE', AElig ; Æ LATIN CAPITAL LETTER AE
UCP 00C7, U5, 'C', Ccedil ; Ç LATIN CAPITAL LETTER C WITH CEDILLA
UCP 00C8, U5, 'E', Egrave ; È LATIN CAPITAL LETTER E WITH GRAVE
UCP 00C9, U7, 'E', Eacute ; É LATIN CAPITAL LETTER E WITH ACUTE
UCP 00CA, U3 , 'E', Ecirc ; Ê LATIN CAPITAL LETTER E WITH CIRCUMFLEX
UCP 00CB, U5, 'E', Euml ; Ë LATIN CAPITAL LETTER E WITH DIAERESIS
UCP 00CC, U5, 'I', Igrave ; Ì LATIN CAPITAL LETTER I WITH GRAVE
UCP 00CD, U7, 'I', Iacute ; Í LATIN CAPITAL LETTER I WITH ACUTE
UCP 00CE, U5, 'I', Icirc ; Î LATIN CAPITAL LETTER I WITH CIRCUMFLEX
UCP 00CF, U3 , 'I', Iuml ; Ï LATIN CAPITAL LETTER I WITH DIAERESIS
UCP 00D0, U3 , 'D', ETH ; Ð LATIN CAPITAL LETTER ETH
UCP 00D1, U5, 'N', Ntilde ; Ñ LATIN CAPITAL LETTER N WITH TILDE
UCP 00D2, U5, 'O', Ograve ; Ò LATIN CAPITAL LETTER O WITH GRAVE
UCP 00D3, U7, 'O', Oacute ; Ó LATIN CAPITAL LETTER O WITH ACUTE
UCP 00D4, U5, 'O', Ocirc ; Ô LATIN CAPITAL LETTER O WITH CIRCUMFLEX
UCP 00D5, U5, 'O', Otilde ; Õ LATIN CAPITAL LETTER O WITH TILDE
UCP 00D6, U5, 'O', Ouml ; Ö LATIN CAPITAL LETTER O WITH DIAERESIS
UCP 00D7, Sm, 'x', times ; × MULTIPLICATION SIGN
UCP 00D8, U5, 'O', Oslash ; Ø LATIN CAPITAL LETTER O WITH STROKE
UCP 00D9, U5, 'U', Ugrave ; Ù LATIN CAPITAL LETTER U WITH GRAVE
UCP 00DA, U7, 'U', Uacute ; Ú LATIN CAPITAL LETTER U WITH ACUTE
UCP 00DB, U5, 'U', Ucirc ; Û LATIN CAPITAL LETTER U WITH CIRCUMFLEX
UCP 00DC, U5, 'U', Uuml ; Ü LATIN CAPITAL LETTER U WITH DIAERESIS
UCP 00DD, U7, 'Y', Yacute ; Ý LATIN CAPITAL LETTER Y WITH ACUTE
UCP 00DE, U3 , 'TH', THORN ; Þ LATIN CAPITAL LETTER THORN
UCP 00DF, L5, 'ss', szlig ; ß LATIN SMALL LETTER SHARP S
UCP 00E0, L5, 'a', agrave ; à LATIN SMALL LETTER A WITH GRAVE
UCP 00E1, L7, 'a', aacute ; á LATIN SMALL LETTER A WITH ACUTE
UCP 00E2, L3, 'a', acirc ; â LATIN SMALL LETTER A WITH CIRCUMFLEX
UCP 00E3, L3, 'a', atilde ; ã LATIN SMALL LETTER A WITH TILDE
UCP 00E4, L5, 'a', auml ; ä LATIN SMALL LETTER A WITH DIAERESIS
UCP 00E5, L5, 'a', aring ; å LATIN SMALL LETTER A WITH RING ABOVE
UCP 00E6, L5, 'ae', aelig ; æ LATIN SMALL LETTER AE
UCP 00E7, L5, 'c', ccedil ; ç LATIN SMALL LETTER C WITH CEDILLA
UCP 00E8, L5, 'e', egrave ; è LATIN SMALL LETTER E WITH GRAVE
UCP 00E9, L7, 'e', eacute ; é LATIN SMALL LETTER E WITH ACUTE
UCP 00EA, L5, 'e', ecirc ; ê LATIN SMALL LETTER E WITH CIRCUMFLEX
UCP 00EB, L5, 'e', euml ; ë LATIN SMALL LETTER E WITH DIAERESIS
UCP 00EC, L5, 'i', igrave ; ì LATIN SMALL LETTER I WITH GRAVE
UCP 00ED, L7, 'i', iacute ; í LATIN SMALL LETTER I WITH ACUTE
UCP 00EE, L5, 'i', icirc ; î LATIN SMALL LETTER I WITH CIRCUMFLEX
UCP 00EF, L5, 'i', iuml ; ï LATIN SMALL LETTER I WITH DIAERESIS
UCP 00F0, L3, 'd', eth ; ð LATIN SMALL LETTER ETH
UCP 00F1, L5, 'n', ntilde ; ñ LATIN SMALL LETTER N WITH TILDE
UCP 00F2, L5, 'o', ograve ; ò LATIN SMALL LETTER O WITH GRAVE
UCP 00F3, L7, 'o', oacute ; ó LATIN SMALL LETTER O WITH ACUTE
UCP 00F4, L5, 'o', ocirc ; ô LATIN SMALL LETTER O WITH CIRCUMFLEX
UCP 00F5, L5, 'o', otilde ; õ LATIN SMALL LETTER O WITH TILDE
UCP 00F6, L5, 'o', ouml ; ö LATIN SMALL LETTER O WITH DIAERESIS
UCP 00F7, Sm, '/', divide ; ÷ DIVISION SIGN
UCP 00F8, L5, 'o', oslash ; ø LATIN SMALL LETTER O WITH STROKE
UCP 00F9, L5, 'u', ugrave ; ù LATIN SMALL LETTER U WITH GRAVE
UCP 00FA, L7, 'u', uacute ; ú LATIN SMALL LETTER U WITH ACUTE
UCP 00FB, L7, 'u', ucirc ; û LATIN SMALL LETTER U WITH CIRCUMFLEX
UCP 00FC, L5, 'u', uuml ; ü LATIN SMALL LETTER U WITH DIAERESIS
UCP 00FD, L7, 'y', yacute ; ý LATIN SMALL LETTER Y WITH ACUTE
UCP 00FE, L3, 'th', thorn ; þ LATIN SMALL LETTER THORN
UCP 00FF, L5, 'y', yuml ; ÿ LATIN SMALL LETTER Y WITH DIAERESIS
UCP 0100, U5, 'A', Amacr ; Ā LATIN CAPITAL LETTER A WITH MACRON
UCP 0101, L5, 'a', amacr ; ā LATIN SMALL LETTER A WITH MACRON
UCP 0102, U5, 'A', Abreve ; Ă LATIN CAPITAL LETTER A WITH BREVE
UCP 0103, L5, 'a', abreve ; ă LATIN SMALL LETTER A WITH BREVE
UCP 0104, U7, 'A', Aogon ; Ą LATIN CAPITAL LETTER A WITH OGONEK
UCP 0105, L7, 'a', aogon ; ą LATIN SMALL LETTER A WITH OGONEK
UCP 0106, U7, 'C', Cacute ; Ć LATIN CAPITAL LETTER C WITH ACUTE
UCP 0107, L7, 'c', cacute ; ć LATIN SMALL LETTER C WITH ACUTE
UCP 0108, U5, 'C', Ccirc ; Ĉ LATIN CAPITAL LETTER C WITH CIRCUMFLEX
UCP 0109, L5, 'c', ccirc ; ĉ LATIN SMALL LETTER C WITH CIRCUMFLEX
UCP 010A, U5, 'C', Cdot ; Ċ LATIN CAPITAL LETTER C WITH DOT ABOVE
UCP 010B, L5, 'c', cdot ; ċ LATIN SMALL LETTER C WITH DOT ABOVE
UCP 010C, U7, 'C', Ccaron ; Č LATIN CAPITAL LETTER C WITH CARON
UCP 010D, L7, 'c', ccaron ; č LATIN SMALL LETTER C WITH CARON
UCP 010E, U5, 'D', Dcaron ; Ď LATIN CAPITAL LETTER D WITH CARON
UCP 010F, L5, 'd', dcaron ; ď LATIN SMALL LETTER D WITH CARON
UCP 0110, U5, 'D', Dstrok ; Đ LATIN CAPITAL LETTER D WITH STROKE
UCP 0111, L5, 'd', dstrok ; đ LATIN SMALL LETTER D WITH STROKE
UCP 0112, U5, 'E', Emacr ; Ē LATIN CAPITAL LETTER E WITH MACRON
UCP 0113, L5, 'e', emacr ; ē LATIN SMALL LETTER E WITH MACRON
UCP 0116, U5, 'E', Edot ; Ė LATIN CAPITAL LETTER E WITH DOT ABOVE
UCP 0117, L5, 'e', edot ; ė LATIN SMALL LETTER E WITH DOT ABOVE
UCP 0118, U7, 'E', Eogon ; Ę LATIN CAPITAL LETTER E WITH OGONEK
UCP 0119, L7, 'e', eogon ; ę LATIN SMALL LETTER E WITH OGONEK
UCP 011A, U7, 'E', Ecaron ; Ě LATIN CAPITAL LETTER E WITH CARON
UCP 011B, L7, 'e', ecaron ; ě LATIN SMALL LETTER E WITH CARON
UCP 011C, U5, 'G', Gcirc ; Ĝ LATIN CAPITAL LETTER G WITH CIRCUMFLEX
UCP 011D, L5, 'g', gcirc ; ĝ LATIN SMALL LETTER G WITH CIRCUMFLEX
UCP 011E, U5, 'G', Gbreve ; Ğ LATIN CAPITAL LETTER G WITH BREVE
UCP 011F, L5, 'g', gbreve ; ğ LATIN SMALL LETTER G WITH BREVE
UCP 0120, U5, 'G', Gdot ; Ġ LATIN CAPITAL LETTER G WITH DOT ABOVE
UCP 0121, L5, 'g', gdot ; ġ LATIN SMALL LETTER G WITH DOT ABOVE
UCP 0122, U5, 'G', Gcedil ; Ģ LATIN CAPITAL LETTER G WITH CEDILLA
UCP 0123, L5, 'g', gcedil ; ģ LATIN SMALL LETTER G WITH CEDILLA
UCP 0124, U5, 'H', Hcirc ; Ĥ LATIN CAPITAL LETTER H WITH CIRCUMFLEX
UCP 0125, L5, 'h', hcirc ; ĥ LATIN SMALL LETTER H WITH CIRCUMFLEX
UCP 0126, U5, 'H', Hstrok ; Ħ LATIN CAPITAL LETTER H WITH STROKE
UCP 0127, L5, 'h', hstrok ; ħ LATIN SMALL LETTER H WITH STROKE
UCP 0128, U5, 'I', Itilde ; Ĩ LATIN CAPITAL LETTER I WITH TILDE
UCP 0129, L5, 'i', itilde ; ĩ LATIN SMALL LETTER I WITH TILDE
UCP 012A, U5, 'I', Imacr ; Ī LATIN CAPITAL LETTER I WITH MACRON
UCP 012B, L5, 'i', imacr ; ī LATIN SMALL LETTER I WITH MACRON
UCP 012E, U5, 'I', Iogon ; Į LATIN CAPITAL LETTER I WITH OGONEK
UCP 012F, L5, 'i', iogon ; į LATIN SMALL LETTER I WITH OGONEK
UCP 0130, U5, 'I', Idot ; İ LATIN CAPITAL LETTER I WITH DOT ABOVE
UCP 0131, L5, 'i', inodot ; ı LATIN SMALL LETTER DOTLESS I
UCP 0134, U5, 'J', Jcirc ; Ĵ LATIN CAPITAL LETTER J WITH CIRCUMFLEX
UCP 0135, L5, 'j', jcirc ; ĵ LATIN SMALL LETTER J WITH CIRCUMFLEX
UCP 0136, U5, 'K', Kcedil ; Ķ LATIN CAPITAL LETTER K WITH CEDILLA
UCP 0137, L5, 'k', kcedil ; ķ LATIN SMALL LETTER K WITH CEDILLA
UCP 0138, L5, 'k', kgreen ; ĸ LATIN SMALL LETTER KRA
UCP 0139, U5, 'L', Lacute ; Ĺ LATIN CAPITAL LETTER L WITH ACUTE
UCP 013A, L5, 'l', lacute ; ĺ LATIN SMALL LETTER L WITH ACUTE
UCP 013B, U5, 'L', Lcedil ; Ļ LATIN CAPITAL LETTER L WITH CEDILLA
UCP 013C, L5, 'l', lcedil ; ļ LATIN SMALL LETTER L WITH CEDILLA
UCP 013D, U5, 'L', Lcaron ; Ľ LATIN CAPITAL LETTER L WITH CARON
UCP 013E, L5, 'l', lcaron ; ľ LATIN SMALL LETTER L WITH CARON
UCP 013F, L5, 'L', Lmidot ; Ŀ LATIN CAPITAL LETTER L WITH MIDDLE DOT
UCP 0140, L5, 'l', lmidot ; ŀ LATIN SMALL LETTER L WITH MIDDLE DOT
UCP 0141, U7, 'L', Lstrok ; Ł LATIN CAPITAL LETTER L WITH STROKE
UCP 0142, L7, 'l', lstrok ; ł LATIN SMALL LETTER L WITH STROKE
UCP 0143, U5, 'N', Nacute ; Ń LATIN CAPITAL LETTER N WITH ACUTE
UCP 0144, L5, 'n', nacute ; ń LATIN SMALL LETTER N WITH ACUTE
UCP 0145, U5, 'N', Ncedil ; Ņ LATIN CAPITAL LETTER N WITH CEDILLA
UCP 0146, L5, 'n', ncedil ; ņ LATIN SMALL LETTER N WITH CEDILLA
UCP 0147, U5, 'N', Ncaron ; Ň LATIN CAPITAL LETTER N WITH CARON
UCP 0148, L5, 'n', ncaron ; ň LATIN SMALL LETTER N WITH CARON
UCP 0149, L5, 'n', Napos ; ʼn LATIN SMALL LETTER N PRECEDED BY APOSTROPHE
UCP 014A, U3 , 'N', napos ; Ŋ LATIN CAPITAL LETTER ENG
UCP 014B, L3, 'n', eng ; ŋ LATIN SMALL LETTER ENG
UCP 014C, U5, 'O', Omacr ; Ō LATIN CAPITAL LETTER O WITH MACRON
UCP 014D, L5, 'o', omacr ; ō LATIN SMALL LETTER O WITH MACRON
UCP 0150, U7, 'O', Odblac ; Ő LATIN CAPITAL LETTER O WITH DOUBLE ACUTE
UCP 0151, L7, 'o', odblac ; ő LATIN SMALL LETTER O WITH DOUBLE ACUTE
UCP 0152, U5, 'OE', OElig ; Œ LATIN CAPITAL LIGATURE OE
UCP 0153, L5, 'oe', oelig ; œ LATIN SMALL LIGATURE OE
UCP 0154, U5, 'R', Racute ; Ŕ LATIN CAPITAL LETTER R WITH ACUTE
UCP 0155, L5, 'r', racute ; ŕ LATIN SMALL LETTER R WITH ACUTE
UCP 0156, U5, 'R', Rcedil ; Ŗ LATIN CAPITAL LETTER R WITH CEDILLA
UCP 0157, L5, 'r', rcedil ; ŗ LATIN SMALL LETTER R WITH CEDILLA
UCP 0158, U7, 'R', Rcaron ; Ř LATIN CAPITAL LETTER R WITH CARON
UCP 0159, L7, 'r', rcaron ; ř LATIN SMALL LETTER R WITH CARON
UCP 015A, U5, 'S', Sacute ; Ś LATIN CAPITAL LETTER S WITH ACUTE
UCP 015B, L5, 's', sacute ; ś LATIN SMALL LETTER S WITH ACUTE
UCP 015C, U5, 'S', Scirc ; Ŝ LATIN CAPITAL LETTER S WITH CIRCUMFLEX
UCP 015D, L5, 's', scirc ; ŝ LATIN SMALL LETTER S WITH CIRCUMFLEX
UCP 015E, U5, 'S', Scedil ; Ş LATIN CAPITAL LETTER S WITH CEDILLA
UCP 015F, L5, 's', scedil ; ş LATIN SMALL LETTER S WITH CEDILLA
UCP 0160, U7, 'S', Scaron ; Š LATIN CAPITAL LETTER S WITH CARON
UCP 0161, L7, 's', scaron ; š LATIN SMALL LETTER S WITH CARON
UCP 0162, U5, 'T', Tcedil ; Ţ LATIN CAPITAL LETTER T WITH CEDILLA
UCP 0163, L5, 't', tcedil ; ţ LATIN SMALL LETTER T WITH CEDILLA
UCP 0164, U5, 'T', Tcaron ; Ť LATIN CAPITAL LETTER T WITH CARON
UCP 0165, L5, 't', tcaron ; ť LATIN SMALL LETTER T WITH CARON
UCP 0166, U5, 'T', Tstrok ; Ŧ LATIN CAPITAL LETTER T WITH STROKE
UCP 0167, L5, 't', tstrok ; ŧ LATIN SMALL LETTER T WITH STROKE
UCP 0168, U5, 'U', Utilde ; Ũ LATIN CAPITAL LETTER U WITH TILDE
UCP 0169, L5, 'u', utilde ; ũ LATIN SMALL LETTER U WITH TILDE
UCP 016A, U5, 'U', Umacr ; Ū LATIN CAPITAL LETTER U WITH MACRON
UCP 016B, L5, 'u', umacr ; ū LATIN SMALL LETTER U WITH MACRON
UCP 016C, U5, 'U', Ubreve ; Ŭ LATIN CAPITAL LETTER U WITH BREVE
UCP 016D, L5, 'u', ubreve ; ŭ LATIN SMALL LETTER U WITH BREVE
UCP 016E, U7, 'U', Uring ; Ů LATIN CAPITAL LETTER U WITH RING ABOVE
UCP 016F, L7, 'u', uring ; ů LATIN SMALL LETTER U WITH RING ABOVE
UCP 0170, U7, 'U', Udblac ; Ű LATIN CAPITAL LETTER U WITH DOUBLE ACUTE
UCP 0171, L7, 'u', udblac ; ű LATIN SMALL LETTER U WITH DOUBLE ACUTE
UCP 0172, U5, 'U', Uogon ; Ų LATIN CAPITAL LETTER U WITH OGONEK
UCP 0173, L5, 'u', uogon ; ų LATIN SMALL LETTER U WITH OGONEK
UCP 0174, U5, 'W', Wcirc ; Ŵ LATIN CAPITAL LETTER W WITH CIRCUMFLEX
UCP 0175, L5, 'w', wcirc ; ŵ LATIN SMALL LETTER W WITH CIRCUMFLEX
UCP 0176, U5, 'Y', Ycirc ; Ŷ LATIN CAPITAL LETTER Y WITH CIRCUMFLEX
UCP 0177, L5, 'y', ycirc ; ŷ LATIN SMALL LETTER Y WITH CIRCUMFLEX
UCP 0178, U5, 'Y', Yuml ; Ÿ LATIN CAPITAL LETTER Y WITH DIAERESIS
UCP 0179, U5, 'Z', Zacute ; Ź LATIN CAPITAL LETTER Z WITH ACUTE
UCP 017A, L5, 'z', zacute ; ź LATIN SMALL LETTER Z WITH ACUTE
UCP 017B, U7, 'Z', Zdot ; Ż LATIN CAPITAL LETTER Z WITH DOT ABOVE
UCP 017C, L7, 'z', zdot ; ż LATIN SMALL LETTER Z WITH DOT ABOVE
UCP 017D, U7, 'Z', Zcaron ; Ž LATIN CAPITAL LETTER Z WITH CARON
UCP 017E, L7, 'z', zcaron ; ž LATIN SMALL LETTER Z WITH CARON
UCP 017F, L5, 's', ; ſ LATIN SMALL LETTER LONG S
UCP 018F, U5, 'E', ; Ə LATIN CAPITAL LETTER SCHWA
UCP 0192, L3, 'f', fnof ; ƒ LATIN SMALL LETTER F WITH HOOK
UCP 01A0, U5, 'O', ; Ơ LATIN CAPITAL LETTER O WITH HORN
UCP 01A1, L5, 'o', ; ơ LATIN SMALL LETTER O WITH HORN
UCP 01AF, U5, 'U', ; Ư LATIN CAPITAL LETTER U WITH HORN
UCP 01B0, L5, 'u', ; ư LATIN SMALL LETTER U WITH HORN
UCP 01C4, U5, 'D', ; DŽ LATIN CAPITAL LETTER DZ WITH CARON
UCP 01C5, U5, 'D', ; Dž LATIN CAPITAL LETTER D WITH SMALL LETTER Z WITH CARON
UCP 01C6, L5, 'd', ; dž LATIN SMALL LETTER DZ WITH CARON
UCP 01F1, U5, 'D', ; DZ LATIN CAPITAL LETTER DZ
UCP 01F2, U5, 'D', ; Dz LATIN CAPITAL LETTER D WITH SMALL LETTER Z
UCP 01F3, L5, 'd', ; dz LATIN SMALL LETTER DZ
UCP 01F5, L5, 'g', ; ǵ LATIN SMALL LETTER G WITH ACUTE
UCP 0218, U5, 'S', ; Ș LATIN CAPITAL LETTER S WITH COMMA BELOW
UCP 0219, L5, 's', ; ș LATIN SMALL LETTER S WITH COMMA BELOW
UCP 021A, U5, 'T', ; Ț LATIN CAPITAL LETTER T WITH COMMA BELOW
UCP 021B, L5, 't', ; ț LATIN SMALL LETTER T WITH COMMA BELOW
UCP 0237, L5, 'j', jmath ; ȷ LATIN SMALL LETTER DOTLESS J
UCP 0259, L5, 'e', ; ə LATIN SMALL LETTER SCHWA
UCP 027C, L5, 'r', ; ɼ LATIN SMALL LETTER R WITH LONG LEG
UCP 02C6, Lm, '', circ ; ˆ MODIFIER LETTER CIRCUMFLEX ACCENT
UCP 02C7, Lm, '', caron ; ˇ CARON
UCP 02CB, Lm, '`', ; ˋ MODIFIER LETTER GRAVE ACCENT
UCP 02D8, Sk, '', breve ; ˘ BREVE
UCP 02D9, Sk, '', dot ; ˙ DOT ABOVE
UCP 02DA, Sk, '', ring ; ˚ RING ABOVE
UCP 02DB, Sk, '', ogon ; ˛ OGONEK
UCP 02DC, Sk, '', tilde ; ˜ SMALL TILDE
UCP 02DD, Sk, '', dblac ; ˝ DOUBLE ACUTE ACCENT
UCP 0300, Mn, '', ; ̀ COMBINING GRAVE ACCENT
UCP 0301, Mn, '', ; ́ COMBINING ACUTE ACCENT
UCP 0303, Mn, '', ; ̃ COMBINING TILDE
UCP 0309, Mn, '', ; ̉ COMBINING HOOK ABOVE
UCP 0323, Mn, '', ; ̣ COMBINING DOT BELOW
UCP 0332, Mn, '', underbar ; ̲ COMBINING LOW LINE
UCP 037A, Lm, '', ; ͺ GREEK YPOGEGRAMMENI
UCP 0384, Sk, '', ; ΄ GREEK TONOS
UCP 0385, Sk, ' ', ; ΅ GREEK DIALYTIKA TONOS
UCP 0386, U5, 'A', ; Ά GREEK CAPITAL LETTER ALPHA WITH TONOS
UCP 0387, Po, '', ; · GREEK ANO TELEIA
UCP 0388, U5, 'E', ; Έ GREEK CAPITAL LETTER EPSILON WITH TONOS
UCP 0389, U5, 'H', ; Ή GREEK CAPITAL LETTER ETA WITH TONOS
UCP 038A, U5, 'I', ; Ί GREEK CAPITAL LETTER IOTA WITH TONOS
UCP 038C, U5, 'O', ; Ό GREEK CAPITAL LETTER OMICRON WITH TONOS
UCP 038E, U5, 'Y', ; Ύ GREEK CAPITAL LETTER UPSILON WITH TONOS
UCP 038F, U5, 'O', ; Ώ GREEK CAPITAL LETTER OMEGA WITH TONOS
UCP 0390, L5, 'i', ; ΐ GREEK SMALL LETTER IOTA WITH DIALYTIKA AND TONOS
UCP 0391, U5, 'A', Alpha ; Α GREEK CAPITAL LETTER ALPHA
UCP 0392, U5, 'B', Beta ; Β GREEK CAPITAL LETTER BETA
UCP 0393, U5, 'G', Gamma ; Γ GREEK CAPITAL LETTER GAMMA
UCP 0394, U5, 'D', Delta ; Δ GREEK CAPITAL LETTER DELTA
UCP 0395, U5, 'E', Epsilon ; Ε GREEK CAPITAL LETTER EPSILON
UCP 0396, U5, 'Z', Zeta ; Ζ GREEK CAPITAL LETTER ZETA
UCP 0397, U5, 'H', Eta ; Η GREEK CAPITAL LETTER ETA
UCP 0398, U5, 'Th', Theta ; Θ GREEK CAPITAL LETTER THETA
UCP 0399, U5, 'I', Iota ; Ι GREEK CAPITAL LETTER IOTA
UCP 039A, U5, 'K', Kappa ; Κ GREEK CAPITAL LETTER KAPPA
UCP 039B, U5, 'L', Lambda ; Λ GREEK CAPITAL LETTER LAMDA
UCP 039C, U5, 'M', Mu ; Μ GREEK CAPITAL LETTER MU
UCP 039D, U5, 'N', Nu ; Ν GREEK CAPITAL LETTER NU
UCP 039E, U5, 'X', Xi ; Ξ GREEK CAPITAL LETTER XI
UCP 039F, U5, 'O', Omicron ; Ο GREEK CAPITAL LETTER OMICRON
UCP 03A0, U5, 'P', Pi ; Π GREEK CAPITAL LETTER PI
UCP 03A1, U5, 'R', Rho ; Ρ GREEK CAPITAL LETTER RHO
UCP 03A3, U5, 'S', Sigma ; Σ GREEK CAPITAL LETTER SIGMA
UCP 03A4, U5, 'T', Tau ; Τ GREEK CAPITAL LETTER TAU
UCP 03A5, U5, 'Y', Upsilon ; Υ GREEK CAPITAL LETTER UPSILON
UCP 03A6, U5, 'F', Phi ; Φ GREEK CAPITAL LETTER PHI
UCP 03A7, U5, 'Ch', Chi ; Χ GREEK CAPITAL LETTER CHI
UCP 03A8, U5, 'Ps', Psi ; Ψ GREEK CAPITAL LETTER PSI
UCP 03A9, U5, 'O', Omega ; Ω GREEK CAPITAL LETTER OMEGA
UCP 03AA, U5, 'I', ; Ϊ GREEK CAPITAL LETTER IOTA WITH DIALYTIKA
UCP 03AB, U5, 'Y', ; Ϋ GREEK CAPITAL LETTER UPSILON WITH DIALYTIKA
UCP 03AC, L5, 'a', ; ά GREEK SMALL LETTER ALPHA WITH TONOS
UCP 03AD, L5, 'e', ; έ GREEK SMALL LETTER EPSILON WITH TONOS
UCP 03AE, L5, 'h', ; ή GREEK SMALL LETTER ETA WITH TONOS
UCP 03AF, L5, 'i', ; ί GREEK SMALL LETTER IOTA WITH TONOS
UCP 03B0, L5, 'u', ; ΰ GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND TONOS
UCP 03B1, L5, 'a', alpha ; α GREEK SMALL LETTER ALPHA
UCP 03B2, L5, 'b', beta ; β GREEK SMALL LETTER BETA
UCP 03B3, L5, 'g', gamma ; γ GREEK SMALL LETTER GAMMA
UCP 03B4, L5, 'd', delta ; δ GREEK SMALL LETTER DELTA
UCP 03B5, L5, 'e', epsilon ; ε GREEK SMALL LETTER EPSILON
UCP 03B6, L5, 'z', zeta ; ζ GREEK SMALL LETTER ZETA
UCP 03B7, L5, 'h', eta ; η GREEK SMALL LETTER ETA
UCP 03B8, L5, 'th', theta ; θ GREEK SMALL LETTER THETA
UCP 03B9, L5, 'i', iota ; ι GREEK SMALL LETTER IOTA
UCP 03BA, L5, 'k', kappa ; κ GREEK SMALL LETTER KAPPA
UCP 03BB, L5, 'l', lambda ; λ GREEK SMALL LETTER LAMDA
UCP 03BC, L5, 'm', mu ; μ GREEK SMALL LETTER MU
UCP 03BD, L5, 'n', nu ; ν GREEK SMALL LETTER NU
UCP 03BE, L5, 'x', xi ; ξ GREEK SMALL LETTER XI
UCP 03BF, L5, 'o', omicron ; ο GREEK SMALL LETTER OMICRON
UCP 03C0, L5, 'p', pi ; π GREEK SMALL LETTER PI
UCP 03C1, L5, 'r', rho ; ρ GREEK SMALL LETTER RHO
UCP 03C2, L5, 's', sigmaf ; ς GREEK SMALL LETTER FINAL SIGMA
UCP 03C3, L5, 's', sigma ; σ GREEK SMALL LETTER SIGMA
UCP 03C4, L5, 't', tau ; τ GREEK SMALL LETTER TAU
UCP 03C5, L5, 'u', upsilon ; υ GREEK SMALL LETTER UPSILON
UCP 03C6, L5, 'f', phi ; φ GREEK SMALL LETTER PHI
UCP 03C7, L5, 'ch', chi ; χ GREEK SMALL LETTER CHI
UCP 03C8, L5, 'ps', psi ; ψ GREEK SMALL LETTER PSI
UCP 03C9, L5, 'o', omega ; ω GREEK SMALL LETTER OMEGA
UCP 03CA, L5, 'i', ; ϊ GREEK SMALL LETTER IOTA WITH DIALYTIKA
UCP 03CB, L5, 'u', ; ϋ GREEK SMALL LETTER UPSILON WITH DIALYTIKA
UCP 03CC, L5, 'o', ; ό GREEK SMALL LETTER OMICRON WITH TONOS
UCP 03CD, L5, 'u', ; ύ GREEK SMALL LETTER UPSILON WITH TONOS
UCP 03CE, L5, 'o', ; ώ GREEK SMALL LETTER OMEGA WITH TONOS
UCP 03CF, So, 'K', ; Ϗ GREEK CAPITAL KAI SYMBOL
UCP 03D0, So, 'b', ; ϐ GREEK BETA SYMBOL
UCP 03D1, So, 'th', thetasym ; ϑ GREEK THETA SYMBOL
UCP 03D2, So, 'u', upsih ; ϒ GREEK UPSILON WITH HOOK SYMBOL
UCP 03D3, So, 'u', ; ϓ GREEK UPSILON WITH ACUTE AND HOOK SYMBOL
UCP 03D4, So, 'u', ; ϔ GREEK UPSILON WITH DIAERESIS AND HOOK SYMBOL
UCP 03D5, So, 'ph', ; ϕ GREEK PHI SYMBOL
UCP 03D6, So, 'pi', piv ; ϖ GREEK PI SYMBOL
UCP 03D7, So, 'K', ; ϗ GREEK KAI SYMBOL
UCP 03D8, Lo, 'Q', ; Ϙ GREEK LETTER QOPPA
UCP 03D9, Lo, 'q', ; ϙ GREEK SMALL LETTER QOPPA
UCP 03DA, Lo, 'C', ; Ϛ GREEK LETTER STIGMA
UCP 03DB, Lo, 'c', ; ϛ GREEK SMALL LETTER STIGMA
UCP 03DC, Lo, 'F', Gammad ; Ϝ GREEK CAPITAL LETTER DIGAMMA
UCP 03DD, Lo, 'f', gammad ; ϝ GREEK SMALL LETTER DIGAMMA
UCP 03DE, Lo, 'S', ; Ϟ GREEK LETTER KOPPA
UCP 03DF, Lo, 's', ; ϟ GREEK SMALL LETTER KOPPA
UCP 03E0, Lo, 'S', ; Ϡ GREEK LETTER SAMPI
UCP 03E1, Lo, 's', ; ϡ GREEK SMALL LETTER SAMPI
UCP 0401, U4, 'Io', IOcy ; Ё CYRILLIC CAPITAL LETTER IO
UCP 0402, U4, 'Dj', DJcy ; Ђ CYRILLIC CAPITAL LETTER DJE
UCP 0403, U4, 'G', GJcy ; Ѓ CYRILLIC CAPITAL LETTER GJE
UCP 0404, U5, 'E', Jukcy ; Є CYRILLIC CAPITAL LETTER UKRAINIAN IE
UCP 0405, U4, 'S', DScy ; Ѕ CYRILLIC CAPITAL LETTER DZE
UCP 0406, U4, 'I', Iukcy ; І CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I
UCP 0407, U4, 'I', YIcy ; Ї CYRILLIC CAPITAL LETTER YI
UCP 0408, U4, 'J', Jsercy ; Ј CYRILLIC CAPITAL LETTER JE
UCP 0409, U4, 'Lj', LJcy ; Љ CYRILLIC CAPITAL LETTER LJE
UCP 040A, U4, 'Nj', NJcy ; Њ CYRILLIC CAPITAL LETTER NJE
UCP 040B, U4, 'Cj', TSHcy ; Ћ CYRILLIC CAPITAL LETTER TSHE
UCP 040C, U4, 'K', KJcy ; Ќ CYRILLIC CAPITAL LETTER KJE
UCP 040D, U4, 'I', ; Ѝ CYRILLIC CAPITAL LETTER I WITH GRAVE
UCP 040E, U5, 'Y', Ubrcy ; Ў CYRILLIC CAPITAL LETTER SHORT U
UCP 040F, U4, 'Dz', DZcy ; Џ CYRILLIC CAPITAL LETTER DZHE
UCP 0410, U7, 'A', Acy ; А CYRILLIC CAPITAL LETTER A
UCP 0411, U5, 'B', Bcy ; Б CYRILLIC CAPITAL LETTER BE
UCP 0412, U5, 'V', Vcy ; В CYRILLIC CAPITAL LETTER VE
UCP 0413, U5, 'G', Gcy ; Г CYRILLIC CAPITAL LETTER GHE
UCP 0414, U5, 'D', Dcy ; Д CYRILLIC CAPITAL LETTER DE
UCP 0415, U7, 'E', IEcy ; Е CYRILLIC CAPITAL LETTER IE
UCP 0416, U5, 'Zh', ZHcy ; Ж CYRILLIC CAPITAL LETTER ZHE
UCP 0417, U5, 'Z', Zcy ; З CYRILLIC CAPITAL LETTER ZE
UCP 0418, U5, 'I', Icy ; И CYRILLIC CAPITAL LETTER I
UCP 0419, U5, 'J', Jcy ; Й CYRILLIC CAPITAL LETTER SHORT I
UCP 041A, U5, 'K', Kcy ; К CYRILLIC CAPITAL LETTER KA
UCP 041B, U5, 'L', Lcy ; Л CYRILLIC CAPITAL LETTER EL
UCP 041C, U5, 'M', Mcy ; М CYRILLIC CAPITAL LETTER EM
UCP 041D, U5, 'N', Ncy ; Н CYRILLIC CAPITAL LETTER EN
UCP 041E, U7, 'O', Ocy ; О CYRILLIC CAPITAL LETTER O
UCP 041F, U5, 'P', Pcy ; П CYRILLIC CAPITAL LETTER PE
UCP 0420, U5, 'R', Rcy ; Р CYRILLIC CAPITAL LETTER ER
UCP 0421, U5, 'C', Scy ; С CYRILLIC CAPITAL LETTER ES
UCP 0422, U5, 'T', Tcy ; Т CYRILLIC CAPITAL LETTER TE
UCP 0423, U5, 'U', Ucy ; У CYRILLIC CAPITAL LETTER U
UCP 0424, U5, 'F', Fcy ; Ф CYRILLIC CAPITAL LETTER EF
UCP 0425, U5, 'Kh', KHcy ; Х CYRILLIC CAPITAL LETTER HA
UCP 0426, U5, 'C', TScy ; Ц CYRILLIC CAPITAL LETTER TSE
UCP 0427, U5, 'Ch', CHcy ; Ч CYRILLIC CAPITAL LETTER CHE
UCP 0428, U5, 'Sh', SHcy ; Ш CYRILLIC CAPITAL LETTER SHA
UCP 0429, U5, 'Shch',SHCHcy ; Щ CYRILLIC CAPITAL LETTER SHCHA
UCP 042A, U4, "'", Hardcy ; Ъ CYRILLIC CAPITAL LETTER HARD SIGN
UCP 042B, U5, 'Y', Ycy ; Ы CYRILLIC CAPITAL LETTER YERU
UCP 042C, U5, '', Softcy ; Ь CYRILLIC CAPITAL LETTER SOFT SIGN
UCP 042D, U5, 'E', Ecy ; Э CYRILLIC CAPITAL LETTER E
UCP 042E, U6, 'Yu', YUcy ; Ю CYRILLIC CAPITAL LETTER YU
UCP 042F, U6, 'Ya', YAcy ; Я CYRILLIC CAPITAL LETTER YA
UCP 0430, L7, 'a', acy ; а CYRILLIC SMALL LETTER A
UCP 0431, L5, 'b', bcy ; б CYRILLIC SMALL LETTER BE
UCP 0432, L5, 'v', vcy ; в CYRILLIC SMALL LETTER VE
UCP 0433, L5, 'g', gcy ; г CYRILLIC SMALL LETTER GHE
UCP 0434, L5, 'd', dcy ; д CYRILLIC SMALL LETTER DE
UCP 0435, L7, 'e', iecy ; е CYRILLIC SMALL LETTER IE
UCP 0436, L5, 'zh', zhcy ; ж CYRILLIC SMALL LETTER ZHE
UCP 0437, L5, 'z', zcy ; з CYRILLIC SMALL LETTER ZE
UCP 0438, L5, 'i', icy ; и CYRILLIC SMALL LETTER I
UCP 0439, L5, 'j', jcy ; й CYRILLIC SMALL LETTER SHORT I
UCP 043A, L5, 'k', kcy ; к CYRILLIC SMALL LETTER KA
UCP 043B, L5, 'l', lcy ; л CYRILLIC SMALL LETTER EL
UCP 043C, L5, 'm', mcy ; м CYRILLIC SMALL LETTER EM
UCP 043D, L5, 'n', ncy ; н CYRILLIC SMALL LETTER EN
UCP 043E, L7, 'o', ocy ; о CYRILLIC SMALL LETTER O
UCP 043F, L5, 'p', pcy ; п CYRILLIC SMALL LETTER PE
UCP 0440, L5, 'r', rcy ; р CYRILLIC SMALL LETTER ER
UCP 0441, L5, 's', scy ; с CYRILLIC SMALL LETTER ES
UCP 0442, L5, 't', tcy ; т CYRILLIC SMALL LETTER TE
UCP 0443, L5, 'u', ucy ; у CYRILLIC SMALL LETTER U
UCP 0444, L5, 'f', fcy ; ф CYRILLIC SMALL LETTER EF
UCP 0445, L5, 'kh', khcy ; х CYRILLIC SMALL LETTER HA
UCP 0446, L5, 'c', tscy ; ц CYRILLIC SMALL LETTER TSE
UCP 0447, L5, 'ch', chcy ; ч CYRILLIC SMALL LETTER CHE
UCP 0448, L5, 'sh', shcy ; ш CYRILLIC SMALL LETTER SHA
UCP 0449, L5, 'shch',shchcy ; щ CYRILLIC SMALL LETTER SHCHA
UCP 044A, L4, "'", hardcy ; ъ CYRILLIC SMALL LETTER HARD SIGN
UCP 044B, L5, 'y', ycy ; ы CYRILLIC SMALL LETTER YERU
UCP 044C, L5, '', softcy ; ь CYRILLIC SMALL LETTER SOFT SIGN
UCP 044D, L5, 'e', ecy ; э CYRILLIC SMALL LETTER E
UCP 044E, L6, 'yu', yucy ; ю CYRILLIC SMALL LETTER YU
UCP 044F, L6, 'ya', yacy ; я CYRILLIC SMALL LETTER YA
UCP 0451, L5, 'io', iocy ; ё CYRILLIC SMALL LETTER IO
UCP 0452, L5, 'dj', djcy ; ђ CYRILLIC SMALL LETTER DJE
UCP 0453, L5, 'g', gjcy ; ѓ CYRILLIC SMALL LETTER GJE
UCP 0454, L5, 'e', jukcy ; є CYRILLIC SMALL LETTER UKRAINIAN IE
UCP 0455, L5, 's', dscy ; ѕ CYRILLIC SMALL LETTER DZE
UCP 0456, L5, 'i', iukcy ; і CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
UCP 0457, L5, 'i', yicy ; ї CYRILLIC SMALL LETTER YI
UCP 0458, L5, 'j', jsercy ; ј CYRILLIC SMALL LETTER JE
UCP 0459, L5, 'lj', ljcy ; љ CYRILLIC SMALL LETTER LJE
UCP 045A, L5, 'nj', njcy ; њ CYRILLIC SMALL LETTER NJE
UCP 045B, L5, 'cj', tshcy ; ћ CYRILLIC SMALL LETTER TSHE
UCP 045C, L5, 'k', kjcy ; ќ CYRILLIC SMALL LETTER KJE
UCP 045D, L5, 'i', ; ѝ CYRILLIC SMALL LETTER I WITH GRAVE
UCP 045E, L5, 'y', ubrcy ; ў CYRILLIC SMALL LETTER SHORT U
UCP 045F, L5, 'dz', dzcy ; џ CYRILLIC SMALL LETTER DZHE
UCP 0490, U3 , 'G', ; Ґ CYRILLIC CAPITAL LETTER GHE WITH UPTURN
UCP 0491, L3, 'g', ; ґ CYRILLIC SMALL LETTER GHE WITH UPTURN
UCP 0492, U3 , 'G', ; Ғ CYRILLIC CAPITAL LETTER GHE WITH STROKE
UCP 0493, L3, 'g', ; ғ CYRILLIC SMALL LETTER GHE WITH STROKE
UCP 049A, U3 , 'G', ; Қ CYRILLIC CAPITAL LETTER KA WITH DESCENDER
UCP 049B, L3, 'g', ; қ CYRILLIC SMALL LETTER KA WITH DESCENDER
UCP 04B2, U3 , 'H', ; Ҳ CYRILLIC CAPITAL LETTER HA WITH DESCENDER
UCP 04B3, L3, 'h', ; ҳ CYRILLIC SMALL LETTER HA WITH DESCENDER
UCP 04B6, U3 , 'Ch', ; Ҷ CYRILLIC CAPITAL LETTER CHE WITH DESCENDER
UCP 04B7, L3, 'ch', ; ҷ CYRILLIC SMALL LETTER CHE WITH DESCENDER
UCP 04E0, L3, 'Z', ; Ӡ CYRILLIC CAPITAL LETTER ABKHASIAN DZE
UCP 04E1, L3, 'z', ; ӡ CYRILLIC SMALL LETTER ABKHASIAN DZE
UCP 04E2, U3 , 'I', ; Ӣ CYRILLIC CAPITAL LETTER I WITH MACRON
UCP 04E3, U3 , 'i', ; ӣ CYRILLIC SMALL LETTER I WITH MACRON
UCP 04EE, U3 , 'U', ; Ӯ CYRILLIC CAPITAL LETTER U WITH MACRON
UCP 04EF, L3, 'u', ; ӯ CYRILLIC SMALL LETTER U WITH MACRON
UCP 05B0, Mn, '', ; ְ HEBREW POINT SHEVA
UCP 05B1, Mn, '', ; ֱ HEBREW POINT HATAF SEGOL
UCP 05B2, Mn, '', ; ֲ HEBREW POINT HATAF PATAH
UCP 05B3, Mn, '', ; ֳ HEBREW POINT HATAF QAMATS
UCP 05B4, Mn, '', ; ִ HEBREW POINT HIRIQ
UCP 05B5, Mn, '', ; ֵ HEBREW POINT TSERE
UCP 05B6, Mn, '', ; ֶ HEBREW POINT SEGOL
UCP 05B7, Mn, '', ; ַ HEBREW POINT PATAH
UCP 05B8, Mn, '', ; ָ HEBREW POINT QAMATS
UCP 05B9, Mn, '', ; ֹ HEBREW POINT HOLAM
UCP 05BA, Mn, '', ; ֺ HEBREW POINT HOLAM HASER FOR VAV
UCP 05BB, Mn, '', ; ֻ HEBREW POINT QUBUTS
UCP 05BC, Mn, '', ; ּ HEBREW POINT DAGESH OR MAPIQ
UCP 05BD, Mn, '', ; ֽ HEBREW POINT METEG
UCP 05BE, Pd, '-', ; ־ HEBREW PUNCTUATION MAQAF
UCP 05BF, Mn, '', ; ֿ HEBREW POINT RAFE
UCP 05C0, Po, '|', ; ׀ HEBREW PUNCTUATION PASEQ
UCP 05C1, Mn, '', ; ׁ HEBREW POINT SHIN DOT
UCP 05C2, Mn, '', ; ׂ HEBREW POINT SIN DOT
UCP 05C3, Po, ':', ; ׃ HEBREW PUNCTUATION SOF PASUQ
UCP 05D0, Lo, 'A', ; א HEBREW LETTER ALEF
UCP 05D1, Lo, 'B', ; ב HEBREW LETTER BET
UCP 05D2, Lo, 'G', ; ג HEBREW LETTER GIMEL
UCP 05D3, Lo, 'D', ; ד HEBREW LETTER DALET
UCP 05D4, Lo, 'H', ; ה HEBREW LETTER HE
UCP 05D5, Lo, 'V', ; ו HEBREW LETTER VAV
UCP 05D6, Lo, 'Z', ; ז HEBREW LETTER ZAYIN
UCP 05D7, Lo, 'H', ; ח HEBREW LETTER HET
UCP 05D8, Lo, 'T', ; ט HEBREW LETTER TET
UCP 05D9, Lo, 'Yi', ; י HEBREW LETTER YOD
UCP 05DA, Lo, 'Kh', ; ך HEBREW LETTER FINAL KAF
UCP 05DB, Lo, 'Kh', ; כ HEBREW LETTER KAF
UCP 05DC, Lo, 'L', ; ל HEBREW LETTER LAMED
UCP 05DD, Lo, 'M', ; ם HEBREW LETTER FINAL MEM
UCP 05DE, Lo, 'M', ; מ HEBREW LETTER MEM
UCP 05DF, Lo, 'N', ; ן HEBREW LETTER FINAL NUN
UCP 05E0, Lo, 'N', ; נ HEBREW LETTER NUN
UCP 05E1, Lo, 'S', ; ס HEBREW LETTER SAMEKH
UCP 05E2, Lo, 'A', ; ע HEBREW LETTER AYIN
UCP 05E3, Lo, 'P', ; ף HEBREW LETTER FINAL PE
UCP 05E4, Lo, 'P', ; פ HEBREW LETTER PE
UCP 05E5, Lo, 'Tz', ; ץ HEBREW LETTER FINAL TSADI
UCP 05E6, Lo, 'Tz', ; צ HEBREW LETTER TSADI
UCP 05E7, Lo, 'K', ; ק HEBREW LETTER QOF
UCP 05E8, Lo, 'R', ; ר HEBREW LETTER RESH
UCP 05E9, Lo, 'Sh', ; ש HEBREW LETTER SHIN
UCP 05EA, Lo, 'T', ; ת HEBREW LETTER TAV
UCP 05F0, Lo, 'W', ; װ HEBREW LIGATURE YIDDISH DOUBLE VAV
UCP 05F1, Lo, 'V', ; ױ HEBREW LIGATURE YIDDISH VAV YOD
UCP 05F2, Lo, 'W', ; ײ HEBREW LIGATURE YIDDISH DOUBLE YOD
UCP 05F3, Po, "'", ; ׳ HEBREW PUNCTUATION GERESH
UCP 05F4, Po, '"', ; ״ HEBREW PUNCTUATION GERSHAYIM
UCP 060C, Po, ',', ; ، ARABIC COMMA
UCP 061B, Po, ';', ; ؛ ARABIC SEMICOLON
UCP 061F, Po, '?', ; ؟ ARABIC QUESTION MARK
UCP 0621, Lo, "'", ; ء ARABIC LETTER HAMZA
UCP 0622, Lo, 'A', ; آ ARABIC LETTER ALEF WITH MADDA ABOVE
UCP 0623, Lo, 'A', ; أ ARABIC LETTER ALEF WITH HAMZA ABOVE
UCP 0624, Lo, 'W', ; ؤ ARABIC LETTER WAW WITH HAMZA ABOVE
UCP 0625, Lo, 'A', ; إ ARABIC LETTER ALEF WITH HAMZA BELOW
UCP 0626, Lo, 'Y', ; ئ ARABIC LETTER YEH WITH HAMZA ABOVE
UCP 0627, Lo, 'A', ; ا ARABIC LETTER ALEF
UCP 0628, Lo, 'B', ; ب ARABIC LETTER BEH
UCP 0629, Lo, 'T', ; ة ARABIC LETTER TEH MARBUTA
UCP 062A, Lo, 'T', ; ت ARABIC LETTER TEH
UCP 062B, Lo, 'Th', ; ث ARABIC LETTER THEH
UCP 062C, Lo, 'J', ; ج ARABIC LETTER JEEM
UCP 062D, Lo, 'H', ; ح ARABIC LETTER HAH
UCP 062E, Lo, 'Kh', ; خ ARABIC LETTER KHAH
UCP 062F, Lo, 'D', ; د ARABIC LETTER DAL
UCP 0630, Lo, 'Dh', ; ذ ARABIC LETTER THAL
UCP 0631, Lo, 'R', ; ر ARABIC LETTER REH
UCP 0632, Lo, 'Z', ; ز ARABIC LETTER ZAIN
UCP 0633, Lo, 'S', ; س ARABIC LETTER SEEN
UCP 0634, Lo, 'Sh', ; ش ARABIC LETTER SHEEN
UCP 0635, Lo, 'S', ; ص ARABIC LETTER SAD
UCP 0636, Lo, 'D', ; ض ARABIC LETTER DAD
UCP 0637, Lo, 'T', ; ط ARABIC LETTER TAH
UCP 0638, Lo, 'Z', ; ظ ARABIC LETTER ZAH
UCP 0639, Lo, "'", ; ع ARABIC LETTER AIN
UCP 063A, Lo, 'Gh', ; غ ARABIC LETTER GHAIN
UCP 0640, Lm, '_', ; ـ ARABIC TATWEEL
UCP 0641, Lo, 'F', ; ف ARABIC LETTER FEH
UCP 0642, Lo, 'Q', ; ق ARABIC LETTER QAF
UCP 0643, Lo, 'K', ; ك ARABIC LETTER KAF
UCP 0644, Lo, 'L', ; ل ARABIC LETTER LAM
UCP 0645, Lo, 'M', ; م ARABIC LETTER MEEM
UCP 0646, Lo, 'N', ; ن ARABIC LETTER NOON
UCP 0647, Lo, 'H', ; ه ARABIC LETTER HEH
UCP 0648, Lo, 'W', ; و ARABIC LETTER WAW
UCP 0649, Lo, 'A', ; ى ARABIC LETTER ALEF MAKSURA
UCP 064A, Lo, 'Y', ; ي ARABIC LETTER YEH
UCP 064B, Mn, 'A', ; ً ARABIC FATHATAN
UCP 064C, Mn, 'U', ; ٌ ARABIC DAMMATAN
UCP 064D, Mn, 'I', ; ٍ ARABIC KASRATAN
UCP 064E, Mn, 'A', ; َ ARABIC FATHA
UCP 064F, Mn, 'U', ; ُ ARABIC DAMMA
UCP 0650, Mn, 'I', ; ِ ARABIC KASRA
UCP 0651, Mn, '', ; ّ ARABIC SHADDA
UCP 0652, Mn, '', ; ْ ARABIC SUKUN
UCP 0660, Nd, '0', ; ٠ ARABIC-INDIC DIGIT ZERO
UCP 0661, Nd, '1', ; ١ ARABIC-INDIC DIGIT ONE
UCP 0662, Nd, '2', ; ٢ ARABIC-INDIC DIGIT TWO
UCP 0663, Nd, '3', ; ٣ ARABIC-INDIC DIGIT THREE
UCP 0664, Nd, '4', ; ٤ ARABIC-INDIC DIGIT FOUR
UCP 0665, Nd, '5', ; ٥ ARABIC-INDIC DIGIT FIVE
UCP 0666, Nd, '6', ; ٦ ARABIC-INDIC DIGIT SIX
UCP 0667, Nd, '7', ; ٧ ARABIC-INDIC DIGIT SEVEN
UCP 0668, Nd, '8', ; ٨ ARABIC-INDIC DIGIT EIGHT
UCP 0669, Nd, '9', ; ٩ ARABIC-INDIC DIGIT NINE
UCP 066A, Po, '%%', ; ٪ ARABIC PERCENT SIGN
UCP 0679, Lo, 'T', ; ٹ ARABIC LETTER TTEH
UCP 067E, Lo, 'P', ; پ ARABIC LETTER PEH
UCP 0686, Lo, 'Ch', ; چ ARABIC LETTER TCHEH
UCP 0688, Lo, 'D', ; ڈ ARABIC LETTER DDAL
UCP 0691, Lo, 'R', ; ڑ ARABIC LETTER RREH
UCP 0698, Lo, 'J', ; ژ ARABIC LETTER JEH
UCP 06A4, Lo, 'V', ; ڤ ARABIC LETTER VEH
UCP 06A9, Lo, 'Kh', ; ک ARABIC LETTER KEHEH
UCP 06AF, Lo, 'G', ; گ ARABIC LETTER GAF
UCP 06BA, Lo, 'N', ; ں ARABIC LETTER NOON GHUNNA
UCP 06BE, Lo, 'H', ; ھ ARABIC LETTER HEH DOACHASHMEE
UCP 06C1, Lo, 'H', ; ہ ARABIC LETTER HEH GOAL
UCP 06D2, Lo, 'Y', ; ے ARABIC LETTER YEH BARREE
UCP 06D5, Lo, 'Ae', ; ە ARABIC LETTER AE
UCP 06F0, Nd, '0', ; ۰ EXTENDED ARABIC-INDIC DIGIT ZERO
UCP 06F1, Nd, '1', ; ۱ EXTENDED ARABIC-INDIC DIGIT ONE
UCP 06F2, Nd, '2', ; ۲ EXTENDED ARABIC-INDIC DIGIT TWO
UCP 06F3, Nd, '3', ; ۳ EXTENDED ARABIC-INDIC DIGIT THREE
UCP 06F4, Nd, '4', ; ۴ EXTENDED ARABIC-INDIC DIGIT FOUR
UCP 06F5, Nd, '5', ; ۵ EXTENDED ARABIC-INDIC DIGIT FIVE
UCP 06F6, Nd, '6', ; ۶ EXTENDED ARABIC-INDIC DIGIT SIX
UCP 06F7, Nd, '7', ; ۷ EXTENDED ARABIC-INDIC DIGIT SEVEN
UCP 06F8, Nd, '8', ; ۸ EXTENDED ARABIC-INDIC DIGIT EIGHT
UCP 06F9, Nd, '9', ; ۹ EXTENDED ARABIC-INDIC DIGIT NINE
UCP 0E01, Lo, 'K', ; ก THAI CHARACTER KO KAI
UCP 0E02, Lo, 'Kh', ; ข THAI CHARACTER KHO KHAI
UCP 0E03, Lo, 'Kh', ; ฃ THAI CHARACTER KHO KHUAT
UCP 0E04, Lo, 'Kh', ; ค THAI CHARACTER KHO KHWAI
UCP 0E05, Lo, 'Kh', ; ฅ THAI CHARACTER KHO KHON
UCP 0E06, Lo, 'Kh', ; ฆ THAI CHARACTER KHO RAKHANG
UCP 0E07, Lo, 'Ng', ; ง THAI CHARACTER NGO NGU
UCP 0E08, Lo, 'Ch', ; จ THAI CHARACTER CHO CHAN
UCP 0E09, Lo, 'Ch', ; ฉ THAI CHARACTER CHO CHING
UCP 0E0A, Lo, 'Ch', ; ช THAI CHARACTER CHO CHANG
UCP 0E0B, Lo, 'S', ; ซ THAI CHARACTER SO SO
UCP 0E0C, Lo, 'Ch', ; ฌ THAI CHARACTER CHO CHOE
UCP 0E0D, Lo, 'Y', ; ญ THAI CHARACTER YO YING
UCP 0E0E, Lo, 'D', ; ฎ THAI CHARACTER DO CHADA
UCP 0E0F, Lo, 'T', ; ฏ THAI CHARACTER TO PATAK
UCP 0E10, Lo, 'Th', ; ฐ THAI CHARACTER THO THAN
UCP 0E11, Lo, 'Th', ; ฑ THAI CHARACTER THO NANGMONTHO
UCP 0E12, Lo, 'Th', ; ฒ THAI CHARACTER THO PHUTHAO
UCP 0E13, Lo, 'N', ; ณ THAI CHARACTER NO NEN
UCP 0E14, Lo, 'D', ; ด THAI CHARACTER DO DEK
UCP 0E15, Lo, 'T', ; ต THAI CHARACTER TO TAO
UCP 0E16, Lo, 'Th', ; ถ THAI CHARACTER THO THUNG
UCP 0E17, Lo, 'Th', ; ท THAI CHARACTER THO THAHAN
UCP 0E18, Lo, 'Th', ; ธ THAI CHARACTER THO THONG
UCP 0E19, Lo, 'N', ; น THAI CHARACTER NO NU
UCP 0E1A, Lo, 'B', ; บ THAI CHARACTER BO BAIMAI
UCP 0E1B, Lo, 'P', ; ป THAI CHARACTER PO PLA
UCP 0E1C, Lo, 'Ph', ; ผ THAI CHARACTER PHO PHUNG
UCP 0E1D, Lo, 'F', ; ฝ THAI CHARACTER FO FA
UCP 0E1E, Lo, 'Ph', ; พ THAI CHARACTER PHO PHAN
UCP 0E1F, Lo, 'F', ; ฟ THAI CHARACTER FO FAN
UCP 0E20, Lo, 'Ph', ; ภ THAI CHARACTER PHO SAMPHAO
UCP 0E21, Lo, 'M', ; ม THAI CHARACTER MO MA
UCP 0E22, Lo, 'Y', ; ย THAI CHARACTER YO YAK
UCP 0E23, Lo, 'R', ; ร THAI CHARACTER RO RUA
UCP 0E24, Lo, 'R', ; ฤ THAI CHARACTER RU
UCP 0E25, Lo, 'L', ; ล THAI CHARACTER LO LING
UCP 0E26, Lo, 'L', ; ฦ THAI CHARACTER LU
UCP 0E27, Lo, 'W', ; ว THAI CHARACTER WO WAEN
UCP 0E28, Lo, 'S', ; ศ THAI CHARACTER SO SALA
UCP 0E29, Lo, 'S', ; ษ THAI CHARACTER SO RUSI
UCP 0E2A, Lo, 'S', ; ส THAI CHARACTER SO SUA
UCP 0E2B, Lo, 'H', ; ห THAI CHARACTER HO HIP
UCP 0E2C, Lo, 'L', ; ฬ THAI CHARACTER LO CHULA
UCP 0E2D, Lo, 'O', ; อ THAI CHARACTER O ANG
UCP 0E2E, Lo, 'H', ; ฮ THAI CHARACTER HO NOKHUK
UCP 0E2F, Lo, 'A', ; ฯ THAI CHARACTER PAIYANNOI
UCP 0E30, Lo, 'A', ; ะ THAI CHARACTER SARA A
UCP 0E31, Mn, '', ; ั THAI CHARACTER MAI HAN-AKAT
UCP 0E32, Lo, 'A', ; า THAI CHARACTER SARA AA
UCP 0E33, Lo, 'A', ; ำ THAI CHARACTER SARA AM
UCP 0E34, Mn, '', ; ิ THAI CHARACTER SARA I
UCP 0E35, Mn, '', ; ี THAI CHARACTER SARA II
UCP 0E36, Mn, '', ; ึ THAI CHARACTER SARA UE
UCP 0E37, Mn, '', ; ื THAI CHARACTER SARA UEE
UCP 0E38, Mn, '', ; ุ THAI CHARACTER SARA U
UCP 0E39, Mn, '', ; ู THAI CHARACTER SARA UU
UCP 0E3A, Mn, '', ; ฺ THAI CHARACTER PHINTHU
UCP 0E3F, Sc, '$', ; ฿ THAI CURRENCY SYMBOL BAHT
UCP 0E40, Lo, 'E', ; เ THAI CHARACTER SARA E
UCP 0E41, Lo, 'AE', ; แ THAI CHARACTER SARA AE
UCP 0E42, Lo, 'O', ; โ THAI CHARACTER SARA O
UCP 0E43, Lo, 'I', ; ใ THAI CHARACTER SARA AI MAIMUAN
UCP 0E44, Lo, 'I', ; ไ THAI CHARACTER SARA AI MAIMALAI
UCP 0E45, Lo, 'A', ; ๅ THAI CHARACTER LAKKHANGYAO
UCP 0E46, Lm, '`', ; ๆ THAI CHARACTER MAIYAMOK
UCP 0E47, Mn, '', ; ็ THAI CHARACTER MAITAIKHU
UCP 0E48, Mn, '', ; ่ THAI CHARACTER MAI EK
UCP 0E49, Mn, '', ; ้ THAI CHARACTER MAI THO
UCP 0E4A, Mn, '', ; ๊ THAI CHARACTER MAI TRI
UCP 0E4B, Mn, '', ; ๋ THAI CHARACTER MAI CHATTAWA
UCP 0E4C, Mn, '', ; ์ THAI CHARACTER THANTHAKHAT
UCP 0E4D, Mn, '', ; ํ THAI CHARACTER NIKHAHIT
UCP 0E4E, Mn, '', ; ๎ THAI CHARACTER YAMAKKAN
UCP 0E4F, Po, '#', ; ๏ THAI CHARACTER FONGMAN
UCP 0E50, Nd, '0', ; ๐ THAI DIGIT ZERO
UCP 0E51, Nd, '1', ; ๑ THAI DIGIT ONE
UCP 0E52, Nd, '2', ; ๒ THAI DIGIT TWO
UCP 0E53, Nd, '3', ; ๓ THAI DIGIT THREE
UCP 0E54, Nd, '4', ; ๔ THAI DIGIT FOUR
UCP 0E55, Nd, '5', ; ๕ THAI DIGIT FIVE
UCP 0E56, Nd, '6', ; ๖ THAI DIGIT SIX
UCP 0E57, Nd, '7', ; ๗ THAI DIGIT SEVEN
UCP 0E58, Nd, '8', ; ๘ THAI DIGIT EIGHT
UCP 0E59, Nd, '9', ; ๙ THAI DIGIT NINE
UCP 0E5A, Po, '|', ; ๚ THAI CHARACTER ANGKHANKHU
UCP 0E5B, Po, '>>', ; ๛ THAI CHARACTER KHOMUT
UCP 1403, Lo, 'I', ; ᐃ CANADIAN SYLLABICS I
UCP 1404, Lo, 'Ii', ; ᐄ CANADIAN SYLLABICS II
UCP 1405, Lo, 'O', ; ᐅ CANADIAN SYLLABICS O
UCP 1406, Lo, 'Oo', ; ᐆ CANADIAN SYLLABICS OO
UCP 140A, Lo, 'A', ; ᐊ CANADIAN SYLLABICS A
UCP 140B, Lo, 'Aa', ; ᐋ CANADIAN SYLLABICS AA
UCP 1431, Lo, 'Pi', ; ᐱ CANADIAN SYLLABICS PI
UCP 1432, Lo, 'Pii', ; ᐲ CANADIAN SYLLABICS PII
UCP 1433, Lo, 'Po', ; ᐳ CANADIAN SYLLABICS PO
UCP 1434, Lo, 'Poo', ; ᐴ CANADIAN SYLLABICS POO
UCP 1438, Lo, 'Pa', ; ᐸ CANADIAN SYLLABICS PA
UCP 1439, Lo, 'Paa', ; ᐹ CANADIAN SYLLABICS PAA
UCP 1449, Lo, 'P', ; ᑉ CANADIAN SYLLABICS P
UCP 144E, Lo, 'Ti', ; ᑎ CANADIAN SYLLABICS TI
UCP 144F, Lo, 'Tii', ; ᑏ CANADIAN SYLLABICS TII
UCP 1450, Lo, 'To', ; ᑐ CANADIAN SYLLABICS TO
UCP 1451, Lo, 'Too', ; ᑑ CANADIAN SYLLABICS TOO
UCP 1455, Lo, 'Ta', ; ᑕ CANADIAN SYLLABICS TA
UCP 1456, Lo, 'Taa', ; ᑖ CANADIAN SYLLABICS TAA
UCP 1466, Lo, 'T', ; ᑦ CANADIAN SYLLABICS T
UCP 146D, Lo, 'Ki', ; ᑭ CANADIAN SYLLABICS KI
UCP 146E, Lo, 'Kii', ; ᑮ CANADIAN SYLLABICS KII
UCP 146F, Lo, 'Ko', ; ᑯ CANADIAN SYLLABICS KO
UCP 1470, Lo, 'Koo', ; ᑰ CANADIAN SYLLABICS KOO
UCP 1472, Lo, 'Ka', ; ᑲ CANADIAN SYLLABICS KA
UCP 1473, Lo, 'Kaa', ; ᑳ CANADIAN SYLLABICS KAA
UCP 1483, Lo, 'K', ; ᒃ CANADIAN SYLLABICS K
UCP 148B, Lo, 'Ci', ; ᒋ CANADIAN SYLLABICS CI
UCP 148C, Lo, 'Cii', ; ᒌ CANADIAN SYLLABICS CII
UCP 148D, Lo, 'Co', ; ᒍ CANADIAN SYLLABICS CO
UCP 148E, Lo, 'Coo', ; ᒎ CANADIAN SYLLABICS COO
UCP 1490, Lo, 'Ca', ; ᒐ CANADIAN SYLLABICS CA
UCP 1491, Lo, 'Caa', ; ᒑ CANADIAN SYLLABICS CAA
UCP 14A1, Lo, 'C', ; ᒡ CANADIAN SYLLABICS C
UCP 14A5, Lo, 'Mi', ; ᒥ CANADIAN SYLLABICS MI
UCP 14A6, Lo, 'Mii', ; ᒦ CANADIAN SYLLABICS MII
UCP 14A7, Lo, 'Mo', ; ᒧ CANADIAN SYLLABICS MO
UCP 14A8, Lo, 'Moo', ; ᒨ CANADIAN SYLLABICS MOO
UCP 14AA, Lo, 'Ma', ; ᒪ CANADIAN SYLLABICS MA
UCP 14AB, Lo, 'Maa', ; ᒫ CANADIAN SYLLABICS MAA
UCP 14BB, Lo, 'M', ; ᒻ CANADIAN SYLLABICS M
UCP 14C2, Lo, 'Ni', ; ᓂ CANADIAN SYLLABICS NI
UCP 14C3, Lo, 'Nii', ; ᓃ CANADIAN SYLLABICS NII
UCP 14C4, Lo, 'No', ; ᓄ CANADIAN SYLLABICS NO
UCP 14C5, Lo, 'Noo', ; ᓅ CANADIAN SYLLABICS NOO
UCP 14C7, Lo, 'Na', ; ᓇ CANADIAN SYLLABICS NA
UCP 14C8, Lo, 'Naa', ; ᓈ CANADIAN SYLLABICS NAA
UCP 14D0, Lo, 'N', ; ᓐ CANADIAN SYLLABICS N
UCP 14D5, Lo, 'Li', ; ᓕ CANADIAN SYLLABICS LI
UCP 14D6, Lo, 'Lii', ; ᓖ CANADIAN SYLLABICS LII
UCP 14D7, Lo, 'Lo', ; ᓗ CANADIAN SYLLABICS LO
UCP 14D8, Lo, 'Loo', ; ᓘ CANADIAN SYLLABICS LOO
UCP 14DA, Lo, 'La', ; ᓚ CANADIAN SYLLABICS LA
UCP 14DB, Lo, 'Laa', ; ᓛ CANADIAN SYLLABICS LAA
UCP 14EA, Lo, 'L', ; ᓪ CANADIAN SYLLABICS L
UCP 14EF, Lo, 'Si', ; ᓯ CANADIAN SYLLABICS SI
UCP 14F0, Lo, 'Sii', ; ᓰ CANADIAN SYLLABICS SII
UCP 14F1, Lo, 'So', ; ᓱ CANADIAN SYLLABICS SO
UCP 14F2, Lo, 'Soo', ; ᓲ CANADIAN SYLLABICS SOO
UCP 14F4, Lo, 'Sa', ; ᓴ CANADIAN SYLLABICS SA
UCP 14F5, Lo, 'Saa', ; ᓵ CANADIAN SYLLABICS SAA
UCP 1505, Lo, 'S', ; ᔅ CANADIAN SYLLABICS S
UCP 1528, Lo, 'Yi', ; ᔨ CANADIAN SYLLABICS YI
UCP 1529, Lo, 'Yii', ; ᔩ CANADIAN SYLLABICS YII
UCP 152A, Lo, 'Yo', ; ᔪ CANADIAN SYLLABICS YO
UCP 152B, Lo, 'Yoo', ; ᔫ CANADIAN SYLLABICS YOO
UCP 152D, Lo, 'Ya', ; ᔭ CANADIAN SYLLABICS YA
UCP 152E, Lo, 'Yaa', ; ᔮ CANADIAN SYLLABICS YAA
UCP 153E, Lo, 'Y', ; ᔾ CANADIAN SYLLABICS Y
UCP 1546, Lo, 'Ri', ; ᕆ CANADIAN SYLLABICS RI
UCP 1547, Lo, 'Rii', ; ᕇ CANADIAN SYLLABICS RII
UCP 1548, Lo, 'Ro', ; ᕈ CANADIAN SYLLABICS RO
UCP 1549, Lo, 'Roo', ; ᕉ CANADIAN SYLLABICS ROO
UCP 154B, Lo, 'Ra', ; ᕋ CANADIAN SYLLABICS RA
UCP 154C, Lo, 'Raa', ; ᕌ CANADIAN SYLLABICS RAA
UCP 1550, Lo, 'R', ; ᕐ CANADIAN SYLLABICS R
UCP 1555, Lo, 'Fi', ; ᕕ CANADIAN SYLLABICS FI
UCP 1556, Lo, 'Fii', ; ᕖ CANADIAN SYLLABICS FII
UCP 1557, Lo, 'Fo', ; ᕗ CANADIAN SYLLABICS FO
UCP 1558, Lo, 'Foo', ; ᕘ CANADIAN SYLLABICS FOO
UCP 1559, Lo, 'Fa', ; ᕙ CANADIAN SYLLABICS FA
UCP 155A, Lo, 'Faa', ; ᕚ CANADIAN SYLLABICS FAA
UCP 155D, Lo, 'F', ; ᕝ CANADIAN SYLLABICS F
UCP 157C, Lo, 'H', ; ᕼ CANADIAN SYLLABICS NUNAVUT H
UCP 157F, Lo, 'Qi', ; ᕿ CANADIAN SYLLABICS QI
UCP 1580, Lo, 'Qii', ; ᖀ CANADIAN SYLLABICS QII
UCP 1581, Lo, 'Qo', ; ᖁ CANADIAN SYLLABICS QO
UCP 1582, Lo, 'Qoo', ; ᖂ CANADIAN SYLLABICS QOO
UCP 1583, Lo, 'Qa', ; ᖃ CANADIAN SYLLABICS QA
UCP 1584, Lo, 'Qaa', ; ᖄ CANADIAN SYLLABICS QAA
UCP 1585, Lo, 'Q', ; ᖅ CANADIAN SYLLABICS Q
UCP 158F, Lo, 'Ngi', ; ᖏ CANADIAN SYLLABICS NGI
UCP 1590, Lo, 'Ngii', ; ᖐ CANADIAN SYLLABICS NGII
UCP 1591, Lo, 'Ngo', ; ᖑ CANADIAN SYLLABICS NGO
UCP 1592, Lo, 'Ngoo', ; ᖒ CANADIAN SYLLABICS NGOO
UCP 1593, Lo, 'Nga', ; ᖓ CANADIAN SYLLABICS NGA
UCP 1594, Lo, 'Ngaa', ; ᖔ CANADIAN SYLLABICS NGAA
UCP 1595, Lo, 'Ng', ; ᖕ CANADIAN SYLLABICS NG
UCP 1596, Lo, 'Nng', ; ᖖ CANADIAN SYLLABICS NNG
UCP 15A0, Lo, 'Lhi', ; ᖠ CANADIAN SYLLABICS LHI
UCP 15A1, Lo, 'Lhii', ; ᖡ CANADIAN SYLLABICS LHII
UCP 15A2, Lo, 'Lho', ; ᖢ CANADIAN SYLLABICS LHO
UCP 15A3, Lo, 'Lhoo', ; ᖣ CANADIAN SYLLABICS LHOO
UCP 15A4, Lo, 'Lha', ; ᖤ CANADIAN SYLLABICS LHA
UCP 15A5, Lo, 'Lhaa', ; ᖥ CANADIAN SYLLABICS LHAA
UCP 15A6, Lo, 'Lh', ; ᖦ CANADIAN SYLLABICS LH
UCP 1671, Lo, 'Nngi', ; ᙱ CANADIAN SYLLABICS NNGI
UCP 1672, Lo, 'Ngii', ; ᙲ CANADIAN SYLLABICS NNGII
UCP 1673, Lo, 'Nngo', ; ᙳ CANADIAN SYLLABICS NNGO
UCP 1674, Lo, 'Ngoo', ; ᙴ CANADIAN SYLLABICS NNGOO
UCP 1675, Lo, 'Nnga', ; ᙵ CANADIAN SYLLABICS NNGA
UCP 1676, Lo, 'Ngaa', ; ᙶ CANADIAN SYLLABICS NNGAA
UCP 1E02, U5, 'B', ; Ḃ LATIN CAPITAL LETTER B WITH DOT ABOVE
UCP 1E03, L5, 'b', ; ḃ LATIN SMALL LETTER B WITH DOT ABOVE
UCP 1E0A, U5, 'D', ; Ḋ LATIN CAPITAL LETTER D WITH DOT ABOVE
UCP 1E0B, L5, 'd', ; ḋ LATIN SMALL LETTER D WITH DOT ABOVE
UCP 1E1E, U5, 'F', ; Ḟ LATIN CAPITAL LETTER F WITH DOT ABOVE
UCP 1E1F, L5, 'f', ; ḟ LATIN SMALL LETTER F WITH DOT ABOVE
UCP 1E40, U5, 'M', ; Ṁ LATIN CAPITAL LETTER M WITH DOT ABOVE
UCP 1E41, L5, 'm', ; ṁ LATIN SMALL LETTER M WITH DOT ABOVE
UCP 1E56, U5, 'P', ; Ṗ LATIN CAPITAL LETTER P WITH DOT ABOVE
UCP 1E57, L5, 'p', ; ṗ LATIN SMALL LETTER P WITH DOT ABOVE
UCP 1E60, U5, 'S', ; Ṡ LATIN CAPITAL LETTER S WITH DOT ABOVE
UCP 1E61, L5, 's', ; ṡ LATIN SMALL LETTER S WITH DOT ABOVE
UCP 1E6A, U5, 'T', ; Ṫ LATIN CAPITAL LETTER T WITH DOT ABOVE
UCP 1E6B, L5, 't', ; ṫ LATIN SMALL LETTER T WITH DOT ABOVE
UCP 1E80, U5, 'W', ; Ẁ LATIN CAPITAL LETTER W WITH GRAVE
UCP 1E81, L5, 'w', ; ẁ LATIN SMALL LETTER W WITH GRAVE
UCP 1E82, U5, 'W', ; Ẃ LATIN CAPITAL LETTER W WITH ACUTE
UCP 1E83, L5, 'w', ; ẃ LATIN SMALL LETTER W WITH ACUTE
UCP 1E84, U5, 'W', ; Ẅ LATIN CAPITAL LETTER W WITH DIAERESIS
UCP 1E85, L5, 'w', ; ẅ LATIN SMALL LETTER W WITH DIAERESIS
UCP 1E9B, L5, 's', ; ẛ LATIN SMALL LETTER LONG S WITH DOT ABOVE
UCP 1E9E, U5, 'SS', ; ẞ LATIN CAPITAL LETTER SHARP S
UCP 1EF2, U5, 'Y', ; Ỳ LATIN CAPITAL LETTER Y WITH GRAVE
UCP 1EF3, L5, 'y', ; ỳ LATIN SMALL LETTER Y WITH GRAVE
UCP 2002, Zs, ' ', ensp ; EN SPACE
UCP 2003, Zs, ' ', emsp ; EM SPACE
UCP 2004, Zs, ' ', emsp13 ; THREE-PER-EM SPACE
UCP 2005, Zs, ' ', emsp14 ; FOUR-PER-EM SPACE
UCP 2007, Zs, ' ', numsp ; FIGURE SPACE
UCP 2008, Zs, ' ', puncsp ; PUNCTUATION SPACE
UCP 2009, Zs, ' ', thinsp ; THIN SPACE
UCP 200A, Zs, ' ', hairsp ; HAIR SPACE
UCP 200B, Cf, '', ; ZERO WIDTH SPACE
UCP 200C, Cf, '', zwnj ; ZERO WIDTH NON-JOINER
UCP 200D, Cf, '', zwj ; ZERO WIDTH JOINER
UCP 200E, Cf, '', lrm ; LEFT-TO-RIGHT MARK
UCP 200F, Cf, '', rlm ; RIGHT-TO-LEFT MARK
UCP 2010, Cf, '', dash ; ‐ HYPHEN
UCP 2013, Pd, '-', ndash ; – EN DASH
UCP 2014, Pd, '-', mdash ; — EM DASH
UCP 2015, Pd, '-', horbar ; ― HORIZONTAL BAR
UCP 2016, Pd, '|', verbar ; ― DOUBLE VERTICAL LINE
UCP 2017, Po, '_', ; ‗ DOUBLE LOW LINE
UCP 2018, Pi, "'", lsquo ; ‘ LEFT SINGLE QUOTATION MARK
UCP 2019, Pf, "'", rsquo ; ’ RIGHT SINGLE QUOTATION MARK
UCP 201A, Ps, "'", sbquo ; ‚ SINGLE LOW-9 QUOTATION MARK
UCP 201C, Pi, '"', ldquo ; “ LEFT DOUBLE QUOTATION MARK
UCP 201D, Pf, '"', rdquo ; ” RIGHT DOUBLE QUOTATION MARK
UCP 201E, Ps, '"', bdquo ; „ DOUBLE LOW-9 QUOTATION MARK
UCP 2020, Po, '+', dagger ; † DAGGER
UCP 2021, Po, '+', Dagger ; ‡ DOUBLE DAGGER
UCP 2022, Po, '.', bull ; • BULLET
UCP 2025, Po, '..', nldr ; ‥ TWO DOT LEADER
UCP 2026, Po, '...', hellip ; … HORIZONTAL ELLIPSIS
UCP 202A, Cf, '', ; LEFT-TO-RIGHT EMBEDDING
UCP 202B, Cf, '', ; RIGHT-TO-LEFT EMBEDDING
UCP 202C, Cf, '', ; POP DIRECTIONAL FORMATTING
UCP 202D, Cf, '', ; LEFT-TO-RIGHT OVERRIDE
UCP 202E, Cf, '', ; RIGHT-TO-LEFT OVERRIDE
UCP 2030, Po, '%%', permil ; ‰ PER MILLE SIGN
UCP 2032, Po, "'", prime ; ′ PRIME
UCP 2033, Po, '"', Prime ; ″ DOUBLE PRIME
UCP 2034, Po, '"', tprime ; ‴ TRIPLE PRIME
UCP 2035, Po, '`', bprime ; ‵ REVERSED PRIME
UCP 2039, Pi, '<', lsaquo ; ‹ SINGLE LEFT-POINTING ANGLE QUOTATION MARK
UCP 203A, Pf, '>', rsaquo ; › SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
UCP 203E, Po, '_', oline ; ‾ OVERLINE
UCP 2041, So, '^', caret ; ⁁ CARET INSERTION POINT
UCP 2043, So, '.', hybull ; ⁃ HYPHEN BULLET
UCP 2044, Sm, '/', frasl ; ⁄ FRACTION SLASH
UCP 204A, Po, '@', ; ⁊ TIRONIAN SIGN ET
UCP 204F, Po, ';', bsemi ; ⁏ REVERSED SEMICOLON
UCP 2060, Cf, '', nobreak ; WORD JOINER
UCP 2063, Cf, '', ic ; INVISIBLE SEPARATOR
UCP 207F, Lm, '`', ; ⁿ SUPERSCRIPT LATIN SMALL LETTER N
UCP 20A7, Sc, '$', ; ₧ PESETA SIGN
UCP 20AA, Sc, '$', ; ₪ NEW SHEQEL SIGN
UCP 20AB, Sc, '$', ; ₫ DONG SIGN
UCP 20AC, Sc, '$', euro ; € EURO SIGN
UCP 20AF, Sc, '$', ; ₯ DRACHMA SIGN
UCP 2105, So, '%%', incare ; ℅ CARE OF
UCP 2113, L5, 'l', ell ; ℓ SCRIPT SMALL L
UCP 2116, So, 'N', numero ; № NUMERO SIGN
UCP 2122, So, '(TM)',trade ; ™ TRADE MARK SIGN
UCP 2126, U5, 'O', ohm ; Ω OHM SIGN
UCP 2190, Sm, '<', larr ; ← LEFTWARDS ARROW
UCP 2191, Sm, '^', uarr ; ↑ UPWARDS ARROW
UCP 2192, Sm, '>', rarr ; → RIGHTWARDS ARROW
UCP 2193, Sm, 'v', darr ; ↓ DOWNWARDS ARROW
UCP 2194, Sm, '-', harr ; ↔ LEFT RIGHT ARROW
UCP 2195, So, '|', varr ; ↕ UP DOWN ARROW
UCP 21B5, So, '<', crarr ; ↵ DOWNWARDS ARROW WITH CORNER LEFTWARDS
UCP 2200, Sm, 'v', forall ; ∀ FOR ALL
UCP 2202, Sm, 'd', part ; ∂ PARTIAL DIFFERENTIAL
UCP 2203, Sm, 'E', exist ; ∃ THERE EXIST
UCP 2205, Sm, '/', empty ; ∅ EMPTY SET
UCP 2206, Sm, '#', ; ∆ INCREMENT
UCP 2207, Sm, '.', nabla ; ∇ NABLA
UCP 2208, Sm, 'E', isin ; ∈ ELEMENT OF
UCP 2209, Sm, '/', notin ; ∉ NOT AN ELEMENT OF
UCP 220B, Sm, 'E', ni ; ∋ CONTAINS AS MEMBER
UCP 220F, Sm, '#', prod ; ∏ N-ARY PRODUCT
UCP 2211, Sm, '#', sum ; ∑ N-ARY SUMMATION
UCP 2212, Sm, '-', minus ; − MINUS SIGN
UCP 2217, Sm, '*', lowast ; ∗ ASTERISK OPERATOR
UCP 2219, Sm, '.', ; ∙ BULLET OPERATOR
UCP 221A, Sm, '#', radic ; √ SQUARE ROOT
UCP 221D, Sm, 'o', prop ; ∝ PROPORTIONAL TO
UCP 221E, Sm, '#', infin ; ∞ INFINITY
UCP 2220, Sm, '<', ang ; ∠ ANGLE
UCP 2227, Sm, '&', and ; ∧ LOGICAL AND
UCP 2228, Sm, '|', or ; ∨ LOGICAL OR
UCP 2229, Sm, '#', cap ; ∩ INTERSECTION
UCP 222A, Sm, 'U', cup ; ∪ UNION
UCP 222B, Sm, '/', int ; ∫ INTEGRAL
UCP 2234, Sm, '.', there4 ; ∴ THEREFORE
UCP 223C, Sm, '~', sim ; ∼ TILDE OPERATOR
UCP 2245, Sm, '~', cong ; ≅ APPROXIMATELY EQUAL TO
UCP 2248, Sm, '=', asymp ; ≈ ALMOST EQUAL TO
UCP 2260, Sm, '=', ne ; ≠ NOT EQUAL TO
UCP 2261, Sm, '=', equiv ; ≡ IDENTICAL TO
UCP 2264, Sm, '<', le ; ≤ LESS-THAN OR EQUAL TO
UCP 2265, Sm, '>', ge ; ≥ GREATER-THAN OR EQUAL TO
UCP 2282, Sm, '<', sub ; ⊂ SUBSET OF
UCP 2283, Sm, '>', sup ; ⊃ SUPERSET OF
UCP 2284, Sm, '/', nsub ; ⊄ NOT SUBSET OF
UCP 2286, Sm, '<=', sube ; ⊆ SUBSET OR EQUAL TO
UCP 2287, Sm, '=>', supe ; ⊇ SUPERSET OR EQUAL TO
UCP 2295, Sm, '+', oplus ; ⊕ CIRCLED PLUS
UCP 2296, Sm, '-', ominus ; ⊖ CIRCLED MINUS
UCP 2297, Sm, '.', otimes ; ⊗ CIRCLED TIMES
UCP 22A5, Sm, '_', perp ; ⊥ UP TACK
UCP 22C5, Sm, '.', sdot ; ⋅ DOT OPERATOR
UCP 2308, Sm, '|', lceil ; ⌈ LEFT CEILING
UCP 2309, Sm, '|', rceil ; ⌉ RIGHT CEILING
UCP 230A, Sm, '|', lfloor ; ⌊ LEFT FLOOR
UCP 230B, Sm, '|', rfloor ; ⌋ RIGHT FLOOR
UCP 2310, So, '^', bnot ; ⌐ REVERSED NOT SIGN
UCP 2320, Sm, '/', ; ⌠ TOP HALF INTEGRAL
UCP 2321, Sm, '/', ; ⌡ BOTTOM HALF INTEGRAL
UCP 2500, So, '-', boxh ; ─ BOX DRAWINGS LIGHT HORIZONTAL
UCP 2502, So, '|', boxv ; │ BOX DRAWINGS LIGHT VERTICAL
UCP 250C, So, '+', boxdr ; ┌ BOX DRAWINGS LIGHT DOWN AND RIGHT
UCP 2510, So, '+', boxdl ; ┐ BOX DRAWINGS LIGHT DOWN AND LEFT
UCP 2514, So, '+', boxur ; └ BOX DRAWINGS LIGHT UP AND RIGHT
UCP 2518, So, '+', boxul ; ┘ BOX DRAWINGS LIGHT UP AND LEFT
UCP 251C, So, '+', boxvr ; ├ BOX DRAWINGS LIGHT VERTICAL AND RIGHT
UCP 2524, So, '+', boxvl ; ┤ BOX DRAWINGS LIGHT VERTICAL AND LEFT
UCP 252C, So, '+', boxhd ; ┬ BOX DRAWINGS LIGHT DOWN AND HORIZONTAL
UCP 2534, So, '+', boxhu ; ┴ BOX DRAWINGS LIGHT UP AND HORIZONTAL
UCP 253C, So, '+', boxvh ; ┼ BOX DRAWINGS LIGHT VERTICAL AND HORIZONTAL
UCP 2550, So, '-', Boxh ; ═ BOX DRAWINGS DOUBLE HORIZONTAL
UCP 2551, So, '|', Boxv ; ║ BOX DRAWINGS DOUBLE VERTICAL
UCP 2552, So, '+', boxdr ; ╒ BOX DRAWINGS DOWN SINGLE AND RIGHT DOUBLE
UCP 2553, So, '+', boxdr ; ╓ BOX DRAWINGS DOWN DOUBLE AND RIGHT SINGLE
UCP 2554, So, '+', Boxdr ; ╔ BOX DRAWINGS DOUBLE DOWN AND RIGHT
UCP 2555, So, '+', boxdl ; ╕ BOX DRAWINGS DOWN SINGLE AND LEFT DOUBLE
UCP 2556, So, '+', Boxdl ; ╖ BOX DRAWINGS DOWN DOUBLE AND LEFT SINGLE
UCP 2557, So, '+', Boxdl ; ╗ BOX DRAWINGS DOUBLE DOWN AND LEFT
UCP 2558, So, '+', boxur ; ╘ BOX DRAWINGS UP SINGLE AND RIGHT DOUBLE
UCP 2559, So, '+', boxur ; ╙ BOX DRAWINGS UP DOUBLE AND RIGHT SINGLE
UCP 255A, So, '+', Boxur ; ╚ BOX DRAWINGS DOUBLE UP AND RIGHT
UCP 255B, So, '+', boxul ; ╛ BOX DRAWINGS UP SINGLE AND LEFT DOUBLE
UCP 255C, So, '+', boxul ; ╜ BOX DRAWINGS UP DOUBLE AND LEFT SINGLE
UCP 255D, So, '+', Boxul ; ╝ BOX DRAWINGS DOUBLE UP AND LEFT
UCP 255E, So, '+', boxvr ; ╞ BOX DRAWINGS VERTICAL SINGLE AND RIGHT DOUBLE
UCP 255F, So, '+', boxvr ; ╟ BOX DRAWINGS VERTICAL DOUBLE AND RIGHT SINGLE
UCP 2560, So, '+', Boxvr ; ╠ BOX DRAWINGS DOUBLE VERTICAL AND RIGHT
UCP 2561, So, '+', boxvl ; ╡ BOX DRAWINGS VERTICAL SINGLE AND LEFT DOUBLE
UCP 2562, So, '+', boxvl ; ╢ BOX DRAWINGS VERTICAL DOUBLE AND LEFT SINGLE
UCP 2563, So, '+', Boxvl ; ╣ BOX DRAWINGS DOUBLE VERTICAL AND LEFT
UCP 2564, So, '+', boxhd ; ╤ BOX DRAWINGS DOWN SINGLE AND HORIZONTAL DOUBLE
UCP 2565, So, '+', boxhd ; ╥ BOX DRAWINGS DOWN DOUBLE AND HORIZONTAL SINGLE
UCP 2566, So, '+', Boxhd ; ╦ BOX DRAWINGS DOUBLE DOWN AND HORIZONTAL
UCP 2567, So, '+', boxhu ; ╧ BOX DRAWINGS UP SINGLE AND HORIZONTAL DOUBLE
UCP 2568, So, '+', boxhu ; ╨ BOX DRAWINGS UP DOUBLE AND HORIZONTAL SINGLE
UCP 2569, So, '+', Boxhu ; ╩ BOX DRAWINGS DOUBLE UP AND HORIZONTAL
UCP 256A, So, '+', boxvh ; ╪ BOX DRAWINGS VERTICAL SINGLE AND HORIZONTAL DOUBLE
UCP 256B, So, '+', boxvh ; ╫ BOX DRAWINGS VERTICAL DOUBLE AND HORIZONTAL SINGLE
UCP 256C, So, '+', Boxvh ; ╬ BOX DRAWINGS DOUBLE VERTICAL AND HORIZONTAL
UCP 2580, So, '*', uhblk ; ▀ UPPER HALF BLOCK
UCP 2584, So, '*', lhblk ; ▄ LOWER HALF BLOCK
UCP 2588, So, '*', block ; █ FULL BLOCK
UCP 258C, So, '*', ; ▌ LEFT HALF BLOCK
UCP 2590, So, '*', ; ▐ RIGHT HALF BLOCK
UCP 2591, So, '*', blk14 ; ░ LIGHT SHADE
UCP 2592, So, '*', blk12 ; ▒ MEDIUM SHADE
UCP 2593, So, '*', blk34 ; ▓ DARK SHADE
UCP 25A0, So, '*', ; ■ BLACK SQUARE
UCP 25A1, So, '*', square ; □ WHITE SQUARE
UCP 25AA, So, '*', squarf ; □ BLACK SMALL SQUARE
UCP 25CA, So, '#', loz ; ◊ LOZENGE
UCP 2618, So, '*', ; ☘ SHAMROCK
UCP 2640, So, '*', female ; ♀ FEMALE SIGN
UCP 2642, So, '*', male ; ♂ MALE SIGN
UCP 2660, So, '*', spades ; ≠ BLACK SPADE SUIT
UCP 2663, So, '*', clubs ; ♣ BLACK CLUB SUIT
UCP 2665, So, '*', hearts ; ♥ BLACK HEART SUIT
UCP 2666, So, '*', diams ; ♦ BLACK DIAMOND SUIT
UCP 274A, So, '*', ; ❊ EIGHT TEARDROP-SPOKED PROPELLER ASTERISK
UCP F8FF, Co, '*', ; Private Use, Last
UCP FB01, L5, 'fi', ; fi LATIN SMALL LIGATURE FI
UCP FB02, L5, 'fl', ; fl LATIN SMALL LIGATURE FL
UCP FB2A, Lo, 'Sh', ; שׁ HEBREW LETTER SHIN WITH SHIN DOT
UCP FB2B, Lo, 'Sh', ; שׂ HEBREW LETTER SHIN WITH SIN DOT
UCP FB35, Lo, 'V', ; וּ HEBREW LETTER VAV WITH DAGESH
UCP FB4B, Lo, 'V', ; וֹ HEBREW LETTER VAV WITH HOLAM
UCP FB56, Lo, 'P', ; ﭖ ARABIC LETTER PEH ISOLATED FORM
UCP FB58, Lo, 'P', ; ﭘ ARABIC LETTER PEH INITIAL FORM
UCP FB66, Lo, 'T', ; ﭦ ARABIC LETTER TTEH ISOLATED FORM
UCP FB68, Lo, 'T', ; ﭨ ARABIC LETTER TTEH INITIAL FORM
UCP FB7A, Lo, 'Ch', ; ﭺ ARABIC LETTER TCHEH ISOLATED FORM
UCP FB7C, Lo, 'Ch', ; ﭼ ARABIC LETTER TCHEH INITIAL FORM
UCP FB84, Lo, 'D', ; ﮄ ARABIC LETTER DAHAL ISOLATED FORM
UCP FB88, Lo, 'D', ; ﮈ ARABIC LETTER DDAL ISOLATED FORM
UCP FB8A, Lo, 'J', ; ﮊ ARABIC LETTER JEH ISOLATED FORM
UCP FB8C, Lo, 'R', ; ﮌ ARABIC LETTER RREH ISOLATED FORM
UCP FB8E, Lo, 'K', ; ﮎ ARABIC LETTER KEHEH ISOLATED FORM
UCP FB92, Lo, 'G', ; ﮒ ARABIC LETTER GAF ISOLATED FORM
UCP FB94, Lo, 'G', ; ﮔ ARABIC LETTER GAF INITIAL FORM
UCP FB9E, Lo, 'N', ; ﮞ ARABIC LETTER NOON GHUNNA ISOLATED FORM
UCP FBA6, Lo, 'H', ; ﮦ ARABIC LETTER HEH GOAL ISOLATED FORM
UCP FBA8, Lo, 'H', ; ﮨ ARABIC LETTER HEH GOAL INITIAL FORM
UCP FBA9, Lo, 'H', ; ﮩ ARABIC LETTER HEH GOAL MEDIAL FORM
UCP FBAA, Lo, 'H', ; ﮪ ARABIC LETTER HEH DOACHASHMEE ISOLATED FORM
UCP FBAE, Lo, 'Ye', ; ﮮ ARABIC LETTER YEH BARREE ISOLATED FORM
UCP FBB0, Lo, 'Ye', ; ﮰ ARABIC LETTER YEH BARREE WITH HAMZA ABOVE ISOLATED FORM
UCP FBFC, Lo, 'Ye', ; ﯼ ARABIC LETTER FARSI YEH ISOLATED FORM
UCP FBFD, Lo, 'Ye', ; ﯽ ARABIC LETTER FARSI YEH FINAL FORM
UCP FBFE, Lo, 'Ye', ; ﯾ ARABIC LETTER FARSI YEH INITIAL FORM
UCP FE7C, Lo, 'Sh', ; ﹼ ARABIC SHADDA ISOLATED FORM
UCP FE7D, Lo, '`', ; ﹽ ARABIC SHADDA MEDIAL FORM
UCP FE80, Lo, "'", ; ﺀ ARABIC LETTER HAMZA ISOLATED FORM
UCP FE81, Lo, 'A', ; ﺁ ARABIC LETTER ALEF WITH MADDA ABOVE ISOLATED FORM
UCP FE82, Lo, 'A', ; ﺂ ARABIC LETTER ALEF WITH MADDA ABOVE FINAL FORM
UCP FE83, Lo, 'A', ; ﺃ ARABIC LETTER ALEF WITH HAMZA ABOVE ISOLATED FORM
UCP FE84, Lo, 'A', ; ﺄ ARABIC LETTER ALEF WITH HAMZA ABOVE FINAL FORM
UCP FE85, Lo, 'W', ; ﺅ ARABIC LETTER WAW WITH HAMZA ABOVE ISOLATED FORM
UCP FE89, Lo, 'Ye', ; ﺉ ARABIC LETTER YEH WITH HAMZA ABOVE ISOLATED FORM
UCP FE8A, Lo, 'Ye', ; ﺊ ARABIC LETTER YEH WITH HAMZA ABOVE FINAL FORM
UCP FE8B, Lo, 'Y', ; ﺋ ARABIC LETTER YEH WITH HAMZA ABOVE INITIAL FORM
UCP FE8D, Lo, 'A', ; ﺍ ARABIC LETTER ALEF ISOLATED FORM
UCP FE8E, Lo, 'A', ; ﺎ ARABIC LETTER ALEF FINAL FORM
UCP FE8F, Lo, 'B', ; ﺏ ARABIC LETTER BEH ISOLATED FORM
UCP FE91, Lo, 'B', ; ﺑ ARABIC LETTER BEH INITIAL FORM
UCP FE93, Lo, 'T', ; ﺓ ARABIC LETTER TEH MARBUTA ISOLATED FORM
UCP FE95, Lo, 'T', ; ﺕ ARABIC LETTER TEH ISOLATED FORM
UCP FE97, Lo, 'T', ; ﺗ ARABIC LETTER TEH INITIAL FORM
UCP FE99, Lo, 'Th', ; ﺙ ARABIC LETTER THEH ISOLATED FORM
UCP FE9B, Lo, 'Th', ; ﺛ ARABIC LETTER THEH INITIAL FORM
UCP FE9D, Lo, 'J', ; ﺝ ARABIC LETTER JEEM ISOLATED FORM
UCP FE9F, Lo, 'J', ; ﺟ ARABIC LETTER JEEM INITIAL FORM
UCP FEA1, Lo, 'H', ; ﺡ ARABIC LETTER HAH ISOLATED FORM
UCP FEA3, Lo, 'H', ; ﺣ ARABIC LETTER HAH INITIAL FORM
UCP FEA5, Lo, 'Kh', ; ﺥ ARABIC LETTER KHAH ISOLATED FORM
UCP FEA7, Lo, 'Kh', ; ﺧ ARABIC LETTER KHAH INITIAL FORM
UCP FEA9, Lo, 'D', ; ﺩ ARABIC LETTER DAL ISOLATED FORM
UCP FEAB, Lo, 'Dh', ; ﺫ ARABIC LETTER THAL ISOLATED FORM
UCP FEAD, Lo, 'R', ; ﺭ ARABIC LETTER REH ISOLATED FORM
UCP FEAF, Lo, 'Z', ; ﺯ ARABIC LETTER ZAIN ISOLATED FORM
UCP FEB1, Lo, 'S', ; ﺱ ARABIC LETTER SEEN ISOLATED FORM
UCP FEB3, Lo, 'S', ; ﺳ ARABIC LETTER SEEN INITIAL FORM
UCP FEB5, Lo, 'Sh', ; ﺵ ARABIC LETTER SHEEN ISOLATED FORM
UCP FEB7, Lo, 'Sh', ; ﺷ ARABIC LETTER SHEEN INITIAL FORM
UCP FEB9, Lo, 'S', ; ﺹ ARABIC LETTER SAD ISOLATED FORM
UCP FEBB, Lo, 'S', ; ﺻ ARABIC LETTER SAD INITIAL FORM
UCP FEBD, Lo, 'D', ; ﺽ ARABIC LETTER DAD ISOLATED FORM
UCP FEBF, Lo, 'D', ; ﺿ ARABIC LETTER DAD INITIAL FORM
UCP FEC1, Lo, 'T', ; ﻁ ARABIC LETTER TAH ISOLATED FORM
UCP FEC3, Lo, 'T', ; ﻃ ARABIC LETTER TAH INITIAL FORM
UCP FEC5, Lo, 'Z', ; ﻅ ARABIC LETTER ZAH ISOLATED FORM
UCP FEC7, Lo, 'Z', ; ﻇ ARABIC LETTER ZAH INITIAL FORM
UCP FEC9, Lo, "'", ; ﻉ ARABIC LETTER AIN ISOLATED FORM
UCP FECA, Lo, "'", ; ﻊ ARABIC LETTER AIN FINAL FORM
UCP FECB, Lo, "'", ; ﻋ ARABIC LETTER AIN INITIAL FORM
UCP FECC, Lo, "'", ; ﻌ ARABIC LETTER AIN MEDIAL FORM
UCP FECD, Lo, 'Gh', ; ﻍ ARABIC LETTER GHAIN ISOLATED FORM
UCP FECE, Lo, 'Gh', ; ﻎ ARABIC LETTER GHAIN FINAL FORM
UCP FECF, Lo, 'Gh', ; ﻏ ARABIC LETTER GHAIN INITIAL FORM
UCP FED0, Lo, 'Gh', ; ﻐ ARABIC LETTER GHAIN MEDIAL FORM
UCP FED1, Lo, 'F', ; ﻑ ARABIC LETTER FEH ISOLATED FORM
UCP FED3, Lo, 'F', ; ﻓ ARABIC LETTER FEH INITIAL FORM
UCP FED5, Lo, 'Q', ; ﻕ ARABIC LETTER QAF ISOLATED FORM
UCP FED7, Lo, 'Q', ; ﻗ ARABIC LETTER QAF INITIAL FORM
UCP FED9, Lo, 'K', ; ﻙ ARABIC LETTER KAF ISOLATED FORM
UCP FEDB, Lo, 'K', ; ﻛ ARABIC LETTER KAF INITIAL FORM
UCP FEDD, Lo, 'L', ; ﻝ ARABIC LETTER LAM ISOLATED FORM
UCP FEDF, Lo, 'L', ; ﻟ ARABIC LETTER LAM INITIAL FORM
UCP FEE0, Lo, 'L', ; ﻠ ARABIC LETTER LAM MEDIAL FORM
UCP FEE1, Lo, 'M', ; ﻡ ARABIC LETTER MEEM ISOLATED FORM
UCP FEE3, Lo, 'M', ; ﻣ ARABIC LETTER MEEM INITIAL FORM
UCP FEE5, Lo, 'N', ; ﻥ ARABIC LETTER NOON ISOLATED FORM
UCP FEE7, Lo, 'N', ; ﻧ ARABIC LETTER NOON INITIAL FORM
UCP FEE9, Lo, 'H', ; ﻩ ARABIC LETTER HEH ISOLATED FORM
UCP FEEB, Lo, 'H', ; ﻫ ARABIC LETTER HEH INITIAL FORM
UCP FEEC, Lo, 'H', ; ﻬ ARABIC LETTER HEH MEDIAL FORM
UCP FEED, Lo, 'W', ; ﻭ ARABIC LETTER WAW ISOLATED FORM
UCP FEEF, Lo, 'A', ; ﻯ ARABIC LETTER ALEF MAKSURA ISOLATED FORM
UCP FEF0, Lo, 'A', ; ﻰ ARABIC LETTER ALEF MAKSURA FINAL FORM
UCP FEF1, Lo, 'Y', ; ﻱ ARABIC LETTER YEH ISOLATED FORM
UCP FEF2, Lo, 'Y', ; ﻲ ARABIC LETTER YEH FINAL FORM
UCP FEF3, Lo, 'Y', ; ﻳ ARABIC LETTER YEH INITIAL FORM
UCP FEF5, Lo, 'LA', ; ﻵ ARABIC LIGATURE LAM WITH ALEF WITH MADDA ABOVE ISOLATED FORM
UCP FEF6, Lo, 'LA', ; ﻶ ARABIC LIGATURE LAM WITH ALEF WITH MADDA ABOVE FINAL FORM
UCP FEF7, Lo, 'LA', ; ﻷ ARABIC LIGATURE LAM WITH ALEF WITH HAMZA ABOVE ISOLATED FORM
UCP FEF8, Lo, 'LA', ; ﻸ ARABIC LIGATURE LAM WITH ALEF WITH HAMZA ABOVE FINAL FORM
UCP FEFB, Lo, 'LA', ; ﻻ ARABIC LIGATURE LAM WITH ALEF ISOLATED FORM
UCP FEFC, Lo, 'LA', ; ﻼ ARABIC LIGATURE LAM WITH ALEF FINAL FORM
UCP FEFF, Bm, '', ; ZERO WIDTH NO-BREAK SPACE
UCP FFFD, ??, '?', ; ? Replacement of a malformed input character.
UCP FFFE, ??, '', ; Character is undefined in output encoding.
UCP FFFF, ??, '' , ; Not a valid Unicode character