DOS) 864 IBM864 OEM Arabic; Arabic (864) 865 IBM865 OEM Nordic; Nordic (DOS) 866 cp866 OEM Russian; Cyrillic ; Cyrillic (Windows) 1252 windows-1252 ANSI Latin 1; Western European (Windows) 1253 windows-1253 ANSI Cyrillic (Mac) 10008 x-mac-chinesesimp MAC Simplified Chinese (GB 2312); Chinese Simplified (Mac) 10010 (KOI8-R) 20871 IBM871 IBM EBCDIC Icelandic 20880 IBM880 IBM EBCDIC Cyrillic Russian 20905 IBM905 IBM Serbian-Bulgarian 21027 (deprecated) 21866 koi8-u Ukrainian (KOI8-U); Cyrillic (KOI8-U) 28591 iso-8859
ISO8859-1 Western European 852 (=x0354) PC-ASCII Eastern European 855 (=x0357) PC-ASCII Cyrillic 864 (=x0360) PC-ASCII Arabic 865 (=x0361) PC-ASCII Scandinavian 866 (=x0362) PC-ASCII Cyrillic 870 (=x0366) EBCDIC Eastern Europe 871 (=x0367) EBCDIC Icelandic 872 (=x0368) PC-ASCII Cyrillic Euro 874 (=x036A) PC-ASCII Thai SBCS 875 (=x036B) EBCDIC Greek 880 (=x0370) EBCDIC Cyrillic 1006 (=x03EE) ISO Urdu 1008 (=x03F0) ASCII Arabic 8-bit ISO 1025 (=x0401) EBCDIC Cyrillic
3801 1256 Arabic - Yemen ar ar-ye 9217 2401 1256 Armenian hy hy 1067 Assamese as as 1101 Azeri - Cyrillic Kazakh kk kk 1087 1251 Khmer km km 1107 453 Konkani 1111 457 Korean ko ko 1042 412 Kyrgyz - Cyrillic 419 1251 Russian - Moldova ru ru-mo 2073 819 Sami Lappish 1083 Sanskrit sa sa 1103 Serbian - Cyrillic Turkmen tk tk 1090 442 Ukrainian uk uk 1058 422 1251 Unicode UTF-8 0 Urdu ur ur 1056 420 1256 Uzbek - Cyrillic
如: 中国 -> ㄓ ㄍ CYRILLIC = 12 汉语拼音与俄语字母对照风格,声调在各个拼音之后,用数字 [1-4] 进行表示。 如: 中国 -> чжун1 го2 CYRILLIC_FIRST = 13 汉语拼音与俄语字母对照风格,仅首字母。
, CSISO5427CYRILLIC1981, CSISO5428GREEK, CSISO10367BOX, CSISOLATIN1, CSISOLATIN2, CSISOLATIN3, CSISOLATIN4 EBCDICFR, EBCDICISFRISS, EBCDICIT, EBCDICPT, EBCDICUK, EBCDICUS, ECMA-114, ECMA-118, ECMA-128, ECMA-CYRILLIC IBM4971, IBM5347, IBM9030, IBM9066, IBM9448, IBM12712, IBM16804, IEC_P27-1, IEC_P271, INIS-8, INIS-CYRILLIC LATIN5, LATIN6, LATIN7, LATIN8, LATIN9, LATIN10, LATINGREEK, LATINGREEK1, MAC-CENTRALEUROPE, MAC-CYRILLIC MACINTOSH, MACIS, MACUK, MACUKRAINIAN, MIK, MS-ANSI, MS-ARAB, MS-CYRL, MS-EE, MS-GREEK, MS-HEBR, MS-MAC-CYRILLIC
, CSISO5427CYRILLIC1981, CSISO5428GREEK, CSISO10367BOX, CSISOLATIN1, CSISOLATIN2, CSISOLATIN3, CSISOLATIN4 EBCDICFR, EBCDICISFRISS, EBCDICIT, EBCDICPT, EBCDICUK, EBCDICUS, ECMA-114, ECMA-118, ECMA-128, ECMA-CYRILLIC IBM4971, IBM5347, IBM9030, IBM9066, IBM9448, IBM12712, IBM16804, IEC_P27-1, IEC_P271, INIS-8, INIS-CYRILLIC LATIN5, LATIN6, LATIN7, LATIN8, LATIN9, LATIN10, LATINGREEK, LATINGREEK1, MAC-CENTRALEUROPE, MAC-CYRILLIC MACINTOSH, MACIS, MACUK, MACUKRAINIAN, MIK, MS-ANSI, MS-ARAB, MS-CYRL, MS-EE, MS-GREEK, MS-HEBR, MS-MAC-CYRILLIC
部分代码页与国家/地区或语言的映射 代码页 国家/地区或语言 437 United States 850 Multilingual (Latin I) 852 Slavic (Latin II) 855 Cyrillic
+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic
, CSISO5427CYRILLIC1981, CSISO5428GREEK, CSISO10367BOX, CSISOLATIN1, CSISOLATIN2, CSISOLATIN3, CSISOLATIN4 EBCDICFR, EBCDICISFRISS, EBCDICIT, EBCDICPT, EBCDICUK, EBCDICUS, ECMA-114, ECMA-118, ECMA-128, ECMA-CYRILLIC IBM4971, IBM5347, IBM9030, IBM9066, IBM9448, IBM12712, IBM16804, IEC_P27-1, IEC_P271, INIS-8, INIS-CYRILLIC LATIN5, LATIN6, LATIN7, LATIN8, LATIN9, LATIN10, LATINGREEK, LATINGREEK1, MAC-CENTRALEUROPE, MAC-CYRILLIC MACINTOSH, MACIS, MACUK, MACUKRAINIAN, MIK, MS-ANSI, MS-ARAB, MS-CYRL, MS-EE, MS-GREEK, MS-HEBR, MS-MAC-CYRILLIC
xi1n']] >>> pinyin('中心', style=Style.BOPOMOFO) # 注音风格 [['ㄓㄨㄥ'], ['ㄒㄧㄣ']] >>> pinyin('中心', style=Style.CYRILLIC
for Information Interchange windows-1250 Cp1250 Windows Eastern European windows-1251 Cp1251 Windows Cyrillic ISO8859_2 Latin Alphabet No. 2 ISO-8859-4 ISO8859_4 Latin Alphabet No. 4 ISO-8859-5 ISO8859_5 Latin/Cyrillic Cp775 PC Baltic Cp838 IBM Thailand extended SBCS Cp850 MS-DOS Latin-1 Cp852 MS-DOS Latin-2 Cp855 IBM Cyrillic Cp964 AIX Chinese (Taiwan) Cp970 AIX Korean Cp1006 IBM AIX Pakistan (Urdu) Cp1025 IBM Multilingual Cyrillic Macintosh Arabic MacCentralEurope Macintosh Latin-2 MacCroatian Macintosh Croatian MacCyrillic Macintosh Cyrillic
兼容多字节的 8 位 Unicode ISO-8859-1 - 西欧 ISO-8859-15 - 西欧(加入欧元符号 + ISO-8859-1 中丢失的法语和芬兰语字母) cp866 - DOS 专用 Cyrillic 字符集 cp1251 - Windows 专用 Cyrillic 字符集 cp1252 - Windows 专用西欧字符集 KOI8-R - 俄语 BIG5 - 繁体中文,主要在台湾使用 GB2312
如: 中国 -> ``чжун1 го2``CYRILLIC = 12#: 汉语拼音与俄语字母对照风格,仅首字母。 如: 中国 -> ``ч г``CYRILLIC_FIRST = 13 示例: 1234 pinyin('我收集的材料散失了,散文没法写了', style=pypinyin.STYLE_TONE3
如: 中国 -> ``чжун1 го2`` CYRILLIC = 12 #: 汉语拼音与俄语字母对照风格,仅首字母。 如: 中国 -> ``ч г`` CYRILLIC_FIRST = 13 如果你的文字中,除了汉字,还有其它符号以及英文,会打印出怎么样的效果呢?
xi1n']] >>> pinyin('中心', style=Style.BOPOMOFO) # 注音风格 [['ㄓㄨㄥ'], ['ㄒㄧㄣ']] >>> pinyin('中心', style=Style.CYRILLIC
兼容多字节的 8 位 Unicode ISO-8859-1 - 西欧 ISO-8859-15 - 西欧(加入欧元符号 + ISO-8859-1 中丢失的法语和芬兰语字母) cp866 - DOS 专用 Cyrillic 字符集 cp1251 - Windows 专用 Cyrillic 字符集 cp1252 - Windows 专用西欧字符集 KOI8-R - / /俄语 BIG5 - 繁体中文,主要在台湾使用
Segoe UI包含拉丁(Latin),希腊(Greek),西里尔字母(Cyrillic)和阿拉伯(Arabic)字符,覆盖了基本的英文俄文字母、数字和一些常用符号。然而其他语言就没有了。
force-yes --no-install-recommends x11vnc x11-xkb-utils xfonts-100dpi xfonts-75dpi xfonts-scalable xfonts-cyrillic
四、总结建议 ✅ 按 Unicode 范围映射字体,不用死记语言 ✅ 给常见文字系统配对应字体(如 CJK、Cyrillic、Latin-Extended) ✅ 使用 Noto 字体家族作为 fallback : 范围名称 编码范围(十六进制) 用途 Basic Latin 0000–007F 英文、数字、标点 Latin-1 Supplement 0080–00FF 西欧语种的扩展字符(é ñ ç 等) Cyrillic
CSISO646DANISH, 58 CSISO2022CN, CSISO2022JP, CSISO2022JP2, CSISO2022KR, CSISO2033, 59 CSISO5427CYRILLIC , CSISO5427CYRILLIC1981, CSISO5428GREEK, CSISO10367BOX, 60 CSISOLATIN1, CSISOLATIN2, CSISOLATIN3, EBCDIC-CP-ROECE, EBCDIC-CP-SE, EBCDIC-CP-TR, EBCDIC-CP-US, 71 EBCDIC-CP-WT, EBCDIC-CP-YU, EBCDIC-CYRILLIC 107 IBM5347, IBM9030, IBM9066, IBM9448, IBM12712, IBM16804, IEC_P27-1, IEC_P271, 108 INIS-8, INIS-CYRILLIC , MACUK, 150 MACUKRAINIAN, MIK, MS-ANSI, MS-ARAB, MS-CYRL, MS-EE, MS-GREEK, MS-HEBR, 151 MS-MAC-CYRILLIC