Global Information Lookup Global Information

GB 2312 information


GB 2312
MIME / IANAGB_2312-80 (GB2312 for usual EUC form)
Alias(es)iso-ir-58, chinese, csGB2312, csISO58GB231280
Language(s)Simplified Chinese, English
Partial support:
Traditional Chinese, Russian, Bulgarian, Greek, Japanese, Italian, Irish, Māori
StandardGB/T 2312-1980
ClassificationISO-2022-compatible DBCS, CJK encoding
ExtensionsISO-IR-165
Encoding formats
  • EUC-CN (GB2312)
  • HZ-GB-2312
  • ISO-2022-CN
  • Shift GB
Preceded byChinese telegraph code
Succeeded byGBK, GB 18030
Other related encoding(s)JIS X 0208, KS X 1001

GB/T 2312-1980 is a key official character set of the People's Republic of China, used for Simplified Chinese characters. GB2312 is the registered internet name for EUC-CN, which is its usual encoded form. GB refers to the Guobiao standards (国家标准), whereas the T suffix (推荐; tuījiàn; 'recommendation') denotes a non-mandatory standard.[1]

GB/T 2312-1980 was originally a mandatory national standard designated GB 2312-1980. However, following a National Standard Bulletin of the People's Republic of China in 2017, GB 2312 is no longer mandatory, and its standard code is modified to GB/T 2312-1980.[2] GB/T 2312-1980 has been superseded by GBK and GB 18030, which include additional characters, but GB/T 2312 remains in widespread use as a subset of those encodings.

As of September 2022, GB2312 is the second-most popular encoding served from China and territories (after UTF-8), with 5.5% of web servers serving a page declaring it.[3] Globally, GB2312 is declared on 0.1% of all web pages.[4] However, all major web browsers decode GB2312-marked documents as if they were marked with the superset GBK encoding, except for Safari and Edge on the label GB_2312.[5]

There is an analogous character set known as GB/T 12345 Code of Chinese ideogram set for information interchange supplementary set, which supplements GB/T 2312 with traditional character forms by replacing simplified forms in their qūwèi code, and some extra 62 supplemental characters.[6][7] GB-encoded fonts often come in pairs, one with the GB/T 2312 (simplified) character set and the other with the GB/T 12345 (traditional) character set. There exists more GB supplementary encoding sets that supplements GB/T 2312, including GB/T 7589 Code of Chinese ideograms set forinformation interchange--The 2nd supplementary set and GB/T 7590 Code of Chinese ideograms set forinformation interchange--The 4th supplementary set which provides additional [Variant Chinese characters|variant characters] in the same qūwèi encoding format (later used in ISO-2022-CN), but has no relation with characters encoded in GB/T 2312.

  1. ^ Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. pp. 94–111. ISBN 978-0-596-51447-1.
  2. ^ "2017年第7号中国国家标准公告 (China National Standard Bulletin 2017 No.7)". Standardization Administration of the People's Republic of China. Retrieved 3 July 2018.
  3. ^ "Distribution of Character Encodings among websites that use China and territories". w3techs.com. Retrieved 2022-09-04.
  4. ^ "Historical trends in the usage statistics of character encodings for websites, October 2022". w3techs.com. Retrieved 2022-10-01.
  5. ^ "Encoding: Summarized test results". www.w3.org. Retrieved 2019-11-15.
  6. ^ Lunde, Ken (1998). Appendix F: GB/T 12345 (PDF). O'Reilly Media. ISBN 9781565922242. {{cite book}}: |work= ignored (help)
  7. ^ GB12345-80 to Unicode table. Unicode Consortium. 1993-12-06. Archived from the original on 2004-06-17.

and 17 Related for: GB 2312 information

Request time (Page generated in 0.8685 seconds.)

GB 2312

Last Update:

GB/T 2312-1980 is a key official character set of the People's Republic of China, used for Simplified Chinese characters. GB2312 is the registered internet...

Word Count : 3374

Extended Unix Code

Last Update:

belonging to a 94×94 coded character set (such as GB 2312) represented in two bytes. The EUC-CN form of GB 2312 and EUC-KR are examples of such two-byte EUC...

Word Count : 5059

GB

Last Update:

handheld video game console Guobiao standards, Chinese National Standards GB 2312, an encoding scheme for rendering Simplified Chinese characters; also has...

Word Count : 489

GB 12345

Last Update:

established by China, and can be thought as the traditional counterpart of GB 2312. It is used as an encoding of traditional Chinese characters, although...

Word Count : 618

GB 12052

Last Update:

94) contain characters as follows: 01–09: identical to GB 2312, except 03-04 (¥ in GB 2312, $ in GB 12052) 16–37: modern Hangul syllables and jamo, level...

Word Count : 485

Radical symbol

Last Update:

include a short overline attached to the radical symbol, whereas the GB 2312 and GB 18030 charts do not. Additionally a "Radical Symbol Bottom" (U+23B7...

Word Count : 947

Character encoding

Last Update:

Shift_JIS-2004 EUC-JIS-2004 ISO-2022-JP-2004 Chinese Guobiao GB 2312 GBK (Microsoft Code page 936) GB 18030 Taiwan Big5 (a more famous variant is Microsoft Code...

Word Count : 3869

Chinese character encoding

Last Update:

Guobiao (GB) line of character encodings start with the Simplified Chinese charset GB 2312 published in 1980. Two encoding schemes existed for GB 2312: a one-or-two...

Word Count : 956

Chinese character sets

Last Update:

components). GB 2312 was designed for simplified characters. Traditional characters which have been simplified are not covered. GB 2312 is still in use...

Word Count : 2004

CJK characters

Last Update:

(official standard of Republic of China) EUC-JP EUC-KR GB 2312 (subset and predecessor of GB 18030) GB 18030 (mandated standard in the People's Republic of...

Word Count : 828

Mojibake

Last Update:

(zhé) in the name of singer David Tao (Chinese: 陶喆; pinyin: Táo Zhé), GB 2312's lack of the "镕" (róng) in ex-PRC Premier Zhu Rongji (Chinese: 朱镕基; pinyin:...

Word Count : 5986

Code page

Last Update:

(same without euro: 950) 1380 – IBM-PC Simplified Chinese GB PC-DATA (DBCS PC IBM GB 2312-80) 1381 – IBM-PC Simplified Chinese (1115 + 1380) 1393 – Japanese...

Word Count : 9331

Character encodings in HTML

Last Update:

related labels. Handled the same as GB 18030 for decoding purposes. For encoding purposes, labelling as GBK (or GB 2312) excludes four-byte codes, and favours...

Word Count : 2460

Pinyin

Last Update:

Letters in bopomofo. Lowercase letters with tone marks are included in GB 2312 and their uppercase counterparts are included in JIS X 0212; thus Unicode...

Word Count : 6301

Comparison of Unicode encodings

Last Update:

UTF-16, UTF-32 and UTF-EBCDIC have these important properties but UTF-7 and GB 18030 do not. Fixed-size characters can be helpful, but even if there is a...

Word Count : 2267

Windows code page

Last Update:

954 (roughly) Japanese EUC-JP Conversion 20936 5479 Chinese (simplified) GB 2312 Conversion 20949, 51949 970 Korean Wansung (8-bit with ASCII, i.e. EUC-KR)...

Word Count : 2776

JIS X 0208

Last Update:

Compare row 6 of GB 2312 and GB 12345 and row 6 of KPS 9566, which include the same Greek letters in the same layout, although GB 12345 adds vertical...

Word Count : 13276

PDF Search Engine © AllGlobal.net