Character encoded solely to maintain round trip convertibility with other standards
This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages)
This article possibly contains original research. Please improve it by verifying the claims made and adding inline citations. Statements consisting only of original research should be removed.(July 2008) (Learn how and when to remove this message)
This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed. Find sources: "Unicode compatibility characters" – news · newspapers · books · scholar · JSTOR(July 2016) (Learn how and when to remove this message)
(Learn how and when to remove this message)
In Unicode and the UCS, a compatibility character is a character that is encoded solely to maintain round-trip convertibility with other, often older, standards.[1] As the Unicode Glossary says:
A character that would not have been encoded except for compatibility and round-trip convertibility with other standards[2]
Although compatibility is used in names, it is not marked as a property. However, the definition is more complicated than the glossary reveals. One of the properties given to characters by the Unicode consortium is the characters' decomposition or compatibility decomposition. Over five thousand characters do have a compatibility decomposition mapping that compatibility character to one or more other UCS characters. By setting a character's decomposition property, Unicode establishes that character as a compatibility character. The reasons for these compatibility designations are varied and are discussed in further detail below. The term decomposition sometimes confuses because a character's decomposition can, in some cases, be a singleton. In these cases the decomposition of one character is simply another approximately (but not canonically) equivalent character.
^"Chapter 2.3: Compatibility characters" (PDF). The Unicode Standard 6.0.0.
^Unicode consortium Unicode Glossary
and 23 Related for: Unicode compatibility characters information
In Unicode and the UCS, a compatibilitycharacter is a character that is encoded solely to maintain round-trip convertibility with other, often older...
often included similar or identical characters. Unicode provides two such notions, canonical equivalence and compatibility. Code point sequences that are defined...
twelve character code points in total. UCS includes thousands of characters that Unicode designates as compatibilitycharacters. These are characters that...
characters described below, but only Arabic numerals are available.) Unicode also includes a handful of vulgar fractions as compatibilitycharacters,...
Unicode has a certain amount of duplication of characters. These are pairs of single Unicode code points that are canonically equivalent. The reason for...
CJK Compatibility is a Unicode block containing square symbols (both CJK and Latin alphanumeric) encoded for compatibility with East Asian character sets...
block containing Unicode compatibilitycharacters that have specific meaning as numbers, but are constructed from other characters. They consist primarily...
Latin characters in Unicode Dead key Compose key Combining characterUnicode equivalence Complex text layout Unicodecompatibilitycharacters Alphabetic...
uncommon Unicodecharacters. Without proper rendering support, you may see question marks, boxes, or other symbols. Unicode, formally The Unicode Standard...
CJK Compatibility Ideographs is a Unicode block created to contain mostly Han characters that were encoded in multiple locations in other established character...
Character Set 2 (MES-2) subset, and some additional related characters. HTML and XML provide ways to reference Unicodecharacters when the characters...
Hangul Compatibility Jamo is a Unicode block containing Hangul characters for compatibility with the South Korean national standard KS X 1001 (formerly...
featured in Unicode. Some characters in the Letterlike Symbols block can be substituted with characters in the ASCII range. Latin script Unicode collation...
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended...
computing, graphic characters are traditionally classed into fullwidth and halfwidth characters. Unlike monospaced fonts, a halfwidth character occupies half...
CJK Compatibility Ideographs Supplement is a Unicode block containing Han characters used only for roundtrip compatibility mapping with planes 3, 4, 5...
almost all standard characters used in mathematics. Unicode Technical Report #25 provides comprehensive information about the character repertoire, their...
characters. During the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Unicode...
The Unicode Standard assigns various properties to each Unicodecharacter and code point. The properties can be used to handle characters (code points)...
an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han characters of the so-called CJK languages...
The regional indicator symbols are a set of 26 alphabetic Unicodecharacters (A–Z) intended to be used to encode ISO 3166-1 alpha-2 two-letter country...
settings) can also affect whitespace. Many of the Unicode space characters were created for compatibility with classic print typography. Even if digital...
Characters for Hong Kong The Standard Form of National Characters for Taiwan The list of jōyō kanji for Japan The Kangxi Dictionary in Korea Unicode deals...