Global Information Lookup Global Information

Unicode compatibility characters information


In Unicode and the UCS, a compatibility character is a character that is encoded solely to maintain round-trip convertibility with other, often older, standards.[1] As the Unicode Glossary says:

A character that would not have been encoded except for compatibility and round-trip convertibility with other standards[2]

Although compatibility is used in names, it is not marked as a property. However, the definition is more complicated than the glossary reveals. One of the properties given to characters by the Unicode consortium is the characters' decomposition or compatibility decomposition. Over five thousand characters do have a compatibility decomposition mapping that compatibility character to one or more other UCS characters. By setting a character's decomposition property, Unicode establishes that character as a compatibility character. The reasons for these compatibility designations are varied and are discussed in further detail below. The term decomposition sometimes confuses because a character's decomposition can, in some cases, be a singleton. In these cases the decomposition of one character is simply another approximately (but not canonically) equivalent character.

  1. ^ "Chapter 2.3: Compatibility characters" (PDF). The Unicode Standard 6.0.0.
  2. ^ Unicode consortium Unicode Glossary

and 23 Related for: Unicode compatibility characters information

Request time (Page generated in 0.8249 seconds.)

Unicode compatibility characters

Last Update:

In Unicode and the UCS, a compatibility character is a character that is encoded solely to maintain round-trip convertibility with other, often older...

Word Count : 3325

Unicode equivalence

Last Update:

often included similar or identical characters. Unicode provides two such notions, canonical equivalence and compatibility. Code point sequences that are defined...

Word Count : 1902

Universal Character Set characters

Last Update:

twelve character code points in total. UCS includes thousands of characters that Unicode designates as compatibility characters. These are characters that...

Word Count : 6987

Numerals in Unicode

Last Update:

characters described below, but only Arabic numerals are available.) Unicode also includes a handful of vulgar fractions as compatibility characters,...

Word Count : 1599

Duplicate characters in Unicode

Last Update:

Unicode has a certain amount of duplication of characters. These are pairs of single Unicode code points that are canonically equivalent. The reason for...

Word Count : 1260

CJK Compatibility

Last Update:

CJK Compatibility is a Unicode block containing square symbols (both CJK and Latin alphanumeric) encoded for compatibility with East Asian character sets...

Word Count : 202

Number Forms

Last Update:

block containing Unicode compatibility characters that have specific meaning as numbers, but are constructed from other characters. They consist primarily...

Word Count : 132

Precomposed character

Last Update:

Latin characters in Unicode Dead key Compose key Combining character Unicode equivalence Complex text layout Unicode compatibility characters Alphabetic...

Word Count : 669

Unicode

Last Update:

uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols. Unicode, formally The Unicode Standard...

Word Count : 10732

CJK Compatibility Ideographs

Last Update:

CJK Compatibility Ideographs is a Unicode block created to contain mostly Han characters that were encoded in multiple locations in other established character...

Word Count : 695

List of Unicode characters

Last Update:

Character Set 2 (MES-2) subset, and some additional related characters. HTML and XML provide ways to reference Unicode characters when the characters...

Word Count : 1827

Hangul Compatibility Jamo

Last Update:

Hangul Compatibility Jamo is a Unicode block containing Hangul characters for compatibility with the South Korean national standard KS X 1001 (formerly...

Word Count : 93

List of precomposed Latin characters in Unicode

Last Update:

featured in Unicode. Some characters in the Letterlike Symbols block can be substituted with characters in the ASCII range. Latin script Unicode collation...

Word Count : 156

Latin script in Unicode

Last Update:

Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended...

Word Count : 488

Halfwidth and fullwidth forms

Last Update:

computing, graphic characters are traditionally classed into fullwidth and halfwidth characters. Unlike monospaced fonts, a halfwidth character occupies half...

Word Count : 605

CJK Compatibility Ideographs Supplement

Last Update:

CJK Compatibility Ideographs Supplement is a Unicode block containing Han characters used only for roundtrip compatibility mapping with planes 3, 4, 5...

Word Count : 80

Mathematical operators and symbols in Unicode

Last Update:

almost all standard characters used in mathematics. Unicode Technical Report #25 provides comprehensive information about the character repertoire, their...

Word Count : 889

CJK Unified Ideographs

Last Update:

characters. During the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Unicode...

Word Count : 2302

Unicode character property

Last Update:

The Unicode Standard assigns various properties to each Unicode character and code point. The properties can be used to handle characters (code points)...

Word Count : 3265

Han unification

Last Update:

an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han characters of the so-called CJK languages...

Word Count : 6317

Regional indicator symbol

Last Update:

The regional indicator symbols are a set of 26 alphabetic Unicode characters (A–Z) intended to be used to encode ISO 3166-1 alpha-2 two-letter country...

Word Count : 1037

Whitespace character

Last Update:

settings) can also affect whitespace. Many of the Unicode space characters were created for compatibility with classic print typography. Even if digital...

Word Count : 2565

Variant Chinese characters

Last Update:

Characters for Hong Kong The Standard Form of National Characters for Taiwan The list of jōyō kanji for Japan The Kangxi Dictionary in Korea Unicode deals...

Word Count : 1483

PDF Search Engine © AllGlobal.net