The Unicode Standard assigns various properties to each Unicode character and code point.[1][2]
The properties can be used to handle characters (code points) in processes, like in line-breaking, script direction right-to-left or applying controls. Some "character properties" are also defined for code points that have no character assigned and code points that are labeled like "<not a character>". The character properties are described in Standard Annex #44.[2]
Properties have levels of forcefulness: normative, informative, contributory, or provisional. For simplicity of specification, a character property can be assigned by specifying a continuous range of code points that have the same property.[3]
^"Character Properties" (PDF). The Unicode Standard Version 15. Mountain View, CA: The Unicode Consortium. September 2022. ISBN 978-1-936213-32-0. Retrieved 2022-09-16.
^ ab"Unicode Standard Annex #44: Unicode Character Database". Unicode. 2017-06-14.
^"Unicode Standard Annex #44: Unicode Character Database, 4.2.3 Code Point Ranges". Unicode. 2022-09-02.
and 24 Related for: Unicode character property information
The Unicode Standard assigns various properties to each Unicodecharacter and code point. The properties can be used to handle characters (code points)...
article contains special characters. Without proper rendering support, you may see question marks, boxes, or other symbols. The Unicode Consortium and the ISO/IEC...
article contains special characters. Without proper rendering support, you may see question marks, boxes, or other symbols. As of Unicode version 15.1, there...
standard characters used in mathematics. Unicode Technical Report #25 provides comprehensive information about the character repertoire, their properties, and...
compatibility character to one or more other UCS characters. By setting a character's decomposition property, Unicode establishes that character as a compatibility...
uncommon Unicodecharacters. Without proper rendering support, you may see question marks, boxes, or other symbols. Unicode, formally The Unicode Standard...
with earlier character sets, such as ² or ②, and composite characters such as ½. Grouped by their numerical property as used in a text, Unicode has four values...
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicodecharacter set that are defined by the Unicode...
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended...
The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/IEC 10646, Information technology...
Many Unicodecharacters are used to control the interpretation or display of text, but these characters themselves have no visual or spatial representation...
occupies half the width of a fullwidth character, hence the name. Halfwidth and Fullwidth Forms is also the name of a Unicode block U+FF00–FFEF, provided so that...
Edition). World Wide Web Consortium. "9.1 Whitespace". W3CHTML 4.01 Specification. World Wide Web Consortium. Property List of UnicodeCharacter Database...
In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private...
while ? makes them greedy. Unicode defines several properties for each character. Patterns in PCRE2 can match these properties: e.g. \p{Ps}.*?\p{Pe} would...
forest General Category of a Unicode symbol, see Unicodecharacterproperty#General Category gc, the Go compiler GC (character), a fictitious cat on 1980s...
contains Unicode Braille characters. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Braille characters. The...
Yi Syllables is a Unicode block containing the 1,165 characters (1,164 phonemic syllables plus 1 syllable iteration mark) of the Liangshan Standard Yi...
symbol has a code point in Unicode at U+2117 ℗ SOUND RECORDING COPYRIGHT, with the supplementary Unicodecharacterproperty names, "published" and "phonorecord...
text. Unicode defines the semantics of a character by its character identity and its normative properties, one of these being the character's general...
Unicode emoticons or emojis. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the intended characters...
Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character "Combining Grapheme Joiner"...