Global Information Lookup Global Information

Unicode character property information


The Unicode Standard assigns various properties to each Unicode character and code point.[1][2]

The properties can be used to handle characters (code points) in processes, like in line-breaking, script direction right-to-left or applying controls. Some "character properties" are also defined for code points that have no character assigned and code points that are labeled like "<not a character>". The character properties are described in Standard Annex #44.[2]

Properties have levels of forcefulness: normative, informative, contributory, or provisional. For simplicity of specification, a character property can be assigned by specifying a continuous range of code points that have the same property.[3]

  1. ^ "Character Properties" (PDF). The Unicode Standard Version 15. Mountain View, CA: The Unicode Consortium. September 2022. ISBN 978-1-936213-32-0. Retrieved 2022-09-16.
  2. ^ a b "Unicode Standard Annex #44: Unicode Character Database". Unicode. 2017-06-14.
  3. ^ "Unicode Standard Annex #44: Unicode Character Database, 4.2.3 Code Point Ranges". Unicode. 2022-09-02.

and 24 Related for: Unicode character property information

Request time (Page generated in 0.8585 seconds.)

Unicode character property

Last Update:

The Unicode Standard assigns various properties to each Unicode character and code point. The properties can be used to handle characters (code points)...

Word Count : 3265

Universal Character Set characters

Last Update:

article contains special characters. Without proper rendering support, you may see question marks, boxes, or other symbols. The Unicode Consortium and the ISO/IEC...

Word Count : 6987

List of Unicode characters

Last Update:

article contains special characters. Without proper rendering support, you may see question marks, boxes, or other symbols. As of Unicode version 15.1, there...

Word Count : 1827

Mathematical operators and symbols in Unicode

Last Update:

standard characters used in mathematics. Unicode Technical Report #25 provides comprehensive information about the character repertoire, their properties, and...

Word Count : 955

Unicode compatibility characters

Last Update:

compatibility character to one or more other UCS characters. By setting a character's decomposition property, Unicode establishes that character as a compatibility...

Word Count : 3325

Unicode

Last Update:

uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols. Unicode, formally The Unicode Standard...

Word Count : 10732

Numerals in Unicode

Last Update:

with earlier character sets, such as ² or ②, and composite characters such as ½. Grouped by their numerical property as used in a text, Unicode has four values...

Word Count : 1599

Unicode block

Last Update:

A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode...

Word Count : 825

Latin script in Unicode

Last Update:

Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended...

Word Count : 488

Bidirectional text

Last Update:

the character will become LTR, in an RTL document, it will become RTL). v t e Bidirectional character type (Bidi_Class Unicode character property)[1]...

Word Count : 1671

Universal Coded Character Set

Last Update:

The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/IEC 10646, Information technology...

Word Count : 1861

Unicode control characters

Last Update:

Many Unicode characters are used to control the interpretation or display of text, but these characters themselves have no visual or spatial representation...

Word Count : 2015

Halfwidth and fullwidth forms

Last Update:

occupies half the width of a fullwidth character, hence the name. Halfwidth and Fullwidth Forms is also the name of a Unicode block U+FF00–FFEF, provided so that...

Word Count : 605

Whitespace character

Last Update:

Edition). World Wide Web Consortium. "9.1 Whitespace". W3CHTML 4.01 Specification. World Wide Web Consortium. Property List of Unicode Character Database...

Word Count : 2565

Private Use Areas

Last Update:

In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private...

Word Count : 2996

Perl Compatible Regular Expressions

Last Update:

while ? makes them greedy. Unicode defines several properties for each character. Patterns in PCRE2 can match these properties: e.g. \p{Ps}.*?\p{Pe} would...

Word Count : 2561

GC

Last Update:

forest General Category of a Unicode symbol, see Unicode character property#General Category gc, the Go compiler GC (character), a fictitious cat on 1980s...

Word Count : 496

Braille Patterns

Last Update:

contains Unicode Braille characters. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Braille characters. The...

Word Count : 951

Magnetic ink character recognition

Last Update:

- Publications". rbi.org.in. Unicode Consortium (2019-09-08). "Derived Age". Unicode Character Database: Derived Property Data. Freytag, Asmus; McGowan...

Word Count : 2004

Yi Syllables

Last Update:

Yi Syllables is a Unicode block containing the 1,165 characters (1,164 phonemic syllables plus 1 syllable iteration mark) of the Liangshan Standard Yi...

Word Count : 774

Sound recording copyright symbol

Last Update:

symbol has a code point in Unicode at U+2117 ℗ SOUND RECORDING COPYRIGHT, with the supplementary Unicode character property names, "published" and "phonorecord...

Word Count : 1005

Religious and political symbols in Unicode

Last Update:

text. Unicode defines the semantics of a character by its character identity and its normative properties, one of these being the character's general...

Word Count : 653

Miscellaneous Symbols

Last Update:

Unicode emoticons or emojis. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the intended characters...

Word Count : 625

Combining Diacritical Marks

Last Update:

Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character "Combining Grapheme Joiner"...

Word Count : 136

PDF Search Engine © AllGlobal.net