Non-printing format effectors and control codes included in Unicode
Many Unicode characters are used to control the interpretation or display of text, but these characters themselves have no visual or spatial representation. For example, the null character (U+0000NULL) is used in C-programming application environments to indicate the end of a string of characters. In this way, these programs only require a single starting memory address for a string (as opposed to a starting address and a length), since the string ends once the program reads the null character.
In the narrowest sense, a control code is a character with the general category Cc, which comprises the C0 and C1 control codes, a concept defined in ISO/IEC 2022 and inherited by Unicode, with the most common set being defined in ISO/IEC 6429. Control codes are handled distinctly from ordinary Unicode characters, for example, by not being assigned character names (although they are assigned normative formal aliases).[1] In a broader sense, other non-printing format characters, such as those used in bidirectional text, are also referred to as control characters by software;[2] these are mostly assigned to the general category Cf (format), used for format effectors introduced and defined by Unicode itself.
^"Name Aliases". Unicode Character Database. Unicode Consortium.
^Segan, Danilo. "Towards a localised desktop". For some cases where automatic decision making doesn't work, you can manually add specific direction markers by right-clicking the text field, choosing "Insert Unicode control character" from the menu, and selecting appropriate direction mark. This would allow you, for instance, to start your RTL text with an otherwise LTR word (such as "GNOME").
and 21 Related for: Unicode control characters information
Many Unicodecharacters are used to control the interpretation or display of text, but these characters themselves have no visual or spatial representation...
Character Set 2 (MES-2) subset, and some additional related characters. HTML and XML provide ways to reference Unicodecharacters when the characters...
The Unicode Standard assigns various properties to each Unicodecharacter and code point. The properties can be used to handle characters (code points)...
the C1 set. These 65 control codes were carried over to Unicode. Unicode added more characters that could be considered controls, but it makes a distinction...
article contains special characters. Without proper rendering support, you may see question marks, boxes, or other symbols. The Unicode Consortium and the ISO/IEC...
errors are corrected or prevented with "pseudo-strong" characters. Such Unicodecontrolcharacters are called marks. The mark (U+200E LEFT-TO-RIGHT MARK...
Unicode to the unrelated emoji character 🔔 (U+1F514). While C0 and C1 controlcharacters were not formally named by the Unicode standard itself at the time...
The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/IECÂ 10646, Information technology...
uncommon Unicodecharacters. Without proper rendering support, you may see question marks, boxes, or other symbols. Unicode, formally The Unicode Standard...
Unicode input is the insertion of a specific Unicodecharacter on a computer by a user; it is a common way to input characters not directly supported by...
text Unicodecontrolcharacters, characters with no visual or spatial representation Control engineering, a discipline of modeling and controlling of systems...
heart shape has found its way into many character sets and encodings, including those of Unicode. Some characters depict the shape directly, others reference...
In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private...
Control Pictures is a Unicode block containing characters for graphically representing the C0 control codes, and other controlcharacters. Its block name...
glyphs for all defined Unicodecharacters (149,813 characters, with Unicode 15.1). This article lists some widely used Unicode fonts (shipped with an...
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended...
is a controlcharacter or sequence of controlcharacters in character encoding specifications such as ASCII, EBCDIC, Unicode, etc. This character, or a...
(U+4DC0–U+4DFF) Special charactersUnicode block Universal Character Set characters "Section 22: Symbols" (PDF). The Unicode Standard. The Unicode Consortium. September...
be displayed properly. Unicode also provides some visible characters that can be used to represent various whitespace characters, in contexts where a visible...
typesetting, a soft hyphen (Unicode U+00AD SOFT HYPHEN (­)) or syllable hyphen, is a code point reserved in some coded character sets for the purpose of...
Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to...