Many scripts in Unicode, such as Arabic, have special orthographic rules that require certain combinations of letterforms to be combined into special ligature forms. In English, the common ampersand (&) developed from a ligature in which the handwritten Latin letters e and t (spelling et, Latin for and) were combined.[1] The rules governing ligature formation in Arabic can be quite complex, requiring special script-shaping technologies such as the Arabic Calligraphic Engine by Thomas Milo's DecoType.[2]
As of Unicode 15.1, the Arabic script is contained in the following blocks:[3]
The basic Arabic range encodes the standard letters and diacritics, but does not encode contextual forms (U+0621–U+0652 being directly based on ISO 8859-6); and also includes the most common diacritics and Arabic-Indic digits.
The Arabic Supplement range encodes letter variants mostly used for writing African (non-Arabic) languages.
The Arabic Extended-B and Arabic Extended-A ranges encode additional Qur'anic annotations and letter variants used for various non-Arabic languages.
The Arabic Presentation Forms-A range encodes contextual forms and ligatures of letter variants needed for Persian, Urdu, Sindhi and Central Asian languages.
The Arabic Presentation Forms-B range encodes spacing forms of Arabic diacritics, and more contextual letter forms.
The presentation forms are present only for compatibility with older standards, and are not currently needed for coding text.[4]
The Arabic Mathematical Alphabetical Symbols block encodes characters used in Arabic mathematical expressions.
The Indic Siyaq Numbers block contains a specialized subset of Arabic script that was used for accounting in India under the Mughal Empire by the 17th century through the middle of the 20th century.[5][6]
The Ottoman Siyaq Numbers block contains a specialized subset of Arabic script, also known as Siyakat numbers, used for accounting in Ottoman Turkish documents.[6]
^"What is the origin of the ampersand (&)?"
^unicode.org Biography: Thomas Milo - DecoType
^"UAX #24: Script data file". Unicode Character Database. The Unicode Consortium.
^"Section 9.2: Arabic, Arabic Presentation Forms-B" (PDF). The Unicode Standard. The Unicode Consortium. September 2022.
Many scriptsinUnicode, such as Arabic, have special orthographic rules that require certain combinations of letterforms to be combined into special ligature...
The Hanifi Rohingya script is a unified script for the Rohingya language. Rohingya today is written in three scripts, Hanifi, Arabic (Rohingya Fonna), and...
al-ʿarabiyyah), or Arabic abjad, is the Arabicscript as specifically codified for writing the Arabic language. It is written from right-to-left in a cursive style...
also known as the Perso-Arabicscript, is the right-to-left alphabet used for the Persian language. It is a variation of the Arabic alphabet with five additional...
taken from the Arabicscript.[better source needed] The shape of the initial marks changed in 2019 as part of the efforts for Unicode standardization...
other writing systems derived from the Arabicscript, Urdu uses the 0600–06FF Unicode range. Certain glyphs in this range appear visually similar (or...
Anshuman (2 November 2015). "L2/15-233: Proposal to encode the Makasar scriptinUnicode" (PDF). Datta, Amaresh (1987). Encyclopaedia of Indian Literature...
The Arabicscript is the writing system used for Arabic and several other languages of Asia and Africa. It is the second-most widely used alphabetic writing...
The Unicode Standard assigns various properties to each Unicode character and code point. The properties can be used to handle characters (code points)...
uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols. Unicode, formally The Unicode Standard...
other symbols. As of Unicode version 15.1, there are 149,878 characters with code points, covering 161 modern and historical scripts, as well as multiple...
numeral (often called number inUnicode) is a character that denotes a number. The decimal number digits 0–9 are used widely in various writing systems throughout...
August 5, 2012) Unicode FAQ – Greek Language and Script alphabetic test for Greek Unicode range (Alan Wood) numeric test for Greek Unicode range Classical...
support to display the uncommon Unicode characters in this article correctly. Khudabadi (देवदेन/ Devden) was a script used to write the Sindhi language...
or other symbols. The Unicode Consortium and the ISO/IEC JTC 1/SC 2/WG 2 jointly collaborate on the list of the characters in the Universal Coded Character...
Unicode Standard in April 2008 with the release of version 5.1. In version 6.3, the support of pasangan and some characters from Old Sundanese script...
other Indic scripts or of the Arabicscript. There is no apparent logic to the order; this has been interpreted as suggesting that the script was scrambled...
need rendering support to display the uncommon Unicode characters in this article correctly. The Cham script is a Brahmic abugida used to write Cham, an...
alphabet. Library resources about Latin script Online books Resources in your library Resources in other libraries Unicode collation chart—Latin letters sorted...
display the uncommon Unicode characters in this article correctly. The Bamum scripts are an evolutionary series of six scripts created for the Bamum...