Global Information Lookup Global Information

Homoglyph information


The homoglyphs
U+0061 a LATIN SMALL LETTER A and
U+0430 а CYRILLIC SMALL LETTER A overlaid. In the image, both characters are set in Helvetica LT Std Roman.

In orthography and typography, a homoglyph is one of two or more graphemes, characters, or glyphs with shapes that appear identical or very similar but may have differing meaning. The designation is also applied to sequences of characters sharing these properties.

In 2008, the Unicode Consortium published its Technical Report #36[1] on a range of issues deriving from the visual similarity of characters both in single scripts, and similarities between characters in different scripts.

Examples of homoglyphic symbols are (a) the diaeresis and umlaut (both a pair of dots, but with different meaning, although encoded with the same code points); and (b) the hyphen and minus sign (both a short horizontal stroke, but with different meaning, although often encoded with the same code point). Among digits and letters, digit 1 and lowercase l are always encoded separately but in many typefaces are given very similar glyphs, and digit 0 and capital O are always encoded separately but in many typefaces are given very similar glyphs. Virtually every example of a homoglyphic pair of characters can potentially be differentiated graphically with clearly distinguishable glyphs and separate code points, but this is not always done. Typefaces that do not emphatically distinguish the one/el and zero/oh homoglyphs are considered unsuitable for writing formulas, URLs, source code, IDs and other text where characters cannot always be differentiated without context. Fonts which distinguish glyphs by means of a slashed zero, for example, are preferred for those uses.

  1. ^ "UTR #36: Unicode Security Considerations". www.unicode.org.

and 24 Related for: Homoglyph information

Request time (Page generated in 0.5532 seconds.)

Homoglyph

Last Update:

In orthography and typography, a homoglyph is one of two or more graphemes, characters, or glyphs with shapes that appear identical or very similar but...

Word Count : 1948

IDN homograph attack

Last Update:

they are homographs, hence the term for the attack, although technically homoglyph is the more accurate term for different characters that look alike). For...

Word Count : 3779

Allograph

Last Update:

concept of the allograph may be compared and contrasted with that of the homoglyph – glyphs of different meaning that are visually similar. For example,...

Word Count : 642

List of Unicode characters

Last Update:

Combining character Compatibility characters Duplicate characters Equivalence Homoglyph Precomposed character list Z-variant Variation sequences Regional indicator...

Word Count : 1827

Unicode

Last Update:

this approach also has issues, requiring security measures relating to homoglyph attacks. Whether the lowercase letter I is expected to retain its tittle...

Word Count : 10727

C

Last Update:

halfwidth and fullwidth forms for legacy CJK font compatibility. The Cyrillic homoglyph of the Latin ⟨C⟩ has a separate encoding: U+0421 С CYRILLIC CAPITAL LETTER...

Word Count : 2454

A

Last Update:

fullwidth forms for legacy CJK font compatibility. The Cyrillic and Greek homoglyphs of the Latin ⟨A⟩ have separate encodings U+0410 А CYRILLIC CAPITAL LETTER...

Word Count : 2780

B

Last Update:

fullwidth forms for legacy CJK font compatibility. The Cyrillic and Greek homoglyphs of the Latin ⟨B⟩ have separate encodings U+0412 В CYRILLIC CAPITAL LETTER...

Word Count : 1387

Illumos

Last Update:

do not clearly distinguish a lowercase L from an uppercase i: Il (see homoglyph). The project name is a combination of words illuminare from Latin for...

Word Count : 1055

Duplicate characters in Unicode

Last Update:

characters that are rendered as identical glyphs or near-identical glyphs (homoglyphs), either because they are historically cognate (such as Greek Η vs. Latin...

Word Count : 1260

Faux Cyrillic

Last Update:

used with faux Cyrillic letters in lieu of their Latin counterparts. Homoglyph IDN homograph attack Foreign branding Heavy metal umlaut for a similar...

Word Count : 680

Calibri

Last Update:

Adobe InDesign. One potential source of confusion in Calibri is a visible homoglyph, a pair of easily confused characters: the lowercase letter L and the...

Word Count : 1693

European vehicle registration plate

Last Update:

Greece uses a combination of three letters (before only two) that are homoglyphs of Latin letters, i.e. A, B, E, Z, H, I, K, M, N, O, P, T, Y, X in Greek...

Word Count : 5309

Serif

Last Update:

Angsana UPC, Kinnari) Look up serif in Wiktionary, the free dictionary. Homoglyph Ming (typeface), a similar style in Asian typefaces The analogs of serifs...

Word Count : 6378

Oy

Last Update:

slang interjection used to get someone's attention Uk (Оу оу), a cyrillic homoglyph of Oy Osakeyhtiö, often abbreviated Oy, a type of Finnish limited company...

Word Count : 247

Nameprep

Last Update:

needed] of this being VeriSign's handling of IDNA names in .com and .net). Homoglyph Unicode Internationalization International Components for Unicode (ICU...

Word Count : 273

Leet

Last Update:

word. For more casual use of leet, the primary strategy is to use quasi-homoglyphs, symbols that closely resemble (to varying degrees) the letters for which...

Word Count : 3669

Slashed zero

Last Update:

telecommunications. It thus helps to differentiate characters that would otherwise be homoglyphs. It was commonly used during the punch card era, when programs were typically...

Word Count : 1783

Trojan Source

Last Update:

Retrieved 2022-01-17. "VU#999008 - Compilers permit Unicode control and homoglyph characters". www.kb.cert.org. Archived from the original on 2022-01-21...

Word Count : 951

ASCII art

Last Update:

art scene, Category:Artscene groups Software: AAlib, cowsay Unicode: Homoglyph, Duplicate characters in Unicode Carlson, Wayne E. (2003). "An Historical...

Word Count : 5377

Turned P

Last Update:

Wandl-Vogt, Eveline (2011). Revised proposal to encode "Teuthonista" phonetic characters in the UCS (PDF). Latin script P Komi De, a homoglyph of this letter...

Word Count : 268

Macedonian alphabet

Last Update:

on Dzělo, the eighth letter of the early Cyrillic alphabet. Although a homoglyph to the Latin letter S, the two letters are not directly related. Both...

Word Count : 2637

Atkinson Hyperlegible

Last Update:

including Times New Roman and Frutiger, they found that distinguishing among homoglyphs, and even among some characters that do not appear very similar to fully...

Word Count : 766

Interleukin 1 beta

Last Update:

many authors of scientific manuscripts make the minor error of using a homoglyph, sharp s (ß), instead of beta (β), mentions of "IL-1ß" [sic] often become...

Word Count : 3305

PDF Search Engine © AllGlobal.net