Garbled text as a result of incorrect character encodings
This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed. Find sources: "Mojibake" – news · newspapers · books · scholar · JSTOR(March 2023) (Learn how and when to remove this message)
This article contains special characters. Without proper rendering support, you may see question marks, boxes, or other symbols.
Mojibake (Japanese: 文字化け; IPA:[mod͡ʑibake], "character transformation") is the garbled or gibberish text that is the result of text being decoded using an unintended character encoding.[1] The result is a systematic replacement of symbols with completely unrelated ones, often from a different writing system.
This display may include the generic replacement character ("�") in places where the binary representation is considered invalid. A replacement can also involve multiple consecutive symbols, as viewed in one encoding, when the same binary code constitutes one symbol in the other encoding. This is either because of differing constant length encoding (as in Asian 16-bit encodings vs European 8-bit encodings), or the use of variable length encodings (notably UTF-8 and UTF-16).
Failed rendering of glyphs due to either missing fonts or missing glyphs in a font is a different issue that is not to be confused with mojibake. Symptoms of this failed rendering include blocks with the code point displayed in hexadecimal or using the generic replacement character. Importantly, these replacements are valid and are the result of correct error handling by the software.
^King, Ritchie (2012). "Will unicode soon be the universal code? [The Data]". IEEE Spectrum. 49 (7): 60. doi:10.1109/MSPEC.2012.6221090.
rendering support, you may see question marks, boxes, or other symbols. Mojibake (Japanese: 文字化け; IPA: [mod͡ʑibake], "character transformation") is the...
seemingly unique stories. Another way of generating meaningless text is mojibake, also called Buchstabensalat ("letter salad") in German, in which an assortment...
communication in the world's widely used languages. However, some glitches such as mojibake (incorrect display of some languages' characters) still remain. In a US...
articles detailing specific character encodings Hexadecimal representations Mojibake – character set mismap Mojikyō – a system ("glyph set") that includes over...
if a character is not in both sets); and was often not done, producing mojibake (semi-readable resulting text, often users learned how to manually decode...
RETURN are widely used in texts using Unicode. In a phenomenon known as mojibake, the C1 code points are improperly decoded according to the Windows-1252...
topics CCSID Character encodings in HTML Charset detection Han unification Hardware code page MICR code Mojibake Variable-length encoding Character sets...
Archive. AppLoc.tmp in the AppPatch folder (%windir%\apppatch) causes a Mojibake issue of Windows Installer. Unofficial solutions of this problems include...
arbitrary text data (in any 8-bit ASCII-like character encoding) via SMTP. Mojibake was still a problem due to differing character set mappings between vendors...
are legible and intelligible in one part of the world but unintelligible mojibake in another. Microsoft adopted a Unicode encoding (first the now-obsolete...
page indication will cause non-ASCII characters in file names to become Mojibake. For example, "ü" may become "³". A different OS may encounter a similar...
often incorrectly pronounced as the archaic pronoun of the same spelling. Mojibake Olde English District Sensational spelling Davis, Lauren (15 January 2015)...
abstract prose lacking concrete meaning, i.e. nonsense Metasemantic poetry Mojibake, random nonsense characters generated by foreign text Nonce word Non-lexical...
quote characters may appear as the generic replacement character � or "mojibake" (gibberish). HTML includes a set of entities for curved quotes: ‘...
interpret the UTF-8 as some other encoding such as CP-1252 and ignore the mojibake for any non-ASCII data. UTF-16 and UTF-32 do not have endianness defined...
improperly assume that a particular encoding is being used, resulting in mojibake. In Windows, the horizontal ellipsis can be inserted with Alt+0133, using...
used so that the message can be correctly displayed by the recipient (see Mojibake). If the sender's or recipient's email address contains non-ASCII characters...
Unicode typefaces Unicode Fonts on Macintosh Code2000 Arial Unicode MS Mojibake Font substitution Wichary, Marcin (September 29, 2020). "When fonts fail"...
on statistical data. In general, incorrect charset detection leads to mojibake. One of the few cases where charset detection works reliably is detecting...
spelling for the municipality of Mogi das Cruzes in São Paulo state, Brazil Mojibake, unreadable characters caused by text encoding problems This disambiguation...
topics CCSID Character encodings in HTML Charset detection Han unification Hardware code page MICR code Mojibake Variable-length encoding Character sets...