Global Information Lookup Global Information

Canonical XML information


Canonical XML is a normal form of XML, intended to allow relatively simple comparison of pairs of XML documents for equivalence; for this purpose, the Canonical XML transformation removes non-meaningful differences between the documents. Any XML document can be converted to Canonical XML.

For example, XML permits whitespace to occur at various points within start-tags, and attributes to be specified in any order. Such differences are seldom if ever used to convey meaning, and so these forms are generally considered equivalent:

   <p class="a" secure="1">
   <p     secure   = "1"
             class='a'   >

In converting an arbitrary XML document to Canonical XML, attributes are encoded in a normative order (alphabetical by name), and with normative spacing and quoting (though with all namespace declarations placed ahead of regular attributes, and namespaced attributes sorted by namespace rather than prefix or qualified name). Thus, the second form above would be converted to the first.

Canonical XML specifies a number of other details, some of which are:

  • the UTF-8 encoding is used
  • line-ends are represented using the newline character 0x0A
  • whitespace in attribute values is normalized
  • entity references and non-special character references are expanded
  • CDATA sections are replaced with their character content
  • empty elements are encoded as start/end pairs, not using the special empty-element syntax
  • default attributes are made explicit
  • superfluous namespace declarations are deleted

According to the W3C, if two XML documents have the same canonical form, then the two documents are logically equivalent within the given application context (except for limitations regarding a few unusual cases).

However, in a special context users might care about special semantics beyond the generic logical equivalence with which Canonical XML is associated. For example, a steganography system could conceal information in an XML document by varying whitespace, attribute quoting and order, the use of hexadecimal vs. decimal numeric character references, and so on. Obviously converting such a file to Canonical XML would lose those specialized semantics. On the other hand, XML files that differ in their use of upper- vs. lower-case, or that use archaic versus modern spelling, and so on, might be considered equivalent for certain purposes. Such contexts are beyond the scope of Canonical XML.

and 22 Related for: Canonical XML information

Request time (Page generated in 0.7792 seconds.)

Canonical XML

Last Update:

Canonical XML is a normal form of XML, intended to allow relatively simple comparison of pairs of XML documents for equivalence; for this purpose, the...

Word Count : 393

Canonicalization

Last Update:

the canonical form of the URL. A Canonical XML document is by definition an XML document that is in XML Canonical form, defined by The Canonical XML specification...

Word Count : 1337

Canonical

Last Update:

System record Canonical S-expressions, a binary encoding form of a subset of general S-expression Canonical XML, a normal form of XML, intended to allow...

Word Count : 602

XML Signature

Last Update:

XML Signature (also called XMLDSig, XML-DSig, XML-Sig) defines an XML syntax for digital signatures and is defined in the W3C recommendation XML Signature...

Word Count : 1290

XML

Last Update:

XML schema (XSD) defines the necessary metadata for interpreting and validating XML. (This is also referred to as the canonical schema.): 135  An XML...

Word Count : 7031

Numeronym

Last Update:

"Origin Of The Abbreviation I18n". Retrieved September 14, 2005. "Canonical XML". W3C. Retrieved November 11, 2007. "INTEROPERABILITY.net". Archived...

Word Count : 942

Canonical model

Last Update:

Information architecture List of XML schemas Service-oriented architecture Web service XML schema "Benefits of a Canonical Data Model (CDM) in a SOA environment"...

Word Count : 410

Topic map

Last Update:

include: CXTM – Canonical XML Topic Maps format (canonicalization of topic maps) CTM – a Compact Topic Maps Notation (not based on XML) GTM – a Graphical...

Word Count : 1254

Binary XML

Last Update:

binary XML format; the source code is open and well-commentated. BSON MessagePack CBOR Canonical S-expressions The performance woe of binary XML http://webservices...

Word Count : 555

Content Assembly Mechanism

Last Update:

Content Assembly Mechanism (CAM) is an XML-based standard for creating and managing information exchanges that are interoperable and deterministic descriptions...

Word Count : 1363

Canonical schema pattern

Last Update:

In software engineering, Canonical Schema is a design pattern, applied within the service-orientation design paradigm, which aims to reduce the need for...

Word Count : 884

List of web service protocols

Last Update:

Protocol CTS - Canonical Text Services Protocol E-Business XML Hessian Internet Open Trading Protocol JSON-RPC JSON-WSP SOAP - outgrowth of XML-RPC, originally...

Word Count : 106

Unicode equivalence

Last Update:

provides two such notions, canonical equivalence and compatibility. Code point sequences that are defined as canonically equivalent are assumed to have...

Word Count : 1902

Message Transmission Optimization Mechanism

Last Update:

usually used with the XOP (XML-binary Optimized Packaging). MTOM only optimizes element content that is in the canonical lexical representation of the...

Word Count : 377

Uniform Resource Identifier

Last Update:

Technical Architecture Group (TAG) published a guide to best practices and canonical URIs for publishing multiple versions of a given resource. For example...

Word Count : 4387

Jon Bosak

Last Update:

creation of the XML specification at the W3C. From 1996–2008, he worked for Sun Microsystems. Tim Bray, who was one of the editors of the XML specification...

Word Count : 657

URI fragment

Last Update:

Notably they cannot begin with a digit or hyphen. xml:id is one of the few generic XML attributes, e.g., xml:lang, which can be used without explicitly declaring...

Word Count : 2676

Entity Framework

Last Update:

XML. Visual Studio also provides the Entity Designer for visual creation of the EDM and the mapping specification. The output of the tool is the XML file...

Word Count : 4070

SciELO

Last Update:

versions) JATS Preview Stylesheets (canonical XSLT conversion), see classical (2013) conversor. eLife Lens converts NLM XML to JSON for displaying using HTML...

Word Count : 1566

Improving Web Advertising Business Group

Last Update:

Recommendations ActivityPub Activity Streams ARIA Canonical XML CDF CSS Animations Flexbox Grid DOM Geolocation API HTML HTML5 IndexedDB ITS JSON-LD Linked...

Word Count : 350

Whitespace character

Last Update:

In XML attribute values, sequences of whitespace characters are treated as a single space when the document is read by a parser. Whitespace in XML element...

Word Count : 2565

IBM App Connect Enterprise

Last Update:

map data from one format to another (for instance, Cobol Copybook to canonical XML). Other nodes evaluate content of data and route the flow differently...

Word Count : 3527

PDF Search Engine © AllGlobal.net