Canonicalisation (1) The canonical form of an XML document is physical representation of the document produced by the canonicalisation method that implies the following changes. Encoding and characters The document is encoded in UTF-8 Line breaks normalized to #xA on input, before parsing Whitespace outside of the document element and within start and end tags is normalized All whitespace in character content is retained (excluding characters removed during line feed normalization) |