The characters in the XML document can be encoded in different formats. XML uses the Unicode (UTF-8 or UTF-16) character set by default, but other encodings can be used if they are declared in the XML declaration at the beginning of the document.
Related categories 3
Sites 8
- The Unicode Consortium The official site with information and resources for programmers, implementers and others involved in computer globalization work.
- Expressing Simple Dublin Core in RDF/XML Explains how to encode the Dublin Core Metadata Element Set V1.1 (DCMES) in XML, provides a DTD to validate the documents and describes a method to link them from web pages.
- ISO 8859 Alphabet Soup A commented graphical overview of the ISO 8859 character sets by Roman Czyborra.
- Entry-Level Unicode for XML A "just enough education to perform" guide to Unicode and ISO 10646 for authors of XML parsers and other software that processes XML.
- Chinese XML FAQ (in English) FAQ about Chinese document encodings, and how they relate to XML. Also includes links and software.
- MSDN: How to Encode XML Data Article explaining how character encoding works and specifically how it works in XML and the MSXML DOM.
- What ASN.1 can offer to XML XML Encoding Rules for Abstract Syntax Notation One (ASN.1) and the ASN.1 Markup Language [ITU-T Rec. X.693 | ISO/IEC 8825-4].
- XER - XML Encoding Rules Idea to standardize rules that would allow for information described in ASN.1 (Abstract Syntax Notation) to be carried in XML.