The Unicode Character Set
- American Standard Code for Information Interchange (ASCII) 0-255 'A' - 65
- XML provides full Support for the two-byte Unicode Character Set. 0-65,535
http://www.unicode.org
- XML Documents written in:
- ASCII
- UTF-8 Compressed version of Unicode (uses 8 bits to represent characters)
<?xml version="1.0" encoding="UTF-8"?>
- XML defines character reference to encode Unicode characters.
© < π
- Universal Character System (UCS ISO 10646)
- 4 bytes per symbol
- UCS-2 and UCS-4 encoding