Unicode 5.0

  • UNICODE Consortium 2006
  • 100,000 distinct characters
  • 75 supported scripts
  • UTF-8 Variable Length Character Encoding
    • 1-4 bytes for each character (max )6
    • ASCII requires 1 byte
    • Alphabetic Systems require 2 bytes
    • Chinese –Japanese-Korean – 3 bytes (sometimes 4 bytes)