Unicode

  • Single encoding
  • With Unicode 3.1, about 100,000 characters (more than 16 bits!)
  • Corresponding character property database.
  • Not a panacea
    • Encoding only one tiny bit of internationalization.
    • Using Unicode as enhanced ASCII is not enough.
    • Need to deal with legacy encodings