UTF-8

  • Encoding of Unicode
  • Can deal with all 1 million+ unicode code points
  • Variable character length
      while (p)
        {
           gunichar c = g_utf8_get_char (p);
           p = g_utf8_next_char (p);
        }
  • Very compatible with legacy
    • Everything is still a char *
    • Same ranges as ISO-8859-1, etc.
    • Can use printf and related functions