15. High bits are used to indicate how many bytes are used to
represent a specific character. Software can easily read a
UTF8 stream, even starting in the middle.
http://tools.ietf.org/html/rfc3629#section-3
19. Most spoken languages can be represented in 3 bytes,
the "Basic Multilingual Plane"
http://www.siriusict.com/2010/08/06/
character-encoding-unicode-utf-8-and-a-bit-of-chauvinism-explained-for-the-masses-2/
20. http://globalmoxie.com/blog/klingon-not-spoken-here.shtml
In May 2001, the Unicode Technical Committee rejected the Klingon proposal;
however, Michael Everson created a mapping of pIqaD into the Private Use
Area of Unicode, which are listed in the ConScript Unicode Registry
(U+F8D0 to U+F8FF).
http://en.wikipedia.org/wiki/Klingon_writing_systems
21. The tengwar font has been proposed for the Unicode standard. The codepoints
are subject to change; the range U+016080 to U+0160FF in the SMP is
tentatively allocated for tengwar according to the current Unicode roadmap.
http://en.wikipedia.org/wiki/Tengwar
22. You need to have an appropriate font installed
to use unicode.
http://en.wikipedia.org/wiki/Tengwar
26. HTTP headers
• You can specify what character set you
want back when you send a form post
• This is informational for the server
• Just setting these won’t change how your
app behaves, unless your web app has code
for that