2010/12/20 Martijn van Oosterhout <kleptog@svana.org>:
> On Mon, Dec 20, 2010 at 09:03:56AM +0900, Itagaki Takahiro wrote:
>
>> UTF-8 is not a superset of all encodings.
>
> I think you mean Unicode is not a superset of all character sets. I've
> heard this before but never found what's missing. [citation needed]?
From <URL:http://en.wikipedia.org/wiki/Japanese_language_and_computers#Character_encodings>:
"Unicode is supposed to solve all encoding problems in all languages
of the world. [..] There are still controversies. For Japanese, the
kanji characters have been unified with Chinese, that is a character
considered to be the same in both Japanese and Chinese have been given
one and the same code number in Unicode, even if they look a little
different. This process, called Han unification, has caused
controversy."
For examples (my browser doesn't show any differences though, probably
because I don't have the corresponding fonts):
<URL:http://en.wikipedia.org/wiki/Han_unification#Examples_of_language_dependent_characters>
Nicolas