Обсуждение: Multiple languages in one database
I need to store multiple languages in one database (English, Spanish, Chinese, Korean, ???). At first I thought I would just us Unicode, but then I realized that it is only UTF-8 and it is my understanding that UTF-8 is insufficient for Chinese and Korean. I state this because I am starting to think that it is possible this assumption is incorrect. My question is this, is anyone storing multiple languages (including non-latin based languages such as Chinese and Korean) in one DB and if so how? My need is to store strings of text for each support language. Ex: for a given English string there would also be a Spanish, Chinese, and Korean string. This would then be used for language specific output in web pages, print (PDF?), and in email. I would really appreciate any help or pointers on this. Thanks, -Mont
On Mar 18, 2005, at 5:54 PM, Mont Rothstein wrote: > I need to store multiple languages in one database (English, Spanish, > Chinese, Korean, ???). At first I thought I would just us Unicode, > but then I realized that it is only UTF-8 and it is my understanding > that UTF-8 is insufficient for Chinese and Korean. I state this > because I am starting to think that it is possible this assumption is > incorrect. Your assumption is incorrect. The idea of unicode is to have a single character set for representing all languages. UTF-8 is a representation of unicode and is designed to encode any unicode character. See http://www.unicode.org/ and http://www.unicode.org/faq/utf_bom.html. John DeSoi, Ph.D. http://pgedit.com/ Power Tools for PostgreSQL
Thank you, that is the best news I have heard in a while :-) -Mont On Mar 19, 2005, at 6:49 PM, John DeSoi wrote: > > On Mar 18, 2005, at 5:54 PM, Mont Rothstein wrote: > >> I need to store multiple languages in one database (English, Spanish, >> Chinese, Korean, ???). At first I thought I would just us Unicode, >> but then I realized that it is only UTF-8 and it is my understanding >> that UTF-8 is insufficient for Chinese and Korean. I state this >> because I am starting to think that it is possible this assumption is >> incorrect. > > Your assumption is incorrect. The idea of unicode is to have a single > character set for representing all languages. UTF-8 is a > representation of unicode and is designed to encode any unicode > character. > > See http://www.unicode.org/ and > > http://www.unicode.org/faq/utf_bom.html. > > > > John DeSoi, Ph.D. > http://pgedit.com/ > Power Tools for PostgreSQL > > > ---------------------------(end of > broadcast)--------------------------- > TIP 3: if posting/reading through Usenet, please send an appropriate > subscribe-nomail command to majordomo@postgresql.org so that your > message can get through to the mailing list cleanly >