Re: 7.3.2 incorrectly counts characters for unicode varchar field
От | Matthew Cooper |
---|---|
Тема | Re: 7.3.2 incorrectly counts characters for unicode varchar field |
Дата | |
Msg-id | 002f01c37bae$92bcf990$6600030a@gateway01 обсуждение исходный текст |
Ответ на | 7.3.2 incorrectly counts characters for unicode varchar field ("Matthew Cooper" <matty@cloverworxs.com>) |
Список | pgsql-bugs |
Doh! It looks like its time to eat humble pie. It turns out that the guy here who has 7.3.4 and helped me to reproduce the problem did not follow our own installation instructions (that he recently re-worded!) as follows: "createdb -E UNICODE -U DB_USER -P DB_PASSWORD DB_NAME" and did not set the encoding. I, like a good boy, did on my 7.2 installation. The guys I am trying to debug the problem for are in another location and are using 7.3.4 too. Hence I narrowed it down to a version problem. I am asking them to check the encoding on their database too and will post back with huge apologies and thanks for your time when they inevitably confirm that the encoding is SQL_ANSI. Thanks, Matty. ----- Original Message ----- From: "Matthew Cooper" <matty@cloverworxs.com> To: "Tom Lane" <tgl@sss.pgh.pa.us> Cc: <pgsql-bugs@postgresql.org> Sent: Monday, September 15, 2003 9:50 AM Subject: Re: [BUGS] 7.3.2 incorrectly counts characters for unicode varchar field > Attached is the UTF-8 encoded sql file in case it got messed up in the mail > transfer. > > And here it is pasted in directly from the window that was displaying > chinese characters. > > insert into mgc values ('åéç»ä¹ åéç»ä¹ ç»ä¹ '); > > > Looking at the UTF-8 documentation, 10 chinese characters could be any > number of bytes, each character being say 2 or 3 characters. > > Matty. > ----- Original Message ----- > From: "Tom Lane" <tgl@sss.pgh.pa.us> > To: "Matthew Cooper" <matty@cloverworxs.com> > Cc: <pgsql-bugs@postgresql.org> > Sent: Saturday, September 13, 2003 5:51 PM > Subject: Re: [BUGS] 7.3.2 incorrectly counts characters for unicode varchar > field > > > > > insert into mgc values ('ÃÂ¥Ã?â? éâ?Tà ¸Ã§Ã»Ã'äù ÃÂ¥Ã?â? > éâ?Tà ¸Ã§Ã»Ã'äù çûÃ'äù '); > > > > I don't think this string is correctly unicode-encoded. Anyway "length" > > claims it is 30 characters. > > > > regards, tom lane > > >
В списке pgsql-bugs по дате отправления: