Обсуждение: Differences in Unicode handling on Mac vs Linux?
Howdy, I loaded a client's DB on my Mac to debug an unrelated bug, but I'm blocked because my Mac is rejecting SQL that works on our Linux production servers. Here's a simple case: # select * from shots where sg_poznÁmka is NULL; ERROR: column "sg_pozn�mka" does not exist LINE 1: select * from shots where sg_poznÁmka is NULL; ... as far as I can tell, all my encodings are consistent on both sides, I've checked LC_COLLATE, LC_CTYPE, client_encoding, server_encoding and the database encodings. I'm running 9.0.13 on both machines (and I tried 9.2.4 on my Mac). Anything else I could double-check? Or are there any known Mac-related Unicode issues? Thanks! Matt
Matt Daw <matt@shotgunsoftware.com> writes:
> Howdy, I loaded a client's DB on my Mac to debug an unrelated bug, but
> I'm blocked because my Mac is rejecting SQL that works on our Linux
> production servers. Here's a simple case:
> # select * from shots where sg_poznÁmka is NULL;
> ERROR: column "sg_pozn�mka" does not exist
> LINE 1: select * from shots where sg_poznÁmka is NULL;
Hm ... what does "\d shots" say about the spelling of the column name?
> Anything else I could double-check? Or are there any known Mac-related
> Unicode issues?
OS X's Unicode locales are pretty crummy. I'm suspicious that there's
some sort of case-folding inconsistency here, but it's hard to say more
(especially since you didn't actually tell us *which* locales you've
selected on each machine). If it is that, as a short-term fix it might
help to double-quote the column name.
regards, tom lane
> Hm ... what does "\d shots" say about the spelling of the column name?
\d shots is the same on both systems:
sg_poznÁmka | text
|
> OS X's Unicode locales are pretty crummy. I'm suspicious that there's
> some sort of case-folding inconsistency here, but it's hard to say more
> (especially since you didn't actually tell us *which* locales you've
> selected on each machine). If it is that, as a short-term fix it might
> help to double-quote the column name.
The locales are set to "en_US.UTF-8" and encodings to "UTF8". Double
quoting does solve the column case, but it's not helping with the
Rails generated:
SELECT a.attname, format_type(a.atttypid, a.atttypmod), d.adsrc, a.attnotnull
FROM pg_attribute a LEFT JOIN pg_attrdef d
ON a.attrelid = d.adrelid AND a.attnum = d.adnum
WHERE a.attrelid =
'asset_sg_kdo_dělá____assigned_to__connections'::regclass
AND a.attnum > 0 AND NOT a.attisdropped
ORDER BY a.attnum
... that produces:
ERROR: relation "asset_sg_kdo_d�l�____assigned_to__connections" does not exist
\d produces:
public | asset_sg_kdo_dělá____assigned_to__connections
| table | matt
For the short term, I think I'll boot up a Linux VM to troubleshoot my
production bug... but I'll submit a bug report with repro steps.
Thanks Tom!
Matt
2013/6/3 Tom Lane <tgl@sss.pgh.pa.us>: > Matt Daw <matt@shotgunsoftware.com> writes: >> Howdy, I loaded a client's DB on my Mac to debug an unrelated bug, but >> I'm blocked because my Mac is rejecting SQL that works on our Linux >> production servers. Here's a simple case: > >> # select * from shots where sg_poznÁmka is NULL; >> ERROR: column "sg_pozn�mka" does not exist >> LINE 1: select * from shots where sg_poznÁmka is NULL; > > Hm ... what does "\d shots" say about the spelling of the column name? > >> Anything else I could double-check? Or are there any known Mac-related >> Unicode issues? > > OS X's Unicode locales are pretty crummy. I'm suspicious that there's > some sort of case-folding inconsistency here, but it's hard to say more > (especially since you didn't actually tell us *which* locales you've > selected on each machine). If it is that, as a short-term fix it might > help to double-quote the column name. I can recreate something similar (OS X 10.7, 9.3beta1): postgres=# CREATE TABLE shots (id int); CREATE TABLE postgres=# SHOW client_encoding ; client_encoding ----------------- UTF8 (1 row) postgres=# select * from shots where col_ä is NULL; ERROR: column "col_�" does not exist LINE 1: select * from shots where col_ä is NULL; The corresponding log output is: ERROR: column "col_<E3><A4>" does not exist at character 27 STATEMENT: select * from shots where col_ä is NULL; Double-quoting the column name does seem to "work": postgres=# select * from shots where "col_ä" is NULL; ERROR: column "col_ä" does not exist LINE 1: select * from shots where "col_ä" is NULL; The only language/locale settings I see in my environment are: LANG=en_GB.UTF-8 __CF_USER_TEXT_ENCODING=0x1F6:0:2 Regards Ian Barwick