Re: Explicit deterministic COLLATE fails with pattern matching operations on column with non-deterministic collation

Поиск
Список
Период
Сортировка
От Tom Lane
Тема Re: Explicit deterministic COLLATE fails with pattern matching operations on column with non-deterministic collation
Дата
Msg-id 7417.1590690568@sss.pgh.pa.us
обсуждение исходный текст
Ответ на Re: Explicit deterministic COLLATE fails with pattern matchingoperations on column with non-deterministic collation  (James Lucas <jlucasdba@gmail.com>)
Ответы Re: Explicit deterministic COLLATE fails with pattern matchingoperations on column with non-deterministic collation
Список pgsql-bugs
James Lucas <jlucasdba@gmail.com> writes:
> I tried setting up a pathological test case for this, and it seems
> like at least currently, even with a non-deterministic collation
> statistics still count values as distinct, even if the default
> collation would consider them equivalent.  Not sure if that's as
> intended or not?

I experimented with this, and what I'm seeing is that ucol_strcollUTF8()
reports that 'aaa' is different from 'aAa'.  So the behavior on the
Postgres side is as-expected.  I suspect that the 'en-US-ks-level2'
ICU locale doesn't act as you think it does.  (That is, just saying
that a collation is nondeterministic doesn't make it so; it only forces
Postgres through slower code paths that allow for the possibility of
bitwise-unequal strings being reported as equal by ICU.)  Not knowing
anything about ICU, I can't say more than that.

[ Tested on libicu-60.3-2.el8_1 ]

            regards, tom lane



В списке pgsql-bugs по дате отправления:

Предыдущее
От: James Lucas
Дата:
Сообщение: Re: Explicit deterministic COLLATE fails with pattern matchingoperations on column with non-deterministic collation
Следующее
От: "Daniel Verite"
Дата:
Сообщение: Re: Explicit deterministic COLLATE fails with pattern matchingoperations on column with non-deterministic collation