Re: Question regarding UTF-8 data and "C" collation on definition of field of table

Поиск
Список
Период
Сортировка
От Tom Lane
Тема Re: Question regarding UTF-8 data and "C" collation on definition of field of table
Дата
Msg-id 2556580.1675642741@sss.pgh.pa.us
обсуждение исходный текст
Ответ на Re: Question regarding UTF-8 data and "C" collation on definition of field of table  (Dionisis Kontominas <dkontominas@gmail.com>)
Ответы Re: Question regarding UTF-8 data and "C" collation on definition of field of table  (Dionisis Kontominas <dkontominas@gmail.com>)
Re: Question regarding UTF-8 data and "C" collation on definition of field of table  (Peter Geoghegan <pg@bowt.ie>)
Список pgsql-general
Dionisis Kontominas <dkontominas@gmail.com> writes:
>    I suppose that affects the outcome of ORDER BY clauses on the field,
> along with the content of the indexes. Is this right?

Yeah.

>    Assuming that the requirement exists, to store UTF-8 characters on a
> field that can be from multiple languages, and the database default
> encoding is UTF8 which is the right thing I suppose (please verify), what
> do you think should be the values of the Collation and Ctype for the
> database to behave correctly?

Um ... so define "correct".  If you have a mishmash of languages in the
same column, it's likely that they have conflicting rules about sorting,
and there may be no ordering that's not surprising to somebody.

If there's a predominant language in the data, selecting a collation
matching that seems like your best bet.  Otherwise, maybe you should
just shrug your shoulders and stick with C collation.  It's likely
to be faster than any alternative.

            regards, tom lane



В списке pgsql-general по дате отправления:

Предыдущее
От: Dionisis Kontominas
Дата:
Сообщение: Re: Question regarding UTF-8 data and "C" collation on definition of field of table
Следующее
От: Dionisis Kontominas
Дата:
Сообщение: Re: Question regarding UTF-8 data and "C" collation on definition of field of table