Re: Question regarding UTF-8 data and "C" collation on definition of field of table
| От | Tom Lane |
|---|---|
| Тема | Re: Question regarding UTF-8 data and "C" collation on definition of field of table |
| Дата | |
| Msg-id | 2556580.1675642741@sss.pgh.pa.us обсуждение исходный текст |
| Ответ на | Re: Question regarding UTF-8 data and "C" collation on definition of field of table (Dionisis Kontominas <dkontominas@gmail.com>) |
| Ответы |
Re: Question regarding UTF-8 data and "C" collation on definition of field of table
Re: Question regarding UTF-8 data and "C" collation on definition of field of table |
| Список | pgsql-general |
Dionisis Kontominas <dkontominas@gmail.com> writes:
> I suppose that affects the outcome of ORDER BY clauses on the field,
> along with the content of the indexes. Is this right?
Yeah.
> Assuming that the requirement exists, to store UTF-8 characters on a
> field that can be from multiple languages, and the database default
> encoding is UTF8 which is the right thing I suppose (please verify), what
> do you think should be the values of the Collation and Ctype for the
> database to behave correctly?
Um ... so define "correct". If you have a mishmash of languages in the
same column, it's likely that they have conflicting rules about sorting,
and there may be no ordering that's not surprising to somebody.
If there's a predominant language in the data, selecting a collation
matching that seems like your best bet. Otherwise, maybe you should
just shrug your shoulders and stick with C collation. It's likely
to be faster than any alternative.
regards, tom lane
В списке pgsql-general по дате отправления: