Unicode normalization SQL functions

Поиск

Список

Период

Сортировка

От	Peter Eisentraut
Тема	Unicode normalization SQL functions
Дата	12 декабря 2019 г. 14:46:21
Msg-id	c1909f27-c269-2ed9-12f8-3ab72c8caf7a@2ndquadrant.com обсуждение исходный текст
Ответы	Re: Unicode normalization SQL functions ("Daniel Verite" <daniel@manitou-mail.org>)
Список	pgsql-hackers

Дерево обсуждения

Here are patches to add support for Unicode normalization into SQL, per 
SQL standard:

     normalize($string [, form])
     $string is [form] normalized

(comment about silly SQL syntax here)

We already have all the infrastructure for Unicode normalization for the 
SASLprep functionality.  The first patch extends the internal APIs to 
support all four normal forms instead of only NFKC used by SASLprep. 
The second patch adds the SQL layer on top of it.

This could be used to preprocess or check strings before using them with 
deterministic collations or locale implementations that don't deal with 
non-NFC data correctly, perhaps using triggers, generated columns, or 
domains.  The NFKC and NFKD normalizations could also be used for 
general data cleaning, similar to what SASLprep does.

As a future idea, I think we could also hook Unicode normalization into 
the protocol-level encoding conversion.

Also, there is a way to optimize the "is normalized" test for common 
cases, described in UTR #15.  For that we'll need an additional data 
file from Unicode.  In order to simplify that, I would like my patch 
"Add support for automatically updating Unicode derived files" 
integrated first.

-- 
Peter Eisentraut              http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

Вложения

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Amit Kapila
Дата: 12 декабря 2019 г., 14:41:33
Сообщение: Re: PATCH: logical_work_mem and logical streaming of largein-progress transactions

Следующее

От: Suraj Kharage
Дата: 12 декабря 2019 г., 15:32:49
Сообщение: Re: backup manifests

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Unicode normalization SQL functions

Вложения

Предыдущее

Следующее