Re: BUG #13440: unaccent does not remove all diacritics

Поиск

Список

Период

Сортировка

От	Léonard Benedetti
Тема	Re: BUG #13440: unaccent does not remove all diacritics
Дата	10 марта 2016 г. 14:44:23
Msg-id	56E18841.2000406@mlpo.fr обсуждение исходный текст
Ответ на	Re: BUG #13440: unaccent does not remove all diacritics (Léonard Benedetti <benedetti@mlpo.fr>)
Ответы	Re: BUG #13440: unaccent does not remove all diacritics
Список	pgsql-bugs

Дерево обсуждения

10/03/2016 15:35, Léonard Benedetti wrote:
> Le 10/03/2016 14:46, Teodor Sigaev a écrit :
>>> On the other hand, UTF-8 encoding for source code is *a feature of
>>> Python 3* (to quote the documentation: “The default encoding for Python
>>> source code is UTF-8”) so there is no possible ambiguity, and it will
>>> not be a problem. That said, some non-ASCII characters may be removed
>>> without prejudice from the source code of the script (I think in
>>> particular to "“" and "”"). Nevertheless, for some comments, it would be
>>> unfortunate (e.g. “# RegEx to parse rules (e.g. “Đ → D ; […]”)” or “# ℃
>>> °C”).
>> Ok, I didn't know that.
>>
>>
>>> Thus, I propose to adapt the code to Python 3 (the encoding of the
>>> script does not seem to be a problem for the above reasons). I try to do
>>> it shortly.
>> We are waiting...
>>
> Sorry for the delay, adaptation to Python 3 was very easy (the code is
> almost identical).
>
> As usual, you will find attached the new version of the script and the
> generated output for convenience.
>
> Léonard Benedetti
Here is the patch, attached.

Léonard Benedetti

Вложения

improve-unaccent-default-rules-generation-script-v3.patch

В списке pgsql-bugs по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: BUG #13440: unaccent does not remove all diacritics

Вложения