Unaccent characters

Поиск

Список

Период

Сортировка

От	Thom Brown
Тема	Unaccent characters
Дата	20 апреля 2012 г. 08:16:39
Msg-id	CAA-aLv5YxXQ9JHSRBsT5kNB5Yn6_v8RXbt+HHNuYdsKm=dNf2g@mail.gmail.com обсуждение исходный текст
Ответы	Re: Unaccent characters
Список	pgsql-general

Дерево обсуждения

Hi,

I had a look at the unaccent.rules file and noticed the following
characters aren't properly converted:

ß (U+00DF)  An eszett represents a double-s "SS" but this replaces it
with one "S".  Shouldn't this be replace with "SS"?

Æ (U+00C6) and æ (U+00E6) These doesn't have an accent, diacritic or
anything added to a single latin character.  It's simply a ligature of
"A" and "E" or "a" and "e".  If someone has the text "æther", I would
imagine they'd be surprised at it being converted to "ather" instead
of "aether".

Œ (U+0152) and œ (U+0153). Same as above.  This is a ligature of "O"
and "E" or "o" and "e".  Except this time the unaccent module chooses
the 2nd character instead of the 1st which is confusing.

If these were properly converted it would change the length of the
text, so I'm wondering if that's the reason for not properly
converting them.  Could someone elaborate?

--
Thom

В списке pgsql-general по дате отправления:

Предыдущее

От: Michael Paquier
Дата: 20 апреля 2012 г., 04:01:20
Сообщение: Re: Compile docs on ArchLinux

Следующее

От: Condor
Дата: 20 апреля 2012 г., 08:39:45
Сообщение: Problem with reading data from standby server?

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Unaccent characters

Предыдущее

Следующее