Re: unaccent extension missing some accents

Поиск

Список

Период

Сортировка

От	Bruce Momjian
Тема	Re: unaccent extension missing some accents
Дата	10 ноября 2011 г. 20:15:47
Msg-id	201111102115.pAALFYx26956@momjian.us обсуждение исходный текст
Ответ на	Re: unaccent extension missing some accents (Tom Lane <tgl@sss.pgh.pa.us>)
Ответы	Re: unaccent extension missing some accents (Tom Lane <tgl@sss.pgh.pa.us>)
Список	pgsql-hackers

Дерево обсуждения

Tom Lane wrote:
> J Smith <dark.panda+lists@gmail.com> writes:
> > I've attached a patch against master for unaccent.c that uses swscanf
> > along with char2wchar and wchar2char instead of sscanf directly to
> > initialize the unaccent extension and it appears to fix the problem in
> > both the master and 9.1 branches.
> 
> swscanf doesn't seem like an acceptable approach: it's a function that
> is relied on nowhere else in PG, so it adds new portability risks of its
> own.  It doesn't exist on some platforms that we support (like the one
> I'm typing this message on) and there's no real good reason to assume
> that it's not broken in its own ways on others.
> 
> If you really want to pursue this, I'd suggest parsing the line
> manually, perhaps via strchr searches for \t and \n.  It likely wouldn't
> be very many more lines than what you've got here.
> 
> However, the bigger picture is that OS X's UTF8 locales are broken
> through-and-through, and most of their other problems are not feasible
> to work around.  So basically you can't use them for anything
> interesting, and it's not clear that it's worth putting any time into
> solving individual problems.  In the particular case here, the issue
> presumably is that sscanf is relying on isspace() ... but we rely on
> isspace() directly, in quite a lot of places, so how much is it going
> to fix to dodge it right here?

If Apple's low-level code came from FreeBSD and NetBSD, how did they get
so broken?

--  Bruce Momjian  <bruce@momjian.us>        http://momjian.us EnterpriseDB
http://enterprisedb.com
 + It's impossible for everything to be true. +

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Bruce Momjian
Дата: 10 ноября 2011 г., 20:10:59
Сообщение: Re: const correctness

Следующее

От: Tom Lane
Дата: 10 ноября 2011 г., 20:19:00
Сообщение: Re: unaccent extension missing some accents

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: unaccent extension missing some accents

Предыдущее

Следующее