Re: A thought about regex versus multibyte character sets

Поиск
Список
Период
Сортировка
От Alvaro Herrera
Тема Re: A thought about regex versus multibyte character sets
Дата
Msg-id 20091201214611.GH5013@alvh.no-ip.org
обсуждение исходный текст
Ответ на A thought about regex versus multibyte character sets  (Tom Lane <tgl@sss.pgh.pa.us>)
Ответы Re: A thought about regex versus multibyte character sets  (Tom Lane <tgl@sss.pgh.pa.us>)
Список pgsql-hackers
Tom Lane wrote:

> I just spent a bit of time considering what we might do to fix this.
> The idea mentioned in the above thread was to switch over to using
> wchar_t in the regex code, but that seems to have a number of problems.
> One showstopper is that on some platforms wchar_t is only 16 bits and
> can't represent the full range of Unicode characters.  I don't want to
> fix case-folding only to break regexes for other uses.

We have a TODO item about having a regex specific data type.  Would
implementing that solve this problem?

-- 
Alvaro Herrera                                http://www.CommandPrompt.com/
PostgreSQL Replication, Consulting, Custom Development, 24x7 support


В списке pgsql-hackers по дате отправления:

Предыдущее
От: Marko Kreen
Дата:
Сообщение: Re: Application name patch - v4
Следующее
От: Bruce Momjian
Дата:
Сообщение: Re: Block-level CRC checks