Re: PostgreSQL SQL Tricks: faster urldecode

Поиск
Список
Период
Сортировка
От Marc Mamin
Тема Re: PostgreSQL SQL Tricks: faster urldecode
Дата
Msg-id B6F6FD62F2624C4C9916AC0175D56D880CE1C178@jenmbs01.ad.intershop.net
обсуждение исходный текст
Ответ на Re: PostgreSQL SQL Tricks: faster urldecode  (Merlin Moncure <mmoncure@gmail.com>)
Список pgsql-general
> Von: Merlin Moncure [mmoncure@gmail.com]
> Gesendet: Freitag, 20. September 2013 17:43
>
> >  On Fri, Sep 20, 2013 at 10:26 AM, Marc Mamin <M.Mamin@intershop.de> wrote:
> > Hi,
> > here is a function which is about 8 x faster than the one described in the PostgreSQL SQL Tricks
> > ( http://postgres.cz/wiki/PostgreSQL_SQL_Tricks#Function_for_decoding_of_url_code )
> >
> > The idea is to handle each encoded/not_encoded parts in bulk rather than spliting on each character.
> >
> > urldecode_arr:
> > Seq Scan on lt_referrer  (actual time=1.966..17623.979 rows=65717 loops=1)
> >
> > urldecode:
> > Seq Scan on lt_referrer  (actual time=4.846..144445.292 rows=65717 loops=1)
>
> very nice.  Basically it comes down to this: all non-trivial regex
> replacements require decomposition of the string into an array because
> regexp_replace() is unable to do any kind of transformation on the
> string.  This is a crippling limitation relative to first-class regex
> languages like perl; postgres string translation functions are
> invisible to the regex engine.  I have no idea if this is fixable (I
> dimly recall Tom explaining why it might not be).
>
> merlin

yes, a possible(?) assistance for such problems would be a new variant of regexp_split_to_table
that would return two columns:
- the splitted parts (as currently)
- the separator matches (new)

Marc




В списке pgsql-general по дате отправления:

Предыдущее
От: Jeff Janes
Дата:
Сообщение: Re: Partitioning V schema
Следующее
От: Gregory Haase
Дата:
Сообщение: Re: Partitioning V schema