Re: Should we optimize the `ORDER BY random() LIMIT x` case?

Поиск

Список

Период

Сортировка

От	Aleksander Alekseev
Тема	Re: Should we optimize the `ORDER BY random() LIMIT x` case?
Дата	20 мая 13:46:38
Msg-id	CAJ7c6TOgEX02tofvoMaZHAO4E5pe=pZs2VNHa9m5aEpM9mwviw@mail.gmail.com обсуждение исходный текст
Ответ на	Re: Should we optimize the `ORDER BY random() LIMIT x` case? (Andrei Lepikhov <lepihov@gmail.com>)
Список	pgsql-hackers

Дерево обсуждения

Andrei,

> > ```
> > -- imagine replacing inefficient array_sample(array_agg(t), 10)
> > -- with more efficient array_sample_reservoir(t, 10)
> > SELECT (unnest(agg)).* AS k FROM
> > (  SELECT array_sample(array_agg(t), 10) AS agg FROM (
> >     ... here goes the subquery ...
> >     ) AS t
> > );
> > ```
> >
> > ... if only we supported such a column expansion for not registered
> > records. Currently such a query fails with:
> >
> > ```
> > ERROR:  record type has not been registered
> > ```
> I know about this issue. Having resolved it in a limited number of local
> cases (like FDW push-down of row types), I still do not have a universal
> solution worth proposing upstream. Do you have any public implementation
> of the array_sample_reservoir to play with?

array_sample_reservoir() is purely a figment of my imagination at the
moment. Semantically it does the same as array_sample(array_agg(t), N)
except the fact that array_sample(..., N) requires the array to have
at least N items. You can experiment with array_sample(array_agg(...),
N) as long as the subquery returns much more than N rows.

-- 
Best regards,
Aleksander Alekseev

В списке pgsql-hackers по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Should we optimize the `ORDER BY random() LIMIT x` case?