Re: Slow recursive CTE query questions, with row estimate and n_distinct issues

Поиск
Список
Период
Сортировка
От Michael Lewis
Тема Re: Slow recursive CTE query questions, with row estimate and n_distinct issues
Дата
Msg-id CAHOFxGo2qtDSbBh6xqQ-DZpCRSR=d0sXdU9xAOyTDxH-iPVo4A@mail.gmail.com
обсуждение исходный текст
Ответ на Slow recursive CTE query questions, with row estimate and n_distinct issues  (Christopher Baines <mail@cbaines.net>)
Ответы Re: Slow recursive CTE query questions, with row estimate and n_distinct issues  (Christopher Baines <mail@cbaines.net>)
Список pgsql-performance
On Mon, Dec 28, 2020 at 7:51 AM Christopher Baines <mail@cbaines.net> wrote:
derivation_inputs:
  COUNT(*):  285422539
  reltuples: 285422528

  derivation_id:
    COUNT(DISTINCT): 7508610
    n_distinct:      4336644 (~57% of the true value)

  derivation_output_id:
    COUNT(DISTINCT): 5539406
    n_distinct:      473762 (~8% of the true value)

If you expect the ratio of distinct of derivation_output_id values to be roughly linear going forward, you can set a custom value for n_distinct on the column (currently seems like -.0194, aka distinct count of derivation_output_id divided by reltuples of the table). You could also do this analysis every month or six and set the custom value as needed.


I am not sure if it will resolve your query problems though.

В списке pgsql-performance по дате отправления:

Предыдущее
От: Christopher Baines
Дата:
Сообщение: Slow recursive CTE query questions, with row estimate and n_distinct issues
Следующее
От: Greg Spiegelberg
Дата:
Сообщение: Re: Slow recursive CTE query questions, with row estimate and n_distinct issues