Re: 10x rowcount mis-estimation favouring merge over nestloop

Поиск
Список
Период
Сортировка
От Tom Lane
Тема Re: 10x rowcount mis-estimation favouring merge over nestloop
Дата
Msg-id 16547.1163139324@sss.pgh.pa.us
обсуждение исходный текст
Ответ на 10x rowcount mis-estimation favouring merge over nestloop  (Abhijit Menon-Sen <ams@oryx.com>)
Ответы Re: 10x rowcount mis-estimation favouring merge over nestloop  (Abhijit Menon-Sen <ams@oryx.com>)
Список pgsql-performance
Abhijit Menon-Sen <ams@oryx.com> writes:
> The header_fields table contains 13.5M rows, of which only ~250K match
> the where condition. I created an index like this:
>     create index hffpv on header_fields(field)
>     where field<=12 and (part!='' or value ilike '%,%')

> Note the 2M estimated rowcount in the bitmap index scan on header_fields
> vs. the actual number (264180).

I think this is basically a lack-of-column-correlation-stats problem.
The planner is estimating this on the basis of the overall selectivity
of the "field<=12" condition, but it seems that "field<=12" is true for
a much smaller fraction of the rows satisfying (part!='' or value ilike '%,%')
than for the general population of rows in the header_fields table.

There's been some speculation about obtaining stats on partial indexes
as a substitute for solving the general problem of correlation stats,
but I for one don't have a very clear understanding of how it'd work.

            regards, tom lane

В списке pgsql-performance по дате отправления:

Предыдущее
От: Abhijit Menon-Sen
Дата:
Сообщение: 10x rowcount mis-estimation favouring merge over nestloop
Следующее
От: Abhijit Menon-Sen
Дата:
Сообщение: Re: 10x rowcount mis-estimation favouring merge over nestloop