Re: index v. seqscan for certain values

От

Stephan Szabo

Тема

Дата

12 апреля 2004 г. в 14:40:02

Msg-id

20040412103711.G16827@megazone.bigpanda.com

обсуждение

Ответ на

index v. seqscan for certain values (Jeremy Dunn)

Список

pgsql-performance

Дерево обсуждения

index v. seqscan for certain values "Jeremy Dunn" <jdunn@autorevenue.com> 12 апреля 2004 г. в 12:40:36

Re: index v. seqscan for certain values Tom Lane <tgl@sss.pgh.pa.us> 12 апреля 2004 г. в 14:51:34

Re: index v. seqscan for certain values "Jeremy Dunn" <jdunn@autorevenue.com> 12 апреля 2004 г. в 16:05:07

Re: index v. seqscan for certain values Tom Lane <tgl@sss.pgh.pa.us> 12 апреля 2004 г. в 18:02:50

Re: index v. seqscan for certain values "Jeremy Dunn" <jdunn@autorevenue.com> 13 апреля 2004 г. в 11:41:35

Re: index v. seqscan for certain values Tom Lane <tgl@sss.pgh.pa.us> 13 апреля 2004 г. в 14:55:53

Re: index v. seqscan for certain values "Jeremy Dunn" <jdunn@autorevenue.com> 13 апреля 2004 г. в 15:04:40

Re: index v. seqscan for certain values Robert Treat <xzilla@users.sourceforge.net> 13 апреля 2004 г. в 15:35:29

configure shmmax on MAC OS X Qing Zhao <qzhao@quotefx.net> 13 апреля 2004 г. в 16:10:53

Re: configure shmmax on MAC OS X Tom Lane <tgl@sss.pgh.pa.us> 13 апреля 2004 г. в 16:55:29

Re: configure shmmax on MAC OS X "Joshua D. Drake" <jd@commandprompt.com> 13 апреля 2004 г. в 17:59:31

Re: configure shmmax on MAC OS X Qing Zhao <qzhao@quotefx.net> 13 апреля 2004 г. в 17:10:49

Re: configure shmmax on MAC OS X Jeff Bohmer <bohmer@visionlink.org> 13 апреля 2004 г. в 16:25:34

Re: index v. seqscan for certain values Manfred Koizar <mkoi-pg@aon.at> 15 апреля 2004 г. в 20:02:06

Re: index v. seqscan for certain values Bruno Wolff III <bruno@wolff.to> 12 апреля 2004 г. в 16:52:14

Re: index v. seqscan for certain values Stephan Szabo <sszabo@megazone.bigpanda.com> 12 апреля 2004 г. в 14:40:02

Re: index v. seqscan for certain values Bill Moran <wmoran@potentialtech.com> 12 апреля 2004 г. в 13:09:29

Re: index v. seqscan for certain values "Jeremy Dunn" <jdunn@autorevenue.com> 12 апреля 2004 г. в 14:08:11


On Mon, 12 Apr 2004, Jeremy Dunn wrote:

>    explain analyze select count(*) from xxx where cid=6223341;
>    Aggregate  (cost=74384.19..74384.19 rows=1 width=0) (actual
> time=11614.89..11614.89 rows=1 loops=1)
>      ->  Index Scan using xxx_cid on emailrcpts  (cost=0.00..74329.26
> rows=21974 width=0) (actual time=35.75..11582.10 rows=20114 loops=1)
>    Total runtime: 11615.05 msec
>
> However for the values that have > 20,000 rows, the plan changes to a
> sequential scan, which is proportionately much slower.
>
>    explain analyze select count(*) from xxx where cid=7191032;
>    Aggregate  (cost=97357.61..97357.61 rows=1 width=0) (actual
> time=46427.81..46427.82 rows=1 loops=1)
>     ->   Seq Scan on xxx (cost=0.00..97230.62 rows=50792 width=0)
> (actual time=9104.45..46370.27 rows=37765 loops=1)
>     Total runtime: 46428.00 msec
>
> The question: why does the planner consider a sequential scan to be
> better for these top 10 values?  In terms of elapsed time it is more
> than twice as slow, proportionate to an index scan for the same number
> of rows.

One thing to do is to set enable_seqscan=off and run the above and compare
the estimated and real costs.  It may be possible to lower
random_page_cost to a still reasonable number in order to move the point
of the switchover to seqscan.

В списке pgsql-performance по дате отправления

Предыдущее

От: Jeremy Dunn

Дата: 12 апреля 2004 г. в 14:08:11

Сообщение: Re: index v. seqscan for certain values

Следующее

От: Tom Lane

Дата: 12 апреля 2004 г. в 14:51:34

Сообщение: Re: index v. seqscan for certain values