Re: tricky query

Поиск
Список
Период
Сортировка
От John A Meinel
Тема Re: tricky query
Дата
Msg-id 42C17A14.3040100@arbash-meinel.com
обсуждение исходный текст
Ответ на Re: tricky query  ("Merlin Moncure" <merlin.moncure@rcsonline.com>)
Список pgsql-performance
Merlin Moncure wrote:

>John Meinel wrote:
>
>
>>See my follow up post, which enables an index scan. On my system with
>>90k rows, it takes no apparent time.
>>(0.000ms)
>>John
>>=:->
>>
>>
>
>Confirmed.  Hats off to you, the above some really wicked querying.
>IIRC I posted the same question several months ago with no response and
>had given up on it.  I think your solution (smallest X1 not in X) is a
>good candidate for general bits, so I'm passing this to varlena for
>review :)
>
>SELECT t1.id+1 as id_new FROM id_test t1
>    WHERE NOT EXISTS
>        (SELECT t2.id FROM id_test t2 WHERE t2.id = t1.id+1)
>    ORDER BY t1.id LIMIT 1;
>
>Merlin
>
>
Just be aware that as your table fills it's holes, this query gets
slower and slower.
I've been doing some testing. And it starts at 0.00 when the first entry
is something like 3, but when you start getting to 16k it starts taking
more like 200 ms.

So it kind of depends how your table fills (and empties I suppose).

The earlier query was slower overall (since it took 460ms to read in the
whole table).
I filled up the table such that 63713 is the first empty space, and it
takes 969ms to run.
So actually if your table is mostly full, the first form is better.

But if you are going to have 100k rows, with basically random
distribution of empties, then the NOT EXISTS works quite well.

Just be aware of the tradeoff. I'm pretty sure the WHERE NOT EXISTS will
always use a looping structure, and go through the index in order.

John
=:->


Вложения

В списке pgsql-performance по дате отправления:

Предыдущее
От: "Karl O. Pinc"
Дата:
Сообщение: Re: Poor index choice -- multiple indexes of the same
Следующее
От: Tom Lane
Дата:
Сообщение: Re: Too slow querying a table of 15 million records