Re: query question

Поиск
Список
Период
Сортировка
От Josh Berkus
Тема Re: query question
Дата
Msg-id 200212061638.18529.josh@agliodbs.com
обсуждение исходный текст
Ответ на query question  (Laurette Cisneros <laurette@nextbus.com>)
Список pgsql-performance
Laurette,

> This query:
> select distinct x, y
>   from table1 t
>   join table2 t2
>  using (col1)
> order by x;
>
> is *slower* than this query:
>
> select disting x, y
>   from table1
>  where col1 = (select col1 from table2)
> ORDER BY x;
>
> Is this because in the latter case the select col1 is cached?

Yes.   For all of the following structures:

where x = (select col from table)
where x IN (select col from table)
where x NOT IN (select col from table)
where x != ANY(select col from table)
etc.,

... Postgres must process the full subquery, return the results, and compare
all of the results as individual values against the reference column.

However, if you re-wrote the query as:

 select distint x, y
   from table1
  where EXISTS (select col1 from table2
    where table2.col1 = table1.col1)
 ORDER BY x;

... then Postgres would be able to use JOIN optimizations to evaluate the
subquery and pull a subset of relevant records or even use an index, making
the query *much* faster.

> Ooo, I would love to have a web page full of these tidbits (along with how
> to get around the max and min aggregates and why as an example..., etc.)!

Um:

http://techdocs.postgresql.org/guides/

Add your own Wiki page!


--
-Josh Berkus

______AGLIO DATABASE SOLUTIONS___________________________
                                        Josh Berkus
   Complete information technology     josh@agliodbs.com
    and data management solutions     (415) 565-7293
   for law firms, small businesses      fax 621-2533
    and non-profit organizations.     San Francisco


В списке pgsql-performance по дате отправления:

Предыдущее
От: Laurette Cisneros
Дата:
Сообщение: query question
Следующее
От: Josh Berkus
Дата:
Сообщение: Re: Speeding up aggregates