Re: Netflix Prize data

Поиск
Список
Период
Сортировка
От Heikki Linnakangas
Тема Re: Netflix Prize data
Дата
Msg-id 4524C3B5.8030206@enterprisedb.com
обсуждение исходный текст
Ответ на Netflix Prize data  ("Mark Woodward" <pgsql@mohawksoft.com>)
Список pgsql-hackers
Mark Woodward wrote:
> 
> I tried to cluster the data along a particular index but had to cancel it
> after 3 hours.

If the data is in random order, it's faster to do

SELECT * INTO foo_sorted FROM foo ORDER BY bar

then CREATE INDEX, than to run CLUSTER.

That's because CLUSTER does a full index scan of the table, which is 
slower than a seqscan + sort if the table is not already clustered.

--   Heikki Linnakangas  EnterpriseDB   http://www.enterprisedb.com


В списке pgsql-hackers по дате отправления:

Предыдущее
От: David Fetter
Дата:
Сообщение: Re: Digging gram.y
Следующее
От: Zdenek Kotala
Дата:
Сообщение: Re: workaround for buggy strtod is not necessary