Re: [GENERAL] Large DB

Поиск

Список

Период

Сортировка

От	Manfred Koizar
Тема	Re: [GENERAL] Large DB
Дата	2 апреля 2004 г. 23:38:46
Msg-id	8ftr60l1ebgcable559ogr2tlb6nuujllq@email.aon.at обсуждение исходный текст
Ответ на	Re: [GENERAL] Large DB (Tom Lane <tgl@sss.pgh.pa.us>)
Ответы	Re: [GENERAL] Large DB (Tom Lane <tgl@sss.pgh.pa.us>)
Список	pgsql-hackers

Дерево обсуждения

On Fri, 02 Apr 2004 18:06:12 -0500, Tom Lane <tgl@sss.pgh.pa.us> wrote:
>You should not need to use the Vitter algorithm for the block-level
>selection, since you can know the number of blocks in the table in
>advance.  You can just use the traditional method of choosing each block
>or not with probability (k/K), where k = number of sample blocks still
>needed, K = number of blocks from here to the end.

Sounds reasonable.  I have to play around a bit more to get a feeling
where the Vitter method gets more efficient.

>  You'd run the Vitter
>algorithm separately to decide whether to keep or discard each live row
>you find in the blocks you read.

You mean once a block is sampled we inspect it in any case?  This was
not the way I had planned to do it, but I'll keep this idea in mind.

>Question: if the table size is less than N blocks, are you going to read
>every block or try to reduce the number of blocks sampled?

Don't know yet.

>people are setting the stats target to 100 which means a sample size of
>30000 --- how do the page-access counts look in that case?
    rel  | page    size | reads   ------+-------------     300 |   300    3000 |  3000    5000 |  4999     10K |  9.9K
  30K |  25.8K    300K |   85K      1M |  120K     10M |  190K    100M |  260K      1G |  330K

This is exactly the table I posted before (for sample size 3000) with
every entry multiplied by 10.  Well, not quite exactly, but the
differences are far behind the decimal point.  So for our purposes, for
a given relation size the number of pages accessed is proportional to
the sample size.

ServusManfred

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Tom Lane
Дата: 02 апреля 2004 г., 23:35:35
Сообщение: Re: Problems Vacuum'ing

Следующее

От: jseymour@LinxNet.com (Jim Seymour)
Дата: 02 апреля 2004 г., 23:52:41
Сообщение: Re: Problems Vacuum'ing

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: [GENERAL] Large DB

Предыдущее

Следующее