Re: Most efficient way to insert without duplicates

Поиск
Список
Период
Сортировка
От Jeff Janes
Тема Re: Most efficient way to insert without duplicates
Дата
Msg-id CAMkU=1xkN0zRNoU4o4SwgFTEjc6fQqA3-X7Z=FyHdB-gnRdMwg@mail.gmail.com
обсуждение исходный текст
Ответ на Re: Most efficient way to insert without duplicates  (François Beausoleil <francois@teksol.info>)
Ответы Re: Most efficient way to insert without duplicates
Re: Most efficient way to insert without duplicates
Список pgsql-general
On Wed, Apr 17, 2013 at 4:26 AM, François Beausoleil <francois@teksol.info> wrote:


 Insert on public.persona_followers  (cost=139261.12..20483497.65 rows=6256498 width=16) (actual time=4729255.535..4729255.535 rows=0 loops=1)
   Buffers: shared hit=33135295 read=4776921
   ->  Subquery Scan on t1  (cost=139261.12..20483497.65 rows=6256498 width=16) (actual time=562265.156..578844.999 rows=6819520 loops=1)


It looks like 12% of the time is being spent figuring out what rows to insert, and 88% actually doing the insertions.

So I think that index maintenance is killing you.  You could try adding a sort to your select so that rows are inserted in index order, or inserting in batches in which the batches are partitioned by service_id (which is almost the same thing as sorting, since service_id is the lead column)

Cheers,

Jeff

В списке pgsql-general по дате отправления:

Предыдущее
От: Thomas Munro
Дата:
Сообщение: Re: Roadmap for Postgres on AIX
Следующее
От: Aleksey Tsalolikhin
Дата:
Сообщение: Re: How large can a PostgreSQL database get?