Re: Postgres bulk insert/ETL performance on high speed servers - test results

Поиск
Список
Период
Сортировка
От Claudio Freire
Тема Re: Postgres bulk insert/ETL performance on high speed servers - test results
Дата
Msg-id CAGTBQpa+go_gT2=H7JJROTY-qpo8x12JLNP-L46kaRMuk14zZg@mail.gmail.com
обсуждение исходный текст
Ответ на Postgres bulk insert/ETL performance on high speed servers - test results  ("Mike Sofen" <msofen@runbox.com>)
Ответы Re: Postgres bulk insert/ETL performance on high speed servers - test results  ("Mike Sofen" <msofen@runbox.com>)
Список pgsql-performance
On Thu, Sep 1, 2016 at 11:30 PM, Mike Sofen <msofen@runbox.com> wrote:
> PASS 2:
> Process:      Transform/Load (all work local to the server - read,
> transform, write as a single batch)
> Num Source Rows:      10,554,800 (one batch from just a single source table
> going to a single target table)
> Avg Rowcount Compression:      31.5 (jsonb row compression resulting in
> 31.5x fewer rows)
> AWS Time in Secs:      2,493 (41.5 minutes)
> Cisco Time in Secs:      661 (10 minutes)
> Difference:      3.8x
> Comment:AWS:  4.2k rows/sec   Cisco:  16k rows/sec
>
> It's obvious the size of the batch exceeded the AWS server memory, resulting
> in a profoundly slower processing time.  This was a true, apples to apples
> comparison between Pass 1 and Pass 2: average row lengths were within 7% of
> each other (1121 vs 1203) using identical table structures and processing
> code, the only difference was the target server.
>
> I'm happy to answer questions about these results.


Are you sure it's a memory thing and not an EBS bandwidth thing?

EBS has significantly less bandwidth than direct-attached flash.


В списке pgsql-performance по дате отправления:

Предыдущее
От: "Mike Sofen"
Дата:
Сообщение: Postgres bulk insert/ETL performance on high speed servers - test results
Следующее
От: "Mike Sofen"
Дата:
Сообщение: Re: Postgres bulk insert/ETL performance on high speed servers - test results