Re: how to monitor the progress of really large bulk operations?

Поиск
Список
Период
Сортировка
От Pavel Stehule
Тема Re: how to monitor the progress of really large bulk operations?
Дата
Msg-id CAFj8pRBP+qacndYRa9P5str2KaydtSXDbcMBCzb2m-N1FsMQQQ@mail.gmail.com
обсуждение исходный текст
Ответ на Re: how to monitor the progress of really large bulk operations?  (Pavel Stehule <pavel.stehule@gmail.com>)
Ответы Re: how to monitor the progress of really large bulk operations?  ("Mike Sofen" <msofen@runbox.com>)
Список pgsql-general


2016-09-28 6:13 GMT+02:00 Pavel Stehule <pavel.stehule@gmail.com>:
Hi

2016-09-27 23:03 GMT+02:00 Mike Sofen <msofen@runbox.com>:

Hi gang,

 

On PG 9.5.1, linux, I’m running some large ETL operations, migrate data from a legacy mysql system into PG, upwards of 250m rows in a transaction (it’s on a big box).  It’s always a 2 step operation – extract raw mysql data and pull it to the target big box into staging tables that match the source, the second step being read the landed dataset and transform it into the final formats, linking to newly generated ids, compressing big subsets into jsonb documents, etc.

 

While I could break it into smaller chunks, it hasn’t been necessary, and it doesn’t eliminate my need:  how to view the state of a transaction in flight, seeing how many rows have been read or inserted (possible for a transaction in flight?), memory allocations across the various PG processes, etc.

 

Possible or a hallucination?

 

Mike Sofen (Synthetic Genomics)



pltoolbox has counter function https://github.com/okbob/pltoolbox/blob/master/utils.c

pavel=# insert into omega2 select (x.xx).*           from (select pst.counter(omega,200000, true) xx                    from omega               ) x;
NOTICE:  processed 200000 rows, current value is '(5,8)'
NOTICE:  processed 200000 rows, current value is '(5,8)'
NOTICE:  processed 400000 rows, current value is '(6,8)'
NOTICE:  processed 400000 rows, current value is '(6,8)'
NOTICE:  processed 600000 rows, current value is '(7,8)'
NOTICE:  processed 600000 rows, current value is '(7,8)'
NOTICE:  processed 800000 rows, current value is '(1,8)'
NOTICE:  processed 800000 rows, current value is '(1,8)'
NOTICE:  processed 1000000 rows, current value is '(5,8)'
NOTICE:  processed 1000000 rows, current value is '(5,8)'

 


Regards

Pavel


В списке pgsql-general по дате отправления:

Предыдущее
От: Pavel Stehule
Дата:
Сообщение: Re: how to monitor the progress of really large bulk operations?
Следующее
От: Adir Shaban
Дата:
Сообщение: Database fixed size