Re: BUG #17716: walsender process hang while decoding 'DROP PUBLICATION' XLOG

Поиск

Список

Период

Сортировка

От	shveta malik
Тема	Re: BUG #17716: walsender process hang while decoding 'DROP PUBLICATION' XLOG
Дата	20 декабря 2022 г. 12:02:47
Msg-id	CAJpy0uDD4k=GU6YjRKt83cZDNuvkTbOV8OL6dZiRtvFQbhMJGA@mail.gmail.com обсуждение исходный текст
Ответ на	Re: BUG #17716: walsender process hang while decoding 'DROP PUBLICATION' XLOG (shveta malik <shveta.malik@gmail.com>)
Ответы	Re: BUG #17716: walsender process hang while decoding 'DROP PUBLICATION' XLOG
Список	pgsql-bugs

Дерево обсуждения

Hello,
I tried to reproduce the lag with a bigger magnitude test case i.e. added more
tables to pub_t_large to increase command_ids and added huge number of tables
to working publication pub_t to increase the number of entries in
rel-cache, but no luck.
No noticeable lag observed on HEAD with the new mechanism of invalidation.

thanks
Shveta


On Tue, Dec 20, 2022 at 11:40 AM shveta malik <shveta.malik@gmail.com> wrote:
>
> Hello,
> The idea looks good to me. For 'relation schema cache (pgoutput  one)', on receiving invalidation msg for one
hash-value,we invalidate the complete cache as there is no way to find an entry corresponding to that hash-value and
thusyour fix-proposal will make good difference. But I feel it makes sense on HEAD as well. 
>
> This complete cache invalidation happens multiple times even on HEAD (10k times for the given case). This cache is
mostlyempty in given test-case, but consider the case where we have huge number of publications and subscriptions (to
makethis cache have huge number of entries) and then we try to drop 1 large publication with say 40k-50k tables, in
thatcase we might see slowness while traversing and invalidating the concerned cache on HEAD as well. The test case
withincreased magnitude can be tried for HEAD once to see if we need it on HEAD or not. 
>
> thanks
> Shveta
>
>
> On Mon, Dec 19, 2022 at 5:52 PM Bowen Shi <zxwsbg12138@gmail.com> wrote:
>>
>> Hello,
>> Thanks for your advice. I make some tests and this problem can't be
>> reproduced in PG 14+ version. I think adding a new XLOG type will help
>> resolve this problem. But I think the following patch may be helpful
>> in the PG 13 version.
>>
>> The invalidation contains two parts: pgoutput and relfilenodeMap. We
>> have no way to optimize relfilenodeMap part , since it has been
>> discussed in previous mails
>> https://www.postgresql.org/message-id/CANDwggKYveEtXjXjqHA6RL3AKSHMsQyfRY6bK+NqhAWJyw8psQ@mail.gmail.com.
>>
>> However, I'd like to contribute a patch to fix pgoutput part. We can skip
>> invalidating caches after first time with a lazy tag and this works.
>> It almost doubles the walsender performance while decoding this XLOG.
>>
>> I use the test in the last email and reduce the number of relations in
>> publications to 1000, the test result is following:
>>
>> Before optimization: 76 min
>> After optimization: 35 min
>>
>> Though the result is not good enough, I think this patch is still worthy.

В списке pgsql-bugs по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: BUG #17716: walsender process hang while decoding 'DROP PUBLICATION' XLOG