Re: Improving spin-lock implementation on ARM.

Поиск

Список

Период

Сортировка

От	Tom Lane
Тема	Re: Improving spin-lock implementation on ARM.
Дата	30 ноября 2020 г. 09:08:27
Msg-id	1158478.1606716507@sss.pgh.pa.us обсуждение исходный текст
Ответ на	Re: Improving spin-lock implementation on ARM. (Krunal Bauskar <krunalbauskar@gmail.com>)
Ответы	Re: Improving spin-lock implementation on ARM. (Krunal Bauskar <krunalbauskar@gmail.com>) Re: Improving spin-lock implementation on ARM. (Alexander Korotkov <aekorotkov@gmail.com>)
Список	pgsql-hackers

Дерево обсуждения

Krunal Bauskar <krunalbauskar@gmail.com> writes:
> On Mon, 30 Nov 2020 at 10:14, Tom Lane <tgl@sss.pgh.pa.us> wrote:
>> The results I posted at [1] seem to contradict this for Apple's new
>> machines.

> For the results you saw on Mac-Mini was LSE enabled by default.

Hmm, I don't know how to get Apple's clang to admit what its default
settings are ... anybody?

However, it does accept "-march=armv8-a+lse", and that seems to
not be the default, because I get different results from my spinlock-
pounding test than I did yesterday.  Abbreviating into a table:

                --- CFLAGS=-O2 ---      --- CFLAGS="-O2 -march=armv8-a+lse" ---

TPS             HEAD    CAS patch       HEAD    CAS patch

clients=1       2127    2174            2612    2722
clients=2       1816    859             892     950
clients=4       714     519             610     468
clients=8       -       -               108     185

Unfortunately, that still doesn't lead me to think that either LSE
or CAS are net wins on this hardware.  It's quite clear that LSE
makes the uncontended case a good bit faster, but the contended case
is a lot worse, so is that really a tradeoff we want?

> * I would also suggest if possible try with higher scalability (more than 4
> to check if with increase scalability CAS out-perform).

As I said yesterday, running more than 4 processes is just going
to bring the low-performance cores into the equation, which is likely
to swamp any interesting comparison.  I did run the test with "-c 8"
today, as shown in the right-hand columns, and the results seem
to bear that out.

            regards, tom lane

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Fujii Masao
Дата: 30 ноября 2020 г., 09:02:03
Сообщение: Re: Feature improvement for pg_stat_statements

Следующее

От: Krunal Bauskar
Дата: 30 ноября 2020 г., 09:19:25
Сообщение: Re: Improving spin-lock implementation on ARM.

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Improving spin-lock implementation on ARM.

Предыдущее

Следующее