CRC32C Parallel Computation Optimization on ARM

Поиск
Список
Период
Сортировка
От Xiang Gao
Тема CRC32C Parallel Computation Optimization on ARM
Дата
Msg-id DB9PR08MB6991329A73923BF8ED4B3422F5DBA@DB9PR08MB6991.eurprd08.prod.outlook.com
обсуждение исходный текст
Ответы Re: CRC32C Parallel Computation Optimization on ARM  (Michael Paquier <michael@paquier.xyz>)
Список pgsql-hackers

Hi all

 

This patch uses a parallel computing optimization algorithm to improve crc32c computing performance on ARM. The algorithm comes from Intel whitepaper: crc-iscsi-polynomial-crc32-instruction-paper. Input data is divided into three equal-sized blocks.Three parallel blocks (crc0, crc1, crc2) for 1024 Bytes.One Block: 42(BLK_LENGTH) * 8(step length: crc32c_u64) bytes

 

Crc32c unitest: https://gist.github.com/gaoxyt/138fd53ca1eead8102eeb9204067f7e4

Crc32c benchmark: https://gist.github.com/gaoxyt/4506c10fc06b3501445e32c4257113e9

It gets ~2x speedup compared to linear Arm crc32c instructions.

 

I'll create a CommitFests ticket for this submission.

Any comments or feedback are welcome.

 

IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you.
Вложения

В списке pgsql-hackers по дате отправления:

Предыдущее
От: Laurenz Albe
Дата:
Сообщение: Re: Fix output of zero privileges in psql
Следующее
От: Michael Paquier
Дата:
Сообщение: Re: CRC32C Parallel Computation Optimization on ARM