Re: Optimize Arm64 crc32c implementation in Postgresql
От | Heikki Linnakangas |
---|---|
Тема | Re: Optimize Arm64 crc32c implementation in Postgresql |
Дата | |
Msg-id | ec487192-f6aa-509a-cacb-6642dad14209@iki.fi обсуждение исходный текст |
Ответ на | Re: Optimize Arm64 crc32c implementation in Postgresql (Andres Freund <andres@anarazel.de>) |
Ответы |
Re: Optimize Arm64 crc32c implementation in Postgresql
|
Список | pgsql-hackers |
On 01/04/18 20:32, Andres Freund wrote: > On 2018-03-06 02:44:35 +0800, Heikki Linnakangas wrote: >> * I tested this on Linux, with gcc and clang, on an ARM64 virtual machine >> that I had available (not an emulator, but a VM on a shared ARM64 server). > > Have you seen actual postgres performance benefits with the patch? I just ran a small test with pg_waldump, similar to what Abhijit Menon-Sen ran with the Slicing-by-8 and Intel SSE patches, when we added those (https://www.postgresql.org/message-id/20141119155811.GA32492%40toroid.org). I ran pgbench, with scale factor 5, until it had generated about 1 GB of WAL, and then I ran pg_waldump -z on that WAL. With slicing-by-8, it took about 7 s, and with the special CPU instructions, about 5 s. 'perf' showed that the CRC computation took about 30% of the CPU time before, and about 12% after, which sounds about right. That's not as big a speedup as we saw with the corresponding Intel SSE instructions back in 2014, but still quite worthwhile. - Heikki
В списке pgsql-hackers по дате отправления: