Re: refactor architecture-specific popcount code
| От | Greg Burd |
|---|---|
| Тема | Re: refactor architecture-specific popcount code |
| Дата | |
| Msg-id | 3a3ca14a-dada-4920-a614-42f2e5eb3f5d@app.fastmail.com обсуждение исходный текст |
| Ответ на | Re: refactor architecture-specific popcount code (Nathan Bossart <nathandbossart@gmail.com>) |
| Ответы |
Re: refactor architecture-specific popcount code
|
| Список | pgsql-hackers |
On Mon, Feb 2, 2026, at 5:51 PM, Nathan Bossart wrote: > On Mon, Feb 02, 2026 at 09:16:42PM +0700, John Naylor wrote: >> It might be a good idea to do a little new testing, and I see a use >> for a special 8-byte path independent of AVX512: v6 seems to regress a >> little for single-words. But, it turns out that when gcc turns >> __builtin_popcountl into a single instruction, it's inline, but if it >> emits portable bitwise ops, it does so in a function called >> __popcountdi2(). That can be avoided by hand-coding in C for normal >> builds (and for 32-bit looks cleaner anyway), as in the attached 0005. > > Oh, interesting. I looked into this a little more [0]. Both gcc and clang > generate cnt instructions for aarch64, so we're good there. However, clang > on x86-64 generates the bit-twiddling version, and gcc on x86-64 generates > a call to __popcountdi2() (which I imagine does something similar). It's > not until you provide a compiler flag like -march=x86-64-v2 that gcc/clang > start generating popcnt instructions for x86-64, which makes sense. 0005 > seems like the correct move to me... > > [0] https://godbolt.org/z/he3WozG3E > > -- > nathan Nathan, John, Thanks for the focus on this area of the code. I've been looking into what to do with popcnt when building Win11/ARM64/MSVC. I know that when _MSC_VER and _M_ARM64 are defined we can make use of the __popcnt(unsigned int) and __popcnt64(unsigned__int64) intrinsics which have been available since VS 2022 17.11+. I thought I'd check that combo outand it turns out that it is identical to clang/gcc on that platform [0]. I'll wait for your work to land before proposing a patch to add these unless it is really easy to fit it and you feel likegiving it a go. :) best. -greg [0] https://godbolt.org/z/TrxjzcPGE
В списке pgsql-hackers по дате отправления: