Re: refactor architecture-specific popcount code
| От | John Naylor |
|---|---|
| Тема | Re: refactor architecture-specific popcount code |
| Дата | |
| Msg-id | CANWCAZa3BRPossmbAQoPb1zdQ7iq9TnUvZhqpmt4AQKhdPocxg@mail.gmail.com обсуждение исходный текст |
| Ответ на | Re: refactor architecture-specific popcount code (Nathan Bossart <nathandbossart@gmail.com>) |
| Список | pgsql-hackers |
On Thu, Feb 5, 2026 at 4:43 AM Nathan Bossart <nathandbossart@gmail.com> wrote: > Sure. I'm tempted to suggest that we only use the plain C version here, > too. The SSE4.2 bms_num_members() test I did yesterday used it and showed > improvement at one word. If we do that, we can rip out even more code > since we no longer need the popcount built-ins. Unlike the 32-bit case, people do run production on 64-bit platforms that are not Arm/x86, so that would require effort to see if the builtins are worth it for them. That seems like a separate effort. I can help with that, but let's get the tested stuff in first. > * tests plain C version on an Apple M3 * > > Yeah, the plain C version might be marginally slower than the built-in > version for that test, but it still seems quite a bit faster than HEAD. > > HEAD v8 v10 > 40 25 29 That's good to know, and maybe it'll be true elsewhere. -- John Naylor Amazon Web Services
В списке pgsql-hackers по дате отправления: