Re: add AVX2 support to simd.h

Поиск
Список
Период
Сортировка
От Nathan Bossart
Тема Re: add AVX2 support to simd.h
Дата
Msg-id 20240319031601.GA828141@nathanxps13
обсуждение исходный текст
Ответ на Re: add AVX2 support to simd.h  (John Naylor <johncnaylorls@gmail.com>)
Ответы Re: add AVX2 support to simd.h  (John Naylor <johncnaylorls@gmail.com>)
Список pgsql-hackers
On Tue, Mar 19, 2024 at 10:03:36AM +0700, John Naylor wrote:
> I took a brief look, and 0001 isn't quite what I had in mind. I can't
> quite tell what it's doing with the additional branches and "goto
> retry", but I meant something pretty simple:

Do you mean 0002?  0001 just adds a 2-register loop for remaining elements
once we've exhausted what can be processed with the 4-register loop.

> - if short, do one element at a time and return

0002 does this.

> - if long, do one block unconditionally, then round the start pointer
> up so that "end - start" is an exact multiple of blocks, and loop over
> them

0002 does the opposite of this.  That is, after we've completed as many
blocks as possible, we move the iterator variable back to "end -
block_size" and do one final iteration to cover all the remaining elements.

-- 
Nathan Bossart
Amazon Web Services: https://aws.amazon.com



В списке pgsql-hackers по дате отправления:

Предыдущее
От: John Naylor
Дата:
Сообщение: Re: add AVX2 support to simd.h
Следующее
От: Masahiko Sawada
Дата:
Сообщение: Re: [PoC] Improve dead tuple storage for lazy vacuum