Re: Speed up COPY FROM text/CSV parsing using SIMD

Поиск

Список

Период

Сортировка

От	Nathan Bossart
Тема	Re: Speed up COPY FROM text/CSV parsing using SIMD
Дата	20 октября 2025 г. 20:04:03
Msg-id	aPZrg6lxb5bgy_px@nathan обсуждение
Ответ на	Re: Speed up COPY FROM text/CSV parsing using SIMD (Andrew Dunstan <andrew@dunslane.net>)
Ответы	Re: Speed up COPY FROM text/CSV parsing using SIMD
Список	pgsql-hackers

Дерево обсуждения

On Mon, Oct 20, 2025 at 10:02:23AM -0400, Andrew Dunstan wrote:
> On 2025-10-16 Th 10:29 AM, Nazir Bilal Yavuz wrote:
>> With this heuristic the regression is limited by %2 in the worst case.
> 
> My worry is that the worst case is actually quite common. Sparse data sets
> dominated by a lot of null values (and hence lots of special characters) are
> very common. Are people prepared to accept a 2% regression on load times for
> such data sets?

Without knowing how common it is, I think it's difficult to judge whether
2% is a reasonable trade-off.  If <5% of workloads might see a small
regression while the other >95% see double-digit percentage improvements,
then I might argue that it's fine.  But I'm not sure we have any way to
know those sorts of details at the moment.

I'm also at least a little skeptical about the 2% number.  IME that's
generally within the noise range and can vary greatly between machines and
test runs.

-- 
nathan

В списке pgsql-hackers по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Speed up COPY FROM text/CSV parsing using SIMD