Re: Speed up COPY FROM text/CSV parsing using SIMD
| От | Nathan Bossart | 
|---|---|
| Тема | Re: Speed up COPY FROM text/CSV parsing using SIMD | 
| Дата | |
| Msg-id | aPZrg6lxb5bgy_px@nathan обсуждение исходный текст  | 
		
| Ответ на | Re: Speed up COPY FROM text/CSV parsing using SIMD (Andrew Dunstan <andrew@dunslane.net>) | 
| Ответы | 
                	
            		Re: Speed up COPY FROM text/CSV parsing using SIMD
            		
            		 | 
		
| Список | pgsql-hackers | 
On Mon, Oct 20, 2025 at 10:02:23AM -0400, Andrew Dunstan wrote: > On 2025-10-16 Th 10:29 AM, Nazir Bilal Yavuz wrote: >> With this heuristic the regression is limited by %2 in the worst case. > > My worry is that the worst case is actually quite common. Sparse data sets > dominated by a lot of null values (and hence lots of special characters) are > very common. Are people prepared to accept a 2% regression on load times for > such data sets? Without knowing how common it is, I think it's difficult to judge whether 2% is a reasonable trade-off. If <5% of workloads might see a small regression while the other >95% see double-digit percentage improvements, then I might argue that it's fine. But I'm not sure we have any way to know those sorts of details at the moment. I'm also at least a little skeptical about the 2% number. IME that's generally within the noise range and can vary greatly between machines and test runs. -- nathan
В списке pgsql-hackers по дате отправления: