Re: [HACKERS] [PATCH] Incremental sort
От | Tomas Vondra |
---|---|
Тема | Re: [HACKERS] [PATCH] Incremental sort |
Дата | |
Msg-id | cc04594b-8a0d-dc24-77d8-df6f4f6279b1@2ndquadrant.com обсуждение исходный текст |
Ответ на | Re: [HACKERS] [PATCH] Incremental sort (Alexander Korotkov <a.korotkov@postgrespro.ru>) |
Ответы |
Re: [HACKERS] [PATCH] Incremental sort
|
Список | pgsql-hackers |
Hi, I have started reviewing the patch and doing some testing, and I have pretty quickly ran into a segfault. Attached is a simple reproducer and an backtrace. AFAICS the bug seems to be somewhere in the tuplesort changes, likely resetting a memory context too soon or something like that. I haven't investigated it further, but it matches my hunch that tuplesort is likely where the bugs will be. Otherwise the patch seems fairly complete. A couple of minor things that I noticed while eyeballing the changes in a diff editor. 1) On a couple of places the new code has this comment /* even when not parallel-aware */ while all the immediately preceding blocks use /* even when not parallel-aware, for EXPLAIN ANALYZE */ I suggest using the same comment, otherwise it kinda suggests it's not because of EXPLAIN ANALYZE. 2) I think the purpose of sampleSlot should be explicitly documented (and I'm not sure "sample" is a good term here, as is suggest some sort of sampling (for example nodeAgg uses grp_firstTuple). 3) skipCols/SkipKeyData seems a bit strange too, I think. I'd use PresortedKeyData or something like that. 4) In cmpSortSkipCols, when checking if the columns changed, the patch does this: n = ((IncrementalSort *) node->ss.ps.plan)->skipCols; for (i = 0; i < n; i++) { ... check i-th key ... } My hunch is that checking the keys from the last one, i.e. for (i = (n-1); i >= 0; i--) { .... } would be faster. The reasoning is that with "ORDER BY a,b" the column "b" changes more often. But I've been unable to test this because of the segfault crashes. 5) The changes from if (pathkeys_contained_in(...)) to n = pathkeys_common(pathkeys, subpath->pathkeys); if (n == 0) seem rather inconvenient to me, as it makes the code unnecessarily verbose. I wonder if there's a better way to deal with this. regards -- Tomas Vondra http://www.2ndQuadrant.com PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services
Вложения
В списке pgsql-hackers по дате отправления: