Re: Minor performance improvement in transition to external sort
От | Jeremy Harris |
---|---|
Тема | Re: Minor performance improvement in transition to external sort |
Дата | |
Msg-id | 530CF51C.1030208@wizmail.org обсуждение исходный текст |
Ответ на | Re: Minor performance improvement in transition to external sort (Robert Haas <robertmhaas@gmail.com>) |
Ответы |
Re: Minor performance improvement in transition to external sort
|
Список | pgsql-hackers |
On 24/02/14 17:38, Robert Haas wrote: > On Thu, Feb 20, 2014 at 7:27 PM, Jeremy Harris <jgh@wizmail.org> wrote: >> Run under cachegrind, it takes about N/10 last-level cache misses, >> all for the new item being introduced to the heap. The existing >> code takes none at all. > > Can you explain this further? This seems like an important clue that > could be useful when trying to optimize this code, but I'm a little > unclear which part of the operation has more cache misses with your > changes and why. In the patched version, for the heapify operation the outer loop starts at the last heap-parent tuple and works backward to the start of the tuples array. A copy is taken of the SortTuple being operated on for the inner loop to use. This copy suffers cache misses. (The inner loop operates on elements between the copy source and the end of the array). In the original, the outer loop runs the array in increasing index order. Again a copy is taken of the SortTuple for the inner loop to use. This copy does not appear to take significant cache misses. (The inner loop operates on elements between the copy source and the start of the array). -- Cheers, Jeremy
В списке pgsql-hackers по дате отправления: