Re: ML-based indexing ("The Case for Learned Index Structures", a paper from Google)
От | Peter Geoghegan |
---|---|
Тема | Re: ML-based indexing ("The Case for Learned Index Structures", a paper from Google) |
Дата | |
Msg-id | CAH2-WzmHq5JrJrJtumj3q_U9SfTkNRyDnpUhXMCL6J3pAhZ=UA@mail.gmail.com обсуждение исходный текст |
Ответ на | Re: ML-based indexing ("The Case for Learned Index Structures", a paper from Google) (Andrey Borodin <x4mmm@yandex-team.ru>) |
Ответы |
Re: ML-based indexing ("The Case for Learned Index Structures", a paper from Google)
|
Список | pgsql-hackers |
On Tue, Apr 20, 2021 at 11:18 AM Andrey Borodin <x4mmm@yandex-team.ru> wrote: > BTW take a look into PGM [0]. I'm slowly working on implementing it. > I think it is kind of straightforward to implement it as extension. > I've started from forking B-tree[1]. I've removed support of anything that is not int4. > Then I plan to remove opclass\comparator abstraction layer. > And then add interpolation search over the page before binsearch. FWIW I'm a skeptic of learned indexes. There are lots of reasons why I don't think that they're practical, but it boils down to this: in general data structures that work well don't need anybody to make a case for them. They simply work well for the task they were designed for. Anything is possible, but I strongly doubt that learned indexes are the exception to that general rule. Why hasn't anybody been able to apply the techniques in any real world database system to date? I'm sure that it's possible to make things much faster if it's possible to make certain wide-reaching assumptions. They talk about big improvements in space utilization compared to B-Trees as if there was some very fixed idea of how that works in B-Trees. Why haven't they compared their model to grotty heuristics that practical experience has shown to work, such as the rightmost leaf page split heuristic? Any paper about learned indexes that I've ever read doesn't even acknowledge the existence of these heuristics. -- Peter Geoghegan
В списке pgsql-hackers по дате отправления: