Re: ML-based indexing ("The Case for Learned Index Structures", a paper from Google)
От | Peter Geoghegan |
---|---|
Тема | Re: ML-based indexing ("The Case for Learned Index Structures", a paper from Google) |
Дата | |
Msg-id | CAH2-Wzkwopr028p+3d=kNjd6hoA3V+Zy6hbou-LcHaV8FymH9Q@mail.gmail.com обсуждение исходный текст |
Ответ на | Re: ML-based indexing ("The Case for Learned Index Structures", a paper from Google) ("Jonah H. Harris" <jonah.harris@gmail.com>) |
Ответы |
Re: ML-based indexing ("The Case for Learned Index Structures", a paper from Google)
|
Список | pgsql-hackers |
On Tue, Apr 20, 2021 at 12:51 PM Jonah H. Harris <jonah.harris@gmail.com> wrote: >> Maybe I'll be wrong about learned indexes - who knows? But the burden >> of proof is not mine. I prefer to spend my time on things that I am >> reasonably confident will work out well ahead of time. > > > Agreed on all of your takes, Peter. In time, they will probably be more realistic. A big problem when critically evaluating any complicated top-down model in the abstract is that it's too easy for the designer to hide *risk* (perhaps inadvertently). If you are allowed to make what amounts to an assumption that you have perfect foreknowledge of the dataset, then sure, you can do a lot with that certainty. You can easily find a way to make things faster or more space efficient by some ridiculous multiple that way (like 10x, 100x, whatever). None of these papers ever get around to explaining why what they've come up with is not simply fool's gold. The assumption that you can have robust foreknowledge of the dataset seems incredibly fragile, even if your model is *almost* miraculously good. I have no idea how fair that is. But my job is to make Postgres better, not to judge papers. My mindset is very matter of fact and practical. -- Peter Geoghegan
В списке pgsql-hackers по дате отправления: