Re: Learned Index
От | Laurenz Albe |
---|---|
Тема | Re: Learned Index |
Дата | |
Msg-id | 1513070199.2564.16.camel@cybertec.at обсуждение исходный текст |
Ответ на | Learned Index (Deepak Balasubramanyam <deepak.balu@gmail.com>) |
Ответы |
Re: Learned Index
|
Список | pgsql-hackers |
Deepak Balasubramanyam wrote: > I came across this paper making a case for indices that use machine learning to optimise search. > > https://arxiv.org/pdf/1712.01208.pdf > > The gist seems to be to use a linear regression model or feed a tensor flow model when a more complicated distributionis needed for the data and allow SIMD instructions working on top of GPUs / TPUs to speed up lookups. The speedupobserved is anywhere from 40-60%. > > That result looks impressive but I don't have enough context on say rebuilding a neural net on every DML operation. Theequivalent operation that I can relate to on PG would be to rebalance the B-tree for DML operations. > > In your opinion, would a ML model work for a table whose operations are both write and read heavy? I'd love to hear yourthoughts on the paper. I have read into the paper. This may be interesting or not, but the paper is very vague about its concepts and algorithms, so it's hard to tell. I'd say that the paper does not meet publication standards. For example, they say that their results were generated by comparing a B-tree implementation with "learned indexes using a 2-stage RMI model and different second-stage sizes (i.e., 10k, 50k, 100k, and 200k)", but they don't say exactly what the neural network in these stages is (at least it is not obvious to me). Their "Learning Index Framework" (LIF) is described with a few vague sentences and a reference to the literature saying that is where they got some ideas from. There is also no clear concept of how these indexes should handle data modifications, so I think that there are some loose ends to be tied up before it is ready for implementation. Finally, I don't see any clear statement as to the error guarantees that the neural network prediction can give, and if it is possible that it may degrade to scanning relevant parts of the table in some cases. Yours, Laurenz Albe
В списке pgsql-hackers по дате отправления: