Re: gaussian distribution pgbench
От | KONDO Mitsumasa |
---|---|
Тема | Re: gaussian distribution pgbench |
Дата | |
Msg-id | 52D4DEA1.3050900@lab.ntt.co.jp обсуждение исходный текст |
Ответ на | Re: gaussian distribution pgbench (Gregory Smith <gregsmithpgsql@gmail.com>) |
Ответы |
Re: gaussian distribution pgbench
|
Список | pgsql-hackers |
Hi, I revise my gaussian pgbench patch which wss requested from community. * Changes - Support custom script. - "\setgaussian" is generating gaussian distribute random number. - ex) \setgaussian [var] [min] [max] [stddev_threshold] - We can use mixture model in multiple custom scripts. - Delete short option "-g", and add long options ”--gaussian" - Refactoring getrand() interface > - getrand(TState *thread, int64 min, int64 max) > + getrand(TState *thread, int64 min, int64 max, DistType dist_type, double value1) - We can easy to add other random distribution algorithms. Please see detail design in attached patch. Febien COELHO wrote: >> From a probabilistic point of view, it seems to me that a randomized > (discretized) exponential would be more significant to model a server load. > > \setexp var min max lambda... I can create randomized exponential distribution under following. It is very easy. double rand_exp( double lambda ){ return -log(Uniform(0,1))/lambda; } If community wants this, I will add this function in my patch. Gavin Flower wrote: > Curious, wouldn't the common usage pattern tend to favour a skewed distribution, > such as the Poisson Distribution (it has been over 40 years since I studied > this area, so there may be better candidates). The difference between Poisson distribution and Gaussian distribution is discrete or not. In my gaussian algorithm, first generating continuos gaussian distribution, next projection to integer values which are each record, it will be discrete value. Therefore, it will be almost simular with Poisson distribution. And when we set larger standard deviations(higher 10), it will be created better approximation of Poisson distribution. Attached sql files are for custom scripts which are different distribution. It realize mixture distribuion benchmark. And attached graph is the result. [example command] $pgbench -f file1.sql file2.sql If you have more some comment, please send me. Regards, -- Mitsumasa KONDO NTT Open Source Software Center
Вложения
В списке pgsql-hackers по дате отправления: