Re: [HACKERS] multivariate statistics (v19)

Поиск
Список
Период
Сортировка
От Tomas Vondra
Тема Re: [HACKERS] multivariate statistics (v19)
Дата
Msg-id fcd7e80f-6c5c-5ae4-8ed6-1496b364e673@2ndquadrant.com
обсуждение исходный текст
Ответ на Re: [HACKERS] multivariate statistics (v19)  (Dean Rasheed <dean.a.rasheed@gmail.com>)
Ответы Re: [HACKERS] multivariate statistics (v19)  (Dean Rasheed <dean.a.rasheed@gmail.com>)
[HACKERS] Multivariate statistics and expression indexes  (Bruce Momjian <bruce@momjian.us>)
Список pgsql-hackers
On 02/08/2017 07:40 PM, Dean Rasheed wrote:
> On 8 February 2017 at 16:09, David Fetter <david@fetter.org> wrote:
>> Combinations are n!/(k! * (n-k)!), so computing those is more
>> along the lines of:
>>
>> unsigned long long
>> choose(unsigned long long n, unsigned long long k) {
>>     if (k > n) {
>>         return 0;
>>     }
>>     unsigned long long r = 1;
>>     for (unsigned long long d = 1; d <= k; ++d) {
>>         r *= n--;
>>         r /= d;
>>     }
>>     return r;
>> }
>>
>> which greatly reduces the chance of overflow.
>>
>
> Hmm, but that doesn't actually prevent overflows, since it can
> overflow in the multiplication step, and there is no protection
> against that.
>
> In the algorithm I presented, the inputs and the intermediate result
> are kept below INT_MAX, so the multiplication step cannot overflow the
> 64-bit integer, and it will only raise an overflow error if the actual
> result won't fit in a 32-bit int. Actually a crucial part of that,
> which I failed to mention previously, is the first step replacing k
> with min(k, n-k). This is necessary for inputs like (100,99), which
> should return 100, and which must be computed as 100 choose 1, not 100
> choose 99, otherwise it will overflow internally before getting to the
> final result.
>

Thanks for the feedback, I'll fix this. I've allowed myself to be a bit 
sloppy because the number of attributes in the statistics is currently 
limited to 8, so the overflows are currently not an issue. But it 
doesn't hurt to make it future-proof, in case we change that mostly 
artificial limit sometime in the future.

regards

-- 
Tomas Vondra                  http://www.2ndQuadrant.com
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services



В списке pgsql-hackers по дате отправления:

Предыдущее
От: Tomas Vondra
Дата:
Сообщение: Re: [HACKERS] PATCH: two slab-like memory allocators
Следующее
От: Andres Freund
Дата:
Сообщение: Re: [HACKERS] PATCH: two slab-like memory allocators