Re: tsvector pg_stats seems quite a bit off.

Поиск
Список
Период
Сортировка
От Tom Lane
Тема Re: tsvector pg_stats seems quite a bit off.
Дата
Msg-id 20653.1275151111@sss.pgh.pa.us
обсуждение исходный текст
Ответ на Re: tsvector pg_stats seems quite a bit off.  (Jan Urbański <wulczer@wulczer.org>)
Список pgsql-hackers
Jan Urbański <wulczer@wulczer.org> writes:
> [ e of ] s/2 or s/3 look reasonable.

The examples in the LC paper seem to all use e = s/10.  Note the stated
assumption e << s.

> So, should I just write a patch that sets the bucket width and pruning
> count using 0.07 as the assumed frequency of the most common word and
> epsilon equal to s/2 or s/3?

I'd go with s = 0.07 / desired-MCE-count and e = s / 10, at least for
a first cut to experiment with.
        regards, tom lane


В списке pgsql-hackers по дате отправления:

Предыдущее
От: Jan Urbański
Дата:
Сообщение: Re: tsvector pg_stats seems quite a bit off.
Следующее
От: Bruce Momjian
Дата:
Сообщение: PG 9.0 release timetable