Re: [PATCHES] a tsearch2 (8.2.4) dictionary that only filters out stopwords

Поиск
Список
Период
Сортировка
От Tom Lane
Тема Re: [PATCHES] a tsearch2 (8.2.4) dictionary that only filters out stopwords
Дата
Msg-id 25604.1195060645@sss.pgh.pa.us
обсуждение исходный текст
Ответ на Re: [PATCHES] a tsearch2 (8.2.4) dictionary that only filters out stopwords  (Oleg Bartunov <oleg@sai.msu.su>)
Ответы Re: [PATCHES] a tsearch2 (8.2.4) dictionary that only filters out stopwords  (Oleg Bartunov <oleg@sai.msu.su>)
Re: [PATCHES] a tsearch2 (8.2.4) dictionary that only filters out stopwords  (Bruce Momjian <bruce@momjian.us>)
Список pgsql-hackers
Oleg Bartunov <oleg@sai.msu.su> writes:
> Let's consider one example - removing accents.
> In the past I always recommend people to use regex functions before
> to_tsvector conversion to remove accents, but recently I was noticed that
> such trick doesn't work with headline(). So, the only way is to have
> special dictionary dict_remove_accent before, which  works as a filter.

> I don't remember why do we left this for future releases, though.

That would require a system-to-dictionary API change (to be able to
modify the token under inspection), no?  So it's certainly something
I'd say is too late for 8.3.

One thought that came to mind is that the option name should be just
"Accept" not "AcceptAll".  To me "All" implies that it would accept
*everything* ... including stopwords.

            regards, tom lane

В списке pgsql-hackers по дате отправления:

Предыдущее
От: Oleg Bartunov
Дата:
Сообщение: Re: [PATCHES] a tsearch2 (8.2.4) dictionary that only filters out stopwords
Следующее
От: Oleg Bartunov
Дата:
Сообщение: Re: [PATCHES] a tsearch2 (8.2.4) dictionary that only filters out stopwords