Re: tsearch2: stop words and stemming separate?

Поиск
Список
Период
Сортировка
От Oleg Bartunov
Тема Re: tsearch2: stop words and stemming separate?
Дата
Msg-id Pine.LNX.4.64.0801270824470.26876@sn.sai.msu.ru
обсуждение исходный текст
Ответ на tsearch2: stop words and stemming separate?  (Sushant Sinha <sushant354@gmail.com>)
Список pgsql-general
On Sat, 26 Jan 2008, Sushant Sinha wrote:

> I want to remove stop words but do not want to stem the words.  Is there
> an interface in tsearch2 that allows me to do this?
>
> Basically I am trying to implement spelling corrections and do not want
> to correct stop words.

Create custom dictionary using simple (or just add stop words to simple)
and use it before english stemmer, which has NO stop words !

=# insert into pg_ts_dict
               (SELECT 'remove_stopwords', dict_init,
                        'contrib/english.stop',
                        dict_lexize,
                        'simple dictionary with stop words'
                 FROM pg_ts_dict
                 WHERE dict_name = 'simple');

insert into pg_ts_dict
               (SELECT 'en_stem_no_stopwords', dict_init,
                        '',
                        dict_lexize,
                        'english stemmer without stop words'
                 FROM pg_ts_dict
                 WHERE dict_name = 'en_stem');


>
> Thanks,
> -Sushant.
>
>
> ---------------------------(end of broadcast)---------------------------
> TIP 2: Don't 'kill -9' the postmaster
>

     Regards,
         Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83

В списке pgsql-general по дате отправления:

Предыдущее
От: Sushant Sinha
Дата:
Сообщение: tsearch2: stop words and stemming separate?
Следующее
От: mljv@planwerk6.de
Дата:
Сообщение: Very long execution time of "select nextval('..');"