Re: Tsearch2 Dutch snowball stemmer in PG8.1

Поиск
Список
Период
Сортировка
От Alban Hertroys
Тема Re: Tsearch2 Dutch snowball stemmer in PG8.1
Дата
Msg-id 4703A1C5.4020601@magproductions.nl
обсуждение исходный текст
Ответ на Re: Tsearch2 Dutch snowball stemmer in PG8.1  (Alban Hertroys <a.hertroys@magproductions.nl>)
Ответы Re: Tsearch2 Dutch snowball stemmer in PG8.1
Список pgsql-general
Alban Hertroys wrote:
> The only odd thing is that to_tsvector('dutch', 'some dutch text') now
> returns '|' for stop words...
>
> For example:
>  select to_tsvector('nederlands', 'De beste stuurlui staan aan wal');
>                   to_tsvector
> ------------------------------------------------
>  '|':1,5 'bes':2 'wal':6 'staan':4 'stuurlui':3

I found the cause. The stop words list I found contained comments
prefixed by '|' signs. Removing the contents and recreating the database
solved the problem. Just updating the reference didn't seem to help...

There's undoubtedly some cleaner way to replace the stop words list, but
at the current stage of our project this was the simplest to achieve.

--
Alban Hertroys
a.hertroys@magproductions.nl

magproductions b.v.

T: ++31(0)534346874
F: ++31(0)534346876
M:
I: www.magproductions.nl
A: Postbus 416
   7500 AK Enschede

// Integrate Your World //

В списке pgsql-general по дате отправления:

Предыдущее
От: Alban Hertroys
Дата:
Сообщение: Re: Tsearch2 Dutch snowball stemmer in PG8.1
Следующее
От: Tom Lane
Дата:
Сообщение: Re: PITR Recovery and out-of-sync indexes