Re: Wich hardware suits best for large full-text indexed

Поиск
Список
Период
Сортировка
От Oleg Bartunov
Тема Re: Wich hardware suits best for large full-text indexed
Дата
Msg-id Pine.GSO.4.58.0404011401120.11543@ra.sai.msu.su
обсуждение исходный текст
Ответ на Re: Wich hardware suits best for large full-text indexed  (Diogo Biazus <diogo@ikono.com.br>)
Список pgsql-general
On Wed, 31 Mar 2004, Diogo Biazus wrote:

> Oleg Bartunov wrote:
>
> >On Tue, 30 Mar 2004, Diogo Biazus wrote:
> >
> >
> >
> >>Hi folks,
> >>
> >>I have a database using tsearch2 to index 300 000 documents.
> >>I've already have optimized the queries, and the database is vacuumed on
> >>a daily basis.
> >>The stat function tells me that my index has aprox. 460 000 unique words
> >>(I'm using stemmer and a nice stopword list).
> >>
> >>
> >
> >460 000 unique words is a lot ! Have you seen on them ? Sometimes it's
> >very useful to analyze what did you indexed and do you want all of them.
> >I suggest you to use ispell dictionary and, if you index numbers
> >(look statistics), use special dictionaries for integer and decimal numbers
> >http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/dicts/README.intdict
> >
> >
> I 'll try the ispell dictionaries and dicts for numbers too ;)
> Could the synonym dictionary help me on this (reducing unique words)?

why not ? It useful for words, which doesnt' correctly stemmed.

>
> thanks,
>
>

    Regards,
        Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83

В списке pgsql-general по дате отправления:

Предыдущее
От: Richard Huxton
Дата:
Сообщение: Re: select distinct w/order by
Следующее
От: mike
Дата:
Сообщение: Problem restoring Database