processing urls with tsearch2

Поиск
Список
Период
Сортировка
От Laimonas Simutis
Тема processing urls with tsearch2
Дата
Msg-id 2b3e22740709131135o63e2d281k2efe27ebaf20a715@mail.gmail.com
обсуждение исходный текст
Ответы Re: processing urls with tsearch2  (Oleg Bartunov <oleg@sai.msu.su>)
Список pgsql-general
Hey guys,

maybe anyone using tsearch2 could advise on this. With the default installation, url, host and some other tokens are processed with the simple dictionary. Thus term like mywebsite.com gets stored as 'mywebsite.com'. The parser correctly assigns token id of type host to the term, but then the dictionary the terms gets routed through is simple and what gets stored is mywebsite.com

The questions are:

1) is there a dictionary available that I could utilize that will remove .com, .net, .org, etc? I could write one myself, but after seeing some sample dictionary implementations and C code I try to avoid, I got scared a bit.

2) has anyone else dealt with this maybe in a different way?


Thanks for any suggestions and help,

Laimis

В списке pgsql-general по дате отправления:

Предыдущее
От: Marco Colombo
Дата:
Сообщение: Re: Cannot declare record members NOT NULL
Следующее
От: Jeff Davis
Дата:
Сообщение: pg_standby observation