improved support for compounds in tsearch2

Поиск
Список
Период
Сортировка
От Oleg Bartunov
Тема improved support for compounds in tsearch2
Дата
Msg-id Pine.GSO.4.62.0501252013020.6363@ra.sai.msu.su
обсуждение исходный текст
Ответы Re: improved support for compounds in tsearch2  (Christopher Kings-Lynne <chriskl@familyhealth.com.au>)
Список pgsql-hackers
Hi there,

we just submitted to CVS several changes to tsearch2:

1. change struct {} WordEntryPos to typedef uint16,  for details see   http://www.pgsql.ru/db/mw/msg.html?mid=2035188
2. improved support for compound words
  "A compound is a word containing a stem that is made up of more than one root"   to_tsquery() now make use of roots
ifdictionary  (should support 'compoundwords' flag, check .aff    file) returns them for compound word. Example:
 


regression=# select to_tsquery( 'fotballklubber');                   to_tsquery
------------------------------------------------ 'fotball' & 'klubb' | 'fot' & 'ball' & 'klubb' 
(1 row)
 Bad thing is that API to tsearch2 dictionaries was changed ! See http://www.pgsql.ru/db/mw/msg.html?mid=2039406 for
detailsand  http://www.sai.msu.su/~megera/oddmuse/index.cgi/Tsearch_V2_compound_words for introduction about compounds
supportin tsearch2.
 

We're badly needed testers of compounds support (german, norway,... languages),
patch for V8.0 release is available 
http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/expand_query_8.0.patch.gz

    Regards,        Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83


В списке pgsql-hackers по дате отправления:

Предыдущее
От: Christopher Browne
Дата:
Сообщение: Re: Much Ado About COUNT(*)
Следующее
От: Nicolai Tufar
Дата:
Сообщение: Re: Goals for 8.1