[to_tsvector] German Compound Words

Поиск
Список
Период
Сортировка
От Sven R. Kunze
Тема [to_tsvector] German Compound Words
Дата
Msg-id 5566F66C.3070701@tbz-pariv.de
обсуждение исходный текст
Ответы Re: [to_tsvector] German Compound Words
Список pgsql-general
Hi everybody,

what do I need to do in order to enable compound word handling in
PostgreSQL tsvector implementation?

I run an Ubuntu 14.04 machine, PostgreSQL 9.3, have installed package
hunspell-de-de and already created a new dictionary as described here:
http://www.postgresql.org/docs/9.3/static/textsearch-dictionaries.html#TEXTSEARCH-ISPELL-DICTIONARY

CREATE TEXT SEARCH DICTIONARY german_hunspell (
     TEMPLATE = ispell,
     DictFile = de_de,
     AffFile = de_de,
     StopWords = german
);

Furthermore, created a new test text search configuration (copied from german) and updated all parser parts where the
german_stemdictionary is used so that it uses german_hunspell first and then german_stem. 

However, ts_vector still does not work for the compound words such as:

wasserkraft -> wasserkraft, kraft
schifffahrt -> schifffahrt, fahrt
blindflansch -> blindflansch, flansch

etc.


What have I done wrong here?

--
Sven R. Kunze
TBZ-PARIV GmbH, Bernsdorfer Str. 210-212, 09126 Chemnitz
Tel: +49 (0)371 33714721, Fax: +49 (0)371 5347920
e-mail: srkunze@tbz-pariv.de
web: www.tbz-pariv.de

Geschäftsführer: Dr. Reiner Wohlgemuth
Sitz der Gesellschaft: Chemnitz
Registergericht: Chemnitz HRB 8543



В списке pgsql-general по дате отправления:

Предыдущее
От: Chris Mair
Дата:
Сообщение: Re: 9.4.2 -> 9.4.3
Следующее
От: Robert Haas
Дата:
Сообщение: Re: [HACKERS] Re: 9.4.1 -> 9.4.2 problem: could not access status of transaction 1