[to_tsvector] German Compound Words
| От | Sven R. Kunze |
|---|---|
| Тема | [to_tsvector] German Compound Words |
| Дата | |
| Msg-id | 5566F66C.3070701@tbz-pariv.de обсуждение |
| Ответы |
Re: [to_tsvector] German Compound Words
|
| Список | pgsql-general |
Hi everybody, what do I need to do in order to enable compound word handling in PostgreSQL tsvector implementation? I run an Ubuntu 14.04 machine, PostgreSQL 9.3, have installed package hunspell-de-de and already created a new dictionary as described here: http://www.postgresql.org/docs/9.3/static/textsearch-dictionaries.html#TEXTSEARCH-ISPELL-DICTIONARY CREATE TEXT SEARCH DICTIONARY german_hunspell ( TEMPLATE = ispell, DictFile = de_de, AffFile = de_de, StopWords = german ); Furthermore, created a new test text search configuration (copied from german) and updated all parser parts where the german_stemdictionary is used so that it uses german_hunspell first and then german_stem. However, ts_vector still does not work for the compound words such as: wasserkraft -> wasserkraft, kraft schifffahrt -> schifffahrt, fahrt blindflansch -> blindflansch, flansch etc. What have I done wrong here? -- Sven R. Kunze TBZ-PARIV GmbH, Bernsdorfer Str. 210-212, 09126 Chemnitz Tel: +49 (0)371 33714721, Fax: +49 (0)371 5347920 e-mail: srkunze@tbz-pariv.de web: www.tbz-pariv.de Geschäftsführer: Dr. Reiner Wohlgemuth Sitz der Gesellschaft: Chemnitz Registergericht: Chemnitz HRB 8543
В списке pgsql-general по дате отправления: