Re: Use of ISpell dictionaries with tsearch2 - what is the point?
От | Don Walker |
---|---|
Тема | Re: Use of ISpell dictionaries with tsearch2 - what is the point? |
Дата | |
Msg-id | 001901c66b00$90c35410$dbd849c6@donxp обсуждение исходный текст |
Ответ на | Use of ISpell dictionaries with tsearch2 - what is the point? ("Don Walker" <don.walker@versaterm.com>) |
Список | pgsql-general |
I realized shortly after I sent this email that I could use a synonym dictionary to solve problem #2. To construct it myself I'd have to determine the common misspellings and create synonyms for them. So I have two more questions: 2.1 Are there any canned synonym dictionaries available the deal with misspellings in English and/or French? 2.2 Are there any clever linguistic algorithms that can partly solve the same problem? -----Original Message----- From: pgsql-general-owner@postgresql.org [mailto:pgsql-general-owner@postgresql.org] On Behalf Of Don Walker Sent: April 28, 2006 15:11 To: pgsql-general@postgresql.org Subject: [GENERAL] Use of ISpell dictionaries with tsearch2 - what is the point? I'm new to using tsearch2 and am trying to understand why I would want to use an ISpell dictionary before the stemming dictionary. I'd originally hoped that ISpell would suggest corrections for misspelled words as the documents that I will be indexing will contain a lot of spelling mistakes. From what I now understand ISpell dictionaries only recognize properly spelled words. This means that any misspelled word will be handled by the stemming dictionary and usually just passed as is. 1. If I am correct about this then what is the point of using the ISpell dictionary in the first place? 2. Is there a solution for correcting spelling mistakes in the documents you index? I have seen the readme files for pg_trgm, http://www.sai.msu.su/~megera/postgres/gist/, which would allow me to suggest other terms for a query if the misspellings were common enough. I'd rather fix the problem at index time so that querying with the proper term would find any misspelled terms (within reason). ---------------------------(end of broadcast)--------------------------- TIP 6: explain analyze is your friend
В списке pgsql-general по дате отправления: