Re: Fuzzy matching

Поиск
Список
Период
Сортировка
От Joe Conway
Тема Re: Fuzzy matching
Дата
Msg-id 025b01c11c57$00280600$48d210ac@jecw2k1
обсуждение исходный текст
Ответы Re: Re: Fuzzy matching
Re: Re: Fuzzy matching
Re: Re: Fuzzy matching
Список pgsql-patches
> > Our usual practice with stuff of uncertain usefulness has been to
> > stick
> > it in contrib for awhile and see if anyone uses it.  If there's
> > sufficient interest, we'll promote it to mainstream in a future
> > release.
>
> Makes sense to me.  Go, Joe!
>

Per this discussion, here's a patch to implement both levenshtein() and
metaphone() in a contrib. There seem to be a fair number of different
approaches to both of these algorithms. I used the simplest case for
levenshtein which has a cost  of 1 for any character insertion, deletion, or
substitution. For metaphone, I adapted the same code from CPAN that the PHP
folks did.

A couple of questions:
1. Does it make sense to fold the soundex contrib together with this one?

2. I was debating trying to add multibyte support to levenshtein (it would
make no sense at all for metaphone), but a quick search through the contrib
directory found no hits on the word MULTIBYTE. Should worry about adding
multibyte support to levenshtein()?

Thanks,

Joe


Вложения

В списке pgsql-patches по дате отправления:

Предыдущее
От: Bruce Momjian
Дата:
Сообщение: Re: Current cvs does not compile jdbc1 driver
Следующее
От: Tom Lane
Дата:
Сообщение: Re: Re: Fuzzy matching