Re: Can pg_trgm handle non-alphanumeric characters?

Поиск
Список
Период
Сортировка
От Tom Lane
Тема Re: Can pg_trgm handle non-alphanumeric characters?
Дата
Msg-id 22869.1336677117@sss.pgh.pa.us
обсуждение исходный текст
Ответ на Re: Can pg_trgm handle non-alphanumeric characters?  (Fujii Masao <masao.fujii@gmail.com>)
Ответы Re: Can pg_trgm handle non-alphanumeric characters?  (Fujii Masao <masao.fujii@gmail.com>)
Список pgsql-hackers
Fujii Masao <masao.fujii@gmail.com> writes:
> On Fri, May 11, 2012 at 12:07 AM, MauMau <maumau307@gmail.com> wrote:
>> Thanks for your explanation. Although I haven't understood it well yet, I'll
>> consider what you taught. And I'll consider if the tentative measure of
>> removing KEEPONLYALNUM is correct for someone who wants to use pg_trgm
>> against Japanese text.

> In Japanese, it's common to do a text search with two characters keyword.
> But since pg_trgm is 3-gram, you basically would not be able to use index
> for such text search. So you might need something like pg_bigm or pg_unigm
> for Japanese text search.

I believe the trigrams are three *bytes* not three characters.  So a
couple of kanji should work just fine for this.
        regards, tom lane


В списке pgsql-hackers по дате отправления:

Предыдущее
От: Andrew Dunstan
Дата:
Сообщение: Re: Draft release notes complete
Следующее
От: Tom Lane
Дата:
Сообщение: Re: Draft release notes complete