Re: BUG #13964: unexpected result from to_tsvector

Поиск
Список
Период
Сортировка
От Artur Zakirov
Тема Re: BUG #13964: unexpected result from to_tsvector
Дата
Msg-id 56C5A2B7.60006@postgrespro.ru
обсуждение исходный текст
Ответ на BUG #13964: unexpected result from to_tsvector  (ruxandra.durus@vauban.ro)
Ответы Re: BUG #13964: unexpected result from to_tsvector
Re: BUG #13964: unexpected result from to_tsvector
Список pgsql-bugs
On 17.02.2016 11:00, ruxandra.durus@vauban.ro wrote:
>
>    My version of PostgreSQL is:
> "PostgreSQL 9.5beta1 on x86_64-pc-linux-gnu, compiled by gcc (GCC) 4.4.7
> 20120313 (Red Hat 4.4.7-16), 64-bit"
>
> More details about the operating system:
> Linux javatesting 2.6.32-573.7.1.el6.x86_64 #1 SMP Tue Sep 22 22:00:00 UTC
> 2015 x86_64 x86_64 x86_64 GNU/Linux
>
>     I am using pgAdmin version 1.20.0 to query the database.
>
>     I am using your full text search (which works great), but i have a small
> problem:
> SELECT to_tsvector('simple', 'test@vauban-reg.ro');
>
> returns "'test@vauban-reg.ro':1"
>
> which is exactly what I need.
>
>
> But when I run :
>
> SELECT to_tsvector('simple', 'test@123-reg.ro');
>
> I get:
> "'123':2 'reg.ro':3 'test':1"
>
> instead of "'test@123-reg.ro':1"
>
>>From the documentation here
> http://www.postgresql.org/docs/current/static/pgtrgm.html , point F.30.4. I
> understood that with "simple" option only space is a separator for the
> stems. Is it a bug or am I doing something wrong?
>
> Thank you for your time,
> Ruxandra Durus
>

Hi,

It seems that this is a text search parser issue. More informative queries:

=> SELECT * FROM ts_debug('simple', 'test@vauban-reg.ro');
  alias |  description  |       token        | dictionaries | dictionary
|       lexemes
-------+---------------+--------------------+--------------+------------+----------------------
  email | Email address | test@vauban-reg.ro | {simple}     | simple
  | {test@vauban-reg.ro}
(1 row)

=> SELECT * FROM ts_debug('simple', 'test@123-reg.ro');
    alias   |   description    | token  | dictionaries | dictionary |
lexemes
-----------+------------------+--------+--------------+------------+----------
  asciiword | Word, all ASCII  | test   | {simple}     | simple     | {test}
  blank     | Space symbols    | @      | {}           |            |
  uint      | Unsigned integer | 123    | {simple}     | simple     | {123}
  blank     | Space symbols    | -      | {}           |            |
  host      | Host             | reg.ro | {simple}     | simple     |
{reg.ro}
(5 rows)


Attached patch can fix it. Is this a bug? Should I create a record in
the commitfest?

This patch also allows to parser work with emails '123@123-reg.ro' and
'test@123_reg.ro' correctly.

--
Artur Zakirov
Postgres Professional: http://www.postgrespro.com
Russian Postgres Company

Вложения

В списке pgsql-bugs по дате отправления:

Предыдущее
От: Jim Nasby
Дата:
Сообщение: Re: BUG #13960: plpython fails with certain function names
Следующее
От: brian@pukkasoft.com
Дата:
Сообщение: BUG #13970: Vacuum hangs on particular table; cannot be terminated - requires `kill -QUIT pid`