Обсуждение: group by

Поиск

Список

Период

Сортировка

group by

От

YourSoft

Дата:

13 мая 2006 г., 10:53:14

Dear Developers,

There is a possible bug in 'select ... group by' SQL.
I reported it on the bugreport form on the web. (I think it number is
2416?). But no any reaction to it.
It is not problem for me, I make my results in other way. But it is
possible problem for other pepoples.

I reproduced it in smaller database table in other way. (with 180000
records) (postgresql 8.03 on linux)
e.g.:
there is a table:
stat-# \d summary
            Table "public.summary"
  colum   |         Type          | MÃ³dosÃtÃ³
-----------+------------------------+----------
 kifejezes | character varying(300) | not null
 cnt       | integer                | not null
 talalat   | integer                |
Indexes:
    "idx_summary_cnt" btree (cnt) CLUSTER
    "idx_summary_kifejezes" btree (kifejezes text_pattern_ops)

1)
select kifejezes, count(kifejezes) from summary group by kifejezes
having count(kifejezes)>1;
the result is:
        kifejezes        | count
-------------------------+-------
 csÃºcscsajok             |     2
 jÃ¡szszentandrÃ¡s         |     3
 kullancscsÃpÃ©s          |     2
 magannyugdijpenztar     |     2
 magÃ¡nnyugdijpÃ©nztÃ¡r     |     2
 magÃ¡nnyugdÃjpÃ©nztÃ¡r     |     3
 magÃ¡nnyugdÃjpÃ©nztÃ¡rak   |     2
 mÅ±velÅdÃ©sszociolÃ³gia    |     2
 otp magÃ¡nnyugdÃjpÃ©nztÃ¡r |     2
(9 rows)

2)
select * from summary where kifejezes like 'jegygy%';
  kifejezes  | cnt | talalat
------------+-----+---------
 jegygyÃ¼rÅ±  |   4 |       0
 jegygyÅ±rÅ±  |   5 |       0
 jegygyÅ±rÅ±  |   7 |       0
 jegygyÅ±rÅ±  |  12 |       0
 jegygyÅ±rÅ±k |   3 |       0
(5 rows)


Why not is in the first query results the "jegygyÅ±rÅ±" (second query
rows )?

Re: group by

От

Tom Lane

Дата:

13 мая 2006 г., 14:52:14

YourSoft <yoursoft@freemail.hu> writes:
> 1)
> select kifejezes, count(kifejezes) from summary group by kifejezes
> having count(kifejezes)>1;
> the result is:
>         kifejezes        | count
> -------------------------+-------
>  csÃºcscsajok             |     2
>  jÃ¡szszentandrÃ¡s         |     3
>  kullancscsÃpÃ©s          |     2
>  magannyugdijpenztar     |     2
>  magÃ¡nnyugdijpÃ©nztÃ¡r     |     2
>  magÃ¡nnyugdÃjpÃ©nztÃ¡r     |     3
>  magÃ¡nnyugdÃjpÃ©nztÃ¡rak   |     2
>  mÅ±velÅdÃ©sszociolÃ³gia    |     2
>  otp magÃ¡nnyugdÃjpÃ©nztÃ¡r |     2
> (9 rows)

> 2)
> select * from summary where kifejezes like 'jegygy%';
>   kifejezes  | cnt | talalat
> ------------+-----+---------
>  jegygyÃ¼rÅ±  |   4 |       0
>  jegygyÅ±rÅ±  |   5 |       0
>  jegygyÅ±rÅ±  |   7 |       0
>  jegygyÅ±rÅ±  |  12 |       0
>  jegygyÅ±rÅ±k |   3 |       0
> (5 rows)

> Why not is in the first query results the "jegygyÅ±rÅ±" (second query
> rows )?

We've seen problems like this occur when you have mismatched locale and
encoding specifications --- that can confuse strcoll() to the point that
it gives inconsistent results, and since all PG character comparisons
depend on strcoll(), you get all sorts of bizarre behavior.  Check the
LC_COLLATE and LC_CTYPE settings of the database, and make sure that you
have selected a database encoding that matches them.

Also, if you're using Hungarian locale, you probably need to update to
PG 8.0.6 or later.  See bug fix list at
http://developer.postgresql.org/docs/postgres/release-8-0-6.html

            regards, tom lane

Re: group by

От

YourSoft

Дата:

14 мая 2006 г., 05:44:43

Dear Tom,

Thanks for suggestion. I upgrade my database to 8.1.3 and it is solve
the problem :-)

Regards,
   Ferenc

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Обсуждение: group by

group by

Re: group by

Re: group by