Обсуждение: Execution plan Question
hi all,
I have a strange problem and really wish to get some help in here.
I have the following tables
create table shr_objects(objectid int8 not nullcompanyid int4 not nullname varchar(255)
description varchar(255)
)
primary key : object id
foreign key on companyid references shr_companies(companyid)
Index on companyid
Number of rows ~ 1,410,000
create table smb_contacts{contactid int4 not nullobjectid int8 not nullfirstname varchar(255)lastname
varchar(255)
)
primary key : contactid
foreign key on objectid references shr_objects9objectid)
index on : objectid
Number of rows ~ 10,000
I am trying to execute a query that joins the 2 tables on object id , it
works fine but when i add an order clause the performance is degarded
dramatically. I have included both quiries with their excution plan.
Regards,
--ObjectZ Maker
Objectz wrote:
> hi all,
>
> I have a strange problem and really wish to get some help in here.
>
> I have the following tables
>
> create table shr_objects(
> objectid int8 not null
> companyid int4 not null
> name varchar(255)
> description varchar(255)
> )
> primary key : object id
> foreign key on companyid references shr_companies(companyid)
> Index on companyid
> Number of rows ~ 1,410,000
>
> create table smb_contacts{
> contactid int4 not null
> objectid int8 not null
> firstname varchar(255)
> lastname varchar(255)
> )
> primary key : contactid
> foreign key on objectid references shr_objects9objectid)
> index on : objectid
> Number of rows ~ 10,000
>
> I am trying to execute a query that joins the 2 tables on object id , it
> works fine but when i add an order clause the performance is degarded
> dramatically. I have included both quiries with their excution plan.
>
> Regards,
> --ObjectZ Maker
Well, it looks like you didn't include neither queries nor there
execution plan. Don't forget to vacuum analyze before explain analyze
your queries.
Anyway, how many rows do you want to be sorted?
Regards,
Tomasz Myrta
Oops .. Here they are
========================================================================
=====
intranet=# explain analyze SELECT obj.companyid, obj.name,
obj.description, intranet-# cnt.firstname, cnt.lastname intranet-# FROM
smb_contacts cnt JOIN shr_objects obj ON cnt.objectid = obj.objectid
intranet-# order by obj.companyid intranet-# limit 90;
NOTICE: QUERY PLAN:
Limit (cost=44459.46..44459.46 rows=90 width=566) (actual
time=14426.92..14427.26 rows=90 loops=1) -> Sort (cost=44459.46..44459.46 rows=10101 width=566) (actual
time=14426.91..14427.05 rows=91 loops=1) -> Merge Join (cost=853.84..41938.61 rows=10101 width=566)
(actual time=123.25..14396.31 rows=10101 loops=1) -> Index Scan using shr_objects_pk on shr_objects obj
(cost=0.00..37386.55 rows=1418686 width=544) (actual time=6.19..11769.85
rows=1418686 loops=1) -> Sort (cost=853.84..853.84 rows=10101 width=22)
(actual time=117.02..134.60 rows=10101 loops=1) -> Seq Scan on smb_contacts cnt (cost=0.00..182.01
rows=10101 width=22) (actual time=0.03..27.14 rows=10101 loops=1) Total
runtime: 14435.77 msec
EXPLAIN
========================================================================
======
intranet=#
intranet=# explain analyze SELECT obj.companyid, obj.name,
obj.description, intranet-# cnt.firstname, cnt.lastname intranet-# FROM
smb_contacts cnt JOIN shr_objects obj ON cnt.objectid = obj.objectid
intranet-# limit 90;
NOTICE: QUERY PLAN:
Limit (cost=0.00..382.72 rows=90 width=566) (actual time=15.87..25.39
rows=90 loops=1) -> Merge Join (cost=0.00..42954.26 rows=10101 width=566) (actual
time=15.86..25.08 rows=91 loops=1) -> Index Scan using objectid_fk on smb_contacts cnt
(cost=0.00..1869.48 rows=10101 width=22) (actual time=15.76..16.32
rows=91 loops=1) -> Index Scan using shr_objects_pk on shr_objects obj
(cost=0.00..37386.55 rows=1418686 width=544) (actual time=0.09..7.81
rows=193 loops=1) Total runtime: 25.60 msec
EXPLAIN
========================================================================
======
It is obvious that in the order by query the company index is not used
and also it had to go thru all records in shr_objects.
Can someone please tell me how is this happening and how to fix it.
Objectz wrote:
> hi all,
>
> I have a strange problem and really wish to get some help in here.
>
> I have the following tables
>
> create table shr_objects(
> objectid int8 not null
> companyid int4 not null
> name varchar(255)
> description varchar(255)
> )
> primary key : object id
> foreign key on companyid references shr_companies(companyid)
> Index on companyid
> Number of rows ~ 1,410,000
>
> create table smb_contacts{
> contactid int4 not null
> objectid int8 not null
> firstname varchar(255)
> lastname varchar(255)
> )
> primary key : contactid
> foreign key on objectid references shr_objects9objectid) index on :
> objectid Number of rows ~ 10,000
>
> I am trying to execute a query that joins the 2 tables on object id ,
> it works fine but when i add an order clause the performance is
> degarded dramatically. I have included both quiries with their
> excution plan.
>
> Regards,
> --ObjectZ Maker
Well, it looks like you didn't include neither queries nor there
execution plan. Don't forget to vacuum analyze before explain analyze
your queries.
Anyway, how many rows do you want to be sorted?
Regards,
Tomasz Myrta
---------------------------(end of broadcast)---------------------------
TIP 5: Have you checked our extensive FAQ?
http://www.postgresql.org/docs/faqs/FAQ.html
Objectz wrote:> Oops .. Here they are>> ========================================================================> =====>>intranet=# explain analyze SELECT obj.companyid, obj.name,> obj.description, intranet-# cnt.firstname, cnt.lastnameintranet-# FROM> smb_contacts cnt JOIN shr_objects obj ON cnt.objectid = obj.objectid> intranet-# order by obj.companyidintranet-# limit 90;> NOTICE: QUERY PLAN:>> Limit (cost=44459.46..44459.46 rows=90 width=566) (actual> time=14426.92..14427.26rows=90 loops=1)> -> Sort (cost=44459.46..44459.46 rows=10101 width=566) (actual> time=14426.91..14427.05rows=91 loops=1)> -> Merge Join (cost=853.84..41938.61 rows=10101 width=566)> (actual time=123.25..14396.31rows=10101 loops=1)> -> Index Scan using shr_objects_pk on shr_objects obj> (cost=0.00..37386.55rows=1418686 width=544) (actual time=6.19..11769.85> rows=1418686 loops=1)> -> Sort (cost=853.84..853.84rows=10101 width=22)> (actual time=117.02..134.60 rows=10101 loops=1)> -> Seq Scanon smb_contacts cnt (cost=0.00..182.01> rows=10101 width=22) (actual time=0.03..27.14 rows=10101 loops=1) Total> runtime:14435.77 msec>> EXPLAIN> ========================================================================> ======> intranet=#>intranet=# explain analyze SELECT obj.companyid, obj.name,> obj.description, intranet-# cnt.firstname, cnt.lastnameintranet-# FROM> smb_contacts cnt JOIN shr_objects obj ON cnt.objectid = obj.objectid> intranet-# limit 90;>NOTICE: QUERY PLAN:>> Limit (cost=0.00..382.72 rows=90 width=566) (actual time=15.87..25.39> rows=90 loops=1)> -> Merge Join (cost=0.00..42954.26 rows=10101 width=566) (actual> time=15.86..25.08 rows=91 loops=1)> -> IndexScan using objectid_fk on smb_contacts cnt> (cost=0.00..1869.48 rows=10101 width=22) (actual time=15.76..16.32> rows=91loops=1)> -> Index Scan using shr_objects_pk on shr_objects obj> (cost=0.00..37386.55 rows=1418686 width=544)(actual time=0.09..7.81> rows=193 loops=1) Total runtime: 25.60 msec>> EXPLAIN> ========================================================================>======> It is obvious that in the order by querythe company index is not used> and also it had to go thru all records in shr_objects.> Can someone please tell me howis this happening and how to fix it. Well - it's not an "order by" problem, but combination of "order by and limit" Look at your execution plan without order by. Postgres thinks it has to result 10000 rows (cost 0.00..1869), but you have "limit 90" and it stops working after 90 rows. It doesn't have more than 200 rows to work. The case with order by is much more complicated. Postgres have to retrieve all 10000 rows , sort all of them and after all give you first 90 rows. In this case there are up to 1400000 rows to work. Try to rewrite your query to return less rows (for example 1000) before sorting/limiting them. Taking 90 of 1000000 rows will be allways a performance leak. Regards, Tomasz Myrta
i was able to fix the problem but i still dont know the reason. I
discovered the column objectid in table smb_contacts was int4 and not
int8. obviously all values were less then max(int4) dso there was no
problems in that. However I still need to know why the order by trigger
this to happen. I am joining between the 2 tables on the objectid which
of different datatypes in this case (int8 and int4) but without the
order by clause it was pretty fast.
Now after i changed the objectid cilumn in smb_contacts the performance
is as quick as before.
here is the execution plan after i ran the same query (with order
caluse) after changing datatypes
Limit (cost=52044.46..52044.46 rows=90 width=1970) (actual
time=395.81..396.16 rows=90 loops=1) -> Sort (cost=52044.46..52044.46 rows=10101 width=1970) (actual
time=395.80..395.96 rows=91 loops=1) -> Nested Loop (cost=0.00..30752.64 rows=10101 width=1970)
(actual time=0.11..360.99 rows=10104 loops=1) -> Seq Scan on smb_contacts cnt (cost=0.00..187.01
rows=10101 width=1398) (actual time=0.01..48.57 rows=10104 loops=1) -> Index Scan using shr_objects_pk on
shr_objectsobj
(cost=0.00..3.01 rows=1 width=572) (actual time=0.02..0.02 rows=1
loops=10104)
Total runtime: 438.96 msec
EXPLAIN
i need to know what happened in here
On Tue, 2003-03-11 at 07:09, Objectz wrote:
> Oops .. Here they are
>
> ========================================================================
> =====
>
> intranet=# explain analyze SELECT obj.companyid, obj.name,
> obj.description, intranet-# cnt.firstname, cnt.lastname intranet-# FROM
> smb_contacts cnt JOIN shr_objects obj ON cnt.objectid = obj.objectid
> intranet-# order by obj.companyid intranet-# limit 90;
> NOTICE: QUERY PLAN:
>
> Limit (cost=44459.46..44459.46 rows=90 width=566) (actual
> time=14426.92..14427.26 rows=90 loops=1)
> -> Sort (cost=44459.46..44459.46 rows=10101 width=566) (actual
> time=14426.91..14427.05 rows=91 loops=1)
> -> Merge Join (cost=853.84..41938.61 rows=10101 width=566)
> (actual time=123.25..14396.31 rows=10101 loops=1)
> -> Index Scan using shr_objects_pk on shr_objects obj
> (cost=0.00..37386.55 rows=1418686 width=544) (actual time=6.19..11769.85
> rows=1418686 loops=1)
> -> Sort (cost=853.84..853.84 rows=10101 width=22)
> (actual time=117.02..134.60 rows=10101 loops=1)
> -> Seq Scan on smb_contacts cnt (cost=0.00..182.01
> rows=10101 width=22) (actual time=0.03..27.14 rows=10101 loops=1) Total
> runtime: 14435.77 msec
>
> EXPLAIN
> ========================================================================
> ======
> intranet=#
> intranet=# explain analyze SELECT obj.companyid, obj.name,
> obj.description, intranet-# cnt.firstname, cnt.lastname intranet-# FROM
> smb_contacts cnt JOIN shr_objects obj ON cnt.objectid = obj.objectid
> intranet-# limit 90;
> NOTICE: QUERY PLAN:
>
> Limit (cost=0.00..382.72 rows=90 width=566) (actual time=15.87..25.39
> rows=90 loops=1)
> -> Merge Join (cost=0.00..42954.26 rows=10101 width=566) (actual
> time=15.86..25.08 rows=91 loops=1)
> -> Index Scan using objectid_fk on smb_contacts cnt
> (cost=0.00..1869.48 rows=10101 width=22) (actual time=15.76..16.32
> rows=91 loops=1)
> -> Index Scan using shr_objects_pk on shr_objects obj
> (cost=0.00..37386.55 rows=1418686 width=544) (actual time=0.09..7.81
> rows=193 loops=1) Total runtime: 25.60 msec
>
> EXPLAIN
> ========================================================================
> ======
> It is obvious that in the order by query the company index is not used
> and also it had to go thru all records in shr_objects.
> Can someone please tell me how is this happening and how to fix it.
>
> Objectz wrote:
> > hi all,
> >
> > I have a strange problem and really wish to get some help in here.
> >
> > I have the following tables
> >
> > create table shr_objects(
> > objectid int8 not null
> > companyid int4 not null
> > name varchar(255)
> > description varchar(255)
> > )
> > primary key : object id
> > foreign key on companyid references shr_companies(companyid)
> > Index on companyid
> > Number of rows ~ 1,410,000
> >
> > create table smb_contacts{
> > contactid int4 not null
> > objectid int8 not null
> > firstname varchar(255)
> > lastname varchar(255)
> > )
> > primary key : contactid
> > foreign key on objectid references shr_objects9objectid) index on :
> > objectid Number of rows ~ 10,000
> >
> > I am trying to execute a query that joins the 2 tables on object id ,
> > it works fine but when i add an order clause the performance is
> > degarded dramatically. I have included both quiries with their
> > excution plan.
> >
> > Regards,
> > --ObjectZ Maker
> Well, it looks like you didn't include neither queries nor there
> execution plan. Don't forget to vacuum analyze before explain analyze
> your queries.
>
> Anyway, how many rows do you want to be sorted?
>
> Regards,
> Tomasz Myrta
>
>
>
> ---------------------------(end of broadcast)---------------------------
> TIP 5: Have you checked our extensive FAQ?
>
> http://www.postgresql.org/docs/faqs/FAQ.html
>
>
> ---------------------------(end of broadcast)---------------------------
> TIP 5: Have you checked our extensive FAQ?
>
> http://www.postgresql.org/docs/faqs/FAQ.html