Обсуждение: problem with volatile functions in subselects ?
Hello Hackers, I see the very strange behaviour with the following set of queries: wsdb=# select na,nb, na::double precision as da, nb::double precision as db from ( select random()::numeric as na,random()::numericas nb from generate_series(1,2)) as xx; na | nb | da | db -------------------+-------------------+-------------------+------------------- 0.756045001445359 | 0.505602368389071 | 0.283893094995941| 0.160685719065687 0.792114335015469 | 0.416411793053342 | 0.342387438445532 | 0.531201674850286 (2 rows) On my understanding that should produce the "na" column equal to "da" ? When I do the same with the select from the table the result is similar: wsdb=# select na,nb, na::double precision as da, nb::double precision as db from ( select random()::numeric as na,random()::numericas nb from pg_proc) as xx; na | nb | da | db -----------------------+-----------------------+----------------------+---------------------- 0.125243402610181 | 0.620239329347498 | 0.64666960465101 | 0.257827353318141 0.934299875951512 | 0.0322264223509591 | 0.96565025298188| 0.0439542480949099 ........ But when I limit the select, I get the expected result. wsdb=# select na,nb, na::double precision as da, nb::double precision as db from ( select random()::numeric as na,random()::numericas nb from pg_proc limit 2) as xx; na | nb | da | db -------------------+-------------------+-------------------+------------------- 0.543030349324937 | 0.925069289712733 | 0.543030349324937| 0.925069289712733 0.934251406665077 | 0.292522935332974 | 0.934251406665077 | 0.292522935332974 (2 rows) Is that a bug, or I'm missing something ? PG version is 8.1.4 or 8.2dev. Regards, Sergey ******************************************************************* Sergey E. Koposov Max Planck Institute for Astronomy/Sternberg Astronomical Institute Tel: +49-6221-528-349 Web: http://lnfm1.sai.msu.ru/~math E-mail: math@sai.msu.ru
"Sergey E. Koposov" <math@sai.msu.ru> writes: > I see the very strange behaviour with the following set of queries: > wsdb=# select na,nb, na::double precision as da, nb::double precision as db from ( select random()::numeric as na,random()::numericas nb from generate_series(1,2)) as xx; The planner "flattens" this query to a single level of SELECT, so what you effectively have is select random()::numeric as na, random()::numeric as nb, random()::numeric::double precision as da, random()::numeric::doubleprecision as db from generate_series(1,2) as xx; There's been some talk about prohibiting flattening if there are any volatile functions in the subselect's targetlist, but nothing's been done about that. Disabling flattening is a sufficiently big hit to the planner's optimization ability that it shouldn't be done lightly. regards, tom lane
On 7/30/06, Tom Lane <tgl@sss.pgh.pa.us> wrote: > "Sergey E. Koposov" <math@sai.msu.ru> writes: > > I see the very strange behaviour with the following set of queries: > > > wsdb=# select na,nb, na::double precision as da, nb::double precision as db from ( select random()::numeric as na,random()::numericas nb from generate_series(1,2)) as xx; > > The planner "flattens" this query to a single level of SELECT, so what > you effectively have is > > select random()::numeric as na, > random()::numeric as nb, > random()::numeric::double precision as da, > random()::numeric::double precision as db > from generate_series(1,2) as xx; > > There's been some talk about prohibiting flattening if there are any > volatile functions in the subselect's targetlist, but nothing's been > done about that. Disabling flattening is a sufficiently big hit to > the planner's optimization ability that it shouldn't be done lightly. > > regards, tom lane > http://archives.postgresql.org/pgsql-hackers/2005-10/msg00396.php maybe a guc? -- regards, Jaime Casanova "Programming today is a race between software engineers striving to build bigger and better idiot-proof programs and the universe trying to produce bigger and better idiots. So far, the universe is winning." Richard Cook
> > There's been some talk about prohibiting flattening if there are any > > volatile functions in the subselect's targetlist, but nothing's been > > done about that. Disabling flattening is a sufficiently big hit to > > the planner's optimization ability that it shouldn't be done lightly. > > > > regards, tom lane > > > > http://archives.postgresql.org/pgsql-hackers/2005-10/msg00396.php > > maybe a guc? > is this worth trying? maybe it's fair to say is to late for 8.2 but maybe it can be qeued for 8.3... i think i can do this in 1 or 2 days... BTW, can you think in a good name for a GUC for this? -- regards, Jaime Casanova "Programming today is a race between software engineers striving to build bigger and better idiot-proof programs and the universe trying to produce bigger and better idiots. So far, the universe is winning." Richard Cook
"Jaime Casanova" <systemguards@gmail.com> writes: >>> There's been some talk about prohibiting flattening if there are any >>> volatile functions in the subselect's targetlist, but nothing's been >>> done about that. > BTW, can you think in a good name for a GUC for this? I'm not in favor of a GUC for this; we should either do it or not. If we do it, basically the response to anyone who complains about loss of performance should be "fix your function to be marked stable or immutable, as appropriate". regards, tom lane
On 8/13/06, Tom Lane <tgl@sss.pgh.pa.us> wrote: > "Jaime Casanova" <systemguards@gmail.com> writes: > >>> There's been some talk about prohibiting flattening if there are any > >>> volatile functions in the subselect's targetlist, but nothing's been > >>> done about that. > > > BTW, can you think in a good name for a GUC for this? > > I'm not in favor of a GUC for this; we should either do it or not. > me neither, the idea came because seems there wasn't enough consensus... my opinion always was we have to return right results and then think on performance... if someone cares, this is the patch i use for avoiding pulling up of subqueries containing volatile functions (at least it has worked for me :)... -- regards, Jaime Casanova "Programming today is a race between software engineers striving to build bigger and better idiot-proof programs and the universe trying to produce bigger and better idiots. So far, the universe is winning." Richard Cook
Вложения
Based on how small this patch is and the demonstrated desire for this behavior, can we consider putting this in 8.2, even though we're past the deadline? On Sun, Aug 13, 2006 at 10:03:30PM -0500, Jaime Casanova wrote: > On 8/13/06, Tom Lane <tgl@sss.pgh.pa.us> wrote: > >"Jaime Casanova" <systemguards@gmail.com> writes: > >>>> There's been some talk about prohibiting flattening if there are any > >>>> volatile functions in the subselect's targetlist, but nothing's been > >>>> done about that. > > > >> BTW, can you think in a good name for a GUC for this? > > > >I'm not in favor of a GUC for this; we should either do it or not. > > > > me neither, the idea came because seems there wasn't enough > consensus... my opinion always was we have to return right results and > then think on performance... > > if someone cares, this is the patch i use for avoiding pulling up of > subqueries containing volatile functions (at least it has worked for > me :)... > > -- > regards, > Jaime Casanova > > "Programming today is a race between software engineers striving to > build bigger and better idiot-proof programs and the universe trying > to produce bigger and better idiots. > So far, the universe is winning." > Richard Cook > Index: src/backend/optimizer/path/allpaths.c > =================================================================== > RCS file: /projects/cvsroot/pgsql/src/backend/optimizer/path/allpaths.c,v > retrieving revision 1.151 > diff -c -p -r1.151 allpaths.c > *** src/backend/optimizer/path/allpaths.c 10 Aug 2006 02:36:28 -0000 1.151 > --- src/backend/optimizer/path/allpaths.c 14 Aug 2006 02:19:08 -0000 > *************** make_one_rel_by_joins(PlannerInfo *root, > *** 737,742 **** > --- 737,745 ---- > * component queries to see if any of them have different output types; > * differentTypes[k] is set true if column k has different type in any > * component. > + * > + * 4. If the subquery has any volatile functions may not be safe to push > + * down any quals. > */ > static bool > subquery_is_pushdown_safe(Query *subquery, Query *topquery, > *************** subquery_is_pushdown_safe(Query *subquer > *** 769,774 **** > --- 772,782 ---- > topop->colTypes, > differentTypes); > } > + > + /* Check point 4 */ > + if (contain_volatile_functions(subquery->targetList)) > + return false; > + > return true; > } > > Index: src/backend/optimizer/prep/prepjointree.c > =================================================================== > RCS file: /projects/cvsroot/pgsql/src/backend/optimizer/prep/prepjointree.c,v > retrieving revision 1.42 > diff -c -p -r1.42 prepjointree.c > *** src/backend/optimizer/prep/prepjointree.c 12 Aug 2006 20:05:55 -0000 1.42 > --- src/backend/optimizer/prep/prepjointree.c 14 Aug 2006 02:19:11 -0000 > *************** is_simple_subquery(Query *subquery) > *** 657,662 **** > --- 657,669 ---- > return false; > > /* > + * Don't pull up a subquery that has any volatile functions in its > + * targetlist. > + */ > + if (contain_volatile_functions((Node *) subquery->targetList)) > + return false; > + > + /* > * Hack: don't try to pull up a subquery with an empty jointree. > * query_planner() will correctly generate a Result plan for a jointree > * that's totally empty, but I don't think the right things happen if an > Index: src/include/optimizer/prep.h > =================================================================== > RCS file: /projects/cvsroot/pgsql/src/include/optimizer/prep.h,v > retrieving revision 1.56 > diff -c -p -r1.56 prep.h > *** src/include/optimizer/prep.h 5 Mar 2006 15:58:57 -0000 1.56 > --- src/include/optimizer/prep.h 14 Aug 2006 02:19:17 -0000 > *************** > *** 17,23 **** > #include "nodes/plannodes.h" > #include "nodes/relation.h" > > - > /* > * prototypes for prepjointree.c > */ > --- 17,22 ---- > > ---------------------------(end of broadcast)--------------------------- > TIP 5: don't forget to increase your free space map settings -- Jim C. Nasby, Sr. Engineering Consultant jnasby@pervasive.com Pervasive Software http://pervasive.com work: 512-231-6117 vcard: http://jim.nasby.net/pervasive.vcf cell: 512-569-9461
Tom Lane wrote: > "Jaime Casanova" <systemguards@gmail.com> writes: > >>> There's been some talk about prohibiting flattening if there are any > >>> volatile functions in the subselect's targetlist, but nothing's been > >>> done about that. > > > BTW, can you think in a good name for a GUC for this? > > I'm not in favor of a GUC for this; we should either do it or not. > > If we do it, basically the response to anyone who complains about loss > of performance should be "fix your function to be marked stable or > immutable, as appropriate". Agreed. Are we doing this, or is it a TODO? -- Bruce Momjian bruce@momjian.us EnterpriseDB http://www.enterprisedb.com + If your life is a hard drive, Christ can be your backup. +
Bruce Momjian <bruce@momjian.us> writes: >> If we do it, basically the response to anyone who complains about loss >> of performance should be "fix your function to be marked stable or >> immutable, as appropriate". > Agreed. Are we doing this, or is it a TODO? It's done: http://archives.postgresql.org/pgsql-committers/2006-08/msg00358.php regards, tom lane