Обсуждение: Named / preparsed / preplaned(prepared) queries - Feature proposal

Поиск
Список
Период
Сортировка

Named / preparsed / preplaned(prepared) queries - Feature proposal

От
NTPT
Дата:
Hi all

I have an idea how to extend postgresql feature set.


I sugest named / preparsed / preplamned query  mechanizm where named query is a simpliest form and preplaned query is
themost sofisticated (implementation wise). Ie to have ability to  assign a global and may be persistant NAME (or
alias)for some query for some user. Similar how PREPARE related stuff does, but AFAIK, prepared statement are limited
forthe current user on current connection only.  

with syntax like this:

NAMEDQUERY "select foo from bar where blah=$1" AS "myquery" [PREPARSED | PREPLANED  REPLAN EVERY condition]
[PERSISTENT][FINAL] [FOR USER user1,user2] 


And then  modify user privileges to add privileges to create a NAMEDQUERY, to create it for others -  typicaly  useful
fordatabase administrator and the  privilege of executing NAMED QUERY ONLY for some user - for example guest. 


PERSISTANT tell the server that this named query should survive server restart.

FINAL: if Administrator set a named query as FINAL, no other user is able to override it.

PREPARSED: Rationale behind PREPARSED queries is that in most scenarios on web applications there is a simple not
complicatedselect of data often in heap or memory or in the in memory index where actually fetching that data is faster
thenparsing a query string. So I thing a global cache of preparsed  NAMEDQUERY statements is a good idea and on some
loadscan boost performance significantly. 

PREPLANED: same as the preparsed, but the query plan is cached too. So in frequent simple queries (like common in
ajax/cometapplications where it ofen do some simple select to query a status of session etc), I thing it can remove
significantburden (parsing and planning a query) from CPU . 

yes, I know that a query plan that is superior  now, could be inferior few minutes later. So I suggest a  REPLAN EVERY
condition,where condition can be whatever user like - time interval,   changed N percent of tuples etc etc - Let user
decidewhat mechanizm is best for him.   


This feature can lead to two interested things. First, there is a decoupuling a database from application level and
keepthis two realm relatively separate, if underlying db structure is changed, just redefine named queries and do not
touchapplication. It create some sort of easy to use abstraction layer. It is a bit "objectish" approach, because you
cancreate a selfcontained database with data and method for its manipulation (if named query is  set as PERSISTENT,
should dump/restore and replication solutions do their job on persistant named queries as well)   


Second thing is a powerful, simple, yet easy to use security feature with as much granularity as is possible. For
exampleuser guest, who have set a NAMED QUERY ONLY user privilege, will be limited only to queries that admin of the
databasedefined for him, no matter what.I thing that it could be easy to use privilege sepration mechanizm.  

I use postgresql  in my projects since 6.x branch and I am happy with it, but proposed feature is on my Postgresql
wishlist. 
Any comments wellcome.

PS: Execuse my wrong english.

Re: Named / preparsed / preplaned(prepared) queries - Feature proposal

От
John R Pierce
Дата:
On 11/06/11 6:07 AM, NTPT wrote:
> I use postgresql  in my projects since 6.x branch and I am happy with it, but proposed feature is on my Postgresql
wishlist.
> Any comments wellcome.

is this based on any existing feature in the SQL specification, or in
other popular database engines?


--
john r pierce                            N 37, W 122
santa cruz ca                         mid-left coast


Re: Re: [GENERAL] Named / preparsed / preplaned(prepared) queries - Feature proposal

От
NTPT
Дата:
I do not know.  Should it be ?

  I think that features are proposed  because user(s) want  it, need it, because it can be useful for other peoples
too,because with this can people solve their problems and/or improve performance and security of their applications.
Notbecause someone else have something similar... 

# ------------ Původní zpráva ------------
# Od: John R Pierce <pierce@hogranch.com>
# Předmět: Re: [GENERAL] Named / preparsed / preplaned(prepared) queries - Feature
# proposal
# Datum: 06.11.2011 17:35:57
# ----------------------------------------
# On 11/06/11 6:07 AM, NTPT wrote:
# > I use postgresql  in my projects since 6.x branch and I am happy with it, but
# proposed feature is on my Postgresql wishlist.
# > Any comments wellcome.
#
# is this based on any existing feature in the SQL specification, or in
# other popular database engines?
#
#
# --
# john r pierce                            N 37, W 122
# santa cruz ca                         mid-left coast
#
#
# --
# Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
# To make changes to your subscription:
# http://www.postgresql.org/mailpref/pgsql-general
#
#
#

Re: Re: Re: [GENERAL] Named / preparsed / preplaned(prepared) queries - Feature proposal

От
John R Pierce
Дата:
On 11/06/11 1:52 PM, NTPT wrote:
>  I think that features are proposed  because user(s) want  it, need
> it, because it can be useful for other peoples too, because with this
> can people solve their problems and/or improve performance and
> security of their applications. Not because someone else have
> something similar...

well, the functionality you describe could be implemented externally to
postgres in as a binding layer, at least for a specific programming
environment, such as Perl DBI, Java JDBC, python, etc etc.   most of
these bindings already have the idea of prepared statements, so adding
an API to fetch a named statement from a table of such shouldn't be that
difficult. Implementing it this way would be a good way to demonstrate
its utility, iron out design issues, and so forth.     If in fact it
proved to be of general interest, it could then be re-implemented as a
contributed project, or perhaps in the postgres core.

generally, built in functionality in PostgreSQL closely follows existing
SQL database standards and practices, I've not seen or heard of any such
'persistent named prepared statements' before in any database system, so
I suspect the core developers aren't going to go anywhere with this
suggestion, but of course, I can't speak for them.



--
john r pierce                            N 37, W 122
santa cruz ca                         mid-left coast


Re: Named / preparsed / preplaned(prepared) queries - Feature proposal

От
Tomas Vondra
Дата:
Hi,

Dne 6.11.2011 15:07, NTPT napsal(a):
> Hi all
>
> I have an idea how to extend postgresql feature set.
>
>
> I sugest named / preparsed / preplamned query  mechanizm where named
> query is a simpliest form and preplaned query is the most
> sofisticated (implementation wise). Ie to have ability to  assign a
> global and may be persistant NAME (or alias) for some query for some
> user. Similar how PREPARE related stuff does, but AFAIK, prepared
> statement are limited for the current user on current connection
> only.
>
> with syntax like this:
>
> NAMEDQUERY "select foo from bar where blah=$1" AS "myquery"
> [PREPARSED | PREPLANED  REPLAN EVERY condition] [PERSISTENT] [FINAL]
> [FOR USER user1,user2]
>
> And then  modify user privileges to add privileges to create a
> NAMEDQUERY, to create it for others -  typicaly  useful for database
> administrator and the  privilege of executing NAMED QUERY ONLY for
> some user - for example guest.

Are you aware of SQL functions? It seems to me it's almost exactly what
you're describing. See this:

  http://www.postgresql.org/docs/current/static/xfunc-sql.html

You can grant privileges on them, it survives server restart, it
provides the decoupling etc. Plus there are other languages (PL/pgSQL)
that may be handy in some cases.

> PERSISTANT tell the server that this named query should survive
> server restart.
>
> FINAL: if Administrator set a named query as FINAL, no other user is
> able to override it.
>
> PREPARSED: Rationale behind PREPARSED queries is that in most
> scenarios on web applications there is a simple not complicated
> select of data often in heap or memory or in the in memory index
> where actually fetching that data is faster then parsing a query
> string. So I thing a global cache of preparsed  NAMEDQUERY statements
> is a good idea and on some loads can boost performance
> significantly.
>
> PREPLANED: same as the preparsed, but the query plan is cached too.
> So in frequent simple queries (like common in ajax/comet applications
> where it ofen do some simple select to query a status of session
> etc), I thing it can remove significant burden (parsing and planning
> a query) from CPU .
>
> yes, I know that a query plan that is superior  now, could be
> inferior few minutes later. So I suggest a  REPLAN EVERY condition,
> where condition can be whatever user like - time interval,   changed
> N percent of tuples etc etc - Let user decide what mechanizm is best
> for him.

Handling the execution plans is a bit tricky, though. The plans are not
forgotten on disconnect, IIRC, but need to be invalidated in some cases
etc. Leaving this up to the developer is not a good idea, and REPLAN
EVERY actually does not fix the problem, because it's not just about
collecting fresh stats but about values (common vs. uncommon values etc.).

Anyway parsing a query is not a problem, once the needed records is
cache - planning it is much more time consuming I guess. So the PREPARSE
does not make much sense to me and the PREPLANED is already available.

> This feature can lead to two interested things. First, there is a
> decoupuling a database from application level and keep this two realm
> relatively separate, if underlying db structure is changed, just
> redefine named queries and do not touch application. It create some
> sort of easy to use abstraction layer. It is a bit "objectish"
> approach, because you can create a selfcontained database with data
> and method for its manipulation (if named query is  set as
> PERSISTENT, should  dump/restore and replication solutions do their
> job on persistant named queries as well)

As I said before, you can this can be achieved with stored procedures
(SQL, PL/pgSQL, PL/whatever). The procedures are dumped/restored,
replicated etc.

> Second thing is a powerful, simple, yet easy to use security feature
> with as much granularity as is possible. For example user guest, who
> have set a NAMED QUERY ONLY user privilege, will be limited only to
> queries that admin of the database defined for him, no matter what.I
> thing that it could be easy to use privilege sepration mechanizm.

Again. Grant the user privileges on stored procedures, not the tables,
and use SECURITY DEFINER when creating the procedures.

Tomas

Re: Re: Re: [GENERAL] Named / preparsed / preplaned(prepared) queries - Feature proposal

От
Tom Lane
Дата:
John R Pierce <pierce@hogranch.com> writes:
> generally, built in functionality in PostgreSQL closely follows existing
> SQL database standards and practices, I've not seen or heard of any such
> 'persistent named prepared statements' before in any database system, so
> I suspect the core developers aren't going to go anywhere with this
> suggestion, but of course, I can't speak for them.

There have been previous proposals for a shared plan cache, which is
somewhat close to what the OP is suggesting.  The trouble with the idea
is that maintenance of the cache would be a mess, and contention for it
would take a big bite out of whatever performance improvement might be
hoped for.  We do already have session-local plan caching; it's unclear
that it makes sense to go further than that.

            regards, tom lane