Обсуждение: Some questions on user defined types and functions.
I was wondering if anyone could help me with the following questions.
They are all related to user defined types and functions.
1. Environment variables in function pathname. We would like to
have multiple environments (i.e. production, backup, reporting, test
etc) with each environment totally independent of the other
environments. One thing that stands in the way of this is the absolute
pathname requirement in the CREATE FUNCTION syntax. Obtimally we would
like the following syntax to work:
CREATE FUNCTION myfunc(mytype) RETURNS text AS
'$HOME/lib/libmyso.so' LANGUAGE 'c':
and have the environment variable $HOME "lazy" evaluated. I
have looked at the fmgr code and this doesn't look too difficult to add
as long as I could get the $HOME past the parser. Has anyone thought
about this before? Is there another, better way to do this?
2. tid assignment - We make extensive use of user defined types.
One, fairly painful, thing we have noticed is the following sequence
really doesn't work very well:
CREATE FUNCTION mytype_in(opaque) RETURNS mytype AS
'/lib/libmyso.so' LANGUAGE 'c';
CREATE FUNCTION mytype_out(opaque) RETURNS opaque AS
'/lib/libmyso.so' LANGUAGE 'c';
CREATE TYPE mytype (internallength = VARIABLE,
input=mytype_in, output=mytype_out);
CREATE TABLE mytable (t mytype);
DROP TYPE mytype;
then create the type again
The reason for possibly wanting to do this is to fix a problem
with the implementation of the type mytype. The reason this doesn't
seem to work is that the definition of mytable "knows" the TypeID of
mytype and, because it got dropped and recreated, the TypeID is now
different. So the question is, is there a way to modify the definition
of a type without dropping and recreating all of the tables that use the
type.
3. fid assignment - Basically the same question as above but with
functions instead of types. If there is an index that uses a function,
it appears that you can't drop and re-create the function without
blowing away the index definition.
Thank you,
Jeff Collins
Jeffery Collins wrote: > I was wondering if anyone could help me with the following questions. > They are all related to user defined types and functions. > > 1. Environment variables in function pathname. We would like to > have multiple environments (i.e. production, backup, reporting, test > etc) with each environment totally independent of the other > environments. One thing that stands in the way of this is the absolute > pathname requirement in the CREATE FUNCTION syntax. Obtimally we would > like the following syntax to work: > > CREATE FUNCTION myfunc(mytype) RETURNS text AS > '$HOME/lib/libmyso.so' LANGUAGE 'c': > > and have the environment variable $HOME "lazy" evaluated. I > have looked at the fmgr code and this doesn't look too difficult to add > as long as I could get the $HOME past the parser. Has anyone thought > about this before? Is there another, better way to do this? > > 2. tid assignment - We make extensive use of user defined types. > One, fairly painful, thing we have noticed is the following sequence > really doesn't work very well: > > CREATE FUNCTION mytype_in(opaque) RETURNS mytype AS > '/lib/libmyso.so' LANGUAGE 'c'; > CREATE FUNCTION mytype_out(opaque) RETURNS opaque AS > '/lib/libmyso.so' LANGUAGE 'c'; > CREATE TYPE mytype (internallength = VARIABLE, > input=mytype_in, output=mytype_out); > CREATE TABLE mytable (t mytype); > DROP TYPE mytype; > then create the type again > > The reason for possibly wanting to do this is to fix a problem > with the implementation of the type mytype. The reason this doesn't > seem to work is that the definition of mytable "knows" the TypeID of > mytype and, because it got dropped and recreated, the TypeID is now > different. So the question is, is there a way to modify the definition > of a type without dropping and recreating all of the tables that use the > type. > > 3. fid assignment - Basically the same question as above but with > functions instead of types. If there is an index that uses a function, > it appears that you can't drop and re-create the function without > blowing away the index definition. > Never mind on questions 2 and 3. I figured out the answer. It seems the way to do this is to update the system tables (pg_attributes, pg_index, etc.) to refer to the new oids instead of old oids. I would still appreciate any thoughts on using environment variables in function paths. > > Thank you, > Jeff Collins
Jeffery Collins <collins@onyx-technologies.com> writes:
> like the following syntax to work:
> CREATE FUNCTION myfunc(mytype) RETURNS text AS
> '$HOME/lib/libmyso.so' LANGUAGE 'c':
> and have the environment variable $HOME "lazy" evaluated. I
> have looked at the fmgr code and this doesn't look too difficult to add
> as long as I could get the $HOME past the parser.
The parser doesn't know a thing about that, it's just seeing a string
literal. I think hacking in dfmgr.c would be sufficient. Whether it's
a good idea is another question --- you realize you'd be dealing with
postmaster environment variables, right, not those of the connected
user? The way we handle this in the distribution is by substituting
appropriate strings into a script before it's handed to psql; see the
regression tests directory for examples.
> 3. fid assignment - Basically the same question as above but with
> functions instead of types. If there is an index that uses a function,
> it appears that you can't drop and re-create the function without
> blowing away the index definition.
I think it would be a really bad idea to allow recycling of type and
function OIDs for what might be completely incompatible objects. But
something that's been on the TODO list for a while is to create an ALTER
FUNCTION command that would replace the body of an existing function
without changing the declared signature (parameters and return type).
That seems relatively safe, and it'd be awfully handy. Want to have a
go at it?
regards, tom lane
PS: you do realize that revising the function on which an index is
based probably renders the index useless anyway? Unless you can
guarantee that none of the stored values change...
Tom Lane wrote: > Jeffery Collins <collins@onyx-technologies.com> writes: > > like the following syntax to work: > > > CREATE FUNCTION myfunc(mytype) RETURNS text AS > > '$HOME/lib/libmyso.so' LANGUAGE 'c': > > > and have the environment variable $HOME "lazy" evaluated. I > > have looked at the fmgr code and this doesn't look too difficult to add > > as long as I could get the $HOME past the parser. > > The parser doesn't know a thing about that, it's just seeing a string > literal. I think hacking in dfmgr.c would be sufficient. Whether it's > a good idea is another question --- you realize you'd be dealing with > postmaster environment variables, right, not those of the connected > user? The way we handle this in the distribution is by substituting > appropriate strings into a script before it's handed to psql; see the > regression tests directory for examples. > Yes, I want the backend's environment to be used, so this is what I want. It looks like a pretty simple change. I'm going to give it a shot. Whether or not it is something y'all want for the main distribution is, of course, up to you. > > > 3. fid assignment - Basically the same question as above but with > > functions instead of types. If there is an index that uses a function, > > it appears that you can't drop and re-create the function without > > blowing away the index definition. > > I think it would be a really bad idea to allow recycling of type and > function OIDs for what might be completely incompatible objects. But > something that's been on the TODO list for a while is to create an ALTER > FUNCTION command that would replace the body of an existing function > without changing the declared signature (parameters and return type). > That seems relatively safe, and it'd be awfully handy. Want to have a > go at it? > > regards, tom lane > > PS: you do realize that revising the function on which an index is > based probably renders the index useless anyway? Unless you can > guarantee that none of the stored values change... I agree it is a really bad idea, but sometimes bad things happen to good databases. I am really just attempting to figure out how to recover if someone accidently types a DROP TABLE or DROP FUNCTION command or (as actually did happen) needs to change the function's path. Assuming the reCREATEd TABLE and/or FUNCTION matches the old ones, it looks like updating the oid references in the relevant system tables to point to the new TABLE and/or FUNCTION oids works just fine. As far as looking at the ALTER FUNCTION command, I can't promise anything (I have a lot of work commitments and a vacation coming up), but I will take a look at it. Thank you, Jeff
Jeffery Collins wrote:
> I was wondering if anyone could help me with the following questions.
> They are all related to user defined types and functions.
>
> 1. Environment variables in function pathname. We would like to
> [...]
Create your SQL scripts that define the functions in a make
step, invoking sed(1) to substitute a constant string with
the content of an environment variable. This worked for the
past 20 years and I'm sure it's extremely portable.
> 2. tid assignment - We make extensive use of user defined types.
> One, fairly painful, thing we have noticed is the following sequence
> really doesn't work very well:
>
> [...]
>
> The reason for possibly wanting to do this is to fix a problem
> with the implementation of the type mytype.
You don't need to DROP and reCREATE the functions and type if
you just fixed some bug in the C coding. Recompile it,
replace the shared object and reconnect to the database. The
new backend (you get at reconnect) will load in the new
shared module and use the new code.
> 3. fid assignment - Basically the same question as above but with
> functions instead of types. If there is an index that uses a function,
> it appears that you can't drop and re-create the function without
> blowing away the index definition.
Same as 2.
Jan
--
#======================================================================#
# It's easier to get forgiveness for being wrong than for being right. #
# Let's break this rule - forgive me. #
#================================================== JanWieck@Yahoo.com #
Jeffery Collins <collins@onyx-technologies.com> writes:
>>>> like the following syntax to work:
>>
>>>> CREATE FUNCTION myfunc(mytype) RETURNS text AS
>>>> '$HOME/lib/libmyso.so' LANGUAGE 'c':
>>
>>>> and have the environment variable $HOME "lazy" evaluated. I
>>>> have looked at the fmgr code and this doesn't look too difficult to add
>>>> as long as I could get the $HOME past the parser.
> I have made the changes necessary to allow environment variables to be
> entered and expanded in file names. Two files had to be changed
> backend/commands/define.c and backend/utils/fmgr/dfmgr.c. Assuming you are
> interested in the change,
Well, that's a good question. Does anyone else have an opinion on
whether this would be a good/bad/indifferent feature? We've seen
problems in the past caused by depending on postmaster environment
variables (restart the postmaster with different environment than
usual, things mysteriously break). So I'm inclined to feel that adding
more dependence on them isn't such a hot idea. But I'm not going to
veto it if there's interest in the feature from other people.
> what is the proper way to build a patch file that
> contains the changes? I have never done this before.
"diff -c" against current sources, done so that the correct file
pathnames are visible in the diff output; that is, cd to top level
of distribution tree and do something like
diff -c src/backend/utils/fmgr/dfmgr.c.orig src/backend/utils/fmgr/dfmgr.c
Don't forget to include diffs for documentation updates, as well.
regards, tom lane
It would seem that it wouldn't break anyone's existing setup, since you couldn't have an env variable in there anyway. (No one really has a directory called $HOME, I hope!) So, perhaps it could just be something in the documentation that has a stern warning about watching your consistency. Caveat hacker and all that. On 26 Jul 2000, at 17:50, Tom Lane wrote: > Jeffery Collins <collins@onyx-technologies.com> writes: > >>>> like the following syntax to work: > >> > >>>> CREATE FUNCTION myfunc(mytype) RETURNS text AS > >>>> '$HOME/lib/libmyso.so' LANGUAGE 'c': > >> > >>>> and have the environment variable $HOME "lazy" evaluated. I have > >>>> looked at the fmgr code and this doesn't look too difficult to > >>>> add as long as I could get the $HOME past the parser. > > > I have made the changes necessary to allow environment variables to > > be entered and expanded in file names. Two files had to be changed > > backend/commands/define.c and backend/utils/fmgr/dfmgr.c. Assuming > > you are interested in the change, > > Well, that's a good question. Does anyone else have an opinion on > whether this would be a good/bad/indifferent feature? We've seen > problems in the past caused by depending on postmaster environment > variables (restart the postmaster with different environment than > usual, things mysteriously break). So I'm inclined to feel that > adding more dependence on them isn't such a hot idea. But I'm not > going to veto it if there's interest in the feature from other people. > > > what is the proper way to build a patch file that > > contains the changes? I have never done this before. > > "diff -c" against current sources, done so that the correct file > pathnames are visible in the diff output; that is, cd to top level of > distribution tree and do something like diff -c > src/backend/utils/fmgr/dfmgr.c.orig src/backend/utils/fmgr/dfmgr.c > Don't forget to include diffs for documentation updates, as well. > > regards, tom lane -- Joel Burton, Director of Information Systems -*- jburton@scw.org Support Center of Washington (www.scw.org)
Tom Lane wrote:
> Jeffery Collins <collins@onyx-technologies.com> writes:
> > what is the proper way to build a patch file that
> > contains the changes? I have never done this before.
>
> "diff -c" against current sources, done so that the correct file
> pathnames are visible in the diff output; that is, cd to top level
> of distribution tree and do something like
For the past years I do a
cp -R src src.orig
after the configure step. Whatever I do in the sources, a
diff -cr src src.orig
in the toplevel directory gives me a patch I can apply to my
CVS checkout. With this I can never forget a single source
file touched.
Jan
--
#======================================================================#
# It's easier to get forgiveness for being wrong than for being right. #
# Let's break this rule - forgive me. #
#================================================== JanWieck@Yahoo.com #
> Well, that's a good question. Does anyone else have an opinion on
> whether this would be a good/bad/indifferent feature? We've seen
> problems in the past caused by depending on postmaster environment
> variables (restart the postmaster with different environment than
> usual, things mysteriously break). So I'm inclined to feel that adding
> more dependence on them isn't such a hot idea. But I'm not going to
> veto it if there's interest in the feature from other people.
As usual, I would like to see *more* support for environment variables
etc. This would fall into that category. You can choose to use it, or
choose to not, but the system has *more* flexibility when all is said
and done.
There is code in the postmaster which does the same thing, nearly. You
might want to check out the implementation there...
- Thomas