Обсуждение: Make StringInfo available to frontend code.
Hi,
This patch, in a slightly rougher form, was submitted as part of [1],
but it seems worth bringing up separately, rather than just committing
hearing no objections.
From the commit message:
Make StringInfo available to frontend code.
There's plenty places in frontend code that could benefit from a
string buffer implementation. Some because it yields simpler and
faster code, and some others because of the desire to share code
between backend and frontend.
While there is a string buffer implementation available to frontend
code, libpq's PQExpBuffer, it is clunkier than stringinfo, it
introduces a libpq dependency, doesn't allow for sharing between
frontend and backend code, and has a higher API/ABI stability
requirement due to being exposed via libpq.
Therefore it seems best to just making StringInfo being usable by
frontend code. There's not much to do for that, except for rewriting
two subsequent elog/ereport calls into others types of error
reporting, and deciding on a maximum string length.
For the maximum string size I decided to privately define MaxAllocSize
to the same value as used in the backend. It seems likely that we'll
want to reconsider this for both backend and frontend code in the not
too far away future.
For now I've left stringinfo.h in lib/, rather than common/, to reduce
the likelihood of unnecessary breakage. We could alternatively decide
to provide a redirecting stringinfo.h in lib/, or just not provide
compatibility.
I'm still using stringinfo in the aforementioned thread, and I also want
to use it in a few more places. On the more ambitious side I really
would like to have a minimal version of elog.h available in the backend,
and that would really be a lot easier with stringinfo available.
I also would like to submit a few patches expanding stringinfo's
capabilities and performance, and it seems to me it'd be better to do so
after moving (lest they introduce new FE vs BE compat issues).
This allows us to remove compat.c hackery providing some stringinfo
functionality for pg_waldump (which now actually needs to pass in a
StringInfo...). I briefly played with converting more code in
pg_waldump.c than just that one call to StringInfo, but it seems that'd
be best done separately.
Comments?
Greetings,
Andres Freund
[1] https://postgr.es/m/20190920051857.2fhnvhvx4qdddviz@alap3.anarazel.de
Вложения
At Tue, 29 Oct 2019 17:10:01 -0700, Andres Freund <andres@anarazel.de> wrote in
> Hi,
>
> This patch, in a slightly rougher form, was submitted as part of [1],
> but it seems worth bringing up separately, rather than just committing
> hearing no objections.
..
> I'm still using stringinfo in the aforementioned thread, and I also want
> to use it in a few more places. On the more ambitious side I really
> would like to have a minimal version of elog.h available in the backend,
> and that would really be a lot easier with stringinfo available.
>
> I also would like to submit a few patches expanding stringinfo's
> capabilities and performance, and it seems to me it'd be better to do so
> after moving (lest they introduce new FE vs BE compat issues).
>
>
> This allows us to remove compat.c hackery providing some stringinfo
> functionality for pg_waldump (which now actually needs to pass in a
> StringInfo...). I briefly played with converting more code in
> pg_waldump.c than just that one call to StringInfo, but it seems that'd
> be best done separately.
>
> Comments?
It uses different form for the same message for FE and BE.
common/stringinfo.c:289-
> BE: ereport(ERROR,
> (errcode(ERRCODE_PROGRAM_LIMIT_EXCEEDED),
> errmsg("out of memory"),
> errdetail("Cannot enlarge string buffer containing %d
> bytes by %d more bytes.",
>
> FE: + _("out of memory\n\nCannot enlarge string buffer containing %d
> bytes by %d more bytes.\n"),
.po files will be smaller and more stable if we keep the same
translation unit for the same messages. That being said it doesn't
matter if it is tentative and the minimal elog.h for frontend comes
soon.
> /* It's possible we could use a different value for this in frontend code */
> #define MaxAllocSize ((Size) 0x3fffffff) /* 1 gigabyte - 1 */
The same symbol is defined for frontend in psprint.c. Isn't it better
merge them and place it in postgres_fe.h?
regards.
--
Kyotaro Horiguchi
NTT Open Source Software Center
Hi,
On 2019-10-30 10:58:59 +0900, Kyotaro Horiguchi wrote:
> At Tue, 29 Oct 2019 17:10:01 -0700, Andres Freund <andres@anarazel.de> wrote in
> > Hi,
> >
> > This patch, in a slightly rougher form, was submitted as part of [1],
> > but it seems worth bringing up separately, rather than just committing
> > hearing no objections.
> ..
> > I'm still using stringinfo in the aforementioned thread, and I also want
> > to use it in a few more places. On the more ambitious side I really
> > would like to have a minimal version of elog.h available in the backend,
> > and that would really be a lot easier with stringinfo available.
> >
> > I also would like to submit a few patches expanding stringinfo's
> > capabilities and performance, and it seems to me it'd be better to do so
> > after moving (lest they introduce new FE vs BE compat issues).
> >
> >
> > This allows us to remove compat.c hackery providing some stringinfo
> > functionality for pg_waldump (which now actually needs to pass in a
> > StringInfo...). I briefly played with converting more code in
> > pg_waldump.c than just that one call to StringInfo, but it seems that'd
> > be best done separately.
> >
> > Comments?
>
> It uses different form for the same message for FE and BE.
> common/stringinfo.c:289-
> > BE: ereport(ERROR,
> > (errcode(ERRCODE_PROGRAM_LIMIT_EXCEEDED),
> > errmsg("out of memory"),
> > errdetail("Cannot enlarge string buffer containing %d
> > bytes by %d more bytes.",
> >
> > FE: + _("out of memory\n\nCannot enlarge string buffer containing %d
> > bytes by %d more bytes.\n"),
>
> .po files will be smaller and more stable if we keep the same
> translation unit for the same messages. That being said it doesn't
> matter if it is tentative and the minimal elog.h for frontend comes
> soon.
I'm inclined to think that the contortions necessary to allow reusing
the translation strings here would be more work than worthwhile. Also,
do we even try to share the translations between backend and frontend?
> > /* It's possible we could use a different value for this in frontend code */
> > #define MaxAllocSize ((Size) 0x3fffffff) /* 1 gigabyte - 1 */
>
> The same symbol is defined for frontend in psprint.c. Isn't it better
> merge them and place it in postgres_fe.h?
No, I don't think so. I'd rather have less than more code depend on it.
Greetings,
Andres Freund
Hello.
At Tue, 29 Oct 2019 19:06:38 -0700, Andres Freund <andres@anarazel.de> wrote in
> Hi,
>
> On 2019-10-30 10:58:59 +0900, Kyotaro Horiguchi wrote:
> > At Tue, 29 Oct 2019 17:10:01 -0700, Andres Freund <andres@anarazel.de> wrote in
> > > Hi,
> > >
> > > This patch, in a slightly rougher form, was submitted as part of [1],
> > > but it seems worth bringing up separately, rather than just committing
> > > hearing no objections.
> > ..
> > > I'm still using stringinfo in the aforementioned thread, and I also want
> > > to use it in a few more places. On the more ambitious side I really
> > > would like to have a minimal version of elog.h available in the backend,
> > > and that would really be a lot easier with stringinfo available.
> > >
> > > I also would like to submit a few patches expanding stringinfo's
> > > capabilities and performance, and it seems to me it'd be better to do so
> > > after moving (lest they introduce new FE vs BE compat issues).
> > >
> > >
> > > This allows us to remove compat.c hackery providing some stringinfo
> > > functionality for pg_waldump (which now actually needs to pass in a
> > > StringInfo...). I briefly played with converting more code in
> > > pg_waldump.c than just that one call to StringInfo, but it seems that'd
> > > be best done separately.
> > >
> > > Comments?
> >
> > It uses different form for the same message for FE and BE.
>
> > common/stringinfo.c:289-
> > > BE: ereport(ERROR,
> > > (errcode(ERRCODE_PROGRAM_LIMIT_EXCEEDED),
> > > errmsg("out of memory"),
> > > errdetail("Cannot enlarge string buffer containing %d
> > > bytes by %d more bytes.",
> > >
> > > FE: + _("out of memory\n\nCannot enlarge string buffer containing %d
> > > bytes by %d more bytes.\n"),
> >
> > .po files will be smaller and more stable if we keep the same
> > translation unit for the same messages. That being said it doesn't
> > matter if it is tentative and the minimal elog.h for frontend comes
> > soon.
>
> I'm inclined to think that the contortions necessary to allow reusing
> the translation strings here would be more work than worthwhile. Also,
Maybe so for doing that in this stage. So I expect that elog.h for FE
comes.
> the translation strings here would be more work than worthwhile. Also,
> do we even try to share the translations between backend and frontend?
No. I don't mean that. FE and BE have their own .po files, anyway.
> > > /* It's possible we could use a different value for this in frontend code */
> > > #define MaxAllocSize ((Size) 0x3fffffff) /* 1 gigabyte - 1 */
> >
> > The same symbol is defined for frontend in psprint.c. Isn't it better
> > merge them and place it in postgres_fe.h?
>
> No, I don't think so. I'd rather have less than more code depend on it.
Ok, understoold. Thanks for the reply.
regards.
> On 30 Oct 2019, at 01:10, Andres Freund <andres@anarazel.de> wrote: > Make StringInfo available to frontend code. I’ve certainly wanted just that on multiple occasions, so +1 on this. > Therefore it seems best to just making StringInfo being usable by > frontend code. There's not much to do for that, except for rewriting > two subsequent elog/ereport calls into others types of error > reporting, and deciding on a maximum string length. Skimming (but not testing) the patch, it seems a reasonable approach. + * StringInfo provides an extensible string data type. It can be used to It might be useful to point out the upper bound on the extensibility in the rewrite of this sentence, and that it’s not guaranteed to be consistent between frontend and backend. > I'm still using stringinfo in the aforementioned thread, and I also want > to use it in a few more places. On the more ambitious side I really > would like to have a minimal version of elog.h available in the backend, > and that would really be a lot easier with stringinfo available. s/backend/frontend/? cheers ./daniel
Hi, On 2019-11-01 23:19:33 +0100, Daniel Gustafsson wrote: > > On 30 Oct 2019, at 01:10, Andres Freund <andres@anarazel.de> wrote: > > > Make StringInfo available to frontend code. > > I’ve certainly wanted just that on multiple occasions, so +1 on this. Cool. > + * StringInfo provides an extensible string data type. It can be used to > > It might be useful to point out the upper bound on the extensibility in the > rewrite of this sentence, and that it’s not guaranteed to be consistent between > frontend and backend. Hm. Something like 'Currently the maximum length of a StringInfo is 1GB.'? I don't really think it's worth pointing out that they may not be consistent, when they currently are... And I suspect we should just fix the length limit to be higher for both, perhaps somehow allowing to limit the length for the backend cases where we want to error out if a string gets too long (possibly adding a separate initialization API that allows to specify the memory allocation flags or such). > > I'm still using stringinfo in the aforementioned thread, and I also want > > to use it in a few more places. On the more ambitious side I really > > would like to have a minimal version of elog.h available in the backend, > > and that would really be a lot easier with stringinfo available. > > s/backend/frontend/? Indeed. Thanks for looking, Andres Freund
> On 2 Nov 2019, at 03:21, Andres Freund <andres@anarazel.de> wrote: > On 2019-11-01 23:19:33 +0100, Daniel Gustafsson wrote: >> + * StringInfo provides an extensible string data type. It can be used to >> >> It might be useful to point out the upper bound on the extensibility in the >> rewrite of this sentence, and that it’s not guaranteed to be consistent between >> frontend and backend. > > Hm. Something like 'Currently the maximum length of a StringInfo is > 1GB.’? Something along those lines (or the define/mechanism with which the upper boundary controlled, simply stating the limit seems more straightforward.) > I don't really think it's worth pointing out that they may not be > consistent, when they currently are… Good point. > And I suspect we should just fix the length limit to be higher for both, > perhaps somehow allowing to limit the length for the backend cases where > we want to error out if a string gets too long (possibly adding a > separate initialization API that allows to specify the memory allocation > flags or such). Sounds reasonable, maybe even where one can set errhint/errdetail on the “out of memory” error to get better reporting on failures. cheers ./daniel
Hi, On 2019-11-02 23:57:06 +0100, Daniel Gustafsson wrote: > > On 2 Nov 2019, at 03:21, Andres Freund <andres@anarazel.de> wrote: > > On 2019-11-01 23:19:33 +0100, Daniel Gustafsson wrote: > > >> + * StringInfo provides an extensible string data type. It can be used to > >> > >> It might be useful to point out the upper bound on the extensibility in the > >> rewrite of this sentence, and that it’s not guaranteed to be consistent between > >> frontend and backend. > > > > Hm. Something like 'Currently the maximum length of a StringInfo is > > 1GB.’? > > Something along those lines (or the define/mechanism with which the upper > boundary controlled, simply stating the limit seems more straightforward.) Pushed now, with a variation of my suggestion above. As nobody commented on that, I've not adjusted the location of stringinfo.h. If somebody has feelings about it being in the wrong place, and whether to put a redirecting header in place, or whether to just break extensions using stringinfo, ... > > And I suspect we should just fix the length limit to be higher for both, > > perhaps somehow allowing to limit the length for the backend cases where > > we want to error out if a string gets too long (possibly adding a > > separate initialization API that allows to specify the memory allocation > > flags or such). > > Sounds reasonable, maybe even where one can set errhint/errdetail on the “out > of memory” error to get better reporting on failures. That seems too much customization for the caller / too much added complexity, for the little gain it'd provide. Normally if a caller wants something like that, they can push something onto the error context stack, and get context information out that way. Greetings, Andres Freund