Обсуждение: [GENERAL] Is float8 a reference type?

Поиск

Список

Период

Сортировка

[GENERAL] Is float8 a reference type?

От

Paul A Jungwirth

Дата:

23 сентября 2017 г., 05:52:33

The docs say that a Datum can be 4 bytes or 8 depending on the machine:

https://www.postgresql.org/docs/9.5/static/sql-createtype.html

Is a Datum always 8 bytes for 64-bit architectures?

And if so, can my C extension skip a loop like this when compiling
there, and just do a memcpy (or even a cast)?:
   float8 *floats;   Datum *datums;
   datums = palloc(arrlen * sizeof(Datum));   for (i = 0; i < arrlen; i++) {     datums[i] = Float8GetDatum(floats[i]);
 }
 

Thanks!
Paul


-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Re: [GENERAL] Is float8 a reference type?

От

Pavel Stehule

Дата:

23 сентября 2017 г., 06:05:05

2017-09-23 4:52 GMT+02:00 Paul A Jungwirth <pj@illuminatedcomputing.com>:

The docs say that a Datum can be 4 bytes or 8 depending on the machine:

https://www.postgresql.org/docs/9.5/static/sql-createtype.html

Is a Datum always 8 bytes for 64-bit architectures?

And if so, can my C extension skip a loop like this when compiling
there, and just do a memcpy (or even a cast)?:

yes, it is 8 bytes on 64-bit.

I don't think so it is good idea to write 64bit only extensions.

float8 *floats;
Datum *datums;

datums = palloc(arrlen * sizeof(Datum));
for (i = 0; i < arrlen; i++) {
datums[i] = Float8GetDatum(floats[i]);
}

Thanks!
Paul

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Re: [GENERAL] Is float8 a reference type?

От

Paul A Jungwirth

Дата:

23 сентября 2017 г., 06:05:11

On Fri, Sep 22, 2017 at 7:52 PM, Paul A Jungwirth
<pj@illuminatedcomputing.com> wrote:
> Is a Datum always 8 bytes for 64-bit architectures?

Never mind, I found this in `pg_config.h`:
   /* float8, int8, and related values are passed by value if 'true', by      reference if 'false' */   #define
FLOAT8PASSBYVALtrue
 

Paul


-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Re: [GENERAL] Is float8 a reference type?

От

Paul A Jungwirth

Дата:

23 сентября 2017 г., 06:10:31

On Fri, Sep 22, 2017 at 8:05 PM, Pavel Stehule <pavel.stehule@gmail.com> wrote:
> yes, it is 8 bytes on 64-bit.

Thanks!

> I don't think so it is good idea to write 64bit only extensions.

I agree, but how about this?:
   if (FLOAT8PASSBYVAL) {     datums = (Datum *)floats;   } else {     datums = palloc0(arrlen * sizeof(Datum));
for(i = 0; i < arrlen; i++) {       datums[i] = Float8GetDatum(floats[i]);     }   }
 

Thanks,
Paul


-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Re: [GENERAL] Is float8 a reference type?

От

Pavel Stehule

Дата:

23 сентября 2017 г., 06:28:11

2017-09-23 5:10 GMT+02:00 Paul A Jungwirth <pj@illuminatedcomputing.com>:

On Fri, Sep 22, 2017 at 8:05 PM, Pavel Stehule <pavel.stehule@gmail.com> wrote:
> yes, it is 8 bytes on 64-bit.

Thanks!

> I don't think so it is good idea to write 64bit only extensions.

I agree, but how about this?:

if (FLOAT8PASSBYVAL) {
datums = (Datum *)floats;
} else {
datums = palloc0(arrlen * sizeof(Datum));
for (i = 0; i < arrlen; i++) {
datums[i] = Float8GetDatum(floats[i]);
}
}

it can work.

You have to solve deallocation in only one path. palloc0 is not necessary in this case.

Thanks,
Paul

Re: [GENERAL] Is float8 a reference type?

От

Tom Lane

Дата:

23 сентября 2017 г., 06:38:06

Paul A Jungwirth <pj@illuminatedcomputing.com> writes:
> On Fri, Sep 22, 2017 at 8:05 PM, Pavel Stehule <pavel.stehule@gmail.com> wrote:
>> I don't think so it is good idea to write 64bit only extensions.

> I agree, but how about this?:

"Premature optimization is the root of all evil".  Do you have good reason
to think that it's worth your time to write unsafe/unportable code?  Do
you know that your compiler doesn't turn Float8GetDatum into a no-op
already?  (Mine does, on a 64-bit machine.)
        regards, tom lane


--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Re: [GENERAL] Is float8 a reference type?

От

Paul A Jungwirth

Дата:

23 сентября 2017 г., 18:41:03

On Fri, Sep 22, 2017 at 8:38 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> "Premature optimization is the root of all evil".  Do you have good reason
> to think that it's worth your time to write unsafe/unportable code?  Do
> you know that your compiler doesn't turn Float8GetDatum into a no-op
> already?  (Mine does, on a 64-bit machine.)

Ha ha, thank you for keeping me honest! But can you explain what is
unsafe about the cast? For a little more context: I've loaded a float8
array from a file, but I need to pass a Datum array to
construct_md_array. With an 8-byte Datum, I can just pass the original
float array, right? But with smaller Datums I need to go through the
array and convert each element. (I'm not really worried about these
files being moved between machines, so I'm willing to make the on-disk
format the same as the in-memory format.)

Since I'm expecting ~10 million elements per array, it seems like
skipping the conversion will have a real effect. I checked the
assembly and do see a difference (on both Mac+clang and Linux+gcc).
Here is the Mac command line:
   platter:floatfile paul$ clang -Wall -Wmissing-prototypes
-Wpointer-arith -Wdeclaration-after-statement -Wendif-labels
-Wmissing-format-attribute -Wformat-security -fno-strict-aliasing
-fwrapv -Wno-unused-command-line-argument -O2  -I. -I./
-I/usr/local/Cellar/postgresql@9.6/9.6.3/include/server
-I/usr/local/Cellar/postgresql@9.6/9.6.3/include/internal
-I/usr/local/opt/gettext/include -I/usr/local/opt/openldap/include
-I/usr/local/opt/openssl/include -I/usr/local/opt/readline/include
-I/usr/local/opt/tcl-tk/include -g -S -o floatfile.s floatfile.c

Here is the assembly for the cast:
     .loc    2 391 23 is_stmt 1      ## floatfile.c:391:23     movq    -48(%rbp), %r15   Ltmp176:     ##DEBUG_VALUE:
load_floatfile:datums<- %R15
 

Here is the assembly for the loop (after just changing the code to `if
(FLOAT8PASSBYVAL && false)`):
     .loc    2 393 21 is_stmt 1      ## floatfile.c:393:21     movslq    %r15d, %r13     .loc    2 393 28 is_stmt 0
## floatfile.c:393:28     leaq    (,%r13,8), %rdi     .loc    2 393 14                ## floatfile.c:393:14     callq
_palloc     movq    %rax, %r12   Ltmp177:     ##DEBUG_VALUE: load_floatfile:i <- 0     .loc    2 394 19 is_stmt 1
discriminator1 ## floatfile.c:394:19     testl    %r15d, %r15d   Ltmp178:     .loc    2 394 5 is_stmt 0 discriminator 1
##floatfile.c:394:5     je    LBB7_11   Ltmp179:   ## BB#9:     ##DEBUG_VALUE: load_floatfile:arrlen <- %R15D
##DEBUG_VALUE:load_floatfile:nulls <- [%RBP+-80]     ##DEBUG_VALUE: load_floatfile:floats <- [%RBP+-72]
##DEBUG_VALUE:load_floatfile:filename <- %RBX     .loc    2 0 5 discriminator 1   ## floatfile.c:0:5     movq
-72(%rbp),%rbx   Ltmp180:     ##DEBUG_VALUE: load_floatfile:floats <- %RBX     xorl    %r14d, %r14d   Ltmp181:
.p2align   4, 0x90   LBB7_10:                                ## =>This Inner Loop Header: Depth=1     ##DEBUG_VALUE:
load_floatfile:floats<- %RBX     ##DEBUG_VALUE: load_floatfile:arrlen <- %R15D     ##DEBUG_VALUE: load_floatfile:nulls
<-[%RBP+-80]     .loc    2 395 34 is_stmt 1      ## floatfile.c:395:34     movsd    (%rbx,%r14,8), %xmm0    ## xmm0 =
mem[0],zero    .loc    2 395 19 is_stmt 0      ## floatfile.c:395:19     callq    _Float8GetDatum     .loc    2 395 17
             ## floatfile.c:395:17     movq    %rax, (%r12,%r14,8)   Ltmp182:     .loc    2 394 30 is_stmt 1
discriminator2 ## floatfile.c:394:30     incq    %r14     .loc    2 394 19 is_stmt 0 discriminator 1 ##
floatfile.c:394:19    cmpq    %r13, %r14   Ltmp183:     .loc    2 394 5 discriminator 1 ## floatfile.c:394:5     jl
LBB7_10  Ltmp184:   LBB7_11:     ##DEBUG_VALUE: load_floatfile:arrlen <- %R15D     ##DEBUG_VALUE: load_floatfile:nulls
<-[%RBP+-80]
 

I get the same results on gcc too: the palloc, the loop, and even
`call Float8GetDatum@PLT`.

I'll do some timing of each version too, but it doesn't look like a
pointless optimization. I'd still like to know what is unsafe about it
though.

Thanks!
Paul


-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Re: [GENERAL] Is float8 a reference type?

От

Tom Lane

Дата:

23 сентября 2017 г., 19:40:24

Paul A Jungwirth <pj@illuminatedcomputing.com> writes:
> Since I'm expecting ~10 million elements per array, it seems like
> skipping the conversion will have a real effect. I checked the
> assembly and do see a difference (on both Mac+clang and Linux+gcc).

I wonder whether you're using up-to-date Postgres headers (ones
where Float8GetDatum is a static inline function).  For me, both
of those platforms recognize it as a no-op --- in fact, clang
turns a loop like
   for (i = 0; i < n; i++) {       datums[i] = Float8GetDatum(floats[i]);   }

into something that looks suspiciously like an inlined, loop-unrolled
memcpy().
        regards, tom lane

-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Re: [GENERAL] Is float8 a reference type?

От

Paul A Jungwirth

Дата:

23 сентября 2017 г., 20:09:08

On Sat, Sep 23, 2017 at 9:40 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> I wonder whether you're using up-to-date Postgres headers (ones
> where Float8GetDatum is a static inline function).

I'm building against 9.6.3 on both machines. I'm not doing anything
special to change the compilation options. Here is my whole Makefile:
   MODULES = floatfile   EXTENSION = floatfile   EXTENSION_VERSION = 1.0.0   DATA =
floatfile--$(EXTENSION_VERSION).sql
   PG_CONFIG = pg_config   PGXS := $(shell $(PG_CONFIG) --pgxs)   include $(PGXS)

But what I'm really interested in is this: What are the bad things
that can happen if I do `datums = (Datum *)floats`, as long as it's
only when Datums are 8 bytes wide? Is there a platform with
pass-by-val float8s where that won't work?

Thanks,
Paul

-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Обсуждение: [GENERAL] Is float8 a reference type?