Обсуждение: Function execution consuming lot of memory and eventually making server unresponsive
Function execution consuming lot of memory and eventually making server unresponsive
От
"Gnanakumar"
Дата:
Hi,
We're using PostgreSQL v8.2.3 on RHEL5.
I'm developing a PostgreSQL plpgsql function for one of our application
report. When I try to run the function multiple times (even twice or
thrice), I'm seeing considerable amount of memory being taken up by
PostgreSQL and thereby after sometime, complete server itself comes to
standstill and not responding at all, even am not able to login to my server
using PuTTY client. I then end up physically restarting the server.
Pasted below the function which I'm developing.
Is there something am doing differently in the function that would cause
PostgreSQL to consume lot of memory? In my experience, I can say, this is
the first time I'm seeing PostgreSQL consuming/eating lot of memory and
causing severe performance issue and eventually making server come to
standstill. Also, I can say that another 2 functions which I'm calling from
within this function ("get_hours_worked" and
"convert_hours_n_minutes_to_decimal") do not have any performance issues,
since those 2 functions we're already using in some other reports and have
not found any performance issues.
Experts suggestions/recommendations on this are highly appreciated.
For example, I would call this function like: SELECT
hours_worked_day_wise_breakup(90204,23893,38921, '01-01-2010 00:00:00',
'12-31-2010 23:59:59');
Output of this function will be like this:
8.00-typ1,4.25-typ2,0.00-typ5,6.00-typ3,8.00-typ4
Logic of this function: Given any 2 dates and filter inputs (input1, input2,
input3), it would return hours worked for each day (along with a suffix -
typ[1,2,3,4]) in comma separated form. In above example, I'm trying to run
this function for one year.
CREATE or replace FUNCTION hours_worked_day_wise_breakup(numeric, numeric,
numeric, varchar, varchar) RETURNS VARCHAR AS '
DECLARE
p_input1 ALIAS FOR $1;
p_input2 ALIAS FOR $2;
p_input3 ALIAS FOR $3;
p_startdate ALIAS FOR $4;
p_enddate ALIAS FOR $5;
v_loopingdate VARCHAR;
v_cur_start_date VARCHAR;
v_cur_end_date VARCHAR;
v_hours_in_decimal NUMERIC := 0.00;
v_returnvalue VARCHAR := '''';
BEGIN
v_loopingdate := TO_CHAR(DATE(p_startdate), ''mm-dd-yyyy'');
WHILE (DATE(v_loopingdate) <= DATE(p_enddate)) LOOP
v_cur_start_date := v_loopingdate || '' 00:00:00'';
v_cur_end_date := v_loopingdate || '' 23:59:59'';
IF (LENGTH(TRIM(v_returnvalue)) >0) THEN
v_returnvalue := v_returnvalue || '','';
END IF;
v_hours_in_decimal :=
convert_hours_n_minutes_to_decimal(get_hours_worked(p_input1, p_input2, 7,
1, -1, p_input3, v_cur_start_date, v_cur_end_date));
IF (v_hours_in_decimal > 0) THEN
v_returnvalue := v_returnvalue || v_hours_in_decimal
|| ''-typ1'';
v_loopingdate := TO_CHAR((DATE(v_loopingdate) +
interval ''1 day''), ''mm-dd-yyyy'');
CONTINUE;
END IF;
v_hours_in_decimal :=
convert_hours_n_minutes_to_decimal(get_hours_worked(p_input1, p_input2, 6,
1, -1, p_input3, v_cur_start_date, v_cur_end_date));
IF (v_hours_in_decimal > 0) THEN
v_returnvalue := v_returnvalue || v_hours_in_decimal
|| ''-typ2'';
v_loopingdate := TO_CHAR((DATE(v_loopingdate) +
interval ''1 day''), ''mm-dd-yyyy'');
CONTINUE;
END IF;
v_hours_in_decimal :=
convert_hours_n_minutes_to_decimal(get_hours_worked(p_input1, p_input2, 4,
1, -1, p_input3, v_cur_start_date, v_cur_end_date));
IF (v_hours_in_decimal > 0) THEN
v_returnvalue := v_returnvalue || v_hours_in_decimal
|| ''-typ3'';
v_loopingdate := TO_CHAR((DATE(v_loopingdate) +
interval ''1 day''), ''mm-dd-yyyy'');
CONTINUE;
END IF;
v_hours_in_decimal :=
convert_hours_n_minutes_to_decimal(get_hours_worked(p_input1, p_input2, 3,
1, -1, p_input3, v_cur_start_date, v_cur_end_date));
IF (v_hours_in_decimal > 0) THEN
v_returnvalue := v_returnvalue || v_hours_in_decimal
|| ''-typ4'';
v_loopingdate := TO_CHAR((DATE(v_loopingdate) +
interval ''1 day''), ''mm-dd-yyyy'');
CONTINUE;
END IF;
v_hours_in_decimal := 0.00;
v_returnvalue := v_returnvalue || v_hours_in_decimal ||
''-typ5'';
v_loopingdate := TO_CHAR((DATE(v_loopingdate) + interval ''1
day''), ''mm-dd-yyyy'');
END LOOP;
RETURN v_returnvalue;
END ;'
LANGUAGE 'plpgsql';
Regards,
Gnanam
Re: Function execution consuming lot of memory and eventually making server unresponsive
От
Pavel Stehule
Дата:
Hello
It hard to say where can be a problem
I see a some risks
a) v8.2.3 isn't last version of 8.2 line
b) why you use a varchar data type for v_loopingdate,
v_cur_start_date, v_cur_end_date - it's bad idea? - you have to do
cast between date and varchar - all operation are slower and needs
more memory - it should be timestamp
Regards
Pavel Stehule
2011/2/24 Gnanakumar <gnanam@zoniac.com>:
> Hi,
>
> We're using PostgreSQL v8.2.3 on RHEL5.
>
> I'm developing a PostgreSQL plpgsql function for one of our application
> report. When I try to run the function multiple times (even twice or
> thrice), I'm seeing considerable amount of memory being taken up by
> PostgreSQL and thereby after sometime, complete server itself comes to
> standstill and not responding at all, even am not able to login to my server
> using PuTTY client. I then end up physically restarting the server.
>
> Pasted below the function which I'm developing.
>
> Is there something am doing differently in the function that would cause
> PostgreSQL to consume lot of memory? In my experience, I can say, this is
> the first time I'm seeing PostgreSQL consuming/eating lot of memory and
> causing severe performance issue and eventually making server come to
> standstill. Also, I can say that another 2 functions which I'm calling from
> within this function ("get_hours_worked" and
> "convert_hours_n_minutes_to_decimal") do not have any performance issues,
> since those 2 functions we're already using in some other reports and have
> not found any performance issues.
>
> Experts suggestions/recommendations on this are highly appreciated.
>
> For example, I would call this function like: SELECT
> hours_worked_day_wise_breakup(90204,23893,38921, '01-01-2010 00:00:00',
> '12-31-2010 23:59:59');
> Output of this function will be like this:
> 8.00-typ1,4.25-typ2,0.00-typ5,6.00-typ3,8.00-typ4
> Logic of this function: Given any 2 dates and filter inputs (input1, input2,
> input3), it would return hours worked for each day (along with a suffix -
> typ[1,2,3,4]) in comma separated form. In above example, I'm trying to run
> this function for one year.
>
> CREATE or replace FUNCTION hours_worked_day_wise_breakup(numeric, numeric,
> numeric, varchar, varchar) RETURNS VARCHAR AS '
>
> DECLARE
> p_input1 ALIAS FOR $1;
> p_input2 ALIAS FOR $2;
> p_input3 ALIAS FOR $3;
> p_startdate ALIAS FOR $4;
> p_enddate ALIAS FOR $5;
>
> v_loopingdate VARCHAR;
> v_cur_start_date VARCHAR;
> v_cur_end_date VARCHAR;
> v_hours_in_decimal NUMERIC := 0.00;
> v_returnvalue VARCHAR := '''';
>
> BEGIN
> v_loopingdate := TO_CHAR(DATE(p_startdate), ''mm-dd-yyyy'');
>
> WHILE (DATE(v_loopingdate) <= DATE(p_enddate)) LOOP
> v_cur_start_date := v_loopingdate || '' 00:00:00'';
> v_cur_end_date := v_loopingdate || '' 23:59:59'';
>
> IF (LENGTH(TRIM(v_returnvalue)) >0) THEN
> v_returnvalue := v_returnvalue || '','';
> END IF;
>
> v_hours_in_decimal :=
> convert_hours_n_minutes_to_decimal(get_hours_worked(p_input1, p_input2, 7,
> 1, -1, p_input3, v_cur_start_date, v_cur_end_date));
> IF (v_hours_in_decimal > 0) THEN
> v_returnvalue := v_returnvalue || v_hours_in_decimal
> || ''-typ1'';
> v_loopingdate := TO_CHAR((DATE(v_loopingdate) +
> interval ''1 day''), ''mm-dd-yyyy'');
> CONTINUE;
> END IF;
>
> v_hours_in_decimal :=
> convert_hours_n_minutes_to_decimal(get_hours_worked(p_input1, p_input2, 6,
> 1, -1, p_input3, v_cur_start_date, v_cur_end_date));
> IF (v_hours_in_decimal > 0) THEN
> v_returnvalue := v_returnvalue || v_hours_in_decimal
> || ''-typ2'';
> v_loopingdate := TO_CHAR((DATE(v_loopingdate) +
> interval ''1 day''), ''mm-dd-yyyy'');
> CONTINUE;
> END IF;
>
> v_hours_in_decimal :=
> convert_hours_n_minutes_to_decimal(get_hours_worked(p_input1, p_input2, 4,
> 1, -1, p_input3, v_cur_start_date, v_cur_end_date));
> IF (v_hours_in_decimal > 0) THEN
> v_returnvalue := v_returnvalue || v_hours_in_decimal
> || ''-typ3'';
> v_loopingdate := TO_CHAR((DATE(v_loopingdate) +
> interval ''1 day''), ''mm-dd-yyyy'');
> CONTINUE;
> END IF;
>
> v_hours_in_decimal :=
> convert_hours_n_minutes_to_decimal(get_hours_worked(p_input1, p_input2, 3,
> 1, -1, p_input3, v_cur_start_date, v_cur_end_date));
> IF (v_hours_in_decimal > 0) THEN
> v_returnvalue := v_returnvalue || v_hours_in_decimal
> || ''-typ4'';
> v_loopingdate := TO_CHAR((DATE(v_loopingdate) +
> interval ''1 day''), ''mm-dd-yyyy'');
> CONTINUE;
> END IF;
>
> v_hours_in_decimal := 0.00;
> v_returnvalue := v_returnvalue || v_hours_in_decimal ||
> ''-typ5'';
> v_loopingdate := TO_CHAR((DATE(v_loopingdate) + interval ''1
> day''), ''mm-dd-yyyy'');
> END LOOP;
>
> RETURN v_returnvalue;
>
> END ;'
> LANGUAGE 'plpgsql';
>
> Regards,
> Gnanam
>
>
> --
> Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-performance
>
Re: Function execution consuming lot of memory and eventually making server unresponsive
От
Merlin Moncure
Дата:
On Thu, Feb 24, 2011 at 3:52 AM, Gnanakumar <gnanam@zoniac.com> wrote:
> Hi,
>
> We're using PostgreSQL v8.2.3 on RHEL5.
>
> I'm developing a PostgreSQL plpgsql function for one of our application
> report. When I try to run the function multiple times (even twice or
> thrice), I'm seeing considerable amount of memory being taken up by
> PostgreSQL and thereby after sometime, complete server itself comes to
> standstill and not responding at all, even am not able to login to my server
> using PuTTY client. I then end up physically restarting the server.
>
> Pasted below the function which I'm developing.
>
> Is there something am doing differently in the function that would cause
> PostgreSQL to consume lot of memory? In my experience, I can say, this is
> the first time I'm seeing PostgreSQL consuming/eating lot of memory and
> causing severe performance issue and eventually making server come to
> standstill. Also, I can say that another 2 functions which I'm calling from
> within this function ("get_hours_worked" and
> "convert_hours_n_minutes_to_decimal") do not have any performance issues,
> since those 2 functions we're already using in some other reports and have
> not found any performance issues.
>
> Experts suggestions/recommendations on this are highly appreciated.
>
> For example, I would call this function like: SELECT
> hours_worked_day_wise_breakup(90204,23893,38921, '01-01-2010 00:00:00',
> '12-31-2010 23:59:59');
> Output of this function will be like this:
> 8.00-typ1,4.25-typ2,0.00-typ5,6.00-typ3,8.00-typ4
> Logic of this function: Given any 2 dates and filter inputs (input1, input2,
> input3), it would return hours worked for each day (along with a suffix -
> typ[1,2,3,4]) in comma separated form. In above example, I'm trying to run
> this function for one year.
>
> CREATE or replace FUNCTION hours_worked_day_wise_breakup(numeric, numeric,
> numeric, varchar, varchar) RETURNS VARCHAR AS '
>
> DECLARE
> p_input1 ALIAS FOR $1;
> p_input2 ALIAS FOR $2;
> p_input3 ALIAS FOR $3;
> p_startdate ALIAS FOR $4;
> p_enddate ALIAS FOR $5;
>
> v_loopingdate VARCHAR;
> v_cur_start_date VARCHAR;
> v_cur_end_date VARCHAR;
> v_hours_in_decimal NUMERIC := 0.00;
> v_returnvalue VARCHAR := '''';
>
> BEGIN
> v_loopingdate := TO_CHAR(DATE(p_startdate), ''mm-dd-yyyy'');
>
> WHILE (DATE(v_loopingdate) <= DATE(p_enddate)) LOOP
> v_cur_start_date := v_loopingdate || '' 00:00:00'';
> v_cur_end_date := v_loopingdate || '' 23:59:59'';
>
> IF (LENGTH(TRIM(v_returnvalue)) >0) THEN
> v_returnvalue := v_returnvalue || '','';
> END IF;
>
> v_hours_in_decimal :=
> convert_hours_n_minutes_to_decimal(get_hours_worked(p_input1, p_input2, 7,
> 1, -1, p_input3, v_cur_start_date, v_cur_end_date));
> IF (v_hours_in_decimal > 0) THEN
> v_returnvalue := v_returnvalue || v_hours_in_decimal
> || ''-typ1'';
> v_loopingdate := TO_CHAR((DATE(v_loopingdate) +
> interval ''1 day''), ''mm-dd-yyyy'');
> CONTINUE;
> END IF;
>
> v_hours_in_decimal :=
> convert_hours_n_minutes_to_decimal(get_hours_worked(p_input1, p_input2, 6,
> 1, -1, p_input3, v_cur_start_date, v_cur_end_date));
> IF (v_hours_in_decimal > 0) THEN
> v_returnvalue := v_returnvalue || v_hours_in_decimal
> || ''-typ2'';
> v_loopingdate := TO_CHAR((DATE(v_loopingdate) +
> interval ''1 day''), ''mm-dd-yyyy'');
> CONTINUE;
> END IF;
>
> v_hours_in_decimal :=
> convert_hours_n_minutes_to_decimal(get_hours_worked(p_input1, p_input2, 4,
> 1, -1, p_input3, v_cur_start_date, v_cur_end_date));
> IF (v_hours_in_decimal > 0) THEN
> v_returnvalue := v_returnvalue || v_hours_in_decimal
> || ''-typ3'';
> v_loopingdate := TO_CHAR((DATE(v_loopingdate) +
> interval ''1 day''), ''mm-dd-yyyy'');
> CONTINUE;
> END IF;
>
> v_hours_in_decimal :=
> convert_hours_n_minutes_to_decimal(get_hours_worked(p_input1, p_input2, 3,
> 1, -1, p_input3, v_cur_start_date, v_cur_end_date));
> IF (v_hours_in_decimal > 0) THEN
> v_returnvalue := v_returnvalue || v_hours_in_decimal
> || ''-typ4'';
> v_loopingdate := TO_CHAR((DATE(v_loopingdate) +
> interval ''1 day''), ''mm-dd-yyyy'');
> CONTINUE;
> END IF;
>
> v_hours_in_decimal := 0.00;
> v_returnvalue := v_returnvalue || v_hours_in_decimal ||
> ''-typ5'';
> v_loopingdate := TO_CHAR((DATE(v_loopingdate) + interval ''1
> day''), ''mm-dd-yyyy'');
> END LOOP;
>
> RETURN v_returnvalue;
>
> END ;'
> LANGUAGE 'plpgsql';
It's a pretty safe bet you are stuck in the loop (either infinite, or
very long) using string concatenation operator on the return code. ||
is not designed for extremely heavy use on large strings in a loop.
Your entire function could probably be reduced to one SQL expression
with some thought.
merlin
Merlin Moncure <mmoncure@gmail.com> writes:
> Your entire function could probably be reduced to one SQL expression
> with some thought.
Or if not that, at least try to get rid of the use of varchar. All
those forced varchar-to-date-and-back conversions are expensive.
I'm also more than a tad worried by this:
> v_loopingdate := TO_CHAR(DATE(p_startdate), ''mm-dd-yyyy'');
>
> WHILE (DATE(v_loopingdate) <= DATE(p_enddate)) LOOP
There's nothing here guaranteeing that DATE() will think its input
is in mm-dd-yyyy format. If DateStyle is set to something else,
the logic would at least be wrong, and very possibly that explains
your infinite loop.
Learn to use PG's type system instead of fighting it. Your code
will be shorter, clearer, faster, and less error-prone.
regards, tom lane