Re: postgres slowdown question

От: Shachindra Agarwal
Тема: Re: postgres slowdown question
Дата: ,
Msg-id: 5A5058425C97FA409CE98C9B87B0CF3805FE11@marnewexchange.mardomain.local
(см: обсуждение, исходный текст)
Ответ на: postgres slowdown question  ("Shachindra Agarwal")
Ответы: Re: postgres slowdown question  (John A Meinel)
Список: pgsql-performance

Скрыть дерево обсуждения

postgres slowdown question  ("Shachindra Agarwal", )
 Re: postgres slowdown question  (John A Meinel, )
 Re: postgres slowdown question  ("Shachindra Agarwal", )
  Re: postgres slowdown question  (John A Meinel, )

Thanks for the note. Please see my responses below:

-----Original Message-----
From: John A Meinel [mailto:]
Sent: Wednesday, April 20, 2005 3:48 PM
To: Shachindra Agarwal
Cc: 
Subject: Re: [PERFORM] postgres slowdown question

Shachindra Agarwal wrote:

> Dear Postgres Masters:
>
> We are using postgres 7.4 in our java application on RedHat linux. The

> Java application connects to Postgres via JDBC. The application goes
> through a 'discovery' phase, whereas it adds large amount of data into

> postgres. Typically, we are adding about a million records in various
> tables. The application also issues multiple queries to the database
> at the same time. We do not delete any records during the discovery
> phase. Both the java application and the postgres are installed on the

> same machine.
>
> At the beginning, the application is able to add in the order of 100
> record per minute. Gradually (after several hours), it slows down to
> less than 10 records per minute. At this time, postgres processes take

> between 80-99% of CPU. When we reindex the database, the speed bumps
> up to about 30 records per minute. Now, postgres server takes between
> 50-70% CPU.
>
> We have the following in the postgresql.conf :
>
> max_fsm_pages = 500000
>
> fsync = false
>
> We certainly can not live with this kind of performance. I believe
> postgres should be able to handle much larger datasets but I can not
> point my finger as to what are we doing wrong. Can somebody please
> point me to the right direction.
>
> With kind regards,
>
> -- Shachindra Agarwal.
>
A few questions first. How are you loading the data? Are you using
INSERT or COPY? Are you using a transaction, or are you autocommitting
each row?

You really need a transaction, and preferably use COPY. Both can help
performance a lot. (In some of the tests, single row inserts can be
10-100x slower than doing it in bulk.)

>> We are using JDBC which supports 'inserts' and 'transactions'. We are
using both. The business logic adds one business object at a time. Each
object is added within its own transaction. Each object add results in 5
records in various tables in the the database. So, a commit is performed
after every 5 inserts.

Also, it sounds like you have a foreign key issue. That as things fill
up, the foreign key reference checks are slowing you down.
Are you using ANALYZE as you go? A lot of times when you only have <1000

rows a sequential scan is faster than using an index, and if you don't
inform postgres that you have more rows, it might still use the old
seqscan.

>> This could be the issue. I will start 'analyze' in a cron job. I will
update you with the results.

There are other possibilities, but it would be nice to know about your
table layout, and possibly an EXPLAIN ANALYZE of the inserts that are
going slow.

John
=:->

PS> I don't know if JDBC supports COPY, but it certainly should support
transactions.




В списке pgsql-performance по дате сообщения:

От: William Yu
Дата:
Сообщение: Re: How to improve db performance with $7K?
От: Richard van den Berg
Дата:
Сообщение: Re: When are index scans used over seq scans?