Re: strange buildfarm failures

Поиск
Список
Период
Сортировка
От Alvaro Herrera
Тема Re: strange buildfarm failures
Дата
Msg-id 20070426030445.GC10666@alvh.no-ip.org
обсуждение исходный текст
Ответ на Re: strange buildfarm failures  (Tom Lane <tgl@sss.pgh.pa.us>)
Ответы Re: strange buildfarm failures  (Stefan Kaltenbrunner <stefan@kaltenbrunner.cc>)
Re: strange buildfarm failures  (Stefan Kaltenbrunner <stefan@kaltenbrunner.cc>)
Список pgsql-hackers
Tom Lane wrote:
> Stefan Kaltenbrunner <stefan@kaltenbrunner.cc> writes:
> > Stefan Kaltenbrunner wrote:
> >> two of my buildfarm members had different but pretty weird looking
> >> failures lately:
> >> http://www.pgbuildfarm.org/cgi-bin/show_log.pl?nm=quagga&dt=2007-04-25%2002:03:03
> >> and
> >> 
> >> http://www.pgbuildfarm.org/cgi-bin/show_log.pl?nm=emu&dt=2007-04-24%2014:35:02
> >> 
> >> any ideas on what might causing those ?

Just for the record, quagga and emu failures don't seem related to the
report below.  They don't crash; the regression.diffs contains data that
suggests that there may be data corruption of some sort.

INSERT INTO INET_TBL (c, i) VALUES ('192.168.1.2/30', '192.168.1.226');
ERROR:  invalid cidr value: "%{"

This doesn't seem to make much sense.


> > lionfish just failed too:
> 
> > http://www.pgbuildfarm.org/cgi-bin/show_log.pl?nm=lionfish&dt=2007-04-25%2005:30:09
> 
> And had a similar failure a few days ago.  The curious thing is that
> what we get in the postmaster log is
> 
> LOG:  server process (PID 23405) was terminated by signal 6: Aborted
> LOG:  terminating any other active server processes
> 
> You would think SIGABRT would come from an assertion failure, but
> there's no preceding assertion message in the log.  The other
> characteristic of these crashes is that *all* of the failing regression
> instances report "terminating connection because of crash of another
> server process", which suggests strongly that the crash was in an
> autovacuum process (if it were bgwriter or stats collector the
> postmaster would've said so).  So I think the recent autovac patches
> are at fault.  I spent a bit of time trolling for a spot where the code
> might abort() without having printed anything, but didn't find one.

Hmm.  I kept an eye on the buildfarm for a few days, but saw nothing
that could be connected to autovacuum so I neglected it.

This is the other failure:

http://www.pgbuildfarm.org/cgi-bin/show_log.pl?nm=lionfish&dt=2007-04-20%2005:30:14

It shows the same pattern.  I am baffled -- I don't understand how it
can die without reporting the error.

Apparently it crashes rather frequently, so it shouldn't be too
difficult to reproduce on manual runs.  If we could get it to run with a
higher debug level, it might prove helpful to further pinpoint the
problem.

The core file would be much better obviously (first and foremost to
confirm that it's autovacuum that's crashing ... )

-- 
Alvaro Herrera                                http://www.CommandPrompt.com/
PostgreSQL Replication, Consulting, Custom Development, 24x7 support


В списке pgsql-hackers по дате отправления:

Предыдущее
От: ITAGAKI Takahiro
Дата:
Сообщение: Re: autovacuum does not start in HEAD
Следующее
От: Stefan Kaltenbrunner
Дата:
Сообщение: Re: strange buildfarm failures