Re: MultiXactId error after upgrade to 9.3.4

Поиск
Список
Период
Сортировка
От Alvaro Herrera
Тема Re: MultiXactId error after upgrade to 9.3.4
Дата
Msg-id 20140331130215.GV9567@eldon.alvh.no-ip.org
обсуждение исходный текст
Ответ на Re: MultiXactId error after upgrade to 9.3.4  (Andres Freund <andres@2ndquadrant.com>)
Ответы Re: MultiXactId error after upgrade to 9.3.4  (Stephen Frost <sfrost@snowman.net>)
Список pgsql-hackers
Andres Freund wrote:
> On 2014-03-31 09:19:12 -0300, Alvaro Herrera wrote:
> > Andres Freund wrote:
> > > On 2014-03-31 08:54:53 -0300, Alvaro Herrera wrote:
> > > > My conclusion here is that some part of the code is failing to examine
> > > > XMAX_INVALID before looking at the value stored in xmax itself.  There
> > > > ought to be a short-circuit.  Fortunately, this bug should be pretty
> > > > harmless.
> > > > 
> > > > .. and after looking, I'm fairly sure the bug is in
> > > > heap_tuple_needs_freeze.
> > > 
> > > heap_tuple_needs_freeze() isn't *allowed* to look at
> > > XMAX_INVALID. Otherwise it could miss freezing something still visible
> > > on a standby or after an eventual crash.
> > 
> > Ah, you're right.  It even says so on the comment at the top (no
> > caffeine yet.)  But what it's doing is still buggy, per this report, so
> > we need to do *something* ...
> 
> Are you sure needs_freeze() is the problem here?
> 
> IIRC it already does some checks for allow_old? Why is the check for
> that not sufficient?

GetMultiXactIdMembers has this:
if (MultiXactIdPrecedes(multi, oldestMXact)){    ereport(allow_old ? DEBUG1 : ERROR,
(errcode(ERRCODE_INTERNAL_ERROR),    errmsg("MultiXactId %u does no longer exist -- apparent wraparound",
multi)));   return -1;}
 
if (!MultiXactIdPrecedes(multi, nextMXact))    ereport(ERROR,            (errcode(ERRCODE_INTERNAL_ERROR),
errmsg("MultiXactId%u has not been created yet -- apparent wraparound",                    multi)));
 

I guess I wasn't expecting that too-old values would last longer than a
full wraparound cycle.  Maybe the right fix is just to have the second
check also conditional on allow_old.

Anyway, it's not clear to me why this database has a multixact value of
6 million when the next multixact value is barely above one million.
Stephen said a wraparound here is not likely.

-- 
Álvaro Herrera                http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Training & Services



В списке pgsql-hackers по дате отправления:

Предыдущее
От: Michael Paquier
Дата:
Сообщение: Re: New parameter RollbackError to control rollback behavior on error
Следующее
От: Stephen Frost
Дата:
Сообщение: Re: MultiXactId error after upgrade to 9.3.4