Обсуждение: [GENERAL] Isolation of schema renames
I'm wondering if I can/should expect schema renames to be isolated.
For example, I have two schemas "test" and "test_new". Each with a "test" table (with same columns, but different data).BEGIN;
ALTER SCHEMA test RENAME TO test_old;
ALTER SCHEMA test_new RENAME TO test;
COMMIT;
In another transaction I do:
BEGIN;
SELECT * FROM test.test;
<first transaction occurs here on a different session>
SELECT * FROM test.test;
COMMIT;
My expectation is that both "SELECT" would return the same data (or that the transaction performing the rename is blocked until the select transaction is complete).
In testing this on 9.5.7 the behaviour I see is that the select "SELECT" returns different data. (Regardless of isolation level chosen).
Is it possible to perform this rename in any way?
If not (which I suspect is the case) is this documented somewhere, I couldn't find it (but that is probably me not looking hard enough / in the right place).
Thanks,
Ben
Ben Leslie <benno@benno.id.au> writes:
> I'm wondering if I can/should expect schema renames to be isolated.
Nope, you should not.
This is not an especially easy thing to fix, because to have the system
behave as you wish it did, your second transaction would have to be
ignoring already-committed DDL changes, and it is very easy to show
examples where that would be fatal.  For example, consider
S1                S2
create table t (f1 int, f2 bigint);
                begin;
                insert into t values(42, 43);  -- ok
begin;
alter table t add check (f1 < 100);
commit;
                insert into t values(1000, 44); -- ok?
begin;
alter table t alter column f2 type int;
commit;
                insert into t values(1, 100000000000); -- ok?
                commit;
Now, in practice this specific example doesn't go through anyway, because
after its first insert, S2's transaction is holding a lock on t that
precludes the ALTER TABLE steps.  But I only wrote it this way for
pedagogic purposes.  If S2 had done some things but not touched t2 yet,
the concurrent ALTERs would succeed, and then S2 has no choice but to
respect the results of that DDL.
There are other ways we could define the results of this sort of thing,
but they generally would end up failing one or both of your transactions.
Letting it "just work" is not in the picture.
            regards, tom lane
			
		On Thu, Aug 10, 2017 at 2:14 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote: > Ben Leslie <benno@benno.id.au> writes: >> I'm wondering if I can/should expect schema renames to be isolated. > > Nope, you should not. > > This is not an especially easy thing to fix, because to have the system > behave as you wish it did, your second transaction would have to be > ignoring already-committed DDL changes, and it is very easy to show > examples where that would be fatal. As mentioned on bug #14691[1], I think it might make theoretical sense to do SSI checks on system catalogue tables + associated caches (though I don't claim that's easy). Then a SERIALIZABLE transaction would abort if you created a conflict cycle with some other session that has moved your cheese and it was also running in SERIALIZABLE isolation. [1] https://www.postgresql.org/message-id/flat/20170605191104.1442.24999%40wrigleys.postgresql.org -- Thomas Munro http://www.enterprisedb.com
On 10 August 2017 at 12:14, Tom Lane <tgl@sss.pgh.pa.us> wrote:
Thanks for the quick response, I appreciate it.
> I'm wondering if I can/should expect schema renames to be isolated.
Nope, you should not.
 
 
			
		Ben Leslie <benno@benno.id.au> writes:
Thanks for the quick response, I appreciate it.
> I'm wondering if I can/should expect schema renames to be isolated.
Nope, you should not.
That's fine. I think I can achieve what I want using an alternative approach.
This is not an especially easy thing to fix, because to have the system
behave as you wish it did, your second transaction would have to be
ignoring already-committed DDL changes, and it is very easy to show
examples where that would be fatal. For example, consider
This is a bit confusing to me, as it seems like for other similar DDL changes it does behave as I expect, and it's just the "ALTER SCHEMA" is a bit of an outlier.
Now, in practice this specific example doesn't go through anyway, because
after its first insert, S2's transaction is holding a lock on t that
precludes the ALTER TABLE steps.
Right, this was I guess my expectation, that ALTER SCHEMA would block in a similar way to how ALTER TABLE blocks.
But I only wrote it this way for
pedagogic purposes. If S2 had done some things but not touched t2 yet,
the concurrent ALTERs would succeed, and then S2 has no choice but to
respect the results of that DDL.
If I have:
S1                        S2
begin;                    begin;
                          select * from test.test; (takes AccessShareLock on test.test)
alter table test.test rename to test_old;   <--- blocks getting AccessExclusiveLock on test.test
select * from test.test; (returns same data as first select)
select * from test.test; (returns same data as first select)
By contrast:
S1                        S2
begin;                    begin;
                          select * from test.test; (takes AccessShareLock on test.test)
alter schema test rename to test_old;        <---- does not block
commit;            
                          select * from test.test; (fails )I'm not saying anything should change, but from a naive point of view, I feel like scenario one above is "doing the right thing" and S2 is getting repeatable reads as it would expect, and the second scenario seems to be "doing the wrong thing" because it is no longer getting a repeatable read. 
From my point of view it seems like some DDL changes are isolated, but schema changes are not isolated. [Maybe I'm not using 'isolated' in a formally correct way here, in this specific case I just mean that S2 doesn't see the changes done in S1, not that both S1 and S2 can make progress].
I don't know the PostgreSQL internals, but I'm guessing that there must not be locks on schemas as there are on tables, and/or taking schemas would be too much overhead to avoid the case shown above (which is fine, I just wanted to verify this is intended behaviour and make sure I wasn't doing something too stupid).
Cheers,
Ben