Re: [HACKERS] Issues with logical replication
От | Andres Freund |
---|---|
Тема | Re: [HACKERS] Issues with logical replication |
Дата | |
Msg-id | 20171129234036.rddarwqoqly2a2ni@alap3.anarazel.de обсуждение исходный текст |
Ответ на | Re: [HACKERS] Issues with logical replication (Petr Jelinek <petr.jelinek@2ndquadrant.com>) |
Ответы |
Re: [HACKERS] Issues with logical replication
|
Список | pgsql-hackers |
On 2017-11-30 00:25:58 +0100, Petr Jelinek wrote: > Yes that helps thanks. Now that I reproduced it I understand, I was > confused by the backtrace that said xid was 0 on the input to > XactLockTableWait() but that's not the case, it's what xid is changed to > in the inner loop. > So what happens is that we manage to do LogStandbySnapshot(), decode the > logged snapshot, and run SnapBuildWaitSnapshot() for a transaction in > between GetNewTransactionId() and XactLockTableInsert() calls in > AssignTransactionId() for that same transaction. > > I guess the probability of this happening is increased by the fact that > GetRunningTransactionData() acquires XidGenLock so if there is > GetNewTransactionId() running in parallel it will wait for it to finish > and we'll log immediately after that. > > Hmm that means that Robert's loop idea will not help and ProcArrayLock > will not save us either. Maybe we could either rewrite XactLockTableWait > or create another version of it with SubTransGetParent() call replaced > by SubTransGetTopmostTransaction() as that will return the same top > level xid in case the input xid wasn't a subxact. That would make it > safe to be called on transactions that didn't acquire lock on themselves > yet. I've not really looked into this deeply, but afair we can just make this code accept that edgecase be done with it. As the comment says: * Iterate through xids in record, wait for all older than the cutoff to* finish. Then, if possible, log a new xl_running_xactsrecord.* --- highlight ---* This isn't required for the correctness of decoding, but to: --- highlight ---* a) allow isolationtester to notice that we're currently waiting for* something.* b) log a new xl_running_xactsrecord where it'd be helpful, without having* to write for bgwriter or checkpointer. Greetings, Andres Freund
В списке pgsql-hackers по дате отправления: