RE: Re: Parallel scan with SubTransGetTopmostTransaction assert coredump

Поиск
Список
Период
Сортировка
От Pengchengliu
Тема RE: Re: Parallel scan with SubTransGetTopmostTransaction assert coredump
Дата
Msg-id 000901d74b84$fa262890$ee7279b0$@tju.edu.cn
обсуждение исходный текст
Ответ на Re: Re: Parallel scan with SubTransGetTopmostTransaction assert coredump  (Greg Nancarrow <gregn4422@gmail.com>)
Ответы Re: Re: Parallel scan with SubTransGetTopmostTransaction assert coredump  (Pavel Borisov <pashkin.elfe@gmail.com>)
Re: Re: Parallel scan with SubTransGetTopmostTransaction assert coredump  (Greg Nancarrow <gregn4422@gmail.com>)
Список pgsql-hackers
Hi Greg,
  > I actually think that the Assert in SubTransGetTopmostTransaction() is correct, but in the parallel-worker case,
thesnapshots are not being setup correctly. 

   I agree with you that Assert in SubTransGetTopmostTransaction() is correct.  The root cause is Transaction Xmin are
notbeing setup correctly in the parallel-worker. 

   Actually I am very confused about ActiveSnapshot and TransactionSnapshot. I don't know why main process send
ActiveSnapshotand TransactionSnapshot separately.  And what is exact difference between them? 
   If you know that, could you explain that for me? It will be very appreciated.
   Before we know them exactly, I think we should not modify the TransactionSnapshot to ActiveSnapshot in main process.
Ifit is, the main process should send ActiveSnapshot only.   

Thanks
Pengcheng



-----Original Message-----
From: Greg Nancarrow <gregn4422@gmail.com>
Sent: 2021年5月17日 20:59
To: 刘鹏程 <pengchengliu@tju.edu.cn>
Cc: Andres Freund <andres@anarazel.de>; PostgreSQL-development <pgsql-hackers@postgresql.org>
Subject: Re: Re: Parallel scan with SubTransGetTopmostTransaction assert coredump

On Sat, May 15, 2021 at 12:37 PM 刘鹏程 <pengchengliu@tju.edu.cn> wrote:
>
>
>   BTW, I test it in a high performance server. It is verly easily be reproduced. My colleague and me use different
environmentboth can reproduce it. 
>

Hi Pengcheng,

Although the issue won't reproduce easily in my system, I can certainly see how, for the snapshots used in the parallel
workercase, the Active snapshot used is potentially an earlier snapshot that the Transaction snapshot. I don't know why
itis getting a newer Transaction snapshot in InitializeParallelDSM(), when it has previously pushed the return value of
GetTransactionSnapshot()as the Active snapshot. 

So I too hope Tom or Robert can explain what is going on here and how to resolve it (as you requested them to, in your
otherpost). 

I actually think that the Assert in SubTransGetTopmostTransaction() is correct, but in the parallel-worker case, the
snapshotsare not being setup correctly. 

Can you try the trivial change below and see if it prevents the coredump?

Regards,
Greg Nancarrow
Fujitsu Australia


diff --git a/src/backend/access/transam/parallel.c
b/src/backend/access/transam/parallel.c
index 14a8690019..870889053f 100644
--- a/src/backend/access/transam/parallel.c
+++ b/src/backend/access/transam/parallel.c
@@ -216,7 +216,7 @@ InitializeParallelDSM(ParallelContext *pcxt)
  int i;
  FixedParallelState *fps;
  dsm_handle session_dsm_handle = DSM_HANDLE_INVALID;
- Snapshot transaction_snapshot = GetTransactionSnapshot();
+ Snapshot transaction_snapshot = GetActiveSnapshot();
  Snapshot active_snapshot = GetActiveSnapshot();

  /* We might be running in a very short-lived memory context. */



В списке pgsql-hackers по дате отправления:

Предыдущее
От: Michael Paquier
Дата:
Сообщение: Re: Move pg_attribute.attcompression to earlier in struct for reduced size?
Следующее
От: David Rowley
Дата:
Сообщение: Re: Skip partition tuple routing with constant partition key