Re: PG_DUMP very slow because of STDOUT ??
От | Andras Fabian |
---|---|
Тема | Re: PG_DUMP very slow because of STDOUT ?? |
Дата | |
Msg-id | B1A1AD14D5F9D647BD2A00988C53B8220ACA3000@atradaex03.nbg.atrada.net обсуждение исходный текст |
Ответ на | Re: PG_DUMP very slow because of STDOUT ?? (Scott Marlowe <scott.marlowe@gmail.com>) |
Ответы |
Re: PG_DUMP very slow because of STDOUT ??
|
Список | pgsql-general |
Hi Scott, Although I can't guarantee for 100% that there was no RAID rebuild at some point, I am almost sure that it wasn't the case.Two machines - the ones which were already in production - exhibited this problem. Both of them were already up forsome weeks. Now, the reboot rather "fixed" one of them instead of making it worse (as your theory goes this way) the problem"disappeared" (but I don't know for how long). Now, only one of the production machines has the issue ... the onewhich wasn't rebooted. Strange, strange. Nevertheless thank you for your idea ... this is exactly the way I try to approachthe problem, by making some theories and trying to prove or disapprove them :-) Now I will try to further investigate along the tips from Craig and Greg. Andras Fabian -----Ursprüngliche Nachricht----- Von: Scott Marlowe [mailto:scott.marlowe@gmail.com] Gesendet: Dienstag, 13. Juli 2010 03:43 An: Andras Fabian Cc: Tom Lane; pgsql-general@postgresql.org Betreff: Re: [GENERAL] PG_DUMP very slow because of STDOUT ?? On Mon, Jul 12, 2010 at 7:03 AM, Andras Fabian <Fabian@atrada.net> wrote: > This STDOU issue gets even weirder. Now I have set up our two new servers (identical hw/sw) as I would have needed to doso anyways. After having PG running, I also set up the same test scenario as I have it on our problematic servers, andstarted the COPY-to-STDOUT experiment. And you know what? Both new servers are performing well. No hanging, and the 3GByte test dump was written in around 3 minutes (as expected). To make things even more complicated ... I went back to ourproduction servers. Now, the first one - which I froze up with oprofile this morning and needed a REBOOT - is performingwell too! It needed 3 minutes for the test case ... WTF? BUT, the second production server, which did not havea reboot, is still behaving badly. I'm gonna take a scientific wild-assed guess that your machine was rebuilding RAID arrays when you started out, and you had massive IO contention underneath the OS level resulting in such a slow down. Note that you mentioned ~5% IO Wait. That's actually fairly high if you've got 8 to 16 cores or something like that. It's much better to use iostat -xd 60 or something like that and look for IO Utilization at the end of the lines. Again, just a guess.
В списке pgsql-general по дате отправления: