Re: emergency outage requiring database restart
От | Jim Nasby |
---|---|
Тема | Re: emergency outage requiring database restart |
Дата | |
Msg-id | 6d8cc07c-8504-0d2a-d088-fe47a407228f@BlueTreble.com обсуждение исходный текст |
Ответ на | Re: emergency outage requiring database restart (Merlin Moncure <mmoncure@gmail.com>) |
Ответы |
Re: emergency outage requiring database restart
|
Список | pgsql-hackers |
On 10/28/16 8:23 AM, Merlin Moncure wrote: > On Thu, Oct 27, 2016 at 6:39 PM, Greg Stark <stark@mit.edu> wrote: >> On Thu, Oct 27, 2016 at 9:53 PM, Merlin Moncure <mmoncure@gmail.com> wrote: >>> I think we can rule out faulty storage >> >> Nobody ever expects the faulty storage LOL > Believe me, I know. But the evidence points elsewhere in this case; > this is clearly application driven. FWIW, just because it's triggered by specific application behavior doesn't mean there isn't a storage bug. That's what makes data corruption bugs such a joy to figure out. BTW, if you haven't already, I would reset all your storage related options and GUCs to safe defaults... plain old FSYNC, no cute journal / FS / mount options, etc. Maybe this is related to the app, but the most helpful thing right now is to find some kind of safe config so you can start bisecting. I would also consider alternatives to plsh, just to rule it out if nothing else. I'd certainly look at some way to get sqsh out of the loop (again, just to get something that doesn't crash). First idea that comes to mind is a stand-alone shell script that watches a named pipe for a filename; when it gets that file it runs it with sqsh and does something to signal completion. -- Jim Nasby, Data Architect, Blue Treble Consulting, Austin TX Experts in Analytics, Data Architecture and PostgreSQL Data in Trouble? Get it in Treble! http://BlueTreble.com 855-TREBLE2 (855-873-2532) mobile: 512-569-9461
В списке pgsql-hackers по дате отправления: