Обсуждение: pg_restore crashes passing NULL to strcmp (9.4 rc1)
I'm not sure what's wrong with this dump file; I remember copying it with rsync
some months ago, experiencing the crash, and recopying it, not able to break
off and go into debugging postgres at that point.
$ dpkg-query -W postgresql-client-9.4
postgresql-client-9.4 9.4~rc1-1
(gdb) r
Starting program: /usr/lib/postgresql/9.4/bin/pg_restore /home/pryzbyj/clientdb/ptci.d/ptci.segv
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
Program received signal SIGSEGV, Segmentation fault.
ReadToc (AH=AH@entry=0x55555577b3b0)
at /build/buildd/postgresql-9.4-9.4~rc1/build/../src/bin/pg_dump/pg_backup_archiver.c:2426
2426 /build/buildd/postgresql-9.4-9.4~rc1/build/../src/bin/pg_dump/pg_backup_archiver.c: No such file or directory.
2422 te->owner = ReadStr(AH);
2423 if (AH->version >= K_VERS_1_9)
2424 {
2425 if (strcmp(ReadStr(AH), "true") == 0)
2426 te->withOids = true;
2427 else
2428 te->withOids = false;
2429 }
2430 else
2431 te->withOids = true;
(gdb) p ReadStr(AH)
$1 = 0x0
On 12/06/2014 08:11 AM, Justin Pryzby wrote:
> I'm not sure what's wrong with this dump file; I remember copying it with rsync
> some months ago, experiencing the crash, and recopying it, not able to break
> off and go into debugging postgres at that point.
>
> $ dpkg-query -W postgresql-client-9.4
> postgresql-client-9.4 9.4~rc1-1
>
> (gdb) r
> Starting program: /usr/lib/postgresql/9.4/bin/pg_restore /home/pryzbyj/clientdb/ptci.d/ptci.segv
> [Thread debugging using libthread_db enabled]
> Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
>
> Program received signal SIGSEGV, Segmentation fault.
> ReadToc (AH=AH@entry=0x55555577b3b0)
> at /build/buildd/postgresql-9.4-9.4~rc1/build/../src/bin/pg_dump/pg_backup_archiver.c:2426
> 2426 /build/buildd/postgresql-9.4-9.4~rc1/build/../src/bin/pg_dump/pg_backup_archiver.c: No such file or
directory.
>
> 2422 te->owner = ReadStr(AH);
> 2423 if (AH->version >= K_VERS_1_9)
> 2424 {
> 2425 if (strcmp(ReadStr(AH), "true") == 0)
> 2426 te->withOids = true;
> 2427 else
> 2428 te->withOids = false;
> 2429 }
> 2430 else
> 2431 te->withOids = true;
Hmm. It's a bit hard to believe that the "te->withOids = true"
assignment could've segfaulted. So the line number probably isn't 100%
accurate.
A corrupted dump file can certainly cause a segfault in pg_restore, so
the dump file was probably somehow damaged while copying it with rsync.
Can't say more than that, I'm afraid, without the dump file itself.
> (gdb) p ReadStr(AH)
> $1 = 0x0
That tries reads the *next* string from the dump file, so it's not very
surprising that that fails. The previous ReadStr() call probably failed
already, and it would be more interesting to to see what went wrong there.
- Heikki
Heikki Linnakangas <hlinnakangas@vmware.com> writes:
> On 12/06/2014 08:11 AM, Justin Pryzby wrote:
>> I'm not sure what's wrong with this dump file; I remember copying it with rsync
>> some months ago, experiencing the crash, and recopying it, not able to break
>> off and go into debugging postgres at that point.
> A corrupted dump file can certainly cause a segfault in pg_restore, so
> the dump file was probably somehow damaged while copying it with rsync.
> Can't say more than that, I'm afraid, without the dump file itself.
Yeah, I think this sounds like "rsync screwed up".
> That tries reads the *next* string from the dump file, so it's not very
> surprising that that fails. The previous ReadStr() call probably failed
> already, and it would be more interesting to to see what went wrong there.
AFAICS, the only thing that would cause ReadStr to return NULL (rather
than failing internally) would be having read a negative length word
from the file. So it's hard to come to any conclusion other than that
the file is corrupt. There are various subcases of that but it all
comes out to the same place in the end --- especially if recopying
the original file produced a valid readable file.
Possibly this bug report should be interpreted as "we should strive
to make pg_restore yield an appropriate error message rather than ever
dumping core, no matter what's wrong with the input file". It would not
be too hard to fix this specific case, but trying to ensure that we've
gotten rid of every other case would probably be a huge amount of work
for not much reward :-(
regards, tom lane