Re: BUG #18991: NUMA test fails on machine with two NUMA nodes when building 18beta2
От | Maximilian Bosch |
---|---|
Тема | Re: BUG #18991: NUMA test fails on machine with two NUMA nodes when building 18beta2 |
Дата | |
Msg-id | DBYOXT4OXK6C.HZEDWO892EXN@mbosch.me обсуждение исходный текст |
Ответ на | BUG #18991: NUMA test fails on machine with two NUMA nodes when building 18beta2 (PG Bug reporting form <noreply@postgresql.org>) |
Список | pgsql-bugs |
Hi! Quick follow-up from my side: I'd like to retract the bugreport, this is entirely an issue on our side. The sandbox hides sysfs, so PostgreSQL doesn't know about the NUMA topology at build-time, however the DB running in the test still sees pages from several nodes which is essentially what broke the test. Apologies for the unnecessary bugreport, I know it can be frustrating for upstreams to get reports that are essential "the distro messed it up". I only do this when I'm absolutely certain that a distro bug can be ruled out, this time I was wrong unfortunately. Have a nice Sunday Ma27 On Sat Jul 19, 2025 at 11:43 AM CEST, PG Bug reporting form wrote: > The following bug has been logged on the website: > > Bug reference: 18991 > Logged by: Maximilian Bosch > Email address: maximilian@mbosch.me > PostgreSQL version: 18beta2 > Operating system: NixOS 25.11 / unstable > Description: > > Hi! > > On NixOS master/unstable we provide betas of PostgreSQL to enable people > such as extension devs to quickly identify problems with it. > > When building 18beta2 with `--with-libnuma` and linking against libnuma.so > from numactl 2.0.18, I get the following test failure while building the > package on a machine with two NUMA nodes: > > ``` > diff -U3 /build/source/src/test/regress/expected/numa.out > /build/source/src/test/regress/results/numa.out > --- /build/source/src/test/regress/expected/numa.out 1970-01-01 > 00:00:01.000000000 +0000 > +++ /build/source/src/test/regress/results/numa.out 2025-07-19 > 08:44:02.793368816 +0000 > @@ -6,8 +6,4 @@ > -- switch to superuser > \c - > SELECT COUNT(*) >= 0 AS ok FROM pg_shmem_allocations_numa; > - ok > ----- > - t > -(1 row) > - > +ERROR: invalid NUMA node id outside of allowed range [0, 0]: 1 > ``` > > As mentioned above, the machine I used to build things has two NUMA nodes. > `numactl --hardware` gives > > ``` > node 0 cpus: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 > 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 > 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 > 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 192 193 > 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 > 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 > 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 > 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 > 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 > node 0 size: 773538 MB > node 0 free: 603763 MB > node 1 cpus: 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 > 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 > 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 > 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 > 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 > 189 190 191 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 > 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 > 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 > 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 > 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 > 380 381 382 383 > node 1 size: 767886 MB > node 1 free: 667997 MB > node distances: > node 0 1 > 0: 10 32 > 1: 32 10 > ``` > > I'm aware that there was already discussion about this in > https://www.postgresql.org/message-id/flat/E1u1tr8-003BbN-2E%40gemulon.postgresql.org, > but it seems like this was a different issue given that an error-code was > given instead of an actual NUMA id? Also, all of the discussion including a > fix seem to have happened well before 18beta2, hence I'm filing a new > report. > > Relevant downstream discussions: > * https://github.com/NixOS/nixpkgs/pull/411958#issuecomment-3031660150 > * https://github.com/NixOS/nixpkgs/pull/426118 > > Let me know if I can help with something, e.g. testing potential fixes. > > Best > Ma27
В списке pgsql-bugs по дате отправления: