[Pacemaker] Possible bug?

Borislav Borisov borislav.v.borisov at gmail.com
Mon Sep 10 09:43:40 EDT 2012


Hi all,

I am experiencing a very strange issue. On two test boxes I have a setup
that should server NFS and iSCSI Targets (SCST).

When I create couple of iSCSI Target groups, composed of Target/Lun, and I
decide to remove one an error occurs:

  Sep 10 15:24:40 Cluster-Server-1 cib: [48709]: WARN: Managed
write_cib_contents process 54564 killed by signal 6 [SIGABRT - Abort].
  Sep 10 15:24:40 Cluster-Server-1 cib: [48709]: ERROR: Managed
write_cib_contents process 54564 dumped core
  Sep 10 15:24:40 Cluster-Server-1 cib: [48709]: ERROR:
cib_diskwrite_complete: Disk write failed: status=134, signo=6, exitcode=0
  Sep 10 15:24:40 Cluster-Server-1 cib: [48709]: ERROR:
cib_diskwrite_complete: Disabling disk writes after write failure

I then executed 'killall -PIPE cib' on both boxes to force a reload. More
resources were added, this time NFS ones, composed FS/ExportFS.
When I delete any of those there is no problem, however, the second that I
remove an iSCSI one the Disk writes are disabled with the error above.

A core file gets generated when that problem occurs, but it did not help me
much:
Reading symbols from /usr/libexec/pacemaker/cib...done.
[New LWP 54564]

warning: Can't read pathname for load map: Input/output error.
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
Core was generated by `/usr/libexec/pacemaker/cib'.
Program terminated with signal 6, Aborted.
#0  0x00007fb3d07e7475 in raise () from /lib/x86_64-linux-gnu/libc.so.6
(gdb) backtrace
#0  0x00007fb3d07e7475 in raise () from /lib/x86_64-linux-gnu/libc.so.6
#1  0x00007fb3d07ea6f0 in abort () from /lib/x86_64-linux-gnu/libc.so.6
#2  0x00007fb3d2578b2b in crm_abort (file=0x411323 "io.c",
function=0x412170 "write_cib_contents", line=662, assert_condition=0x412120
"retrieveCib(tmp1, tmp2, FALSE) != NULL", do_core=1, do_fork=0) at
utils.c:1659
#3  0x0000000000406d59 in write_cib_contents (p=0x0) at io.c:662
#4  0x00007fb3d1511863 in TempProcessTrigger (ginfo=0x6637a0) at
GSource.c:1792
#5  0x00007fb3d1510822 in G_TRIG_dispatch (source=0x664960, callback=0,
user_data=0x0) at GSource.c:1403
#6  0x00007fb3cfef94a3 in g_main_context_dispatch () from
/lib/libglib-2.0.so.0
#7  0x00007fb3cfef9c80 in ?? () from /lib/libglib-2.0.so.0
#8  0x00007fb3cfefa2f2 in g_main_loop_run () from /lib/libglib-2.0.so.0
#9  0x000000000040e905 in cib_init () at main.c:561
#10 0x000000000040df60 in main (argc=1, argv=0x7fff7f780ff8) at main.c:247

Something appears to be very wrong and I just can't figure out what. Any
help is appreciated.

Cheers.

P.S. I have attached the hb_report.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.clusterlabs.org/pipermail/pacemaker/attachments/20120910/880b292c/attachment-0002.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: hb_report-Mon-10-Sep-2012.tar.bz2
Type: application/x-bzip2
Size: 323822 bytes
Desc: not available
URL: <http://lists.clusterlabs.org/pipermail/pacemaker/attachments/20120910/880b292c/attachment-0002.bz2>


More information about the Pacemaker mailing list