[Pacemaker] starting resources: Interrupted system call

Bernd Schubert bs_lists at aakef.fastmail.fm
Thu Jul 1 13:11:52 UTC 2010


Hi all,

there seems to be a new regression in pacemaker-1.0.8 (or cluster-glue 
or whatever, really difficult to differentiate the layers).

ul 01 15:04:37 phys-oss2 lustre_server[8571]: [8602]: INFO: Running start for /dev/mapper/ost_demofs_8 on /lustre/demofs/ost_8
Jul 01 15:04:38 phys-oss2 lustre_server[8571]: [8620]: INFO: Running mount -t lustre  /dev/mapper/ost_demofs_8 /lustre/demofs/ost_8
Jul 01 15:04:41 phys-oss2 cib: [8901]: info: write_cib_contents: Archived previous version as /var/lib/heartbeat/crm/cib-57.raw
Jul 01 15:04:42 phys-oss2 cib: [8901]: info: write_cib_contents: Wrote version 0.638.0 of the CIB to disk (digest: 13003c55305a60f2178e455f50
9530df)
Jul 01 15:04:43 phys-oss2 cib: [8901]: info: retrieveCib: Reading cluster configuration from: /var/lib/heartbeat/crm/cib.KkBEWh (digest: /var
/lib/heartbeat/crm/cib.QRl46D)
Jul 01 15:04:46 phys-oss2 cib: [8946]: info: write_cib_contents: Archived previous version as /var/lib/heartbeat/crm/cib-58.raw
Jul 01 15:04:48 phys-oss2 cib: [8946]: info: write_cib_contents: Wrote version 0.639.0 of the CIB to disk (digest: 0d88dd459c597542895265d5f8
bedb3a)
Jul 01 15:04:49 phys-oss2 cib: [8946]: info: retrieveCib: Reading cluster configuration from: /var/lib/heartbeat/crm/cib.qsk62F (digest: /var
/lib/heartbeat/crm/cib.dhAidm)
Jul 01 15:04:50 phys-oss2 cib: [8948]: info: write_cib_contents: Archived previous version as /var/lib/heartbeat/crm/cib-59.raw
Jul 01 15:04:51 phys-oss2 cib: [8948]: info: write_cib_contents: Wrote version 0.640.0 of the CIB to disk (digest: 8103866a901b72c8ff77443eb5
d2ae0f)
Jul 01 15:04:51 phys-oss2 cib: [8948]: info: retrieveCib: Reading cluster configuration from: /var/lib/heartbeat/crm/cib.6xOswj (digest: /var
/lib/heartbeat/crm/cib.BHtHgd)
Jul 01 15:04:52 phys-oss2 cib: [8950]: info: write_cib_contents: Archived previous version as /var/lib/heartbeat/crm/cib-60.raw
Jul 01 15:04:53 phys-oss2 cib: [8950]: info: write_cib_contents: Wrote version 0.641.0 of the CIB to disk (digest: 
dce9a7bf8b8d4f3fe53c6ca2ce399fb4)
Jul 01 15:04:54 phys-oss2 cib: [8950]: info: retrieveCib: Reading cluster configuration from: /var/lib/heartbeat/crm/cib.VaA9y7 (digest: 
/var/lib/heartbeat/crm/cib.x5WXTb)
Jul 01 15:04:54 phys-oss2 cib: [7987]: WARN: G_SIG_dispatch: Dispatch function for SIGCHLD was delayed 240 ms (> 100 ms) before being called 
(GSource: 0xef73700)
Jul 01 15:04:54 phys-oss2 cib: [7987]: info: G_SIG_dispatch: started at 486502645 should have started at 486502621
Jul 01 15:04:55 phys-oss2 cib: [8951]: info: write_cib_contents: Archived previous version as /var/lib/heartbeat/crm/cib-61.raw
Jul 01 15:04:56 phys-oss2 cib: [8951]: info: write_cib_contents: Wrote version 0.642.0 of the CIB to disk (digest: 
72358cbb47103129ea2ec4db0ca09fa5)
Jul 01 15:04:57 phys-oss2 cib: [8951]: info: retrieveCib: Reading cluster configuration from: /var/lib/heartbeat/crm/cib.AwzNsn (digest: 
/var/lib/heartbeat/crm/cib.U8PVzD)
Jul 01 15:05:08 phys-oss2 lustre_server[8571]: [8957]: ERROR: cmd "mount -t lustre  /dev/mapper/ost_demofs_8 /lustre/demofs/ost_8" failed: 
mount.lustre: mount /dev/mapper/ost_demofs_8 at /lustre/demofs/ost_8 failed: Interrupted system call
Jul 01 15:05:09 phys-oss2 crmd: [7991]: info: process_lrm_event: LRM operation ost_demofs_8_start_0 (call=103, rc=1, cib-update=220, 
confirmed=true) unknown error


Now the start timeout is set to 600s, so it I don't see why it should abort the mount command:

primitive ost_demofs_8 ocf:ddn:lustre_server \
        params device="/dev/mapper/ost_demofs_8" directory="/lustre/demofs/ost_8" \
        op monitor interval="120" timeout="600" \
        op start interval="0" timeout="700" \
        op stop interval="0" timeout="300" \
        meta resource-stickiness="0" target-role="Started" is-managed="true"



Shall I open a bug entry and attach hb_report or is it a know issue?


Thanks,
Bernd

-- 
Bernd Schubert
DataDirect Networks



More information about the Pacemaker mailing list