[Pacemaker] starting resources: Interrupted system call

Dejan Muhamedagic dejanmm at fastmail.fm
Thu Jul 1 15:45:22 UTC 2010


Hi,

On Thu, Jul 01, 2010 at 03:37:57PM +0200, Bernd Schubert wrote:
> Never mind, seems to be fixed in 1.0.9

I have no idea what was going on in there. The pacemaker bits
shouldn't make a difference.

Thanks,

Dejan

> Thanks,
> Bernd
> 
> On Thursday, July 01, 2010, Bernd Schubert wrote:
> > Hi all,
> > 
> > there seems to be a new regression in pacemaker-1.0.8 (or cluster-glue
> > or whatever, really difficult to differentiate the layers).
> > 
> > ul 01 15:04:37 phys-oss2 lustre_server[8571]: [8602]: INFO: Running start
> > for /dev/mapper/ost_demofs_8 on /lustre/demofs/ost_8 Jul 01 15:04:38
> > phys-oss2 lustre_server[8571]: [8620]: INFO: Running mount -t lustre 
> > /dev/mapper/ost_demofs_8 /lustre/demofs/ost_8 Jul 01 15:04:41 phys-oss2
> > cib: [8901]: info: write_cib_contents: Archived previous version as
> > /var/lib/heartbeat/crm/cib-57.raw Jul 01 15:04:42 phys-oss2 cib: [8901]:
> > info: write_cib_contents: Wrote version 0.638.0 of the CIB to disk
> > (digest: 13003c55305a60f2178e455f50 9530df)
> > Jul 01 15:04:43 phys-oss2 cib: [8901]: info: retrieveCib: Reading cluster
> > configuration from: /var/lib/heartbeat/crm/cib.KkBEWh (digest: /var
> > /lib/heartbeat/crm/cib.QRl46D)
> > Jul 01 15:04:46 phys-oss2 cib: [8946]: info: write_cib_contents: Archived
> > previous version as /var/lib/heartbeat/crm/cib-58.raw Jul 01 15:04:48
> > phys-oss2 cib: [8946]: info: write_cib_contents: Wrote version 0.639.0 of
> > the CIB to disk (digest: 0d88dd459c597542895265d5f8 bedb3a)
> > Jul 01 15:04:49 phys-oss2 cib: [8946]: info: retrieveCib: Reading cluster
> > configuration from: /var/lib/heartbeat/crm/cib.qsk62F (digest: /var
> > /lib/heartbeat/crm/cib.dhAidm)
> > Jul 01 15:04:50 phys-oss2 cib: [8948]: info: write_cib_contents: Archived
> > previous version as /var/lib/heartbeat/crm/cib-59.raw Jul 01 15:04:51
> > phys-oss2 cib: [8948]: info: write_cib_contents: Wrote version 0.640.0 of
> > the CIB to disk (digest: 8103866a901b72c8ff77443eb5 d2ae0f)
> > Jul 01 15:04:51 phys-oss2 cib: [8948]: info: retrieveCib: Reading cluster
> > configuration from: /var/lib/heartbeat/crm/cib.6xOswj (digest: /var
> > /lib/heartbeat/crm/cib.BHtHgd)
> > Jul 01 15:04:52 phys-oss2 cib: [8950]: info: write_cib_contents: Archived
> > previous version as /var/lib/heartbeat/crm/cib-60.raw Jul 01 15:04:53
> > phys-oss2 cib: [8950]: info: write_cib_contents: Wrote version 0.641.0 of
> > the CIB to disk (digest: dce9a7bf8b8d4f3fe53c6ca2ce399fb4)
> > Jul 01 15:04:54 phys-oss2 cib: [8950]: info: retrieveCib: Reading cluster
> > configuration from: /var/lib/heartbeat/crm/cib.VaA9y7 (digest:
> > /var/lib/heartbeat/crm/cib.x5WXTb)
> > Jul 01 15:04:54 phys-oss2 cib: [7987]: WARN: G_SIG_dispatch: Dispatch
> > function for SIGCHLD was delayed 240 ms (> 100 ms) before being called
> > (GSource: 0xef73700)
> > Jul 01 15:04:54 phys-oss2 cib: [7987]: info: G_SIG_dispatch: started at
> > 486502645 should have started at 486502621 Jul 01 15:04:55 phys-oss2 cib:
> > [8951]: info: write_cib_contents: Archived previous version as
> > /var/lib/heartbeat/crm/cib-61.raw Jul 01 15:04:56 phys-oss2 cib: [8951]:
> > info: write_cib_contents: Wrote version 0.642.0 of the CIB to disk
> > (digest: 72358cbb47103129ea2ec4db0ca09fa5)
> > Jul 01 15:04:57 phys-oss2 cib: [8951]: info: retrieveCib: Reading cluster
> > configuration from: /var/lib/heartbeat/crm/cib.AwzNsn (digest:
> > /var/lib/heartbeat/crm/cib.U8PVzD)
> > Jul 01 15:05:08 phys-oss2 lustre_server[8571]: [8957]: ERROR: cmd "mount -t
> > lustre  /dev/mapper/ost_demofs_8 /lustre/demofs/ost_8" failed:
> > mount.lustre: mount /dev/mapper/ost_demofs_8 at /lustre/demofs/ost_8
> > failed: Interrupted system call Jul 01 15:05:09 phys-oss2 crmd: [7991]:
> > info: process_lrm_event: LRM operation ost_demofs_8_start_0 (call=103,
> > rc=1, cib-update=220, confirmed=true) unknown error
> > 
> > 
> > Now the start timeout is set to 600s, so it I don't see why it should abort
> > the mount command:
> > 
> > primitive ost_demofs_8 ocf:ddn:lustre_server \
> >         params device="/dev/mapper/ost_demofs_8"
> > directory="/lustre/demofs/ost_8" \ op monitor interval="120" timeout="600"
> > \
> >         op start interval="0" timeout="700" \
> >         op stop interval="0" timeout="300" \
> >         meta resource-stickiness="0" target-role="Started"
> > is-managed="true"
> > 
> > 
> > 
> > Shall I open a bug entry and attach hb_report or is it a know issue?
> > 
> > 
> > Thanks,
> > Bernd
> 
> 
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker



More information about the Pacemaker mailing list