[Pacemaker] pacemakerd does not daemonize?
Andrew Beekhof
andrew at beekhof.net
Wed Apr 17 05:36:12 UTC 2013
On 10/04/2013, at 2:21 PM, Andrei Belov <defanator at gmail.com> wrote:
>
> On Apr 10, 2013, at 2:06 , Andrew Beekhof <andrew at beekhof.net> wrote:
>
>>
>> On 09/04/2013, at 4:13 PM, Andrei Belov <defanator at gmail.com> wrote:
>>
>>> Hello pacemaker users,
>>>
>>> I noticed that neither "-p" nor "-f" option does not make any sense for pacemakerd -
>>> pid_file is never used, and "f" option marked as "Legacy".
>>>
>>> Is the ability to run as a daemon disappeared completely?
>>
>> Is "pacemakerd &" insufficient?
>> This is what the init script uses.
>
> That's ok, I just was a little confused by meaningless options in "pacemakerd --help".
I've updated it to:
[03:27 PM] beekhof at f17 ~/Development/sources/pacemaker/devel ☺ # mcp/pacemakerd --help
pacemakerd - Start/Stop Pacemaker
Usage: pacemakerd mode [options]
Options:
-?, --help This text
-$, --version Version information
-V, --verbose Increase debug output
-S, --shutdown Instruct Pacemaker to shutdown on this machine
-F, --features Display the full version and list of features Pacemaker was built with
Additional Options:
-f, --foreground (Ignored) Pacemaker always runs in the foreground
-p, --pid-file=value (Ignored) Daemon pid file location
Report bugs to pacemaker at oss.clusterlabs.org
>
>
>>> Also I'd like to know if there are any reasons to worry about the following:
>>
>> Absolutely... four processes crashed/aborted.
>>
>>>
>>> Apr 08 19:54:20 [6025] pacemakerd: info: pcmk_child_exit: Child process crmd exited (pid=6031, rc=0)
>>> Apr 08 19:54:20 [6025] pacemakerd: info: pcmk_child_exit: Child process pengine exited (pid=6030, rc=0)
>>> Apr 08 19:54:24 [6025] pacemakerd: notice: pcmk_child_exit: Child process attrd terminated with signal 6 (pid=6029, core=128)
>>> Apr 08 19:54:29 [6025] pacemakerd: notice: pcmk_child_exit: Child process lrmd terminated with signal 6 (pid=6028, core=128)
>>> Apr 08 19:54:33 [6025] pacemakerd: notice: pcmk_child_exit: Child process stonith-ng terminated with signal 6 (pid=6027, core=128)
>>> Apr 08 19:54:38 [6025] pacemakerd: notice: pcmk_child_exit: Child process cib terminated with signal 6 (pid=6026, core=128)
>>>
>>> Why some helper daemons could be terminated using abort() ?
>>
>> Something _really_ bad happened.
>
> I suspect something wrong with pacemaker + libqb and QB_IPC_SOCKET.
> Would appreciate any advices - my knowledge of pacemaker/libqb internals
> is very limited.
>
> It looks like the reason for abort() is somewhere in qb_ipcs_connection_unref():
This is on non-linux right?
I think Angus was of the opinion that $thing_i_cant_remember did reference counting a bit differently on non-linux.
I'm not sure he made much progress with it. Can you confirm which arch this is before we continue?
>
> Core was generated by `/opt/local/libexec/pacemaker/attrd'.
> Program terminated with signal 6, Aborted.
> #0 0xfffffd7fff0e061a in _lwp_kill () from /lib/64/libc.so.1
> (gdb) bt
> #0 0xfffffd7fff0e061a in _lwp_kill () from /lib/64/libc.so.1
> #1 0xfffffd7fff0d4ddd in thr_kill () from /lib/64/libc.so.1
> #2 0xfffffd7fff06a971 in raise () from /lib/64/libc.so.1
> #3 0xfffffd7fff0400a1 in abort () from /lib/64/libc.so.1
> #4 0xfffffd7fff0403f5 in _assert () from /lib/64/libc.so.1
> #5 0xfffffd7fc021274e in qb_ipcs_connection_unref () from /opt/local/lib/libqb.so.0
> #6 0x00000000004044f9 in main ()
>
> Core was generated by `/opt/local/libexec/pacemaker/cib'.
> Program terminated with signal 6, Aborted.
> #0 0xfffffd7fff0f061a in _lwp_kill () from /lib/64/libc.so.1
> (gdb) bt
> #0 0xfffffd7fff0f061a in _lwp_kill () from /lib/64/libc.so.1
> #1 0xfffffd7fff0e4ddd in thr_kill () from /lib/64/libc.so.1
> #2 0xfffffd7fff07a971 in raise () from /lib/64/libc.so.1
> #3 0xfffffd7fff0500a1 in abort () from /lib/64/libc.so.1
> #4 0xfffffd7fff0503f5 in _assert () from /lib/64/libc.so.1
> #5 0xfffffd7fc021274e in qb_ipcs_connection_unref () from /opt/local/lib/libqb.so.0
> #6 0x0000000000410438 in cib_shutdown ()
> #7 0xfffffd7fbfc5533f in crm_signal_dispatch (source=0x49be80, callback=<optimized out>, userdata=<optimized out>)
> at mainloop.c:203
> #8 0xfffffd7fc555f9e0 in g_main_context_dispatch () from /opt/local/lib/libglib-2.0.so.0
> #9 0xfffffd7fc555fd40 in g_main_context_iterate.isra.24 () from /opt/local/lib/libglib-2.0.so.0
> #10 0xfffffd7fc5560152 in g_main_loop_run () from /opt/local/lib/libglib-2.0.so.0
> #11 0x0000000000411056 in cib_init ()
> #12 0x000000000041163e in main ()
>
> Core was generated by `/opt/local/libexec/pacemaker/lrmd'.
> Program terminated with signal 6, Aborted.
> #0 0xfffffd7fff0e061a in _lwp_kill () from /lib/64/libc.so.1
> (gdb) bt
> #0 0xfffffd7fff0e061a in _lwp_kill () from /lib/64/libc.so.1
> #1 0xfffffd7fff0d4ddd in thr_kill () from /lib/64/libc.so.1
> #2 0xfffffd7fff06a971 in raise () from /lib/64/libc.so.1
> #3 0xfffffd7fff0400a1 in abort () from /lib/64/libc.so.1
> #4 0xfffffd7fff0403f5 in _assert () from /lib/64/libc.so.1
> #5 0xfffffd7fc021274e in qb_ipcs_connection_unref () from /opt/local/lib/libqb.so.0
> #6 0xfffffd7fc02128a4 in qb_ipcs_disconnect () from /opt/local/lib/libqb.so.0
> #7 0xfffffd7fc0212995 in qb_ipcs_unref () from /opt/local/lib/libqb.so.0
> #8 0xfffffd7fc02129c7 in qb_ipcs_destroy () from /opt/local/lib/libqb.so.0
> #9 0xfffffd7fbfc55a3f in mainloop_del_ipc_server (server=<optimized out>) at mainloop.c:517
> #10 0x00000000004041cd in lrmd_shutdown ()
> #11 0xfffffd7fbfc5533f in crm_signal_dispatch (source=0x48ad40, callback=<optimized out>, userdata=<optimized out>)
> at mainloop.c:203
> #12 0xfffffd7fc555f9e0 in g_main_context_dispatch () from /opt/local/lib/libglib-2.0.so.0
> #13 0xfffffd7fc555fd40 in g_main_context_iterate.isra.24 () from /opt/local/lib/libglib-2.0.so.0
> #14 0xfffffd7fc5560152 in g_main_loop_run () from /opt/local/lib/libglib-2.0.so.0
> #15 0x00000000004045e3 in main ()
>
> Core was generated by `/opt/local/libexec/pacemaker/stonithd'.
> Program terminated with signal 6, Aborted.
> #0 0xfffffd7fff11061a in _lwp_kill () from /lib/64/libc.so.1
> (gdb) bt
> #0 0xfffffd7fff11061a in _lwp_kill () from /lib/64/libc.so.1
> #1 0xfffffd7fff104ddd in thr_kill () from /lib/64/libc.so.1
> #2 0xfffffd7fff09a971 in raise () from /lib/64/libc.so.1
> #3 0xfffffd7fff0700a1 in abort () from /lib/64/libc.so.1
> #4 0xfffffd7fff0703f5 in _assert () from /lib/64/libc.so.1
> #5 0xfffffd7fc021274e in qb_ipcs_connection_unref () from /opt/local/lib/libqb.so.0
> #6 0xfffffd7fc02128a4 in qb_ipcs_disconnect () from /opt/local/lib/libqb.so.0
> #7 0xfffffd7fc0212995 in qb_ipcs_unref () from /opt/local/lib/libqb.so.0
> #8 0xfffffd7fc02129c7 in qb_ipcs_destroy () from /opt/local/lib/libqb.so.0
> #9 0x0000000000405e60 in ?? ()
> #10 0x0000000000407d28 in main ()
>
>
>
>>> Here's the backtrace for cib:
>>>
>>> Core was generated by `/opt/local/libexec/pacemaker/cib'.
>>> Program terminated with signal 6, Aborted.
>>> #0 0xfffffd7fff0f061a in _lwp_kill () from /lib/64/libc.so.1
>>> (gdb) bt
>>> #0 0xfffffd7fff0f061a in _lwp_kill () from /lib/64/libc.so.1
>>> #1 0xfffffd7fff0e4ddd in thr_kill () from /lib/64/libc.so.1
>>> #2 0xfffffd7fff07a971 in raise () from /lib/64/libc.so.1
>>> #3 0xfffffd7fff0500a1 in abort () from /lib/64/libc.so.1
>>> #4 0xfffffd7fff0503f5 in _assert () from /lib/64/libc.so.1
>>> #5 0xfffffd7fc021274e in qb_ipcs_connection_unref () from /opt/local/lib/libqb.so.0
>>> #6 0x0000000000410438 in cib_shutdown ()
>>> #7 0xfffffd7fbff2533f in crm_signal_dispatch (source=0x49be80, callback=<optimized out>, userdata=<optimized out>)
>>> at mainloop.c:203
>>> #8 0xfffffd7fc555f9e0 in g_main_context_dispatch () from /opt/local/lib/libglib-2.0.so.0
>>> #9 0xfffffd7fc555fd40 in g_main_context_iterate.isra.24 () from /opt/local/lib/libglib-2.0.so.0
>>> #10 0xfffffd7fc5560152 in g_main_loop_run () from /opt/local/lib/libglib-2.0.so.0
>>> #11 0x0000000000411056 in cib_init ()
>>> #12 0x000000000041163e in main ()
>>> (gdb)
>>>
>>>
>>> I'm using pacemaker 1.1.8 under SunOS (Joyent cloud).
>>>
>>>
>>> Best regards,
>>> Andrei
>
>
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
More information about the Pacemaker
mailing list