[Pacemaker] ERROR: te_graph_trigger: Transition failed: terminated pacemaker's problem or mine?
Andrew Beekhof
andrew at beekhof.net
Wed May 2 01:42:13 UTC 2012
On Mon, Apr 30, 2012 at 10:44 PM, Lars Ellenberg
<lars.ellenberg at linbit.com> wrote:
> On Mon, Apr 30, 2012 at 01:00:11PM +1000, Andrew Beekhof wrote:
>> On Sat, Apr 28, 2012 at 5:40 AM, Lars Ellenberg
>> <lars.ellenberg at linbit.com> wrote:
>> > On Fri, Apr 27, 2012 at 11:31:23AM +0100, Tim Small wrote:
>> >> Hi,
>> >>
>> >> I'm trying to get to the bottom of a problem I'm seeing with a cluster.
>> >> At this stage I'm unclear as to whether the issue is with the config or
>> >> not - the generated error messages seem unclear. So I'm not sure
>> >> whether I should be staring at the config or the source code at this
>> >> point, and would appreciate a clue!
>> >>
>> >> I'm running with some of the (live) resources in an unmanaged state
>> >> whilst testing fail-over with other (non-dependant) resources.
>> >>
>> >> The managed resources are a number of OpenVZ virtual machines (each
>> >> comprising 3 primitives - file-system + OpenVZ VE + SendArp). The
>> >> filesystems are on LVM volume groups, and the single LVM PV for each
>> >> volume group resides on a DRBD volume. There are n virtual machines per
>> >> DRBD volume.
>> >>
>> >> I'm running pacemaker 1.0.9.1+hg15626-1 on Debian 6.0. Here are some of
>> >> the messages (configuration follows at the end of the email):
>> >
>> > Upgrading to 1.0.12, or 1.1.7, may get you a little further.
>> > It would not solve the "I need to stop that resource first, but I can
>> > not as it is unmanaged" dependency problem you apparently have here.
>>
>> There's really not a lot the cluster can do in this situation, there's
>> a 50% chance of getting it wrong no matter what we do.
>> In the most recent versions we now log as loudly as possible
>> (LOG_CRIT) that we cant shutdown because something depends on an
>> unmanaged resource.
>
> That's in fact what I meant ;-)
>
> Not only the cryptic "ERROR: te_graph_trigger: Transition failed: terminated"
> but "Hey you fool, I cannot do that because you told me not to manage
> that resource, but the other ones depend on it".
>
> Though, you still have to spot that line in the flood...
We're working on that part too :-)
More information about the Pacemaker
mailing list