Errors with fence_scsi unfencing on Centos7 Pacemaker Cluster -
i attempting implement 2 node pacemaker cluster in centos 7, using pcs. have run apparent brick wall stonith fencing configuration , have been far unable resolve it.
our cluster running on 2 hyper-v virtual machines using vhds disk sets being presented shared scsi devices. attempting use 1 such shared disk fence_scsi fencing agent
my stonith resources configured thusly:
resource: fence_1 (class=stonith type=fence_scsi) attributes: pcmk_host_list="node_1 node_2" pcmk_monitor_action=metadata pcmk_reboot_action=off devices=/dev/disk/by-id/wwn-0x60.......... pcmk_host_check=static-list nodename=node_2 meta attrs: provides=unfencing resource: fence_2 (class=stonith type=fence_scsi) attributes: pcmk_host_list="node_1 node_2" pcmk_monitor_action=metadata pcmk_reboot_action=off devices=/dev/disk/by-id/wwn-0x60.......... pcmk_host_check=static-list nodename=node_1 meta attrs: provides=unfencing
i have tried various combinations of resources , parameters, including using 1 stonith resource , using more minimal set of parameters
when attempting test fencing agents usinf stonith_admin or pcs, node in question appears being fenced, in far node removed cluster, not power off or reboot, , resources running on fenced node stay in stopped state , not attempt move other node.
the corosync.log shows following errors each time fencing attempt made
jul 25 08:45:58 [64970] node_1 stonith-ng: error: remote_op_done operation on of node_2 <no-one> crmd.64947@node_1.79747baa: no such devices jul 25 08:45:58 [64974] node_1 crmd: error: tengine_stonith_notify: unfencing of node_2 <anyone> failed: no such device (-19)
i not have access red hat documentation , support articles behind subscription paywall , have got far using mixture of articles clusterlabs, centos , various blogs have been unable fencing / unfencing working.
any advice appreciated.
Comments
Post a Comment