📖 ~1 min read
Table of contents
Symptom & Impact
PowerHA reports node DOWN and resource group fails to acquire on standby.
Environment & Reproduction
IBM AIX 7.2 LPAR exhibiting HACMP/PowerHA cluster failover issues under standard PowerVM workloads.
clRGinfo
lssrc -ls clstrmgrES
Root Cause Analysis
Heartbeat network loss or RG dependency cycle prevents acquisition.
Quick Triage
Confirm scope with errpt, recent changes, and subsystem state via lssrc.
errpt | grep -i CLUSTER
lssrc -ls topsvcs
Step-by-Step Diagnosis
Trace clstrmgr log and verify network and disk heartbeat paths.
clcmd lssrc -ls clstrmgrES
tail /var/hacmp/log/clstrmgr.debug

Solution – Primary Fix
Force RG move to the healthy node and clear sticky locations.
Still having issues? Our IT Solutions & Services team can diagnose and resolve this for you. Get in touch for a free consultation.
clRGmove -g rg1 -n node2 -m
clRGinfo -v

Solution – Alternative Approaches
Run cluster verification and synchronisation after fixing config drift.
Verification & Acceptance Criteria
Confirm subsystem returns to RUNNING state and errpt shows no new entries.
clRGinfo
lssrc -ls clstrmgrES
Rollback Plan
Restore prior configuration from mksysb or alt_disk_install clone if the fix regresses.
clRGmove -g rg1 -n node1 -m # only after root cause cleared
Prevention & Hardening
Encode the fix in NIM customisation scripts and monitor via topas/nmon.
smit clverify # schedule verification weekly
Related Errors & Cross-Refs
errpt LABEL=NETWORK_DOWN; topsvcs daemon stops
Related tutorial: View the step-by-step tutorial for aix-7.1.
View all aix-7.1 tutorials on the Tutorials Hub →
Browse all common problems & solutions on the Tutorials Hub.
References & Further Reading
IBM PowerHA SystemMirror Administration
Need Expert Help?
If you cannot resolve this yourself, our team offers hands-on Server Management, Managed IT Services, and flexible Support Plans. Contact us today — we respond within one business day.