If the Cluster fails to start, what can you do and what to
check?
Initial system start
If it is the first time you start up the cluster and it fails,
then check the following:
- Network - Check /etc/hosts
You should have (on the line for localhost)
127.0.0.1 localhost
and nothing else! Then the normal IP address host mapping follows..
REDHAT and others can add a lot of other things to that line, and this results in that the nodes won't be able to connect to each other (they will be stuck in phase 0 or phase 1)
- Network - Check if you can ping the machines
- Network - Check if you have any firewalls enables ( e.g check
with /sbin/iptables -L)
Disable the firewall in that case. Exactly how depends on OS and Linux distribution.
On Redhat systems, then SELinux might be enabled. Googling "disable firewall <your …