You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Manually terminated the node in 1c via the AWS console.
New node comes up shortly after. Priam detects the old 1c node is dead in its log.
is_replace_token is true and the IP of the dead node is returned in get_replaced_ip
Old node marked as '-dead' in aws sdb and DN in nodetool status
New Node starts Priam, TRIES to start Cassandra, but keeps telling Cassandra that the old 1c node is still in gossip. Cassandra cannot connect to the downed node, and aborts on startup
Fix:
a service tomcat8 restart on the new node fixes the problem
on the restart, is_replace_token returns: false so no IP is replaced so no gossip with dead nodes occurs
upon restart, nodetool status on the other nodes replaces the 'DN' node with the new node
Questions:
Why is Cassandra not able to replace the dead node ?
Why on the Priam restart, is Cassandra able to restart successfully ignoring the dead node?
The text was updated successfully, but these errors were encountered:
I think that the protocol used by priam is incorrect:
If is_replace = true, and it's attempting to replace a downed node - that node might be unavailable altogether. Priam has explicitly marked this downed node as dead, so the expectation of any communication with it should be 0.
Cassandra, when started with in replace mode, attempts to talk to the downed node and fails whenever the node doesn't exist in gossip. Hence the replace can never happen without manual intervention.
@arunagrawal84 thx for helping out in the past, could you comment on this?
Setup:
Fix:
service tomcat8 restart
on the new node fixes the problemQuestions:
The text was updated successfully, but these errors were encountered: