All times are shown in UTC
One of our Cassandra nodes in us-east-1 is down due to an underlying hardware fault. This transiently caused some errors on a percentage of our realtime service nodes connected to that faulty node. The server has now been isolated.
The faulty node has now been fully removed from the cluster, and all data has been successfully replicated to a new healthy node.
Resolvedin 28 minutes