Unplanned failover

Failover of application traffic to the DR target

Unplanned failover is the process of switching application traffic to the Target cluster in case the Source cluster becomes unavailable. One of the common reasons for such a scenario is an outage of the primary region.

Perform failover

Use the following procedure to perform an unplanned failover to the Target and resume applications.

If the Source is terminated for some reason, do the following:

  1. Stop the application traffic to ensure no more updates are attempted.

  2. Navigate to your Source cluster Disaster Recovery tab.

  3. Note the Potential data loss on failover to understand the extent of possible data loss as a result of the outage, and determine if the extent of data loss is acceptable for your situation.

    • The potential data loss is computed as the safe time lag that existed at the current safe time on the Target.
    • Use the Tables tab to understand which specific tables have the highest safe time lag and replication lag.

    For more information on replication metrics, refer to Replication.

  4. To proceed, click Switchover and choose Failover.

  5. Enter the name of the Target and click Failover.

  6. Click Restart Replication.

  7. Resume the application traffic on the new Source.

At this point, the DR configuration is halted and needs to be repaired.

Disaster recovery failed

Repair DR after failover

There are two options to repair a DR that has failed over:

  • If the original Source has recovered and is fully functional with no active alerts, you can configure DR to use the cluster as a Target.
  • If the original Source cannot be recovered, create a new cluster to be configured to act as the Target (see Prerequisites).

In both cases, repairing DR involves making a full copy of the databases through the backup-restore process.

To repair DR, do the following:

  1. Navigate to your (new) Source cluster Disaster Recovery tab.

  2. Click Repair DR to display the Repair DR dialog.

    Repair DR

  3. If the current Target (formerly the Source) has recovered and is fully functional with no active alerts, choose Reuse the current Target.

    To use a new cluster as the Target, choose Select a new cluster as Target and select the cluster.

  4. Click Initiate Repair.

After the repair is complete, if your eventual desired configuration is for the Target (that is, the former Source if you chose Reuse, or the new one you added to DR to act as Target) to be the Source, follow the steps for Planned switchover.

Important

Do not attempt a switchover if you have not first repaired DR.