Every disaster-recovery plan comes down to two targets: RTO and RPO. Get them right and you spend the appropriate amount protecting each system; get them wrong and you either overspend or discover the gap during an outage.
What is RTO (recovery time objective)?
RTO is how quickly a system must be back online after a disruption — the maximum tolerable downtime. A payment system might have an RTO of minutes; an internal reporting tool might tolerate a day.
What is RPO (recovery point objective)?
RPO is how much data you can afford to lose, measured in time — the maximum acceptable gap between your last good backup and the moment of failure. An RPO of 15 minutes means you must be able to recover to within 15 minutes of the incident.
A worked example
Suppose your order database fails at 2:14 p.m. With an RPO of 15 minutes, you can restore to no earlier than 1:59 p.m. (so backups/replication must run at least that often). With an RTO of one hour, the database must be serving orders again by 3:14 p.m. Those two numbers dictate your backup frequency and your recovery architecture.
How to set them
Start with a business impact analysis: for each system, quantify what an hour of downtime or an hour of lost data actually costs in revenue, compliance, and trust. Tighter targets cost more, so set them per system rather than applying one blanket number — reserve near-zero RTO/RPO for the systems that truly warrant it.
Matching a DR strategy to your targets
- Backup & restore — lowest cost, RTO/RPO in hours.
- Pilot light — core kept warm, faster restore.
- Warm standby — a scaled-down running copy, RTO in minutes.
- Hot standby / active-active — near-zero RTO and RPO, highest cost.
And remember: a plan you have never tested is a guess. RHC Solutions designs and tests these plans as part of business continuity and disaster recovery.