Commitment Discounts
Commitment Discounts
On-demand pricing is the cloud's retail price — it is what you pay when you walk in without a reservation. Every major cloud provider offers a parallel purchasing system that exchanges schedule flexibility for dramatically lower rates: AWS Savings Plans and Reserved Instances, GCP Committed Use Discounts (CUDs), and Azure Reserved VM Instances and Savings Plans. At scale these are not optional features; they are the single highest-leverage lever in FinOps. A mature programme running 70–80% coverage can reduce compute spend by 30–45% with zero architectural change. The discipline is in the strategy: knowing when to commit, how much to commit, and how to layer the instruments correctly.
The Three Instruments
Reserved Instances (RIs) are the original commitment product. On AWS you purchase capacity for a specific instance family, size, region (or zone), and operating system for 1 or 3 years. Standard RIs offer the deepest discounts — up to 72% off on-demand — but are rigid: you cannot change the instance type family. Convertible RIs allow exchanges at the cost of a shallower discount (around 54–66%). RIs are most appropriate for workloads where the instance type is unlikely to change: database servers, long-lived application servers, and baseline Kubernetes node pools.
Savings Plans (AWS, Azure) decouple the discount from a specific resource. Instead of buying a specific instance, you commit to a dollar-per-hour spend floor. AWS Compute Savings Plans apply automatically to any EC2 instance regardless of family, size, region, OS, or tenancy, and also cover Lambda and Fargate. EC2 Instance Savings Plans are narrower (one family, one region) but discount deeper — approaching RI rates. Because they follow the workload rather than the resource, Savings Plans are the right default for almost everything except databases.
GCP Committed Use Discounts come in two flavours: resource-based CUDs commit vCPU and memory amounts for 1 or 3 years at up to 57% off; flexible CUDs (spend-based, available for certain services) commit a dollar amount per hour similar to AWS Savings Plans. GCP also grants automatic Sustained Use Discounts for any instance that runs more than 25% of a month — no action required — which serves as a safety floor before you layer CUDs on top.
Coverage Strategy: Laddering
A naive approach buys RIs for whatever is running today at full commitment. The correct approach is laddering: spreading commitment terms and sizes across time so that you are never all-in on a prediction more than 12 months into the future.
The ladder model works as follows. Look at your trailing 90-day minimum baseline — the lowest hourly compute spend observed in the period, excluding incident-driven spikes. That floor represents the safest 3-year commitment candidate because the workload has never dropped below it. The next tier — trailing 90-day average minus the floor — is a reasonable 1-year commitment. Anything above the average stays on-demand (or Spot, covered in lesson 6). Each quarter you re-run the analysis and layer in new commitments as workloads stabilise.
In practice, most AWS teams target 70–80% coverage of their stable baseline with Compute Savings Plans (1-year, no-upfront or partial-upfront), and overlay standard or zonal RIs on top of the highest-utilisation node pools where the instance type has not changed in 18+ months. GCP teams follow a similar pattern: resource CUDs for the proven baseline, flexible CUDs for services like Cloud Run or GKE Autopilot where resource shapes change.
Purchasing in Practice — AWS CLI
Before buying, validate the recommendation with Cost Explorer's programmatic recommendation engine. Always review the UpfrontCost, EstimatedMonthlySavings, and EstimatedROI fields before any commitment.
get-savings-plans-utilization and pages the FinOps channel if utilisation drops below 85%. An underutilised Savings Plan that nobody notices is money thrown away — you are paying the committed rate but not consuming that much compute. This happens after a workload is decommissioned or right-sized mid-term.
Payment Options and Cash Flow
AWS and Azure offer three payment modes for RIs and Savings Plans: All Upfront, Partial Upfront, and No Upfront. All Upfront yields the deepest discount (typically 3–5% more than No Upfront) but ties up working capital. At $1M/year of compute, the all-upfront saving over no-upfront is roughly $30–50k — meaningful, but weigh it against cash flow needs and the company's cost of capital. Most growth-stage companies default to No Upfront or Partial Upfront and accept the slightly shallower discount in exchange for predictable monthly cash flow.
GCP CUDs are always billed monthly over the commitment term — there is no upfront option, simplifying the decision but removing the cash-flow arbitrage.
Handling Architecture Changes Mid-Term
The most common FinOps incident is buying a 3-year RI for an instance type that gets deprecated or replaced by a new generation within 18 months. AWS provides three escape hatches: Convertible RI exchanges (swap to a different instance family at equivalent or greater value), the RI Marketplace (sell unused Standard RIs to other AWS customers), and Savings Plans which simply follow your workload automatically. The GCP equivalent is that resource CUDs are flexible across most instance types within a family; if the workload disappears entirely, GCP offers no secondary market, and you are on the hook for the remaining term.
Terraform-Managed RI Purchases
At scale, managing hundreds of RIs manually through the console is error-prone. Codify commitments in Terraform so that RI purchases, tagging, and portfolio reviews are version-controlled and auditable.
The FinOps Review Cadence
Commitment portfolios are not fire-and-forget. A well-run programme runs a monthly FinOps review with three agenda items: coverage rate (target 70–80% of stable baseline), utilisation rate (target above 90%), and upcoming expiry calendar (what RIs expire in the next 90 days and should they be renewed, exchanged, or allowed to lapse). Pair this with quarterly ladder analysis to decide whether new commitments are warranted as workloads mature. Assign a clear DRI (directly responsible individual) — typically a senior platform engineer or a dedicated FinOps analyst — who owns the commitment calendar the way an SRE owns an SLO budget.