How It Works
A GCP Compute Engine CUD is a 1-year or 3-year commitment to consume a defined amount of compute resources, expressed in vCPUs and memory, within a specific region. In return, Google Cloud applies a discounted rate to that usage automatically, without requiring you to specify a particular machine type. The discount applies to Compute Engine virtual machines (VMs), GKE (Google Kubernetes Engine) clusters, and Cloud Run workloads. You pay the committed rate whether or not you consume the full amount, so accurate sizing is essential. On AWS, the equivalent mechanism is Reserved Instances or Savings Plans. On Azure, it is Reservations.
Why It Matters for Cloud Cost
Compute Engine is typically the largest line item on a GCP bill for companies running workloads at scale. Without a CUD in place, every hour of VM usage is billed at on-demand rates, which are the most expensive option Google offers. The discount available through CUDs is material: GCP Committed Use Discounts can reach up to 57% versus on-demand rates. The risk is on the other side: if you commit to more than your workloads actually consume, you pay for capacity you do not use. Most engineering and finance teams either under-commit to avoid waste or over-commit and absorb losses, leaving significant savings on the table either way.
Key Characteristics
- Commitments are scoped to a region and apply automatically to eligible Compute Engine, GKE, and Cloud Run usage without requiring instance-type locks.
- Two CUD types exist: resource-based CUDs commit to a specific amount of vCPU and memory, while spend-based CUDs commit to a minimum hourly dollar amount.
- Discounts apply as long as eligible usage exists, but unused commitment still incurs charges with no automatic rebate from Google.
- CUD terms are available in 1-year increments, with longer terms offering steeper discounts.
How Usage AI Handles This
Usage AI manages GCP Compute Engine CUDs on your behalf through its CoPilot and Autopilot products, purchasing and adjusting commitments daily based on your actual usage patterns, with cashback plus credits guaranteed on any underutilization. Usage AI owns the commitment, so your team carries zero financial risk if usage shifts.
See how Usage AI saves 30 to 50% on AWS, GCP, and Azure.
Common Questions
1. What is the difference between a resource-based and a spend-based Compute Engine CUD?
A resource-based CUD commits to a specific quantity of vCPUs and memory in a region and offers the steeper discount. A spend-based CUD commits to a minimum hourly dollar amount across eligible services and provides more flexibility across machine families, though typically at a lower discount rate.
2. What happens if my GCP usage drops below my committed amount?
Google still charges you for the full committed amount. Unlike AWS Savings Plans, which simply stop applying once usage falls short, a GCP CUD generates a charge for unused capacity with no automatic credit or rebate from Google. This is why commitment sizing accuracy matters and why underutilization protection, like the cashback guarantee Usage AI provides, is valuable.
3. Do GCP Compute Engine CUDs cover GKE and Cloud Run automatically?
Yes. Resource-based Compute Engine CUDs apply automatically to eligible vCPU and memory usage across Compute Engine VMs, GKE Standard clusters, and Cloud Run, as long as the usage occurs in the committed region. GKE Autopilot clusters have their own separate CUD type.