Table of Contents
- GCP's Discount Landscape: Three Mechanisms
- Standard CUDs: Resource-Based Commitments
- Flex CUDs: Spend-Based Commitments
- Side-by-Side Comparison
- When to Choose Standard CUDs
- When to Choose Flex CUDs
- Combining Flex and Standard CUDs
- Sustained Use Discounts: The Interaction
- Enterprise Agreements and CUDs
- GCP CUD Optimization Strategy
GCP's Discount Landscape: Three Mechanisms
Google Cloud offers three distinct discount mechanisms that can apply simultaneously to your workloads, each requiring different decisions and providing different economics:
Sustained Use Discounts (SUDs) apply automatically to Compute Engine VMs that run for more than 25% of a month. No commitment required. Discounts scale from 0% to approximately 30% based on monthly usage fraction. They apply to N1, N2, N2D, C2, C2D, M1, and M2 machine families.
Standard Committed Use Discounts (Standard CUDs) provide 28-57% discounts in exchange for committing to specific vCPU and memory quantities in a specific region for 1 or 3 years. Higher discount, more specific commitment.
Flexible Committed Use Discounts (Flex CUDs) provide 20-30% discounts in exchange for committing to a minimum dollar-per-hour spend across eligible managed services. Lower discount, broader applicability, more flexibility.
Understanding how these three mechanisms interact — and which to prioritize for each workload — is the foundation of GCP cost optimization.
Standard CUDs: Resource-Based Commitments
Standard CUDs commit you to a specific amount of compute resources (vCPU and memory) in a specific GCP region for 1 or 3 years. In return, GCP provides its deepest available discounts:
| Machine Family | 1-Year Discount | 3-Year Discount | Best For |
|---|---|---|---|
| General Purpose (N1/N2) | ~28-37% | ~46-57% | Web, APIs, microservices |
| Compute Optimized (C2/C2D) | ~28-37% | ~46-57% | High-performance computing |
| Memory Optimized (M1/M2) | ~35-41% | ~55-60% | In-memory databases, SAP |
| GPU (A100, T4, V100) | ~20-35% | ~40-55% | ML training, rendering |
How Standard CUDs apply: You commit to, for example, 100 vCPUs and 400 GB memory in us-central1 for 1 year. GCP applies the CUD discount automatically to any compatible VM in that region — regardless of instance type — as long as the resource quantities are covered. You pay for the committed resources whether you use them or not.
Key flexibility feature: Standard CUDs apply at the resource level (vCPU + memory), not at the instance type level. A commitment for 100 vCPUs can cover ten n2-standard-8 instances (8 vCPU each) or twenty n2-standard-4 instances (4 vCPU each) — flexibility within resource families that makes Standard CUDs less rigid than AWS Reserved Instances.
Scope: Standard CUDs are project-scoped by default. To share CUDs across projects in a billing account, enable commitment sharing at the billing account level. This is important for organizations with separate projects for production, staging, and development — properly configured, CUDs can cover all environments efficiently.
Flex CUDs: Spend-Based Commitments
Flex CUDs (Flexible Committed Use Discounts) are GCP's spend-based commitment model — similar in concept to AWS Compute Savings Plans. Instead of committing to specific resource quantities, you commit to a minimum hourly spend level and receive a discount applied automatically to eligible services.
Eligible services for Flex CUDs include:
- Cloud SQL (MySQL, PostgreSQL, SQL Server)
- Cloud Spanner
- Cloud Run
- VMware Engine
- Compute Engine (some instance types not covered by Standard CUDs)
- Bare Metal Solution
Discount rate: Typically 20-30% off on-demand pricing, depending on the service and commitment term. Lower than Standard CUDs (28-57%) but applicable to services where Standard CUDs don't apply — particularly the managed database services that represent significant spend for enterprise workloads.
Commitment structure: You commit to a minimum spend in $/hour (e.g., $5.00/hour). GCP applies the Flex CUD discount to your eligible consumption up to that hourly commitment. If your actual hourly spend on eligible services exceeds the commitment, the excess is charged at on-demand rates (or other applicable discounts). If your actual spend is below the commitment, you still pay the committed amount.
Standard CUDs vs Flex CUDs: Side-by-Side Comparison
| Feature | Standard CUDs | Flex CUDs |
|---|---|---|
| Discount Rate | 28-57% | 20-30% |
| Commitment Type | Resource quantity (vCPU + memory) | Spend level ($/hour) |
| Term Options | 1 year or 3 years | 1 year or 3 years |
| Primary Service Coverage | Compute Engine VMs | Cloud SQL, Spanner, Cloud Run, VMware Engine |
| Flexibility | Instance type flexible (within resource family) | Service flexible (within eligible service list) |
| Region Restriction | Single region | Single region |
| SUD Interaction | CUD takes priority over SUD | CUD takes priority over SUD |
| Cancellation | Non-cancellable | Non-cancellable |
| Over-commitment Risk | High (pay for unused resources) | Moderate (pay for unused spend commitment) |
When to Choose Standard CUDs
Standard CUDs are the right choice when you have stable, predictable Compute Engine workloads with a high degree of confidence about resource requirements over the commitment term.
Ideal Standard CUD scenarios:
- Core application infrastructure: Web servers, application servers, and microservices running on known instance types with consistent utilization. If you're running 50+ n2-standard-8 instances continuously, a Standard CUD for the corresponding vCPU and memory is straightforward.
- Database VMs (Compute Engine): Self-managed MySQL, PostgreSQL, or MongoDB running on GCE instances. Standard CUDs cover the underlying compute; you manage the database software.
- Long-term ML training infrastructure: GPU instances dedicated to ongoing ML training workloads. The high per-unit cost of GPUs means the 40-55% CUD discount represents very large absolute savings.
- SAP workloads on M1/M2: Memory-optimized instances for SAP HANA or SAP applications have some of the highest CUD discounts (55-60% for 3-year). Organizations running SAP on GCP should almost always evaluate 3-year Standard CUDs for this tier.
Avoid Standard CUDs when: Your compute footprint changes significantly quarter-over-quarter, you're mid-migration between instance families, or workloads are project-based with defined end dates.
When to Choose Flex CUDs
Flex CUDs are the right choice when your highest-spend GCP services include managed databases, serverless compute, or other services not covered by Standard CUDs.
Ideal Flex CUD scenarios:
- Heavy Cloud SQL usage: The clearest Flex CUD use case. If Cloud SQL represents $50K+/month in spend, a Flex CUD commitment at 70-80% of that baseline generates immediate and significant savings with lower commitment risk than Standard CUDs.
- Cloud Run workloads: Serverless container workloads with predictable baseline traffic are excellent Flex CUD candidates. Commit to your baseline hourly spend; variable peaks above that pay on-demand.
- Cloud Spanner: Globally distributed database with high per-unit costs. Flex CUDs provide meaningful discounts for stable Spanner workloads without requiring resource-level commitment.
- Mixed managed services portfolio: If your GCP spend is spread across Cloud SQL, Cloud Run, and Spanner, a Flex CUD covering the combined baseline is more efficient than attempting separate Standard CUDs for compute components of each.
Avoid Flex CUDs when: All of your eligible spend is already covered by Standard CUDs or SUDs, or when your managed service usage is too variable to predict a consistent hourly baseline.
Combining Flex and Standard CUDs: The Enterprise Approach
For most enterprise GCP customers, the optimal strategy is to use both — Standard CUDs for core compute and Flex CUDs for managed services. The two mechanisms don't conflict; they complement.
Example layered approach for a $500K/month GCP customer:
- $250K/month in Compute Engine VMs (N2, C2): 3-year Standard CUDs on 70% of baseline vCPU/memory (~$131K/month at ~57% discount savings $87K/month)
- $100K/month in Cloud SQL: 1-year Flex CUD at $70K/month commitment (25% discount saves $17.5K/month)
- $80K/month in Cloud Run: 1-year Flex CUD at $55K/month commitment (25% discount saves $13.75K/month)
- $70K/month remaining (variable services, dev/test): On-demand or Sustained Use Discounts
Total committed spend: approximately $256K/month. Total monthly savings: approximately $118K/month ($1.4M annually). This layered approach generates 47% savings on committed workloads while maintaining flexibility on variable spend.
Note: this example excludes GCP Enterprise Agreement discounts, which would stack on top of these CUD savings for customers with formal EA relationships.
Sustained Use Discounts: The Interaction
Sustained Use Discounts don't stack with CUDs — CUDs take priority. But this creates an important strategic consideration for commitment sizing.
If you commit 100 vCPUs via Standard CUD and run 80 vCPUs consistently, you're paying for 20 vCPUs of unused commitment. Those 20 vCPUs don't generate any additional SUD — you've already paid for them at CUD pricing.
If instead you commit 70 vCPUs and run 80-100 vCPUs dynamically, the committed 70 receive CUD pricing and the remaining 10-30 vCPUs receive SUDs (up to 30% at full month utilization). This is often a better economic outcome than over-committing to achieve CUD coverage on all resources.
The 70-80% rule for Standard CUDs: Commit to 70-80% of your average monthly utilization. The top 20-30% of your usage footprint will accumulate substantial SUDs automatically — at no commitment risk. The combined effective discount is often within 5-10% of what full CUD coverage would provide, with significantly less over-commitment exposure.
Enterprise Agreements and CUDs: The Stack
GCP Enterprise Agreements provide an additional discount layer that stacks with CUDs. Enterprise Agreement discounts (typically 10-25% for customers committing $1M+ annually) apply to eligible spend including spend already covered by CUDs.
This makes the GCP EA particularly valuable: CUD discounts reduce unit costs, and then the EA discount reduces costs further. For an enterprise customer with a $5M annual GCP commitment:
- 3-year Standard CUDs on 60% of compute: ~50% discount saves $1.5M
- Flex CUDs on managed services: ~25% discount saves $250K
- Enterprise Agreement 15% discount on remaining consumption: saves additional $125K
- Total annual savings vs on-demand: approximately $1.875M (37.5% of $5M baseline)
The EA negotiation should happen alongside CUD strategy — not independently. Sizing your CUD commitments correctly is part of demonstrating to GCP that you're a committed, high-value customer deserving of enterprise commercial terms.
If you're managing GCP spend above $1M annually and don't have an active Enterprise Agreement, the EA negotiation should be an immediate priority. Our team has negotiated GCP enterprise agreements for customers at multiple spend tiers and can advise on what's achievable at your level. Request a consultation.
GCP CUD Optimization Strategy: Practical Steps
Implementing an effective CUD strategy requires data before decisions. Here's the practical approach:
Step 1: Analyze 6-12 months of service-level spend. Use GCP's Cost Table report or BigQuery billing export to understand your spend distribution across Compute Engine, Cloud SQL, Cloud Run, Spanner, and other services. This determines which CUD types are relevant.
Step 2: Identify your compute baseline. For Compute Engine, pull hourly vCPU and memory utilization data by region. Identify your consistent minimum — the resource level you run 24/7. This is your Standard CUD commitment target (at 70-80% of minimum).
Step 3: Identify your managed services baseline. For Cloud SQL and Cloud Run specifically, calculate average hourly spend. Your Flex CUD commitment should be 65-75% of this average — conservative enough to avoid over-commitment while capturing the bulk of the discount opportunity.
Step 4: Model the combined discount rate. Compare on-demand cost vs CUD + SUD combination. The blended effective discount should be significantly better than either CUDs or SUDs alone for a well-structured commitment.
Step 5: Evaluate EA alongside CUDs. If total GCP spend exceeds $1M/year, initiate Enterprise Agreement discussions with your GCP account team. Present your CUD commitment plans as evidence of long-term platform commitment — this directly improves EA negotiating position.
Step 6: Review quarterly. CUD utilization should be reviewed every 90 days. Identify over-committed or under-utilized CUDs and adjust the next commitment cycle accordingly. Don't wait until annual renewal to identify commitment imbalances.
Final Thoughts
Flex CUDs and Standard CUDs are complementary tools, not competing choices. Standard CUDs deliver the highest discounts on predictable compute workloads; Flex CUDs unlock discounts on the managed services (Cloud SQL, Spanner, Cloud Run) that Standard CUDs can't cover. Enterprise GCP customers managing $500K+ in annual spend should almost always deploy both — and should evaluate Enterprise Agreement discounts as the additional layer that makes the combined savings compelling.
Related reading: Complete Guide to Enterprise Cloud Contract Negotiation | GCP CUD Optimization Guide | Cloud Cost Optimization: Committed Use Strategy | Google Cloud Vendor Intelligence