What is the discount difference between Flex CUDs and Standard CUDs?

Standard CUDs provide 28-57% discount off on-demand pricing for 1-year or 3-year commitments on specific resource types (vCPU and memory). Flex CUDs (spend-based commitments) provide 20-30% discount off on-demand pricing but apply to a broader range of services including Cloud SQL, Cloud Spanner, Cloud Run, and VMware Engine. The trade-off: Standard CUDs offer higher discounts but require committing to specific resources in specific regions. Flex CUDs offer lower discounts but apply across services and are more flexible if your workload mix shifts.

Can you combine Flex CUDs and Standard CUDs on Google Cloud?

Yes. Standard CUDs and Flex CUDs can coexist in the same Google Cloud project and billing account. Standard CUDs are applied first to eligible resources; Flex CUDs then cover remaining eligible spend. The recommended enterprise approach: use Standard CUDs for your stable, highest-confidence compute workloads (core VMs, databases on Compute Engine) and layer Flex CUDs on top for managed services (Cloud SQL, Cloud Run, Cloud Spanner) where resource-level commitment isn't practical.

What happens if you don't fully utilize a Google Cloud CUD?

CUD commitments are charged regardless of utilization. If you purchase a 1-year Standard CUD for 100 vCPUs and only consume 80 vCPUs on average, you're paying for 20 vCPUs of unused commitment at the committed price. Unlike AWS Reserved Instances, there is no secondary marketplace to sell unused GCP CUD capacity. The mitigation: size commitments at 70-80% of baseline utilization, leave 20-30% on Sustained Use Discounts (SUDs), and conduct quarterly commitment utilization reviews.

How do Sustained Use Discounts interact with CUDs on GCP?

Sustained Use Discounts (SUDs) apply automatically to any Compute Engine VM running more than 25% of a month, scaling to ~30% discount at 100% monthly uptime. SUDs and CUDs don't fully stack — CUDs take priority. If a resource is covered by a CUD, SUD doesn't additionally apply to that resource. The strategic implication: commit your most consistently running workloads with CUDs (higher discount), let variable workloads fall to SUDs (automatic, lower discount). Don't commit resources that run intermittently — let them accumulate SUDs naturally.

Google Cloud Flex CUDs vs Standard CUDs: Which Committed Use Discount Is Right for You?

Table of Contents

GCP's Discount Landscape: Three Mechanisms
Standard CUDs: Resource-Based Commitments
Flex CUDs: Spend-Based Commitments
Side-by-Side Comparison
When to Choose Standard CUDs
When to Choose Flex CUDs
Combining Flex and Standard CUDs
Sustained Use Discounts: The Interaction
Enterprise Agreements and CUDs
GCP CUD Optimization Strategy

GCP's Discount Landscape: Three Mechanisms

Google Cloud offers three distinct discount mechanisms that can apply simultaneously to your workloads, each requiring different decisions and providing different economics:

Sustained Use Discounts (SUDs) apply automatically to Compute Engine VMs that run for more than 25% of a month. No commitment required. Discounts scale from 0% to approximately 30% based on monthly usage fraction. They apply to N1, N2, N2D, C2, C2D, M1, and M2 machine families.

Standard Committed Use Discounts (Standard CUDs) provide 28-57% discounts in exchange for committing to specific vCPU and memory quantities in a specific region for 1 or 3 years. Higher discount, more specific commitment.

Flexible Committed Use Discounts (Flex CUDs) provide 20-30% discounts in exchange for committing to a minimum dollar-per-hour spend across eligible managed services. Lower discount, broader applicability, more flexibility.

Understanding how these three mechanisms interact — and which to prioritize for each workload — is the foundation of GCP cost optimization.

Standard CUDs: Resource-Based Commitments

Standard CUDs commit you to a specific amount of compute resources (vCPU and memory) in a specific GCP region for 1 or 3 years. In return, GCP provides its deepest available discounts:

Machine Family	1-Year Discount	3-Year Discount	Best For
General Purpose (N1/N2)	~28-37%	~46-57%	Web, APIs, microservices
Compute Optimized (C2/C2D)	~28-37%	~46-57%	High-performance computing
Memory Optimized (M1/M2)	~35-41%	~55-60%	In-memory databases, SAP
GPU (A100, T4, V100)	~20-35%	~40-55%	ML training, rendering

How Standard CUDs apply: You commit to, for example, 100 vCPUs and 400 GB memory in us-central1 for 1 year. GCP applies the CUD discount automatically to any compatible VM in that region — regardless of instance type — as long as the resource quantities are covered. You pay for the committed resources whether you use them or not.

Key flexibility feature: Standard CUDs apply at the resource level (vCPU + memory), not at the instance type level. A commitment for 100 vCPUs can cover ten n2-standard-8 instances (8 vCPU each) or twenty n2-standard-4 instances (4 vCPU each) — flexibility within resource families that makes Standard CUDs less rigid than AWS Reserved Instances.

Scope: Standard CUDs are project-scoped by default. To share CUDs across projects in a billing account, enable commitment sharing at the billing account level. This is important for organizations with separate projects for production, staging, and development — properly configured, CUDs can cover all environments efficiently.

Flex CUDs: Spend-Based Commitments

Flex CUDs (Flexible Committed Use Discounts) are GCP's spend-based commitment model — similar in concept to AWS Compute Savings Plans. Instead of committing to specific resource quantities, you commit to a minimum hourly spend level and receive a discount applied automatically to eligible services.

Eligible services for Flex CUDs include:

Cloud SQL (MySQL, PostgreSQL, SQL Server)
Cloud Spanner
Cloud Run
VMware Engine
Compute Engine (some instance types not covered by Standard CUDs)
Bare Metal Solution

Discount rate: Typically 20-30% off on-demand pricing, depending on the service and commitment term. Lower than Standard CUDs (28-57%) but applicable to services where Standard CUDs don't apply — particularly the managed database services that represent significant spend for enterprise workloads.

Commitment structure: You commit to a minimum spend in $/hour (e.g., $5.00/hour). GCP applies the Flex CUD discount to your eligible consumption up to that hourly commitment. If your actual hourly spend on eligible services exceeds the commitment, the excess is charged at on-demand rates (or other applicable discounts). If your actual spend is below the commitment, you still pay the committed amount.

Key Flex CUD Use Case: Cloud SQL Cloud SQL is one of the most common high-spend managed services for enterprise GCP customers — and it's not covered by Standard CUDs or SUDs. Flex CUDs are the primary discount mechanism for Cloud SQL. An enterprise spending $100K monthly on Cloud SQL can achieve $20-30K in monthly savings through appropriately sized Flex CUD commitments. This is one of the clearest value cases for Flex CUDs specifically.

Standard CUDs vs Flex CUDs: Side-by-Side Comparison

Feature	Standard CUDs	Flex CUDs
Discount Rate	28-57%	20-30%
Commitment Type	Resource quantity (vCPU + memory)	Spend level ($/hour)
Term Options	1 year or 3 years	1 year or 3 years
Primary Service Coverage	Compute Engine VMs	Cloud SQL, Spanner, Cloud Run, VMware Engine
Flexibility	Instance type flexible (within resource family)	Service flexible (within eligible service list)
Region Restriction	Single region	Single region
SUD Interaction	CUD takes priority over SUD	CUD takes priority over SUD
Cancellation	Non-cancellable	Non-cancellable
Over-commitment Risk	High (pay for unused resources)	Moderate (pay for unused spend commitment)

When to Choose Standard CUDs

Standard CUDs are the right choice when you have stable, predictable Compute Engine workloads with a high degree of confidence about resource requirements over the commitment term.

Ideal Standard CUD scenarios:

Core application infrastructure: Web servers, application servers, and microservices running on known instance types with consistent utilization. If you're running 50+ n2-standard-8 instances continuously, a Standard CUD for the corresponding vCPU and memory is straightforward.
Database VMs (Compute Engine): Self-managed MySQL, PostgreSQL, or MongoDB running on GCE instances. Standard CUDs cover the underlying compute; you manage the database software.
Long-term ML training infrastructure: GPU instances dedicated to ongoing ML training workloads. The high per-unit cost of GPUs means the 40-55% CUD discount represents very large absolute savings.
SAP workloads on M1/M2: Memory-optimized instances for SAP HANA or SAP applications have some of the highest CUD discounts (55-60% for 3-year). Organizations running SAP on GCP should almost always evaluate 3-year Standard CUDs for this tier.

Avoid Standard CUDs when: Your compute footprint changes significantly quarter-over-quarter, you're mid-migration between instance families, or workloads are project-based with defined end dates.

When to Choose Flex CUDs

Flex CUDs are the right choice when your highest-spend GCP services include managed databases, serverless compute, or other services not covered by Standard CUDs.

Ideal Flex CUD scenarios:

Heavy Cloud SQL usage: The clearest Flex CUD use case. If Cloud SQL represents $50K+/month in spend, a Flex CUD commitment at 70-80% of that baseline generates immediate and significant savings with lower commitment risk than Standard CUDs.
Cloud Run workloads: Serverless container workloads with predictable baseline traffic are excellent Flex CUD candidates. Commit to your baseline hourly spend; variable peaks above that pay on-demand.
Cloud Spanner: Globally distributed database with high per-unit costs. Flex CUDs provide meaningful discounts for stable Spanner workloads without requiring resource-level commitment.
Mixed managed services portfolio: If your GCP spend is spread across Cloud SQL, Cloud Run, and Spanner, a Flex CUD covering the combined baseline is more efficient than attempting separate Standard CUDs for compute components of each.

Avoid Flex CUDs when: All of your eligible spend is already covered by Standard CUDs or SUDs, or when your managed service usage is too variable to predict a consistent hourly baseline.

Combining Flex and Standard CUDs: The Enterprise Approach

For most enterprise GCP customers, the optimal strategy is to use both — Standard CUDs for core compute and Flex CUDs for managed services. The two mechanisms don't conflict; they complement.

Example layered approach for a $500K/month GCP customer:

$250K/month in Compute Engine VMs (N2, C2): 3-year Standard CUDs on 70% of baseline vCPU/memory (~$131K/month at ~57% discount savings $87K/month)
$100K/month in Cloud SQL: 1-year Flex CUD at $70K/month commitment (25% discount saves $17.5K/month)
$80K/month in Cloud Run: 1-year Flex CUD at $55K/month commitment (25% discount saves $13.75K/month)
$70K/month remaining (variable services, dev/test): On-demand or Sustained Use Discounts

Total committed spend: approximately $256K/month. Total monthly savings: approximately $118K/month ($1.4M annually). This layered approach generates 47% savings on committed workloads while maintaining flexibility on variable spend.

Note: this example excludes GCP Enterprise Agreement discounts, which would stack on top of these CUD savings for customers with formal EA relationships.

Sustained Use Discounts: The Interaction

Sustained Use Discounts don't stack with CUDs — CUDs take priority. But this creates an important strategic consideration for commitment sizing.

If you commit 100 vCPUs via Standard CUD and run 80 vCPUs consistently, you're paying for 20 vCPUs of unused commitment. Those 20 vCPUs don't generate any additional SUD — you've already paid for them at CUD pricing.

If instead you commit 70 vCPUs and run 80-100 vCPUs dynamically, the committed 70 receive CUD pricing and the remaining 10-30 vCPUs receive SUDs (up to 30% at full month utilization). This is often a better economic outcome than over-committing to achieve CUD coverage on all resources.

The 70-80% rule for Standard CUDs: Commit to 70-80% of your average monthly utilization. The top 20-30% of your usage footprint will accumulate substantial SUDs automatically — at no commitment risk. The combined effective discount is often within 5-10% of what full CUD coverage would provide, with significantly less over-commitment exposure.

Enterprise Agreements and CUDs: The Stack

GCP Enterprise Agreements provide an additional discount layer that stacks with CUDs. Enterprise Agreement discounts (typically 10-25% for customers committing $1M+ annually) apply to eligible spend including spend already covered by CUDs.

This makes the GCP EA particularly valuable: CUD discounts reduce unit costs, and then the EA discount reduces costs further. For an enterprise customer with a $5M annual GCP commitment:

3-year Standard CUDs on 60% of compute: ~50% discount saves $1.5M
Flex CUDs on managed services: ~25% discount saves $250K
Enterprise Agreement 15% discount on remaining consumption: saves additional $125K
Total annual savings vs on-demand: approximately $1.875M (37.5% of $5M baseline)

The EA negotiation should happen alongside CUD strategy — not independently. Sizing your CUD commitments correctly is part of demonstrating to GCP that you're a committed, high-value customer deserving of enterprise commercial terms.

If you're managing GCP spend above $1M annually and don't have an active Enterprise Agreement, the EA negotiation should be an immediate priority. Our team has negotiated GCP enterprise agreements for customers at multiple spend tiers and can advise on what's achievable at your level. Request a consultation.

GCP CUD Optimization Strategy: Practical Steps

Implementing an effective CUD strategy requires data before decisions. Here's the practical approach:

Step 1: Analyze 6-12 months of service-level spend. Use GCP's Cost Table report or BigQuery billing export to understand your spend distribution across Compute Engine, Cloud SQL, Cloud Run, Spanner, and other services. This determines which CUD types are relevant.

Step 2: Identify your compute baseline. For Compute Engine, pull hourly vCPU and memory utilization data by region. Identify your consistent minimum — the resource level you run 24/7. This is your Standard CUD commitment target (at 70-80% of minimum).

Step 3: Identify your managed services baseline. For Cloud SQL and Cloud Run specifically, calculate average hourly spend. Your Flex CUD commitment should be 65-75% of this average — conservative enough to avoid over-commitment while capturing the bulk of the discount opportunity.

Step 4: Model the combined discount rate. Compare on-demand cost vs CUD + SUD combination. The blended effective discount should be significantly better than either CUDs or SUDs alone for a well-structured commitment.

Step 5: Evaluate EA alongside CUDs. If total GCP spend exceeds $1M/year, initiate Enterprise Agreement discussions with your GCP account team. Present your CUD commitment plans as evidence of long-term platform commitment — this directly improves EA negotiating position.

Step 6: Review quarterly. CUD utilization should be reviewed every 90 days. Identify over-committed or under-utilized CUDs and adjust the next commitment cycle accordingly. Don't wait until annual renewal to identify commitment imbalances.

Final Thoughts

Flex CUDs and Standard CUDs are complementary tools, not competing choices. Standard CUDs deliver the highest discounts on predictable compute workloads; Flex CUDs unlock discounts on the managed services (Cloud SQL, Spanner, Cloud Run) that Standard CUDs can't cover. Enterprise GCP customers managing $500K+ in annual spend should almost always deploy both — and should evaluate Enterprise Agreement discounts as the additional layer that makes the combined savings compelling.

Google Cloud Flex CUDs vs Standard CUDs: Which Is Right for Your Enterprise?

GCP's Discount Landscape: Three Mechanisms

Standard CUDs: Resource-Based Commitments

Flex CUDs: Spend-Based Commitments

Standard CUDs vs Flex CUDs: Side-by-Side Comparison

When to Choose Standard CUDs

When to Choose Flex CUDs

Combining Flex and Standard CUDs: The Enterprise Approach

Sustained Use Discounts: The Interaction

Enterprise Agreements and CUDs: The Stack

GCP CUD Optimization Strategy: Practical Steps

Final Thoughts

Frequently Asked Questions

Ready to Optimize Your GCP Commitment Strategy?