Oxygen vs Vercel vs Netlify: A/B Testing Commerce Performance
Introduction
Merchants migrating to Hydrogen often ask: Why should we host on Shopify Oxygen instead of Vercel or Netlify?
Synthetic benchmarks help, but the most persuasive evidence comes from competitor A/B testing β running identical storefronts across Oxygen, Vercel, and Netlify, then comparing performance region by region.
This post outlines how to run those tests and use them in client/stakeholder conversations.
Why Competitor A/B Testing?
- βοΈ Fair comparisons β same code, different host.
- π Region insights β show APAC vs EU vs NA variance.
- π Stakeholder clarity β visual data beats abstract promises.
- π Commerce context β measure cart mutations + checkout API latency, not just static TTFB.
Test Setup
Step 1. Deploy Identical Storefronts
- Deploy Hydrogen build to Oxygen, Vercel, and Netlify.
- Use same Storefront API + metafields.
Step 2. Define Metrics
- Median TTFB.
- p95/p99 latency (realistic user worst-case).
- Cold vs warm starts.
- Shopify API proximity (cart mutations, checkout calls).
Step 3. Collect Data
- Synthetic: k6, Lighthouse CI.
- Real-world: RUM beacons (segment by region/ISP).
Sample Benchmark Table
| Region | Oxygen TTFB |
Vercel TTFB |
Netlify TTFB |
Notes |
| NA (East) | ~80ms | ~70ms | ~90ms | Comparable, warm starts. |
| EU (Frankfurt) | ~120ms | ~110ms | ~130ms | All solid; Oxygen closer to Shopify APIs. |
| APAC (Sydney) | ~200ms | ~180ms | ~220ms | Prefetch matters most. |
π Data shows Oxygen slightly trails Vercel in cold starts, but wins in Shopify API latency for commerce flows.
Using Results with Stakeholders
- β Present regional charts in pitch decks.
- β Show Shopify API proximity advantage (cart, checkout faster on Oxygen).
- β Frame Oxygen as the best long-term bet for Shopify merchants.
- β Use Vercel/Netlify as βcontrol armsβ to validate choices.
Case Example: Footwear Brand Pitch
- Agency ran same PDP on all 3 hosts.
- Oxygen β slightly slower cold start, but cart mutations 20% faster.
- Vercel β great cold starts, but API latency higher.
- Client chose Oxygen, convinced by commerce-first advantage.
Guardrails
- β Always use same storefront code for fair test.
- β Test both synthetic + RUM.
- β Focus on commerce flows, not just static TTFB.
- β Update benchmarks yearly β hosts evolve fast.
Conclusion
Oxygen doesnβt always βwinβ in raw latency, but it consistently outperforms when measuring commerce-specific performance. Competitor A/B testing turns abstract arguments into clear evidence for merchants.
Donβt just tell clients Oxygen is best β show them the data.