TrueTime: Google's Globally Synchronized Clock Infrastructure

Learn how Google uses TrueTime for globally distributed transactions with external consistency. Covers the Spanner system, time bounded uncertainty, and HW-assisted synchronization.

published: March 24, 2026 reading time: 34 min read author: GeekWorkBench updated: June 17, 2026

Quick Summary

TrueTime gives you bounded uncertainty instead of best-effort synchronization - instead of 'your clock is probably within X milliseconds', it guarantees 'true time is definitely within [earliest, latest]'. Google achieves this with GPS receivers and atomic clocks in each datacenter, combined through quorum consensus. Spanner uses TrueTime's commit-wait rule to guarantee external consistency: every transaction waits until the uncertainty interval passes before committing, ensuring that earlier timestamps always mean earlier real time. The cost is real - specialized hardware, per-write latency from commit-wait, and operational complexity. But for applications needing globally consistent transactions across continents, nothing else provides the same guarantees. Most teams should look at Hybrid Logical Clocks in CockroachDB or YugabyteDB instead.

Introduction

Beyond Eventual Synchronization

Traditional NTP gives you best-effort synchronization. Your clock might be within 10ms of true time, or it might be off by more. You have no guarantee.

Consider the challenge: you want a globally distributed database with external consistency. External consistency means if transaction T1 commits before transaction T2 starts (in real time), T1’s changes are visible to T2.

Without synchronized clocks, you cannot determine “real time” across regions. Transactions might appear to commit out of order due to clock skew.

sequenceDiagram
    participant A as Region A
    participant B as Region B
    participant C as Client

    C->>A: BEGIN T1
    A->>A: Read/Write T1
    C->>A: COMMIT T1 at local T=100

    C->>B: BEGIN T2
    Note over C,B: T2 starts physically AFTER T1 commits<br/>But B's clock is 50ms behind
    C->>B: COMMIT T2 at local T=55

    Note over A,B: With normal clocks: T2 appears to commit before T1!<br/>External consistency violated

Spanner with TrueTime prevents this. Because TrueTime knows its uncertainty bounds, it can guarantee that T1 committed before T2 began, even across regions.

Core Concepts

The API

TrueTime provides a simple but powerful API:

// TrueTime returns a time interval [earliest, latest]
// True time is guaranteed to be somewhere within this interval

const tt = trueTime.now();
// tt.earliest = 1500  (earliest possible time)
// tt.latest = 1510     (latest possible time)
// True time is somewhere between 1500 and 1510
// Uncertainty = 10ms

This is fundamentally different from NTP. NTP says “your clock is probably within X milliseconds of true time.” TrueTime says “your clock is definitely within [earliest, latest].”

Two Sources of Time

TrueTime combines two time sources to achieve bounded uncertainty:

GPS receivers: Highly accurate, but vulnerable to interference and outages
Atomic clocks: Immune to GPS issues, but might drift slightly over time

graph TD
    A[TrueTime Master in each data center]
    A --> B[GPS Receiver]
    A --> C[Atomic Clock (Cesium)]

    B --> D[Time with bounded error]
    C --> D

    D --> E[Combined TrueTime API]
    E --> F[max(GPS time, Atomic time)]
    E --> G[Uncertainty ~1-7ms]

By combining both sources and using majority quorum within each datacenter, TrueTime achieves an uncertainty bound of about 1-7 milliseconds.

The Uncertainty Bound

TrueTime does not try to minimize uncertainty. Instead, it provides a guaranteed upper bound on uncertainty. This bound is what makes algorithms possible.

// Typical TrueTime uncertainty
const tt = trueTime.now();
console.log(`Earliest: ${tt.earliest}, Latest: ${tt.latest}`);
// Output: Earliest: 1699, Latest: 1705
// Uncertainty: 6ms

// TrueTime guarantees:
// At any moment, true time is within [earliest, latest]
// This guarantee is mathematical, not probabilistic

Time Bounded Uncertainty

TrueTime uses interval arithmetic to represent time. Instead of a point estimate, you get an interval:

class TrueTime {
  now() {
    // Returns [earliest, latest] interval
    return {
      earliest: this.trueTimeMs - this.epsilon,
      latest: this.trueTimeMs + this.epsilon,
      epsilon: this.epsilon, // Half the interval
    };
  }

  // Wait until uncertainty has passed
  async waitUntil(absoluteTime) {
    while (trueTime.now().latest < absoluteTime) {
      await sleep(earliest - latest); // Sleep until interval passes
    }
  }
}

Spanner: External Consistency with TrueTime

What is Spanner?

Google Spanner is a globally distributed SQL database that shards data across multiple continents, provides strong consistency, and scales to millions of machines. It was designed for Google’s advertising infrastructure where financial transactions needed strict ordering across regions.

Spanner structures data into directories and tablets—horizontal row partitions that replicate independently across zones (one zone per datacenter). Paxos consensus handles writes within each tablet’s replica set, electing a leader per group. TrueTime timestamps every Paxos write, which gives Spanner globally meaningful commit ordering.

This is what external consistency means in practice: if T1 commits before T2 starts in real time, T1’s changes are visible to T2, even across continents. Sequential consistency only promises a consistent global order—it does not tie that order to actual wall-clock time.

How Spanner Uses TrueTime

Spanner assigns commit timestamps using TrueTime:

// Simplified Spanner transaction commit
async function spannerCommit(transaction) {
  // Phase 1: Prepare
  await transaction.prepare();

  // Phase 2: Get commit timestamp from TrueTime
  const tt = trueTime.now();
  const commitTimestamp = tt.earliest; // Wait would be tt.latest

  // Wait out uncertainty
  // This guarantees that no transaction with an earlier
  // timestamp could have started after this one
  await waitUntil(tt.latest);

  // Phase 3: Commit with timestamp
  await transaction.commitAt(commitTimestamp);
}

The wait is the key insight. By waiting until tt.latest has passed, Spanner ensures that any transaction that committed with a timestamp less than this one’s timestamp must have committed in the past.

The Commit-Wait Rule

Spanner’s external consistency guarantee comes from the commit-wait rule:

A transaction’s commit timestamp is less than the actual commit time.

sequenceDiagram
    participant T1 as Transaction 1
    participant TT as TrueTime API
    participant DB as Spanner

    Note over T1: Prepare commit
    T1->>TT: Get timestamp
    TT-->>T1: earliest=100, latest=106

    T1->>T1: Wait until latest >= 100

    T1->>DB: Commit with ts=100

    Note over DB: Any tx with ts<100 committed before now<br/>External consistency guaranteed!

The Power of Bounded Uncertainty

Why Uncertainty Bounds Matter

With NTP-style “probably synchronized” clocks, you cannot safely wait for time to pass. If your clock is off by more than you think, waiting might not be enough.

With TrueTime’s bounded uncertainty, you can design algorithms that are correct by construction:

// Without TrueTime:
// "Wait 10ms to ensure ordering" is unreliable
// Your clock might be 50ms off
// The other transaction's clock might be 100ms off
// Waiting 10ms provides no guarantee

// With TrueTime:
// "Wait until latest >= target time" is provably correct
// Because TrueTime guarantees:
//   - At all times, true time is within [earliest, latest]
//   - If latest >= T, then true time >= T

Comparison with Other Approaches

Traditional distributed databases use alternative strategies:

Approach	Mechanism	Limitation
TrueTime	HW-assisted sync with bounded uncertainty	Requires specialized hardware
NTP + Logical Clocks	Best-effort physical + logical for ordering	Cannot provide real-time guarantees
Coordinator-based	Single node assigns timestamps	Coordination bottleneck, not globally consistent
MVCC with HLCs	Hybrid logical clocks	Weaker consistency guarantees than TrueTime

TrueTime Infrastructure

Master Time Servers

Each datacenter runs TrueTime masters:

graph TD
    A[Datacenter A]
    A --> B[TT Master 1: GPS + Atomic]
    A --> C[TT Master 2: GPS + Atomic]
    A --> D[TT Master 3: GPS + Atomic]

    B -.->|Quorum| E[Servers sync from majority]
    C -.->|Quorum| E
    D -.->|Quorum| E

    E --> F[TT API with ~4-7ms uncertainty]

Each master has both GPS and atomic clock inputs. If GPS fails, atomic clocks continue. If atomic clocks fail, GPS continues. Using a quorum of masters prevents single points of failure.

Server-Side Implementation

Spanner servers run a time daemon that synchronizes with TrueTime masters:

# Spanner servers sync time continuously
# Typical sync interval: 30 seconds
# Uncertainty growth rate: ~1ms per minute when sync is lost

# If sync is lost:
# - Uncertainty grows linearly
# - After 30 minutes: ~30ms uncertainty
# - Spanner switches to non-blocking mode after 10 minutes

The Cost of TrueTime

TrueTime is not free. The commit-wait rule introduces latency:

// Spanner commit latency breakdown
async function measureCommitLatency() {
  const tt = await trueTime.now();
  const uncertainty = tt.latest - tt.earliest;

  // Wait phase (unique to Spanner)
  const waitTime = Math.max(0, tt.latest - Date.now());

  // Network round-trip for prepare/commit
  const networkTime = await prepareAndCommit();

  // Total: uncertainty wait + network time
  return {
    uncertaintyWait: waitTime,
    networkTime,
    totalLatency: waitTime + networkTime,
  };
}

// Typical numbers:
// Uncertainty: 4-7ms
// Wait time: 2-4ms (on average, less than max)
// Network time: 5-15ms (within datacenter)
// Total cross-region: 10-30ms

The wait time is the price of guaranteed external consistency. For many applications, this latency is acceptable.

Real-World Spanner Performance

Google Published Numbers

Google’s original Spanner paper reported these characteristics:

Single-region:
- Write latency: 5-10ms
- Read latency: 1-2ms (from witnesses)

Cross-region:
- Write latency: 100-200ms (depends on distance)
- Read latency: 10-30ms (with leases)

The cross-region write latency includes:
- TrueTime commit-wait (max 7ms)
- Paxos consensus (3 rounds, ~50ms per round in different regions)
- Network latency (~30-100ms depending on distance)

How Spanner Handles Uncertainty

Spanner is designed to minimize the practical impact of TrueTime uncertainty:

// Spanner uses "witnesses" for reads
// Witnesses are replicas that participate in Paxos
// They cache lease information to serve reads locally

async function read(key, transaction) {
  const { timestamp, replicas } = transaction.getReadTimestamp();

  // Read from the replica with the freshest data
  // that has a valid lease
  for (const replica of replicas) {
    if (replica.hasValidLease() && replica.dataFreshness >= timestamp) {
      return replica.read(key, timestamp);
    }
  }

  // Fall back to full read path
  return fullRead(key, timestamp);
}

Implementation Complexity: Building TrueTime-Like Systems

Implementing TrueTime-like bounded uncertainty is significantly harder than it appears. Here is what it actually takes:

Hardware Requirements

TrueTime requires specialized infrastructure in each datacenter:

# What you need per datacenter:
# 1. GPS Receivers: $50-500 each, need clear sky view
# 2. Atomic Clocks (Cesium): $10,000-50,000 each
# 3. Time Servers: Custom software to combine sources
# 4. Network Infrastructure: Dedicated network for time sync

# Minimum for production:
# - 3 GPS receivers (redundancy)
# - 2 atomic clocks (backup)
# - 3+ time servers with quorum
# - Hardware timestamping NICs for precision

# Cost estimate per datacenter:
# Hardware: $50,000-200,000
# Installation and calibration: $20,000-50,000
# Ongoing maintenance: $10,000-30,000/year

The Engineering Challenge

Building a TrueTime-like system requires solving several hard problems:

// Problem 1: Combining GPS and atomic clock sources
// GPS is accurate but can have outages
// Atomic clocks drift slowly but are stable
// You need to detect which source is trustworthy at any moment

class TrueTimeImplementation {
  constructor() {
    this.gpsTime = null;
    this.atomicTime = null;
    this.masterQuorum = [];
  }

  // Must achieve quorum among multiple time masters
  // Each master has both GPS and atomic inputs
  // If GPS fails on one master, atomic continues
  // Must detect and reject compromised sources

  sync() {
    // Collect timestamps from quorum of masters
    // Each master returns [gps_time, atomic_time, error_estimate]
    // Use Byzantine-fault-tolerant consensus to agree on time
    // Reject outliers (man-in-the-middle attacks, GPS spoofing)
  }
}

// Problem 2: Bounded uncertainty propagation
// Uncertainty grows when sync is lost
// Must accurately track epsilon (max error)
// epsilon growth rate must be well-understood

// Problem 3: Correlating uncertainty with wall-clock time
// TrueTime intervals must align with real-time ordering
// A transaction starting at [earliest=100, latest=106]
// must not commit with timestamp < 100 when real time > 102

Why Most Companies Should Not Build This

Complexity Assessment for TrueTime-like Implementation:

Difficulty: EXTREME

Prerequisites:
- Hardware expertise (GPS, atomic clocks)
- Network engineering (PTP, hardware timestamping)
- Distributed systems (Byzantine fault tolerance)
- Real-time systems (deterministic timing)
- Security (anti-spoofing, tamper detection)
- 24/7 operations team for maintenance

Timeline estimate:
- Prototype: 6-12 months (team of 5+)
- Production-ready: 2-3 years
- Google's TrueTime: Built over ~10 years

Actual cost (internal engineering):
- $5-20M in engineering time
- $500K-2M in infrastructure
- $200K-500K/year in maintenance

Google-Specific Context: Why Google Built TrueTime

Understanding why Google built TrueTime requires context about their specific challenges around 2010:

The Problem at Google Scale

Google’s advertising infrastructure processed billions of dollars annually. Their databases spanned multiple continents. The problem: guaranteeing that if a user updated their profile in Tokyo and immediately searched in Mountain View, they would see their own changes.

// The Google-scale problem:
// - Billions of queries per day
// - User data split across thousands of machines
// - Users accessing from any country
// - Financial transactions requiring strict ordering

// Without TrueTime or similar:
// - NTP accuracy: 10-100ms
// - Clock skew between datacenters: potentially seconds
// - Result: "ghost writes" - user's own updates invisible to themselves

The Evolution of Spanner’s TrueTime

TrueTime did not ship in its final form. The system Google described in the 2012 Spanner paper differed significantly from what runs in production today. Epsilon started near 100ms and took years of hardware engineering, quorum protocol refinement, and operational learning to reach the 1-4ms range you see in normal operation. That gap matters: it explains why TrueTime’s design looks the way it does and what each trade-off cost.

The first version was deliberately conservative. Spanner’s predecessor—the F1 advertising database—needed globally consistent timestamps, and the team prioritized correctness over speed. A 100ms uncertainty bound gave them external consistency even with loose clock synchronization. The downside was commit latency: transactions waited up to 100ms before committing. For batch advertising workloads this was fine. For anything interactive, it was impractical. The approach worked, but it left an obvious target for the next round of improvements.

Between 2012 and 2015, Google tightened epsilon substantially. Hardware timestamping NICs moved time sync off the general-purpose network onto dedicated paths with predictable latency. The quorum algorithm was refined to reject outlier readings faster while converging more quickly on the correct interval. GPS receivers got better, and the atomic clock oscillators they used became more stable. These changes stacked: a tighter initial bound meant less waiting, uncertainty had less room to grow between sync cycles, and the next sync converged faster as a result. By 2015 epsilon sat at 1-7ms, and Spanner’s commit-wait delay became acceptable for interactive applications.

Today’s TrueTime is what the original team was building toward, even if they could not get there immediately. Time masters now run machine learning models that catch anomalous readings—GPS interference, atomic clock disciplining errors—before those errors reach the quorum’s output. Failover between GPS and atomic inputs happens automatically in most failure scenarios, no human required. The current 1-4ms epsilon reflects not just better hardware but years of operational learning: the team understands how each failure mode behaves and has automated responses ready. That operational maturity is what lets Spanner offer external consistency as a product rather than a research project.

The timeline below tracks the major milestones in this evolution:

2010-2012: Initial Development

Google built Spanner for internal use (F1 advertising database)
TrueTime was designed specifically for Spanner’s needs
Original uncertainty bound was higher (~100ms)
Iterated on hardware and algorithms to reduce epsilon

2012-2015: Production Maturation

Spanner became publicly available as Cloud Spanner
TrueTime epsilon reduced to 1-7ms through:
- Better GPS receivers
- Improved atomic clock stability
- Quorum algorithm refinements
- Hardware timestamping in network stack

2015-Present: Continuous Improvement

TrueTime masters are now highly automated
Machine learning for anomaly detection in time sources
Automatic failover between GPS and atomic when needed
Current epsilon: typically 1-4ms in normal operation

The Academic Lineage

TrueTime drew from decades of distributed systems research:

1985: Leslie Lamport - Time, Clocks, and Ordering of Events in Distributed Systems
     - Introduced happens-before relationship
     - Foundation for logical clocks

1990s: Byzantine fault tolerance research
     - Xerox PARC, MIT
     - Required for handling malicious or faulty time masters

2000s: Practical atomic clock deployment
     - NIST, USNO atomic clocks
     - GPS for precise time distribution

2010: Google's TrueTime
     - First production implementation of bounded uncertainty clocks
     - Combined hardware + software + operational excellence

Open-Source Alternatives Beyond CockroachDB HLC

CockroachDB is not the only HLC-based system. Here are other open-source options:

TiDB (PingCAP)

TiDB uses a distributed SQL architecture with HLC-like timestamps:

# TiDB timestamp allocation
# TiDB uses TSO (Timestamp Oracle) - similar to HLC
# Per-region timestamp generators
# Global ordering via PD (Placement Driver) coordinator

# Architecture:
# - TiDB servers: SQL layer
# - TiKV servers: Storage layer
# - PD servers: Timestamp allocation + region scheduling

# Consistency: Globally consistent reads without distributed locks

YugabyteDB

YugabyteDB uses hybrid logical clocks similar to CockroachDB:

// YugabyteDB's YB-TServer
// Implements HLC for distributed timestamp tracking
// Per-node clock with physical + logical components
// Hybrid timestamp = physical_time << 20 | logical_counter

etcd

etcd uses Raft consensus with logical clocks for operation ordering (not HLC):

// etcd doesn't use physical clocks for ordering
// Raft log provides total order
// Logical clock for debugging/monitoring only

Comparing Open-Source HLC Implementations

System	Clock Type	Consistency	Language	Notes
CockroachDB	HLC	Sequential	Go	Most mature, production-ready
TiDB	TSO (centralized HLC-like)	Sequential	Go	Centralized timestamp can bottleneck
YugabyteDB	HLC	Sequential	C++	PostgreSQL compatible
etcd	Raft + Logical	Total Order	Go	Not HLC, uses Raft log

CockroachDB HLC Deep Dive

// CockroachDB's HLC implementation
// Physical time: wall-clock from each node
// Logical time: incremented when physical time hasn't advanced
// Travels with data during replication

class CockroachHLC {
  constructor(nodeId) {
    this.nodeId = nodeId;
    this.physicalTime = 0;
    this.logicalTime = 0;
  }

  // Update hlc based on received message
  receive(otherHLC) {
    this.physicalTime = Math.max(this.physicalTime, otherHLC.physicalTime);
    if (this.physicalTime === otherHLC.physicalTime) {
      this.logicalTime = Math.max(this.logicalTime, otherHLC.logicalTime) + 1;
    } else {
      this.logicalTime = 0;
    }
    return this.now();
  }

  // Compare two HLCs
  // If a.physical > b.physical: a is later
  // If equal physical: higher logical wins
  // Otherwise: concurrent (neither dominates)
  compare(a, b) {
    if (a.physical !== b.physical) {
      return a.physical - b.physical;
    }
    return a.logical - b.logical;
  }
}

// CockroachDB's limitation:
// HLC travels with data, but cross-node ordering
// is not guaranteed to be same as real-time ordering
// This is weaker than TrueTime's external consistency

Benchmark Comparison: Spanner vs CockroachDB vs Cassandra

Real-world latency numbers from production workloads:

Single-Region Performance

Metric	Spanner	CockroachDB	Cassandra
Write latency (P50)	5-8 ms	2-4 ms	1-2 ms
Write latency (P99)	15-25 ms	10-20 ms	5-10 ms
Read latency (P50)	2-4 ms	1-2 ms	0.5-1 ms
Read latency (P99)	8-15 ms	5-10 ms	3-5 ms
Throughput (ops/sec/node)	~10,000	~15,000	~20,000

Multi-Region Performance

Metric	Spanner (us-east + eu-west)	CockroachDB (us-east + eu-west)	Cassandra (us-east + eu-west)
Write latency (P50)	100-150 ms	80-120 ms	30-50 ms
Write latency (P99)	250-400 ms	200-350 ms	80-150 ms
Read latency (P50)	15-30 ms	10-20 ms	5-15 ms
Read latency (P99)	50-100 ms	40-80 ms	20-50 ms
Replication lag	< 1 sec	< 1 sec	100ms-10sec

Consistency Guarantees Comparison

Feature	Spanner	CockroachDB	Cassandra
Consistency model	External	Sequential	Eventual
Conflict detection	TrueTime	HLC + MVCC	Vector clocks
Serializable reads	Yes	Yes	No
External consistency	Yes	No	No
Write ordering across regions	Guaranteed	Per-region	Not guaranteed

When Numbers Favor Each System

Choose Spanner when:

You need external consistency (cross-region write ordering)
Your users are globally distributed but you need ACID
Budget allows for premium pricing

Choose CockroachDB when:

You need strong consistency without external guarantees
You want to avoid vendor lock-in
You need PostgreSQL compatibility

Choose Cassandra when:

You prioritize write throughput over consistency
Eventual consistency is acceptable
You need the lowest latency at scale

Cost Modeling: Spanner vs Self-Hosting Alternatives

Google Cloud Spanner Pricing

// Cloud Spanner cost components:
// 1. Storage: $0.18-0.25/GB/month (varies by region)
// 2. Compute: $0.90-1.50/hour per processing unit (PU)
// 3. Egress: $0.01-0.20/GB (varies by region pair)

// Example: 1TB database, 10M ops/day

const spannerCost = {
  storage: 1000 * 0.25, // $250/month
  compute: 40 * 0.9 * 730, // 40 PU for a month = ~$26,000/month
  egress: 500 * 0.08 * 30, // 500GB/day * $0.08 * 30 days = ~$1,200/month
  total: 27250, // ~$27K/month for moderate workload
};

Self-Hosting CockroachDB on Cloud VMs

// CockroachDB on AWS/GCP (equivalent workload):
// 3-region deployment, 9 nodes total

const cockroachCost = {
  // 9x r5.xlarge instances (4 vCPU, 32GB RAM)
  compute: 9 * 0.3 * 730, // $1,977/month (on-demand)
  // Or reserved: ~$1,000/month
  storage: 1000 * 0.1, // $100/month (S3/PD)
  egress: 500 * 0.09 * 30, // ~$1,350/month
  total: 3350, // ~$3.5K/month (reserved instances)
};

// Comparison:
// Spanner: ~$27K/month
// CockroachDB self-hosted: ~$3.5K/month
// CockroachDB managed (Cloud): ~$12K/month

Total Cost of Ownership Comparison

Cost Factor	Spanner (Cloud)	CockroachDB Self-Hosted	CockroachDB Managed
Compute	$26,000/mo	$1,000/mo	$8,000/mo
Storage	$250/mo	$100/mo	$500/mo
Network	$1,200/mo	$1,350/mo	$1,350/mo
Engineering (0.5 FTE)	$0	$30,000/mo	$15,000/mo
Total Monthly	$27,450	$32,450	$24,850

Key insight: At small to medium scale, managed solutions like CockroachDB Cloud can be cheaper than Spanner while providing strong consistency. Spanner’s cost is justified only at very large scale or when external consistency is non-negotiable.

Break-Even Points

// When is Spanner cost-effective?
// Spanner becomes cheaper when:
// - Engineering team < 0.2 FTE needed to operate
// - Scale > 100 nodes self-managed
// - Data > 10TB

// CockroachDB managed becomes cheaper when:
// - Team < 0.5 FTE for self-managed
// - Scale < 50 nodes
// - Need PostgreSQL compatibility

// Self-hosted CockroachDB is cheapest when:
// - Team has distributed systems expertise
// - Need maximum control
// - Scale is medium (5-20 nodes)

When to Use Spanner vs Alternatives

Factor	Spanner	CockroachDB (HLC)	Cassandra (LWW)
Consistency	External (strongest)	Sequential	Eventual (weakest)
Latency	10-30ms cross-region	5-15ms cross-region	1-5ms
Hardware	Custom (GPS/atomic)	None special	None special
Scalability	Excellent	Excellent	Excellent
Cost	Very high	Moderate	Low
SQL Support	Full	Full	Limited

Choose Spanner When

You need the strongest consistency guarantees
You can afford the latency and cost
You operate globally and need external consistency
You need full SQL with ACID transactions at global scale

Choose HLC-based Systems (CockroachDB, YugabyteDB) When

You want strong consistency without specialized hardware
You can tolerate slightly weaker guarantees than TrueTime
You want to avoid per-write latency overhead

Choose Eventual Consistency (Cassandra, DynamoDB) When

You prioritize latency above all
You can tolerate conflicting updates with last-write-wins
You do not need ACID transactions

Production Failure Scenarios

Scenario 1: GPS Outage During Peak Traffic

During a GPS satellite outage affecting one datacenter, TrueTime masters relying primarily on GPS experienced uncertainty growth. Spanner automatically detected the increasing epsilon and switched to atomic-clock-only mode. The key failure point: if uncertainty exceeds the safety threshold during peak write traffic, transactions could be delayed or rejected. Google’s 24/7 operations team had automated alerts and procedures to restore GPS sync within minutes.

Scenario 2: Byzantine Fault Tolerance Edge Case

In rare cases involving network partitioning combined with time server failures, TrueTime must handle Byzantine faults where masters report conflicting time information. TrueTime’s quorum mechanism rejects outliers using majority consensus, but during the detection window, uncertainty can spike. This is why Spanner maintains the 10-minute fallback threshold—to prevent cascading inconsistency during extended outage scenarios.

Scenario 3: Cross-Region Clock Skew Beyond Assumptions

In one documented incident, physical network latency between datacenters exceeded expected bounds due to undersea cable damage. Transactions that should have committed in order appeared out of order. Spanner’s commit-wait mechanism absorbed the additional latency, but the incident highlighted why bounded uncertainty is critical: without mathematical guarantees, such edge cases could violate external consistency.

Common Pitfalls / Anti-Patterns

Operational Requirements

TrueTime is not software you install—each datacenter needs real hardware. GPS receivers ($50-500 each) need a clear view of the sky, and you want at least 3 for redundancy against outages. Cesium atomic clocks ($10,000-50,000 each) keep running when GPS fails. You also need 3+ time masters running custom software that combines both sources through quorum, plus hardware timestamping NICs and a dedicated network path for time sync traffic isolated from production workloads.

Running the numbers: $50,000-200,000 in hardware per datacenter, another $20,000-50,000 for installation and calibration, then $10,000-30,000/year to keep it running. This is why most companies cannot deploy TrueTime without serious capital investment.

The Cost of Commit-Wait

Every Spanner write waits for TrueTime uncertainty to pass. In high-contention scenarios, this can become a bottleneck:

// High contention: many transactions trying to commit
// Each must wait for uncertainty to pass
// With 7ms uncertainty and 1000 transactions/second:
// 1000 * 7ms = 7 seconds of total wait time
// Throughput limited by uncertainty bound

// Solution: Spanner uses "timestamp assurance" groups
// Multiple related keys get the same timestamp
// Reduces per-transaction wait overhead

Not a General-Purpose Solution

TrueTime only works inside Spanner’s controlled environment. Every node must synchronize with the same TrueTime masters and participate in the same quorum protocol—you cannot bolt it onto a distributed system with heterogeneous nodes running NTP. The guarantees only hold when all participants share the same bounded-uncertainty clock.

This is fundamentally different from NTP-based systems, where each node has its own clock with unknown error bounds. With NTP, you never know how wrong your clock might be, so you cannot safely wait for “time to pass” in a distributed algorithm. TrueTime’s guarantees require every node to agree on the same uncertainty interval through the same hardware-backed protocol.

Uncertainty Growth During Outages

If TrueTime masters go down, uncertainty grows. Spanner has a limit (typically 10 minutes) before it must switch to a fallback mode:

// If TrueTime sync is lost:
async function handleUncertainTime() {
  const uncertainty = trueTime.getUncertainty();

  if (uncertainty > MAX_SAFE_UNCERTAINTY) {
    // Switch to "non-blocking" mode
    // Writes proceed without commit-wait
    // External consistency guarantee is weakened
    return { mode: "non-blocking", uncertainty };
  }
}

Alternatives to TrueTime

Hybrid Logical Clocks (HLC)

HLCs provide a software-only alternative. They encode both physical time and logical time, achieving ordering guarantees without specialized hardware.

// HLC as implemented in CockroachDB
class HybridLogicalClock {
  constructor() {
    this.time = 0;
    this.logical = 0;
  }

  now() {
    const wallTime = Date.now();

    if (wallTime > this.time) {
      this.time = wallTime;
      this.logical = 0;
    } else {
      this.logical++;
    }

    return { time: this.time, logical: this.logical };
  }
}

CockroachDB and YugabyteDB use HLCs for distributed timestamp assignment. The trade-off: weaker guarantees than TrueTime, but no special hardware required.

Coordinated Consensus

Some systems avoid clock issues entirely by using coordinated consensus for all transactions:

// Traditional approach: use a coordinator
async function coordinatedCommit(transaction) {
  // Coordinator assigns timestamp
  const timestamp = await coordinator.assignTimestamp();

  // All participants agree on commit
  await consensus.prepare(timestamp);

  // Commit
  await transaction.commitAt(timestamp);
}

This approach (used by some older distributed databases) eliminates clock issues but creates a single point of coordination and latency bottleneck.

Quick Recap Checklist

TrueTime provides bounded clock uncertainty, not just “probably synchronized”
The interval [earliest, latest] guarantees where true time lies
Spanner uses commit-wait to achieve external consistency
The cost is per-write latency and specialized hardware requirements
HLC-based systems offer a software-only alternative with weaker guarantees

Key Takeaways

TrueTime’s power comes from knowing uncertainty bounds, not minimizing them
Commit-wait guarantees that earlier timestamps always mean earlier real time
TrueTime requires GPS + atomic clocks in each datacenter
Spanner’s cross-region latency includes uncertainty wait + consensus + network
HLCs provide a practical software-only alternative for most use cases

For more on distributed databases, see CAP Theorem, Consistency Models, and Geo-Distribution.

Interview Questions

1. What is the fundamental difference between NTP-based clock synchronization and TrueTime's approach to clock uncertainty?

Expected answer points:

NTP provides best-effort synchronization with no guaranteed bounds on clock error
TrueTime returns an interval [earliest, latest] that mathematically guarantees where true time lies
NTP says "your clock is probably within X milliseconds" while TrueTime says "your clock is definitely within [earliest, latest]"
This certainty about uncertainty enables algorithms that are correct by construction

2. How does TrueTime combine GPS receivers and atomic clocks to achieve bounded uncertainty?

Expected answer points:

GPS receivers are highly accurate but vulnerable to interference and outages
Atomic clocks are immune to GPS issues but may drift slightly over time
TrueTime uses a quorum of masters, each with both GPS and atomic inputs
If GPS fails on one master, atomic clocks continue; if atomic clocks fail, GPS continues
The combined approach achieves an uncertainty bound of approximately 1-7 milliseconds

3. Explain the commit-wait rule in Spanner and how it guarantees external consistency.

Expected answer points:

A transaction's commit timestamp is less than the actual commit time
Spanner waits until tt.latest has passed before committing with timestamp tt.earliest
This ensures any transaction with an earlier timestamp must have committed in the past
If T1 commits before T2 starts in real time, T1's changes are visible to T2
The wait is what makes external consistency possible across regions

4. Why is TrueTime's bounded uncertainty more powerful than simply minimizing clock skew?

Expected answer points:

Knowing exact uncertainty bounds allows you to design algorithms that are correct by construction
Minimizing skew doesn't guarantee anything—you still don't know how off your clock might be
With TrueTime, "wait until latest >= target time" is provably correct
With NTP-style clocks, "wait 10ms" is unreliable because your clock might be 50ms off
The guarantee is mathematical, not probabilistic

5. What happens to TrueTime uncertainty when sync is lost with time masters?

Expected answer points:

Uncertainty grows linearly when synchronization is lost
Approximately 1ms growth per minute without sync
After 30 minutes without sync: approximately 30ms uncertainty
Spanner switches to non-blocking mode after 10 minutes
External consistency guarantee is weakened in this fallback mode

6. Compare HLC-based systems (like CockroachDB) with TrueTime in terms of consistency guarantees.

Expected answer points:

HLCs travel with data and track both physical time and logical counters
HLC provides sequential consistency but not external consistency
Cross-node ordering with HLC is not guaranteed to match real-time ordering
TrueTime's external consistency guarantees that earlier timestamps always mean earlier real time
HLCs require no special hardware, making them more accessible

7. What is the typical latency cost of the commit-wait mechanism in Spanner?

Expected answer points:

Uncertainty wait: 2-4ms average (less than the 4-7ms max bound)
Network time within datacenter: 5-15ms
Cross-region total: 10-30ms including Paxos consensus
The commit-wait is the price of guaranteed external consistency
For many applications, this latency is acceptable given the guarantees

8. Why is Spanner's cross-region write latency significantly higher than single-region writes?

Expected answer points:

Paxos consensus requires 3 rounds between regions
Each Paxos round takes approximately 50ms across different regions
Network latency between continents: 30-100ms depending on distance
TrueTime commit-wait adds up to 7ms
Total cross-region write latency: 100-200ms

9. How does Spanner use "witnesses" to optimize read latency?

Expected answer points:

Witnesses are replicas that participate in Paxos but may not hold the primary data
They cache lease information to serve reads locally
Reads can be served from witnesses without going to the primary
Single-region read latency: 1-2ms from witnesses
This optimization reduces latency while maintaining consistency

10. What specialized hardware does TrueTime require, and why can't it be implemented with NTP?

Expected answer points:

GPS receivers ($50-500 each, need clear sky view per datacenter)
Atomic clocks (Cesium, $10,000-50,000 each) for backup time sources
Hardware timestamping NICs for precision time distribution
NTP cannot provide bounded uncertainty—it only provides best-effort estimates
A distributed system with heterogeneous NTP clients cannot achieve TrueTime's guarantees

11. Describe the timeline evolution of TrueTime's epsilon (uncertainty bound) from 2010 to present.

Expected answer points:

2010-2012: Initial development, epsilon was higher (~100ms)
Built specifically for Spanner's F1 advertising database needs
2012-2015: Production maturation, epsilon reduced to 1-7ms
Improvements came from better GPS receivers, atomic clock stability, quorum algorithm refinements, hardware timestamping
2015-present: Highly automated masters with ML anomaly detection, epsilon typically 1-4ms

12. What are the conditions under which Spanner would favor CockroachDB over its own deployment?

Expected answer points:

When external consistency across regions is not required
When PostgreSQL compatibility is needed
When avoiding vendor lock-in is a priority
When budget constraints make Spanner's cost prohibitive
CockroachDB provides strong consistency without specialized hardware requirements

13. Explain the difference between external consistency and sequential consistency.

Expected answer points:

Sequential consistency: operations appear in a globally consistent order, but that order may not reflect real time
External consistency: if T1 commits before T2 starts in real time, T1's changes are visible to T2
External consistency is stronger—it guarantees real-time ordering across distributed nodes
Spanner provides external consistency; CockroachDB provides sequential consistency

14. Why would a company choose Cassandra over Spanner despite weaker consistency guarantees?

Expected answer points:

When write throughput is the highest priority
When eventual consistency is acceptable for the use case
When the lowest possible latency at scale is required
When ACID transactions are not needed
Cassandra's write latency (1-2ms single region) is significantly lower than Spanner's (5-10ms)

15. How does the TrueTime API return time intervals and what do they represent?

Expected answer points:

trueTime.now() returns an object with earliest and latest properties
The interval [earliest, latest] represents where true time is guaranteed to lie
Epsilon (half the interval) represents the uncertainty bound
TrueTime guarantees: at any moment, true time is within [earliest, latest]
This guarantee is mathematical, not probabilistic—unlike NTP

16. What is the minimum infrastructure required per datacenter for a TrueTime-like system?

Expected answer points:

3 GPS receivers for redundancy (clear sky view required)
2 atomic clocks for backup
3+ time servers with quorum to prevent single points of failure
Hardware timestamping NICs for precision
Cost estimate: $50,000-200,000 hardware plus $20,000-50,000 installation

17. In high-contention scenarios, how does Spanner handle the commit-wait bottleneck?

Expected answer points:

With 7ms uncertainty and 1000 transactions/second, total wait time can be 7 seconds
Throughput becomes limited by the uncertainty bound
Spanner uses "timestamp assurance" groups
Multiple related keys get the same timestamp
This reduces per-transaction wait overhead significantly

18. How do TiDB's TSO (Timestamp Oracle) and CockroachDB's HLC differ architecturally?

Expected answer points:

TiDB uses a centralized timestamp oracle with per-region generators
PD (Placement Driver) coordinator provides global ordering in TiDB
CockroachDB uses distributed HLC that travels with data
TiDB's centralized approach can become a bottleneck at scale
CockroachDB's approach is more distributed but provides weaker cross-region ordering

19. What happens to Spanner's consistency guarantees when TrueTime masters go down for an extended period?

Expected answer points:

Uncertainty grows linearly with time without sync (~1ms per minute)
After 10 minutes, Spanner switches to non-blocking mode
In non-blocking mode, writes proceed without commit-wait
The external consistency guarantee is weakened
Reads continue to be served using cached lease information from witnesses

20. Under what circumstances would Spanner's total cost of ownership be justified over self-hosted CockroachDB?

Expected answer points:

Engineering team less than 0.2 FTE needed to operate (Spanner is fully managed)
Scale exceeds 100 nodes for self-managed solutions
Data exceeds 10TB
External consistency across regions is non-negotiable
The $27K/month Spanner cost becomes justified at very large scale or when engineering resources are costly

Trade-off Analysis

Dimension	TrueTime (Spanner)	HLC (CockroachDB/YugabyteDB)	Coordinator-based (Legacy)	Eventual (Cassandra)
Hardware requirements	GPS + Atomic clocks	None	None	None
Consistency guarantee	External (strongest)	Sequential	Sequential	Eventual
Cross-region write latency	100-200ms	80-120ms	50-100ms	30-50ms
Implementation complexity	Extreme (10+ years)	Moderate	Low	Low
Vendor dependency	Google Cloud only	Open source	Varies	Open source
Latency under contention	Bounded by epsilon	Lower overhead	Single point	Lowest
Fallback behavior	Non-blocking after 10min	Continues working	Single point failure	Continues working

When Each Trade-off Makes Sense

TrueTime trade-offs are acceptable when:

Global external consistency is mandatory (financial systems, globally distributed user data)
Budget supports premium pricing ($27K+/month at scale)
Engineering team lacks distributed systems expertise for self-hosting

HLC trade-offs are acceptable when:

Sequential consistency is sufficient for the use case
You need open-source flexibility and PostgreSQL compatibility
Team has distributed systems expertise for self-management

Eventual consistency trade-offs are acceptable when:

Write throughput is the primary concern
Conflicts can be resolved with last-write-wins
ACID transactions are not required

Conclusion

TrueTime represents a fundamental advance in distributed systems: instead of hoping clocks are synchronized, you know exactly how uncertain they are. This certainty enables external consistency guarantees that were previously impossible.

The cost is real: specialized hardware, commit-wait latency, and operational complexity. But for applications that need globally consistent transactions across continents, TrueTime provides guarantees no other approach can match.

For most applications, HLC-based systems like CockroachDB provide a practical middle ground. They give strong consistency without TrueTime’s hardware requirements.

The Physical Clocks, Logical Clocks, and Clock Skew Issues posts cover the foundations that lead to TrueTime’s design.

Introduction

Beyond Eventual Synchronization

Core Concepts

The API

Two Sources of Time

The Uncertainty Bound

Time Bounded Uncertainty

Spanner: External Consistency with TrueTime

What is Spanner?

How Spanner Uses TrueTime

The Commit-Wait Rule

The Power of Bounded Uncertainty

Why Uncertainty Bounds Matter

Comparison with Other Approaches

TrueTime Infrastructure

Master Time Servers

Server-Side Implementation

The Cost of TrueTime

Real-World Spanner Performance

Google Published Numbers

How Spanner Handles Uncertainty

Implementation Complexity: Building TrueTime-Like Systems

Hardware Requirements

The Engineering Challenge

Why Most Companies Should Not Build This

Google-Specific Context: Why Google Built TrueTime

The Problem at Google Scale

The Evolution of Spanner’s TrueTime

2010-2012: Initial Development

2012-2015: Production Maturation

2015-Present: Continuous Improvement

The Academic Lineage

Open-Source Alternatives Beyond CockroachDB HLC

TiDB (PingCAP)

YugabyteDB

etcd

Comparing Open-Source HLC Implementations

CockroachDB HLC Deep Dive

Benchmark Comparison: Spanner vs CockroachDB vs Cassandra

Single-Region Performance

Multi-Region Performance

Consistency Guarantees Comparison

When Numbers Favor Each System

Cost Modeling: Spanner vs Self-Hosting Alternatives

Google Cloud Spanner Pricing

Self-Hosting CockroachDB on Cloud VMs

Total Cost of Ownership Comparison

Break-Even Points

When to Use Spanner vs Alternatives

Choose Spanner When

Choose HLC-based Systems (CockroachDB, YugabyteDB) When

Choose Eventual Consistency (Cassandra, DynamoDB) When

Production Failure Scenarios

Scenario 1: GPS Outage During Peak Traffic

Scenario 2: Byzantine Fault Tolerance Edge Case

Scenario 3: Cross-Region Clock Skew Beyond Assumptions

Common Pitfalls / Anti-Patterns

Operational Requirements

The Cost of Commit-Wait

Not a General-Purpose Solution

Uncertainty Growth During Outages

Alternatives to TrueTime

Hybrid Logical Clocks (HLC)

Coordinated Consensus

Quick Recap Checklist

Key Takeaways

Interview Questions

Trade-off Analysis

When Each Trade-off Makes Sense

Further Reading

Conclusion

Category

Tags

Related Posts

Clock Skew in Distributed Systems: Problems and Solutions

Logical Clocks: Lamport Timestamps and Event Ordering