Transactions Across Datacenters
(and other weekend projects)



Special Lecture Series in Computer Science
University of San Francisco
Feb. 12, 2009

Of three properties of distributed data systems - consistency, availability, partition tolerance - choose two.

  -Eric Brewer, CAP theorem, PODC 2000

Scaling is hard.

  -Various

What's ahead

GMail

Why transactions?

Cliched examples

What's ahead

Consistency

Weak consistency

Eventual consistency

Strong consistency

What's ahead

Why across datacenters?

Why not across datacenters?

What's ahead

Multihoming

Option 1: Don't.

Option 2: Primary with hot failover(s)

Option 3: True multihoming

What's ahead

Interested in...

Backups

Locking

Optimistic concurrency

Master/slave replication

Multi-master replication

Two Phase Commit

Paxos

What's ahead

Tradeoffs (very approximate)


Paxos ... 2PC ... MMR ... MSR ... Backups
Consistency Strong Eventual Weak
Latency High Low
Throughput Medium Low High
Data loss None Some Lots
Failover N/A Minimal impact Read only

Tradeoffs (very approximate)


Paxos ... 2PC ... MMR ... MSR ... Backups
Consistency Strong Eventual Weak
Latency High Low
Throughput Medium Low High
Data loss None Some Lots
Failover N/A Minimal impact Read only
GMail

In conclusion...

:'(

What's behind (phew!)

Questions?


(slides will be on snarfed.org)