If you work on distributed systems, you probably try to design your system
to keep running if any single machine fails. If you’re ambitious, you
might extend this to entire racks, or even more inconvenient sets of
machines. However, what if your entire datacenter falls off the face of
the earth?
This talk will examine how current large scale storage systems handle
fault tolerance and consistency, with a particular focus on popular cloud
computing platforms. We’ll cover techniques such as replication, sharding,
two phase commit, and consensus protocols (e.g. Paxos), then explore how
they can be applied across datacenters.
Transactions Across Datacenters Slides
I’ve posted the slides from a talk I gave recently at USF: Transactions Across Datacenters. Benjamin Wells invited me to speak as part of the Special Lecture Series in Computer Science.
Here’s the abstract:
Feedback is welcome!