Achieving Distributed Point-in-Time Consistency for Geo-Replication

Red Hat Computer Science, 2014-15

Liaison(s): Ian Colle, Greg Farnum ’09, Sam Just ’10, Sage Weil ’00
Advisor(s): Beth Trushkowsky
Students(s): Michael Saffron (PM), Nick Carter, Matt Cook, Philip Davis

Red Hat’s Ceph is an open source, hyper-scalable, distributed, strongly consistent file system. Red Hat is interested in supporting geo-replication of data in a Ceph instance, which requires the ability to take consistent, point-in-time, online snapshots of the complete distributed system. We propose and analyze an algorithm that uses time synchronization and transient write delays to implement this snapshotting feature while maintaining Ceph’s consistency and performance guarantees.