Publication Data
Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing
Abstract: Mesa is a highly scalable analytic data warehousing system
that stores critical measurement data related to Google's Internet advertising
business. Mesa is designed to satisfy a complex and challenging set of user and systems
requirements, including near real-time data ingestion and queryability, as well as high
availability, reliability, fault tolerance, and scalability for large data and query
volumes. Specifically, Mesa handles petabytes of data, processes millions of row
updates per second, and serves billions of queries that fetch trillions of rows per
day. Mesa is geo-replicated across multiple datacenters and provides consistent and
repeatable query answers at low latency, even when an entire datacenter fails. This
paper presents the Mesa system and reports the performance and scale that it achieves.