Abstract
Over the past decade, we've witnessed a fundamental shift in how infrastructure is built, deployed, and run. The rise of reliability engineering is a response to systems' increasing complexity and scale. Without its tools and methods, managing and monitoring the environments of hundreds or thousands of hosts and services is an unimaginable, impossible task.