Runbooks¶
Step-by-step operational procedures for incidents and common DevOps tasks. Each runbook should be copy-paste ready -- a teammate at 2 AM should be able to follow it without additional context.
Registry¶
| Runbook | Date | Author | Related |
|---|---|---|---|
| StatefulSet incident resize | 2026-02-26 | John Adedigba | ADR-0001 |
Creating a new runbook¶
File naming: YYYY-MM-DD_short-description.md
Example: 2026-03-01_kafka-cluster-recovery.md
Template¶
# Runbook: <Title>
**Last updated:** YYYY-MM-DD
**Author:** Name
**JIRA:** [DEVOPSBLN-XXXX](https://tickets.digitalturbine.com/browse/DEVOPSBLN-XXXX)
## Overview
Brief description of what this runbook covers and when to use it.
## Prerequisites
- Access requirements
- Tools needed
- Environment context
## Procedure
1. Step one
2. Step two
3. ...
## Verification
How to confirm the procedure was successful.
## Rollback
Steps to undo the changes if something goes wrong.
## References
Links to related ADRs, test scenarios, dashboards, or tickets.
Warning
Runbooks are living documents. If you follow a runbook and a step is wrong or missing, fix it in the same MR as your change.