Practices and Tactics for Surviving Oncall

Audience:

Topic:

As a Production Engineer on the Operating Systems team at Facebook, I'm a member of one of the more active oncalls for one of the largest fleets of servers in the world. In this talk I'm going to cover the mindset of our oncall, how the values of Production Engineering manifest in the practices of our oncalls, and how I ensure that the workload stays managable. Ignoring internal tooling, this will focus more on best practices and mindset.

Presentation:

surviving_oncall_scale18x.pdf

Room:

Ballroom F

Time:

Sunday, March 8, 2020 - 13:30 to 14:30

Audio/Video:

https://youtu.be/yOsgRMhbis4?t=389

Learn about the steps we’re taking to mitigate the risk against Coronavirus at SCALE 18x.

Practices and Tactics for Surviving Oncall