Shaun is a Production Engineer at Meta, where they have been i shaping integrations with the public cloud infrastructure landscape. They specialize in creating intuitive platforms for container creation and Kubernetes management, with a particular focus on building infrastructure-level software to support both generic compute and large-scale Artificial Intelligence HPC training clusters. Shaun's work is dedicated to enhancing accessibility and efficiency for developers, driving innovation, and simplifying cloud computing at Meta.

Presentations

21x

FBInstance to MetaInstance: Our Journey from Long-Running Mutable to Immutable Instances

Accepted

We'll discuss the challenges we faced when running Chef in the cloud: juggling build time v/s runtime dependencies, attempting to test continuously moving updates, sharing cookbooks across different deployment environments, and being constrained to CentOS.

Later, we’ll reveal how we turned these challenges around by adopting an immutable instance philosophy: managing instance lifecycle via CI/CD pipelines, relying on build time testing, reducing chef runtime footprint, and making critical host level infrastructure portable by moving it to docker containers.

See Presentation
20x

Scaling AMI and Container Image Lifecycle @ Meta

Pending Review

Join this talk to learn how Meta's Cloud Foundation team supports the registration, building, runtime, deployment, and deletion of image artifacts (AMIs, OCI Containers) amongst the company’s dev teams with diverse public cloud access requirements.


See Presentation
22x

Evolving AI Research Infrastructure with Kubernetes at Meta: Overcoming Challenges and Lessons Learned

Accepted

We migrated our Slurm based AI research infrastructure to Kubernetes, focusing on a seamless user experience. This required innovative solutions for managing complex systems, including provisioning, authentication, access control, and host management.

We leveraged Kubernetes features to build a scalable and flexible infrastructure, and developed new approaches to configuration management. Join us to learn about our journey and the lessons we learned along the way.

See Presentation
23x

Containers All the Way Down: What we learned running containers-in-containers @ Meta for AI & More

Accepted

Over the past few years, OCI containers and Kubernetes have become the backbone of Meta’s open-source cloud infrastructure. This talk explores the complexities of safely running containers inside containers (“nested containers”) without root privileges. It demystifies OCI container internals, highlights the latest open-source advancements enabling rootless deployments, and addresses the unique challenges posed by nested environments. Through production case studies, it shares lessons for secure, efficient container-in-container deployments.

See Presentation