Tony is a Software Engineer at LinkedIn, specializing in analytical databases. He has made significant contributions to LinkedIn's data infrastructure by leading a zero-downtime Kubernetes migration for the Apache Pinot production fleet, implementing fault-tolerant shard placement strategies, and developing a database node maintenance coordinator to optimize reliability and efficiency.

Presentations

23x

Zero-downtime Kubernetes migration of 14K Apache Pinot database fleet at LinkedIn

LinkedIn recently migrated its production Apache Pinot fleet from on-premises bare-metal hardware to Kubernetes with zero downtime. This tech talk will explore the technical journey, focusing on design choices, the challenges and trade-offs faced, and a balance of building custom tools versus leveraging existing solutions.

Key highlights include availability zone-aware data shard placement, automated OLAP table migrations with Airflow and Temporal, performance testing, pre- and post-migration validations, and disruption management. Lessons learned and valuable strategies for ensuring uninterrupted service-level objectives (SLOs) will also be shared.

See Presentation