Loading…
Wednesday February 19, 2025 2:00pm - 2:25pm PST
Ben Ryves, GetYourGuide, Staff Site Reliability Engineer
Maggie Slukova, GetYourGuide, Staff Site Reliability Engineer


How difficult is it to minimize the toil of cluster upgrades while increasing confidence in deploying the changes?

Even when managed, cluster upgrades are an unavoidable part of cluster maintenance, yet they can also be a source of toil, stress and outages; either when upgrading Kubernetes itself or additional cluster components. Manually testing changes in staging clusters, deploying with a high sense of uncertainty - these time consuming tasks can be mitigated with tests. Add nifty automation and cluster management efforts are reduced to a bare minimum.

Is investing in cluster tests and automation worth it? Yes! This talk shows how a team of three engineers keeps multiple clusters continuously up-to-date with minimal time investment, effort and disruptions. By leveraging the e2e-framework and Helm, everything from Istio to cluster autoscaler is tested and seamlessly managed.
Speakers
avatar for Ben Ryves

Ben Ryves

Staff Site Reliability Engineer, GetYourGuide
Ben works as an SRE at GetYourGuide, building automation and testing tooling to provision production Kubernetes clusters. His main focus is networking, security, and resource optimisation.
avatar for Maggie Slukova

Maggie Slukova

Staff Site Reliability Engineer, GetYourGuide
Maggie is a backend engineer turned SRE with a background in Mathematics. Her areas of focus are Kubernetes, Istio, cluster optimisation, autoscaling and automation. She loves building things, be it software, infrastructure, furniture or games.
Wednesday February 19, 2025 2:00pm - 2:25pm PST
OpsWorld
  OpsWorld

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link