Dealing with a Google Kubernetes Engine Cluster Outage

This post was originally posted to the heurekadevs.cz blog. Intro As the infrastructure team at Heureka, we are responsible for providing certain infrastructure components as a service (Infrastructure as a Service, or IaaS) to our developers. One such service is Kubernetes, for which we use the Google Kubernetes Engine, which serves as the core for many of our services, both in our “legacy” on-prem and in the cloud. Everything was running smoothly for a few months, until one day (luckily not while in full production yet); our pre-production (staging) cluster went down....

31 March, 2022 · Me