anignoramuss
homearchiveabouttags

automation

3 posts

▸ Scaling the World's Largest Deployment System: From Bare Metal to Cloud Native

October 18, 2025 · 27 min read · deployment, infrastructure, automation, cloud, scaling, autoscaling

I'm building what might be the world's largest internal deployment system. In 2025, we expanded it from internal bare-metal infrastructure to native cloud resources. Here's what happens when you need to deploy to dynamic fleets defined by tags and integrate with autoscaling.

▸ How 'Production' Are You Really? Building a Resource Classification System

June 15, 2021 · 10 min read · safety, classification, infrastructure, automation, risk-management

When automation systems need to know if they can touch a resource, 'is this production?' becomes a surprisingly complex question. I built a service to classify the productionness of infrastructure—it's harder than you think.

▸ Patching 25,000 Servers Without Breaking the Internet

November 08, 2020 · 9 min read · infrastructure, automation, scaling, deployment, distributed-systems

How do you deploy security patches across thousands of compute instances spanning multiple regions without causing an outage? Turns out, infrastructure-as-code starts to break at scale. Here's what I learned building a global patching system.

github · linkedin · scholar · twitter · rss