Kubernetes Debugging Recipe: Practical Steps to Diagnose Pods Like a Pro
(Fri, 24 Oct 2025)
Automation isn’t optional at enterprise scale. It’s resilient by design. Kubernetes provides remarkable scalability and resilience , but when pods crash, even seasoned engineers struggle to
translate complex and cryptic logs and events.
This guide walks you through the spectrum of AI-powered root cause analysis and manual debugging, combining command-line reproducibility and predictive observability approaches.
>> Read More
From Distributed Monolith to Composable Architecture on AWS
(Fri, 24 Oct 2025)
You adopted microservices for independence and agility. Instead, every deployment requires coordinating multiple teams and testing the entire system. What you built is a distributed monolith,
complexity spread across systems, but still bound by monolithic coupling. The shift from technical boundaries to business-driven boundaries is the only path to true agility. Many organizations
discover too late that microservices alone do not guarantee independence. Domain-Driven Composable Architecture (DDCA) provides a methodology to escape this rigidity.
This article is a practical playbook for decomposing services into Packaged Business Capabilities (PBCs) aligned with business domains and mapped to AWS patterns such as EventBridge, Step
Functions, and DynamoDB Streams. It explains when DDCA fits and when it does not, and covers security, anti-patterns, and operational realities, so you can adopt composability with a clear view
of the investment required.
>> Read More
Unhandled Promise Rejections: The Tiny Mistake That Crashed Our Node.js App
(Fri, 24 Oct 2025)
Imagine deploying a Node.js backend service that works flawlessly in development, only to have it mysteriously crash in production. Everything ran fine on your laptop, but on the live server, the
process keeps shutting down unexpectedly.
In our case, the culprit was a single unhandled promise rejection — one missing .catch() in our code caused Node to exit abruptly whenever an error
occurred. That one “tiny” mistake made the difference between a stable service and frequent downtime. In this article, we’ll explore how a misconfigured error handling in a Node/Express API can
bring down an application, and how to diagnose and fix it to prevent future crashes.
>> Read More
Performance Testing 101: A Beginner's Guide to Building Robust Applications
(Fri, 24 Oct 2025)
Welcome! This guide is for anyone who has built an application and wants to ensure it doesn't fall over when real people start using it. We'll walk through the essentials of performance testing
without the complicated jargon, focusing on practical steps you can take to make your app robust and reliable.
You may find this article too abstract, but be sure the next time we will go over the real example of making a performance test with Java, Gatling, and Docker Compose.
>> Read More
DevOps Cafe Ep 79 - Guests: Joseph Jacks and Ben Kehoe
(Mon, 13 Aug 2018)
Triggered by Google Next 2018, John and Damon chat with Joseph Jacks (stealth startup) and Ben Kehoe (iRobot) about their public disagreements — and agreements — about Kubernetes and
Serverless.
>> Read More
DevOps Cafe Ep 78 - Guest: J. Paul Reed
(Mon, 23 Jul 2018)
John and Damon chat with J.Paul Reed (Release Engineering Approaches) about the field of Systems Safety and Human Factors that studies why accidents happen and how to minimize the occurrence and
impact.
Show notes at http://devopscafe.org
>> Read More
DevOps Cafe Ep. 77 - Damon interviews John
(Wed, 20 Jun 2018)
A new season of DevOps Cafe is here. The topic of this episode is "DevSecOps." Damon interviews John about what this term means, why it matters now, and the overall state of security.
Show notes at http://devopscafe.org
>> Read More