RAG at Scale: The Data Engineering Challenges
(Fri, 16 Jan 2026)
Retrieval-augmented generation (RAG) has emerged as a powerful technique for building AI systems that can access and reason over external knowledge bases. RAG enabled us to build accurate and
up-to-date systems by combining the content-generative capabilities of LLMs with user-context-specific, precise information retrieval.
However, deploying RAG systems at scale in production reveals a different reality that most blog posts
and conference talks gloss over. While the core RAG concept is straightforward, the engineering challenges required to make it work reliably, efficiently, and cost-effectively at production scale
are substantial and often underestimated.
>> Read More
IT Asset, Vulnerability, and Patch Management Best Practices
(Fri, 16 Jan 2026)
The vulnerability management lifecycle is a continuous process for discovering, addressing, and prioritizing vulnerabilities in an organization's IT assets
A normal round of the lifecycle has five phases:
>> Read More
Speeding Up BigQuery Reads in Apache Beam/Dataflow
(Fri, 16 Jan 2026)
Real‑time and overnight data pipelines often succeed or fail on one thing: Can you move enough data through BigQuery and Dataflow within your SLA window?
In a production Apache Beam/Dataflow environment, several large jobs started to miss their daily deadlines after a Beam
upgrade. All of them shared a pattern:
>> Read More
From RAG to RAG + RAV: A Practical Pipeline for Factual LLM Responses
(Fri, 16 Jan 2026)
Recently, I've been working on a project where getting the factual data right was absolutely critical. I’ll be honest, when I first wired up a retrieval-augmented generation (RAG) system, I thought I was mostly done with hallucinations. I had:
A vector DB full of documents
A decent embedding model
A prompt that said "answer only using the context above."
And yet I still got answers that looked grounded but contained subtle factual errors: wrong years, swapped names, invented details that weren't in any
source.
>> Read More
DevOps Cafe Ep 79 - Guests: Joseph Jacks and Ben Kehoe
(Mon, 13 Aug 2018)
Triggered by Google Next 2018, John and Damon chat with Joseph Jacks (stealth startup) and Ben Kehoe (iRobot) about their public disagreements — and agreements — about Kubernetes and
Serverless.
>> Read More
DevOps Cafe Ep 78 - Guest: J. Paul Reed
(Mon, 23 Jul 2018)
John and Damon chat with J.Paul Reed (Release Engineering Approaches) about the field of Systems Safety and Human Factors that studies why accidents happen and how to minimize the occurrence and
impact.
Show notes at http://devopscafe.org
>> Read More
DevOps Cafe Ep. 77 - Damon interviews John
(Wed, 20 Jun 2018)
A new season of DevOps Cafe is here. The topic of this episode is "DevSecOps." Damon interviews John about what this term means, why it matters now, and the overall state of security.
Show notes at http://devopscafe.org
>> Read More