LLM Pipelines - Ivan França - AI Engineering Notes

AI Engineering | Retrieval-Augmented Generation (RAG)

RAG Movie Plots: Designing a Modular RAG System

ByIvan França 14/02/202611/03/2026

Introduction Retrieval-Augmented Generation (RAG) systems are often described as linear pipelines: load documents, split text, generate embeddings, store vectors, retrieve context and invoke a language model. This description is conceptually accurate, but architecturally incomplete. When presented as a sequence of steps, the structural decisions that shape system behavior remain implicit. Choices about segmentation, persistence, filtering…