Research Article

Retrieval-Augmented Generation (RAG) Systems: Architectures, Strategies, and Evaluation

Authors

  • Sunil Karthik Kota Engineering Leader, Software Architect, AI & Automation Expert at CISCO, USA

Abstract

Retrieval-Augmented Generation (RAG) systems have revolutionized the way large language models (LLMs) synthesize responses by coupling generative capabilities with dynamic retrieval from external knowledge bases. This integration not only enhances the factual accuracy and contextual relevance of responses but also reduces the potential for hallucination in generated content. In this paper, we present an extensive survey of RAG systems, covering theoretical underpinnings, various retrieval strategies, agentic architectures, and the technical developer stack necessary for system integration. Additionally, we detail advanced techniques for fine-tuning embedding and reranker models and establish comprehensive evaluation metrics applicable to both retrieval and generation components. This document synthesizes methods and best practices described in recent research articles [1]–[10], offering a roadmap for researchers and practitioners to design robust, efficient, and scalable RAG systems.

Article information

Journal

Journal of Computer Science and Technology Studies

Volume (Issue)

6 (5)

Pages

350-358

Published

2024-12-25

How to Cite

Sunil Karthik Kota. (2024). Retrieval-Augmented Generation (RAG) Systems: Architectures, Strategies, and Evaluation. Journal of Computer Science and Technology Studies, 6(5), 350-358. https://doi.org/10.32996/jcsts.2024.6.5.30

Downloads

Views

30

Downloads

15

Keywords:

Retrieval-Augmented Generation, RAG, Embedding Models, Reranker Fine-Tuning, Semantic Retrieval, Agentic Architectures, Evaluation Metrics, Vector Databases, Query Routing, Dynamic Context