Retrieval-Augmented Generation (RAG) Systems: Architectures, Strategies, and Evaluation

Sunil Karthik Kota

doi:10.32996/jcsts.2024.6.5.30

Research Article

Retrieval-Augmented Generation (RAG) Systems: Architectures, Strategies, and Evaluation

Authors

Sunil Karthik Kota Engineering Leader, Software Architect, AI & Automation Expert at CISCO, USA

Abstract

Retrieval-Augmented Generation (RAG) systems have revolutionized the way large language models (LLMs) synthesize responses by coupling generative capabilities with dynamic retrieval from external knowledge bases. This integration not only enhances the factual accuracy and contextual relevance of responses but also reduces the potential for hallucination in generated content. In this paper, we present an extensive survey of RAG systems, covering theoretical underpinnings, various retrieval strategies, agentic architectures, and the technical developer stack necessary for system integration. Additionally, we detail advanced techniques for fine-tuning embedding and reranker models and establish comprehensive evaluation metrics applicable to both retrieval and generation components. This document synthesizes methods and best practices described in recent research articles [1]–[10], offering a roadmap for researchers and practitioners to design robust, efficient, and scalable RAG systems.

Article information

Journal

Journal of Computer Science and Technology Studies

Volume (Issue)

6 (5)

DOI

https://doi.org/10.32996/jcsts.2024.6.5.30

Pages

350-358

Published

2024-12-25

Copyright

Open access

This work is licensed under a Creative Commons Attribution 4.0 International License.

How to Cite

Sunil Karthik Kota. (2024). Retrieval-Augmented Generation (RAG) Systems: Architectures, Strategies, and Evaluation. Journal of Computer Science and Technology Studies, 6(5), 350-358. https://doi.org/10.32996/jcsts.2024.6.5.30

Journal of Computer Science and Technology Studies

Retrieval-Augmented Generation (RAG) Systems: Architectures, Strategies, and Evaluation

Authors

Abstract

Article information

Journal

Journal of Computer Science and Technology Studies

Volume (Issue)

6 (5)

DOI

https://doi.org/10.32996/jcsts.2024.6.5.30

Pages

350-358

Published

Copyright

Open access

How to Cite

Downloads

42

30

Keywords:

rightbar

submission

menus