Research Article

Generative AI–Driven Observability for Automated Root Cause Analysis in Modern IT Systems: Architecture and Vision

Authors

  • Kamal Singh Bisht Senior Member, IEEE Individual Researcher, Frisco, Texas, 75036, USA

Abstract

Contemporary IT environments are increasingly complex, driven by distributed microservices, ephemeral infras- tructure, and exponential telemetry growth. Traditional observ- ability methods struggle to deliver timely and accurate root cause analysis (RCA) in such settings. This paper presents a conceptual framework that integrates Generative Artificial Intel- ligence (GenAI) with observability pipelines through multimodal telemetry fusion, retrieval-augmented generation (RAG), and agentic AI principles. The proposed four-layer reference ar- chitecture—comprising telemetry ingestion, data normalization, multimodal fusion, and generative RCA engines—illustrates how large language models (LLMs) and agentic modules can enable contextual reasoning and incident triage. While an illustrative proof-of-concept simulation demonstrates feasibility, the primary contribution of this work lies in its architecture and research vision rather than definitive empirical validation. Benchmark comparisons against rule-based, ML, and commercial AIOps solutions demonstrate improved RCA accuracy (89.7%), reduced MTTR (26.4 minutes), and lower false positives, highlighting both feasibility and performance advantages. The paper further outlines open challenges, including scalability, hallucination risks, and integration with heterogeneous monitoring systems, thereby providing a roadmap for future research at the intersection of GenAI, observability, and IT operations.

Article information

Journal

Journal of Computer Science and Technology Studies

Volume (Issue)

7 (9)

Pages

549--560

Published

2025-09-15

How to Cite

Bisht, K. S. (2025). Generative AI–Driven Observability for Automated Root Cause Analysis in Modern IT Systems: Architecture and Vision. Journal of Computer Science and Technology Studies, 7(9), 549-560. https://doi.org/10.32996/jcsts.2025.7.9.63

Downloads

Views

5

Downloads

5

Keywords:

Agentic AI, AI-OPS, Artificial Intelligence, Generative AI, Generative Artificial Intelligence, Incident Automation, Intelligent Operation, LLM, Machine Learning, Modern IT Environments, Monitoring, Multimodal Data Fusion, Observability, RAG, RCA