Download PDFOpen PDF in browserUniGPT Revisited: From a Simple Chatbot to an API-First AI Platform — Two Years of On-Premises LLM Operations12 pages•Published: June 18, 2026AbstractIn 2024, we introduced uniGPT, an on-premises Kubernetes-based LLM platform at a major German university designed for GDPR compliance, digital sovereignty, and avoiding vendor lock-in. This paper evaluates nearly two years of operation (May 2024 - February 2026), tracing its evolution from a simple chatbot into a multi-modal, API-first AI infrastructure. Using the TOE framework, we analyze this progression as an iterative design cycle triggered by technological, organisational, or environmental factors. We detail 8 key iterations - including frontend and inference engine swaps, adding an OpenAI-compatible API layer, multi-modal services, and RAG pipelines. Notably, we find that >99% of usage now occurs via API rather than the chat frontend. Finally, we offer generalizable lessons for institutions building sustainable on-premises AI infrastructure in higher education.Keyphrases: design science research, experience report, higher education, kubernetes, large language models, on premises ai, research In: Laurence Desnos, Carmen Diaz, Janina Mincer-Daszkiewicz, Lazaros Merakos, Raimund Vogl, Stuart McLellan and Ulrike Lucke (editors). Proceedings of EUNIS 2026 Annual Congress, vol 109, pages 96-107.
|

