Creation of RAG Systems for Managing Massive Data in Vector Databases

dc.contributor.authorJoho, Luca
dc.contributor.mentorMartin, Andreas
dc.date.accessioned2025-07-09T12:38:28Z
dc.date.issued2025
dc.description.abstractThis master’s thesis explores the development and optimization of a Retrieval Augmented Generation (RAG) pipeline designed to extract contextually rich, accurate, and detail-oriented responses from extensive, multilingual technical documents stored in a vector database. Grounded in a design science research methodology, the study employs an iterative, artifact-centric approach that not only builds and refines the RAG pipeline but also systematically evaluates its effectiveness. A comprehensive literature review provided the theoretical basis for the choice of embedding models, evaluation metrics, and prompt templates. Based on these theoretical insights, a first conceptual design was created prior to coding to ensure that the practical implementation was closely aligned with the best practices, new techniques, and recognized knowledge gaps identified in the literature.
dc.identifier.urihttps://irf.fhnw.ch/handle/11654/52019
dc.language.isoen
dc.publisherHochschule für Wirtschaft FHNW
dc.spatialOlten
dc.subject.ddc330 - Wirtschaft
dc.titleCreation of RAG Systems for Managing Massive Data in Vector Databases
dc.type11 - Studentische Arbeit
dspace.entity.typePublication
fhnw.InventedHereYes
fhnw.StudentsWorkTypeMaster
fhnw.affiliation.hochschuleHochschule für Wirtschaft FHNWde_CH
fhnw.affiliation.institutMaster of Sciencede_CH
relation.isMentorOfPublication6a3865e7-85dc-41b5-afe3-c834c56fab4e
relation.isMentorOfPublication.latestForDiscovery6a3865e7-85dc-41b5-afe3-c834c56fab4e
Dateien