Creation of RAG Systems for Managing Massive Data in Vector Databases
dc.contributor.author | Joho, Luca | |
dc.contributor.mentor | Martin, Andreas | |
dc.date.accessioned | 2025-07-09T12:38:28Z | |
dc.date.issued | 2025 | |
dc.description.abstract | This master’s thesis explores the development and optimization of a Retrieval Augmented Generation (RAG) pipeline designed to extract contextually rich, accurate, and detail-oriented responses from extensive, multilingual technical documents stored in a vector database. Grounded in a design science research methodology, the study employs an iterative, artifact-centric approach that not only builds and refines the RAG pipeline but also systematically evaluates its effectiveness. A comprehensive literature review provided the theoretical basis for the choice of embedding models, evaluation metrics, and prompt templates. Based on these theoretical insights, a first conceptual design was created prior to coding to ensure that the practical implementation was closely aligned with the best practices, new techniques, and recognized knowledge gaps identified in the literature. | |
dc.identifier.uri | https://irf.fhnw.ch/handle/11654/52019 | |
dc.language.iso | en | |
dc.publisher | Hochschule für Wirtschaft FHNW | |
dc.spatial | Olten | |
dc.subject.ddc | 330 - Wirtschaft | |
dc.title | Creation of RAG Systems for Managing Massive Data in Vector Databases | |
dc.type | 11 - Studentische Arbeit | |
dspace.entity.type | Publication | |
fhnw.InventedHere | Yes | |
fhnw.StudentsWorkType | Master | |
fhnw.affiliation.hochschule | Hochschule für Wirtschaft FHNW | de_CH |
fhnw.affiliation.institut | Master of Science | de_CH |
relation.isMentorOfPublication | 6a3865e7-85dc-41b5-afe3-c834c56fab4e | |
relation.isMentorOfPublication.latestForDiscovery | 6a3865e7-85dc-41b5-afe3-c834c56fab4e |