End-to-End Table Extraction from Annual Reports using DL and NLP
dc.contributor.author | Mushkolaj, Rijon | |
dc.contributor.mentor | Hanne, Thomas | |
dc.date.accessioned | 2025-04-30T07:27:25Z | |
dc.date.issued | 2024 | |
dc.description.abstract | Annual reports contain many important data and information – some of this data and information is included in tables. The extraction of these table data is associated with various challenges, including the unstructured nature of PDF documents and the wide variability of table representations. The aim of this master's thesis is to explore an innovative end-to-end solution that enables a user to interface with tabular data within annual reports in PDF format through natural language inputs. The thesis addresses two main challenges: the automated extraction of table data from unstructured PDF documents, and interfacing this data through user inputs in the form of natural language questioning – for example, allowing the user to ask a question about the table content in the annual report like: "What was the profit in 2023?". This aims to make the process of information retrieval easier and more efficient. | |
dc.identifier.uri | https://irf.fhnw.ch/handle/11654/51126 | |
dc.language.iso | en | |
dc.publisher | Hochschule für Wirtschaft FHNW | |
dc.spatial | Olten | |
dc.subject.ddc | 330 - Wirtschaft | |
dc.title | End-to-End Table Extraction from Annual Reports using DL and NLP | |
dc.type | 11 - Studentische Arbeit | |
dspace.entity.type | Publication | |
fhnw.InventedHere | Yes | |
fhnw.StudentsWorkType | Master | |
fhnw.affiliation.hochschule | Hochschule für Wirtschaft FHNW | de_CH |
fhnw.affiliation.institut | Master of Science | de_CH |
relation.isMentorOfPublication | 35d8348b-4dae-448a-af2a-4c5a4504da04 | |
relation.isMentorOfPublication.latestForDiscovery | 35d8348b-4dae-448a-af2a-4c5a4504da04 |