End-to-End Table Extraction from Annual Reports using DL and NLP

dc.contributor.authorMushkolaj, Rijon
dc.contributor.mentorHanne, Thomas
dc.date.accessioned2025-04-30T07:27:25Z
dc.date.issued2024
dc.description.abstractAnnual reports contain many important data and information – some of this data and information is included in tables. The extraction of these table data is associated with various challenges, including the unstructured nature of PDF documents and the wide variability of table representations. The aim of this master's thesis is to explore an innovative end-to-end solution that enables a user to interface with tabular data within annual reports in PDF format through natural language inputs. The thesis addresses two main challenges: the automated extraction of table data from unstructured PDF documents, and interfacing this data through user inputs in the form of natural language questioning – for example, allowing the user to ask a question about the table content in the annual report like: "What was the profit in 2023?". This aims to make the process of information retrieval easier and more efficient.
dc.identifier.urihttps://irf.fhnw.ch/handle/11654/51126
dc.language.isoen
dc.publisherHochschule für Wirtschaft FHNW
dc.spatialOlten
dc.subject.ddc330 - Wirtschaft
dc.titleEnd-to-End Table Extraction from Annual Reports using DL and NLP
dc.type11 - Studentische Arbeit
dspace.entity.typePublication
fhnw.InventedHereYes
fhnw.StudentsWorkTypeMaster
fhnw.affiliation.hochschuleHochschule für Wirtschaft FHNWde_CH
fhnw.affiliation.institutMaster of Sciencede_CH
relation.isMentorOfPublication35d8348b-4dae-448a-af2a-4c5a4504da04
relation.isMentorOfPublication.latestForDiscovery35d8348b-4dae-448a-af2a-4c5a4504da04
Files