Information Extraction from Building Insurance Policies

dc.contributor.authorSchmid, Celia Lorena
dc.contributor.mentorPustulka, Elzbieta
dc.date.accessioned2023-12-22T16:03:39Z
dc.date.available2023-12-22T16:03:39Z
dc.date.issued2020
dc.description.abstractInformation extraction from pdf documents such as Swiss insurance policies is a challenge to which no well-established solution yet exists. FHNW has a project running where they made first attempts in extracting information from insurance policies by applying a machine-learning based approach. However, the solution is not ripe for the market yet because the accuracy is not sufficient. The goal of this master thesis was to manually explore what kind of information a building insurance policy contains and to develop a rule-based approach to annotate building insurance policies. We received a data-base containing bounding-boxes of about 3’000 scanned documents containing the term “Gebäudeversicherung”. We programmed an algorithm to automatically annotate OCR-processed building insurance policies of the company Mobiliar. The algorithm returns useful output, most of the text boxes labelled correctly. These results can be of use to further develop the machine-learning-based approach of the FHNW project.
dc.identifier.urihttps://irf.fhnw.ch/handle/11654/40412
dc.language.isoen
dc.publisherHochschule für Wirtschaft FHNW
dc.spatialOlten
dc.subject.ddc330 - Wirtschaft
dc.titleInformation Extraction from Building Insurance Policies
dc.type11 - Studentische Arbeit
dspace.entity.typePublication
fhnw.InventedHereYes
fhnw.PublishedSwitzerlandYes
fhnw.StudentsWorkTypeMaster
fhnw.affiliation.hochschuleHochschule für Wirtschaft FHNWde_CH
fhnw.affiliation.institutMaster of Science
relation.isMentorOfPublication3e7f2a0a-692e-4652-b305-7a7e19e011de
relation.isMentorOfPublication.latestForDiscovery3e7f2a0a-692e-4652-b305-7a7e19e011de
Dateien