Automated error detection through specialized task implementation

dc.contributor.authorMasanti, Corina
dc.contributor.authorWitschel, Hans Friedrich
dc.contributor.authorRiesen, Kaspar
dc.contributor.editorWallraven, Christian
dc.contributor.editorLiu, Cheng-Lin
dc.contributor.editorRoss, Arun
dc.date.accessioned2026-05-20T12:02:53Z
dc.date.issued2025
dc.description.abstractThe present paper introduces a multilingual data set of erroneous and correct text sentences. The novel data set marks a significant advancement from an existing corpus by incorporating additional samples and refining its overall structure. The primary purpose of this data set is to support the research and development of automated error detection systems, especially in the multilingual setting where high-quality data sets are scarce. A distinctive feature of our data set is that it incorporates only incorrect sentences and their corresponding correct versions. These sentences are sourced from a variety of texts written by native speakers from different industries, such as pharmaceuticals, banking, insurance, retail, communications, and more. Each sentence in the data set has been annotated by professional proofreaders. The paper includes a comprehensive error analysis, where we classify and scrutinize the different types of errors within the data set. By categorizing and analysing the errors in the data set, we aim to identify patterns and common issues. Additionally, we conduct a thorough experimental evaluation using a well-established language model. Our analysis assesses the classification accuracy measured over all errors and the accuracy of each specific error type. Interestingly, our results show that while some error types can be detected with an accuracy exceeding 80%, it turns out that the recognition of other error types is very difficult to solve.
dc.event4th International Conference, ICPRAI 2024
dc.event.end2024-07-06
dc.event.start2024-07-03
dc.identifier.doi10.1007/978-981-97-8705-0_12
dc.identifier.isbn978-981-97-8704-3
dc.identifier.isbn978-981-97-8705-0
dc.identifier.urihttps://irf.fhnw.ch/handle/11654/56302
dc.language.isoen
dc.publisherSpringer
dc.relation.ispartofPattern Recognition and Artificial Intelligence
dc.relation.ispartofseriesLecture Notes in Computer Science
dc.spatialSingapore
dc.subject.ddc330 - Wirtschaft
dc.titleAutomated error detection through specialized task implementation
dc.type04B - Beitrag Konferenzschrift
dspace.entity.typePublication
fhnw.InventedHereYes
fhnw.ReviewTypepeer-reviewed
fhnw.affiliation.hochschuleHochschule für Wirtschaft FHNWde_CH
fhnw.affiliation.institutInstitut für Wirtschaftsinformatikde_CH
fhnw.openAccessCategoryClosed
fhnw.pagination182-195
fhnw.publicationStatePublished
fhnw.seriesNumber14893
relation.isAuthorOfPublication4f94a17c-9d05-433c-882f-68f062e0e6ae
relation.isAuthorOfPublicationd761e073-1612-4d22-8521-65c01c19f97a
relation.isAuthorOfPublication.latestForDiscovery4f94a17c-9d05-433c-882f-68f062e0e6ae
Dateien

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
license.txt
Größe:
2.66 KB
Format:
Item-specific license agreed upon to submission
Beschreibung: