Automated error detection through specialized task implementation
| dc.contributor.author | Masanti, Corina | |
| dc.contributor.author | Witschel, Hans Friedrich | |
| dc.contributor.author | Riesen, Kaspar | |
| dc.contributor.editor | Wallraven, Christian | |
| dc.contributor.editor | Liu, Cheng-Lin | |
| dc.contributor.editor | Ross, Arun | |
| dc.date.accessioned | 2026-05-20T12:02:53Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | The present paper introduces a multilingual data set of erroneous and correct text sentences. The novel data set marks a significant advancement from an existing corpus by incorporating additional samples and refining its overall structure. The primary purpose of this data set is to support the research and development of automated error detection systems, especially in the multilingual setting where high-quality data sets are scarce. A distinctive feature of our data set is that it incorporates only incorrect sentences and their corresponding correct versions. These sentences are sourced from a variety of texts written by native speakers from different industries, such as pharmaceuticals, banking, insurance, retail, communications, and more. Each sentence in the data set has been annotated by professional proofreaders. The paper includes a comprehensive error analysis, where we classify and scrutinize the different types of errors within the data set. By categorizing and analysing the errors in the data set, we aim to identify patterns and common issues. Additionally, we conduct a thorough experimental evaluation using a well-established language model. Our analysis assesses the classification accuracy measured over all errors and the accuracy of each specific error type. Interestingly, our results show that while some error types can be detected with an accuracy exceeding 80%, it turns out that the recognition of other error types is very difficult to solve. | |
| dc.event | 4th International Conference, ICPRAI 2024 | |
| dc.event.end | 2024-07-06 | |
| dc.event.start | 2024-07-03 | |
| dc.identifier.doi | 10.1007/978-981-97-8705-0_12 | |
| dc.identifier.isbn | 978-981-97-8704-3 | |
| dc.identifier.isbn | 978-981-97-8705-0 | |
| dc.identifier.uri | https://irf.fhnw.ch/handle/11654/56302 | |
| dc.language.iso | en | |
| dc.publisher | Springer | |
| dc.relation.ispartof | Pattern Recognition and Artificial Intelligence | |
| dc.relation.ispartofseries | Lecture Notes in Computer Science | |
| dc.spatial | Singapore | |
| dc.subject.ddc | 330 - Wirtschaft | |
| dc.title | Automated error detection through specialized task implementation | |
| dc.type | 04B - Beitrag Konferenzschrift | |
| dspace.entity.type | Publication | |
| fhnw.InventedHere | Yes | |
| fhnw.ReviewType | peer-reviewed | |
| fhnw.affiliation.hochschule | Hochschule für Wirtschaft FHNW | de_CH |
| fhnw.affiliation.institut | Institut für Wirtschaftsinformatik | de_CH |
| fhnw.openAccessCategory | Closed | |
| fhnw.pagination | 182-195 | |
| fhnw.publicationState | Published | |
| fhnw.seriesNumber | 14893 | |
| relation.isAuthorOfPublication | 4f94a17c-9d05-433c-882f-68f062e0e6ae | |
| relation.isAuthorOfPublication | d761e073-1612-4d22-8521-65c01c19f97a | |
| relation.isAuthorOfPublication.latestForDiscovery | 4f94a17c-9d05-433c-882f-68f062e0e6ae |
Dateien
Lizenzbündel
1 - 1 von 1
Lade...
- Name:
- license.txt
- Größe:
- 2.66 KB
- Format:
- Item-specific license agreed upon to submission
- Beschreibung: