Enhancing language models with boosting and targeted fine-tuning for real-word error detection

dc.contributor.authorMasanti, Corina
dc.contributor.authorWitschel, Hans Friedrich
dc.contributor.authorRiesen, Kaspar
dc.date.accessioned2026-06-16T06:52:44Z
dc.date.issued2026
dc.description.abstractWe propose a boosting-based approach to enhance language models of diverse architectures with the goal of detecting real-word errors in documents. • We thoroughly evaluate the benefits and limitations of our novel framework through experiments on a large real-world data set. Based on a thorough error analysis, we generate additional targeted training data to address identified weaknesses and apply targeted fine-tuning to further improve model performance. Over the past years, extensive research has led to significant advancements in tools for the automatic detection and correction of errors in documents. Despite this progress, several challenges remain unresolved. In particular, the identification of real-word errors – errors involving words that are grammatically valid but contextually inappropriate within a given sentence – continues to pose a considerable difficulty. Addressing such errors requires models with a sophisticated understanding of linguistic context. Transformer-based language models are particularly well-suited for this task due to their contextual modeling capabilities. To further enhance their performance, we propose a boosting-based training approach in conjunction with a synthetically generated data set created via pattern-based noise injection. We evaluate this method across three transformer-based architectures, viz. mBERT, LLaMA 3, and Mistral 7B. Our experimental results show that the boosting-based strategy consistently improves real-word error detection across all models. A subsequent in-depth error analysis reveals limitations in the synthetic training data, prompting the development of a targeted fine-tuning procedure designed to address these shortcomings and further optimize model performance. A comparison with prompt-based inference using a large language model demonstrates that specialized, fine-tuned models yield more reliable performance for this task. Finally, an evaluation under realistic class imbalance highlights practical trade-offs between ranking quality and threshold-based detection, particularly for rare error types.
dc.identifier.doi10.1016/j.nlp.2026.100202
dc.identifier.issn2949-7191
dc.identifier.urihttps://irf.fhnw.ch/handle/11645/57047
dc.identifier.urihttps://doi.org/10.26041/fhnw-16512
dc.language.isoen
dc.publisherElsevier
dc.relation.ispartofNatural Language Processing Journal
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subject.ddc004 - Computer Wissenschaften, Internet
dc.titleEnhancing language models with boosting and targeted fine-tuning for real-word error detection
dc.type01A - Beitrag in wissenschaftlicher Zeitschrift
dc.volume14
dspace.entity.typePublication
fhnw.InventedHereYes
fhnw.ReviewTypepeer-reviewed
fhnw.oastatus.auroraVersion: Published *** Embargo: None *** Licence: CC BY *** URL: https://v2.sherpa.ac.uk/id/publication/46861
fhnw.openAccessCategoryGold
fhnw.pagination100202-100202
fhnw.publicationStatePublished
fhnw.targetcollectiond40e4c67-dd87-4d14-8518-b2f0a855e750
relation.isAuthorOfPublication4f94a17c-9d05-433c-882f-68f062e0e6ae
relation.isAuthorOfPublication.latestForDiscovery4f94a17c-9d05-433c-882f-68f062e0e6ae
Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
1-s2.0-S2949719126000063-main.pdf
Größe:
4.24 MB
Format:
Adobe Portable Document Format

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
license.txt
Größe:
2.66 KB
Format:
Item-specific license agreed upon to submission
Beschreibung: