Naïve Bayes and named entity recognition for requirements mining in job postings

No Thumbnail Available
Author (Corporation)
Publication date
2021
Typ of student thesis
Course of study
Type
04B - Conference paper
Editors
Editor (Corporation)
Supervisor
Parent work
2021 3rd International Conference on Natural Language Processing. Proceedings
Special issue
DOI of the original publication
Link
Series
Series number
Volume
Issue / Number
Pages / Duration
155-161
Patent number
Publisher / Publishing institution
Place of publication / Event location
Bejing
Edition
Version
Programming language
Assignee
Practice partner / Client
Abstract
This paper analyses how the required skills in a job post can be extracted. With an automated extraction of skills from unstructured text, applicants could be more accurately matched and search engines could provide better recommendations. The problem is optimized by classifying the relevant parts of the description with a multinomial naïve Bayes model. The model identifies the section of the unstructured text in which the requirements are stated. Subsequently, a named entity recognition (NER) model extracts the required skills from the classified text. This approach minimizes the false positives since the data which is analyzed is already filtered. The results show that the naïve Bayes model classifies up to 99% of the sections correctly, and the NER model extracts 65% of the skills required for a position. The accuracy of the NER model is not sufficient to be used in production. On the validation set, the performance was insufficient. A more consistent labelling guideline would be needed and more data should be annotated to increase the performance.
Keywords
Subject (DDC)
330 - Wirtschaft
Project
Event
2021 3rd International Conference on Natural Language Processing (ICNLP 2021)
Exhibition start date
Exhibition end date
Conference start date
26.03.2021
Conference end date
28.03.2021
Date of the last check
ISBN
978-1-6654-1411-1
ISSN
Language
English
Created during FHNW affiliation
Yes
Strategic action fields FHNW
Publication status
Published
Review
Peer review of the complete publication
Open access category
Closed
License
Citation
WILD, Simon, Soyhan PARLAR, Thomas HANNE und Rolf DORNBERGER, 2021. Naïve Bayes and named entity recognition for requirements mining in job postings. In: 2021 3rd International Conference on Natural Language Processing. Proceedings. Bejing. 2021. S. 155–161. ISBN 978-1-6654-1411-1. Verfügbar unter: https://irf.fhnw.ch/handle/11654/42936