Naïve Bayes and named entity recognition for requirements mining in job postings
dc.contributor.author | Wild, Simon | |
dc.contributor.author | Parlar, Soyhan | |
dc.contributor.author | Hanne, Thomas | |
dc.contributor.author | Dornberger, Rolf | |
dc.date.accessioned | 2024-04-17T10:08:45Z | |
dc.date.available | 2024-04-17T10:08:45Z | |
dc.date.issued | 2021 | |
dc.description.abstract | This paper analyses how the required skills in a job post can be extracted. With an automated extraction of skills from unstructured text, applicants could be more accurately matched and search engines could provide better recommendations. The problem is optimized by classifying the relevant parts of the description with a multinomial naïve Bayes model. The model identifies the section of the unstructured text in which the requirements are stated. Subsequently, a named entity recognition (NER) model extracts the required skills from the classified text. This approach minimizes the false positives since the data which is analyzed is already filtered. The results show that the naïve Bayes model classifies up to 99% of the sections correctly, and the NER model extracts 65% of the skills required for a position. The accuracy of the NER model is not sufficient to be used in production. On the validation set, the performance was insufficient. A more consistent labelling guideline would be needed and more data should be annotated to increase the performance. | |
dc.event | 2021 3rd International Conference on Natural Language Processing (ICNLP 2021) | |
dc.event.end | 2021-03-28 | |
dc.event.start | 2021-03-26 | |
dc.identifier.doi | 10.1109/ICNLP52887.2021.00032 | |
dc.identifier.isbn | 978-1-6654-1411-1 | |
dc.identifier.uri | https://irf.fhnw.ch/handle/11654/42936 | |
dc.language.iso | en | |
dc.relation.ispartof | 2021 3rd International Conference on Natural Language Processing. Proceedings | |
dc.spatial | Bejing | |
dc.subject.ddc | 330 - Wirtschaft | |
dc.title | Naïve Bayes and named entity recognition for requirements mining in job postings | |
dc.type | 04B - Beitrag Konferenzschrift | |
dspace.entity.type | Publication | |
fhnw.InventedHere | Yes | |
fhnw.ReviewType | Anonymous ex ante peer review of a complete publication | |
fhnw.affiliation.hochschule | Hochschule für Wirtschaft FHNW | de_CH |
fhnw.affiliation.institut | Institut für Wirtschaftsinformatik | de_CH |
fhnw.openAccessCategory | Closed | |
fhnw.pagination | 155-161 | |
fhnw.publicationState | Published | |
relation.isAuthorOfPublication | 4c2e16b0-225a-4087-862a-b18369380bd4 | |
relation.isAuthorOfPublication | 2b600b71-1924-46e6-93a5-cdc21f52f455 | |
relation.isAuthorOfPublication | 35d8348b-4dae-448a-af2a-4c5a4504da04 | |
relation.isAuthorOfPublication | 64196f63-c326-4e10-935d-6776cc91354c | |
relation.isAuthorOfPublication.latestForDiscovery | 35d8348b-4dae-448a-af2a-4c5a4504da04 |
Dateien
Lizenzbündel
1 - 1 von 1
Kein Vorschaubild vorhanden
- Name:
- license.txt
- Größe:
- 1.36 KB
- Format:
- Item-specific license agreed upon to submission
- Beschreibung: