Building a technology recommender system using web crawling and natural language processing technology
Loading...
Author (Corporation)
Publication date
2022
Typ of student thesis
Course of study
Collections
Type
01A - Journal article
Editors
Editor (Corporation)
Supervisor
Parent work
Algorithms
Special issue
DOI of the original publication
Link
Series
Series number
Volume
15
Issue / Number
8
Pages / Duration
Patent number
Publisher / Publishing institution
MDPI
Place of publication / Event location
Basel
Edition
Version
Programming language
Assignee
Practice partner / Client
Abstract
Finding, retrieving, and processing information on technology from the Internet can be a tedious task. This article investigates if technological concepts such as web crawling and natural language processing are suitable means for knowledge discovery from unstructured information and the development of a technology recommender system by developing a prototype of such a system. It also analyzes how well the resulting prototype performs in regard to effectivity and efficiency. The research strategy based on design science research consists of four stages: (1) Awareness generation; (2) suggestion of a solution considering the information retrieval process; (3) development of an artefact in the form of a Python computer program; and (4) evaluation of the prototype within the scope of a comparative experiment. The evaluation yields that the prototype is highly efficient in retrieving basic and rather random extractive text summaries from websites that include the desired search terms. However, the effectivity, measured by the quality of results is unsatisfactory due to the aforementioned random arrangement of extracted sentences within the resulting summaries. It is found that natural language processing and web crawling are indeed suitable technologies for such a program whilst the use of additional technology/concepts would add significant value for a potential user. Several areas for incremental improvement of the prototype are identified.
Keywords
Subject (DDC)
330 - Wirtschaft
Event
Exhibition start date
Exhibition end date
Conference start date
Conference end date
Date of the last check
ISBN
ISSN
1999-4893
Language
German
Created during FHNW affiliation
Yes
Strategic action fields FHNW
Publication status
Published
Review
Peer review of the complete publication
Open access category
Gold
Citation
CAMPOS MACIAS-HAMMEL, Nathalie, Wilhelm DÜGGELIN, Yesim RUF und Thomas HANNE, 2022. Building a technology recommender system using web crawling and natural language processing technology. Algorithms. 2022. Bd. 15, Nr. 8. DOI 10.3390/a15080272. Verfügbar unter: https://doi.org/10.26041/fhnw-7284