Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022

Authors

  • Mohammed Ali Mohammed University of Information Technology and Communications
  • Hala Abdulsalam jasim University of Baghdad
  • Ahmed Oday University of Information Technology and Communications

DOI:

https://doi.org/10.25195/ijci.v51i1.549

Keywords:

Web Usage Mining, Access Log File, Data Pre-processing

Abstract

The data preprocessing step is an important step in web usage mining because of the nature of log data, which are heterogeneous, unstructured, and noisy. Given the scalability and efficiency of algorithms in pattern discovery, a preprocessing step must be applied. In this study, the sequential methodologies utilized in the preprocessing of data from web server logs, with an emphasis on sub-phases, such as session identification, user identification, and data cleansing, are comprehensively evaluated and meticulously examined.

Downloads

Download data is not yet available.

Author Biographies

Hala Abdulsalam jasim, University of Baghdad

College of Science / Department of
Remote Sensing and GIS

Ahmed Oday, University of Information Technology and Communications

College of Biomedical Informatics

Downloads

Published

2025-05-31