Skip to main navigation Skip to search Skip to main content

Impact of Data Pre-Processing in Information Retrieval for Data Analytics

  • Huma Naz*
  • , Sachin Ahuja
  • , Rahul Nijhawan
  • , Neelu Jyothi Ahuja
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

1 Scopus citations

Abstract

In recent years, data-driven decision making has emerged as the main focus of research due to the extensive use and availability of data-driven approaches. The accuracy of such research studies is completely dependent on the quality of data available for the research. To enhance the performance of the model, diverse “data pre-processing” techniques are adopted by the researchers. This chapter attempts to provide an insight into the application of data pre-processing techniques and their effects on information retrieval. That further takes into consideration a few chosen problems involving huge amounts of data. This chapter covers the major issues that need to be dealt with before the beginning of any data analysis process. The chapter consists of two sections that highlight the need for data pre-processing. To establish the need for data pre-processing and study its effects on the achieved results, three machine learning algorithms named decision tree, Naive Bayes, and artificial neural network were applied to four diverse datasets. The result shows that high accuracy, as well as better data quality, is attained after the application of data pre-processing methods. The solution can be used to solve the problem of data discrepancies, noise, and outliers in different datasets for improved results.

Original languageEnglish
Title of host publicationMachine Intelligence, Big Data Analytics, and IoT in Image Processing
Subtitle of host publicationPractical Applications
Publisherwiley
Pages199-224
Number of pages26
ISBN (Electronic)9781119865513
ISBN (Print)9781119865049
DOIs
StatePublished - 1 Jan 2023

Bibliographical note

Publisher Copyright:
© 2023 Scrivener Publishing LLC.

Keywords

  • data analytics
  • Data pre-processing techniques
  • decision tree
  • impact of pre-processing
  • information retrieval
  • neural network

ASJC Scopus subject areas

  • General Engineering
  • General Materials Science

Fingerprint

Dive into the research topics of 'Impact of Data Pre-Processing in Information Retrieval for Data Analytics'. Together they form a unique fingerprint.

Cite this