Abstract
In recent years, data-driven decision making has emerged as the main focus of research due to the extensive use and availability of data-driven approaches. The accuracy of such research studies is completely dependent on the quality of data available for the research. To enhance the performance of the model, diverse “data pre-processing” techniques are adopted by the researchers. This chapter attempts to provide an insight into the application of data pre-processing techniques and their effects on information retrieval. That further takes into consideration a few chosen problems involving huge amounts of data. This chapter covers the major issues that need to be dealt with before the beginning of any data analysis process. The chapter consists of two sections that highlight the need for data pre-processing. To establish the need for data pre-processing and study its effects on the achieved results, three machine learning algorithms named decision tree, Naive Bayes, and artificial neural network were applied to four diverse datasets. The result shows that high accuracy, as well as better data quality, is attained after the application of data pre-processing methods. The solution can be used to solve the problem of data discrepancies, noise, and outliers in different datasets for improved results.
| Original language | English |
|---|---|
| Title of host publication | Machine Intelligence, Big Data Analytics, and IoT in Image Processing |
| Subtitle of host publication | Practical Applications |
| Publisher | wiley |
| Pages | 199-224 |
| Number of pages | 26 |
| ISBN (Electronic) | 9781119865513 |
| ISBN (Print) | 9781119865049 |
| DOIs | |
| State | Published - 1 Jan 2023 |
Bibliographical note
Publisher Copyright:© 2023 Scrivener Publishing LLC.
Keywords
- data analytics
- Data pre-processing techniques
- decision tree
- impact of pre-processing
- information retrieval
- neural network
ASJC Scopus subject areas
- General Engineering
- General Materials Science
Fingerprint
Dive into the research topics of 'Impact of Data Pre-Processing in Information Retrieval for Data Analytics'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver