Impact of convolutional neural network and FastText embedding on text classification

  • Muhammad Umer
  • , Zainab Imtiaz
  • , Muhammad Ahmad
  • , Michele Nappi
  • , Carlo Medaglia
  • , Gyu Sang Choi*
  • , Arif Mehmood*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

122 Scopus citations

Abstract

Efficient word representation techniques (word embeddings) with modern machine learning models have shown reasonable improvement on automatic text classification tasks. However, the effectiveness of such techniques has not been evaluated yet in terms of insufficient word vector representation for training. Convolutional Neural Network has achieved significant results in pattern recognition, image analysis, and text classification. This study investigates the application of the CNN model on text classification problems by experimentation and analysis. We trained our classification model with a prominent word embedding generation model, Fast Text on publically available datasets, six benchmark datasets including Ag News, Amazon Full and Polarity, Yahoo Question Answer, Yelp Full, and Polarity. Furthermore, the proposed model has been tested on the Twitter US airlines non-benchmark dataset as well. The analysis indicates that using Fast Text as word embedding is a very promising approach.

Original languageEnglish
Pages (from-to)5569-5585
Number of pages17
JournalMultimedia Tools and Applications
Volume82
Issue number4
DOIs
StatePublished - Feb 2023
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2022, The Author(s).

Keywords

  • Convolutional Neural Network (CNN)
  • Deep learning
  • FastText
  • Natural language processing
  • Text mining

ASJC Scopus subject areas

  • Software
  • Media Technology
  • Hardware and Architecture
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Impact of convolutional neural network and FastText embedding on text classification'. Together they form a unique fingerprint.

Cite this