Aop-lse: Antioxidant proteins classification using deep latent space encoding of sequence features

  • Muhammad Usman
  • , Shujaat Khan
  • , Seongyong Park
  • , Jeong A. Lee*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

14 Scopus citations

Abstract

It is of utmost importance to develop a computational method for accurate prediction of antioxidants, as they play a vital role in the prevention of several diseases caused by oxidative stress. In this correspondence, we present an effective computational methodology based on the notion of deep latent space encoding. A deep neural network classifier fused with an auto-encoder learns class labels in a pruned latent space. This strategy has eliminated the need to separately develop classifier and the feature selection model, allowing the standalone model to effectively harness discriminating feature space and perform improved predictions. A thorough analytical study has been presented alongwith the PCA/tSNE visualization and PCA-GCNR scores to show the discriminating power of the proposed method. The proposed method showed a high MCC value of 0.43 and a balanced accuracy of 76.2%, which is superior to the existing models. The model has been evaluated on an independent dataset during which it outperformed the contemporary methods by correctly identifying the novel proteins with an accuracy of 95%.

Original languageEnglish
Article number105
Pages (from-to)1489-1501
Number of pages13
JournalCurrent Issues in Molecular Biology
Volume43
Issue number3
DOIs
StatePublished - Dec 2021
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2021 by the authors.

Keywords

  • Antioxidation
  • Classification
  • Composition of k-spaced amino acid pair (CKSAAP)
  • Deep auto-encoder
  • Latent space learning
  • Neural network

ASJC Scopus subject areas

  • Microbiology
  • Molecular Biology
  • Microbiology (medical)

Fingerprint

Dive into the research topics of 'Aop-lse: Antioxidant proteins classification using deep latent space encoding of sequence features'. Together they form a unique fingerprint.

Cite this