RAFP-Pred: Robust Prediction of Antifreeze Proteins Using Localized Analysis of n-Peptide Compositions

Shujaat Khan, Imran Naseem, Roberto Togneri, Mohammed Bennamoun

Research output: Contribution to journalArticlepeer-review

34 Scopus citations

Abstract

In extreme cold weather, living organisms produce Antifreeze Proteins (AFPs) to counter the otherwise lethal intracellular formation of ice. Structures and sequences of various AFPs exhibit a high degree of heterogeneity, consequently the prediction of the AFPs is considered to be a challenging task. In this research, we propose to handle this arduous manifold learning task using the notion of localized processing. In particular, an AFP sequence is segmented into two sub-segments each of which is analyzed for amino acid and di-peptide compositions. We propose to use only the most significant features using the concept of information gain (IG) followed by a random forest classification approach. The proposed RAFP-Pred achieved an excellent performance on a number of standard datasets. We report a high Youden's index (sensitivity1) value of 0.75 on the standard independent test data set outperforming the AFP-PseAAC, AFP-PSSM, AFP-Pred, and iAFP by a margin of 0.05, 0.06, 0.14, and 0.68, respectively. The verification rate on the UniProKB dataset is found to be 83.19 percent which is substantially superior to the 57.18 percent reported for the iAFP method.

Original languageEnglish
Article number7590067
Pages (from-to)244-250
Number of pages7
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
Volume15
Issue number1
DOIs
StatePublished - 1 Jan 2018
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2018 IEEE.

Keywords

  • Antifreeze protein
  • amino acid compositions
  • dipeptide compositions
  • localized analysis

ASJC Scopus subject areas

  • Biotechnology
  • Genetics
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'RAFP-Pred: Robust Prediction of Antifreeze Proteins Using Localized Analysis of n-Peptide Compositions'. Together they form a unique fingerprint.

Cite this