Abstract
Extracelluar matrix (ECM) proteins create complex networks of macromolecules which fill-in the extracellular spaces of living tissues. They provide structural support and play an important role in maintaining cellular functions. Identification of ECM proteins can play a vital role in studying various types of diseases. Conventional wet lab–based methods are reliable; however, they are expensive and time consuming and are, therefore, not scalable. In this research, we propose a sequence-based novel machine learning approach for the prediction of ECM proteins. In the proposed method, composition of k-spaced amino acid pair (CKSAAP) features are encoded into a classifiable latent space (LS) with the help of deep latent space encoding (LSE). A comprehensive ablation analysis is conducted for performance evaluation of the proposed method. Results are compared with other state-of-the-art methods on the benchmark dataset, and the proposed ECM-LSE approach has shown to comprehensively outperform the contemporary methods.
| Original language | English |
|---|---|
| Article number | 752658 |
| Journal | Frontiers in Bioengineering and Biotechnology |
| Volume | 9 |
| DOIs | |
| State | Published - 14 Oct 2021 |
| Externally published | Yes |
Bibliographical note
Publisher Copyright:© Copyright © 2021 Al-Saggaf, Usman, Naseem, Moinuddin, Jiman, Alsaggaf, Alshoubaki and Khan.
Keywords
- amino acid composition (AAC)
- auto-encoder
- classification
- composition of k-spaced amino acid pair (CKSAAP)
- extracellular matrix (ECM)
- latent space learning
- neural network
ASJC Scopus subject areas
- Biotechnology
- Bioengineering
- Histology
- Biomedical Engineering