Rule mining techniques to predict prokaryotic metabolic pathways

Rabie Saidi*, Imane Boudellioua, Maria J. Martin, Victor Solovyev

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

3 Scopus citations

Abstract

It is becoming more evident that computational methods are needed for the identification and the mapping of pathways in new genomes. We introduce an automatic annotation system (ARBA4Path Association Rule-Based Annotator for Pathways) that utilizes rule mining techniques to predict metabolic pathways across wide range of prokaryotes. It was demonstrated that specific combinations of protein domains (recorded in our rules) strongly determine pathways in which proteins are involved and thus provide information that let us very accurately assign pathway membership (with precision of 0.999 and recall of 0.966) to proteins of a given prokaryotic taxon. Our system can be used to enhance the quality of automatically generated annotations as well as annotating proteins with unknown function. The prediction models are represented in the form of human-readable rules, and they can be used effectively to add absent pathway information to many proteins in UniProtKB/TrEMBL database.

Original languageEnglish
Title of host publicationMethods in Molecular Biology
PublisherHumana Press Inc.
Pages311-331
Number of pages21
DOIs
StatePublished - 2017
Externally publishedYes

Publication series

NameMethods in Molecular Biology
Volume1613
ISSN (Print)1064-3745

Bibliographical note

Publisher Copyright:
© Springer Science+Business Media LLC 2017.

Keywords

  • Automatic annotation
  • Functional genomics
  • Machine learning
  • Pathway prediction
  • Proteomics
  • Rule mining

ASJC Scopus subject areas

  • Molecular Biology
  • Genetics

Fingerprint

Dive into the research topics of 'Rule mining techniques to predict prokaryotic metabolic pathways'. Together they form a unique fingerprint.

Cite this