Bayesian optimization-guided machine learning models for bio-oil yield prediction from lignocellulosic biomass pyrolysis

  • Ricardo Ervilha*
  • , Deivid Campos
  • , Bruno da Silva Macêdo
  • , Daniel A. Bertuol
  • , Eduardo H. Tanabe
  • , Zaher Mundher Yaseen
  • , Matteo Bodini
  • , Camila M. Saporetti
  • , Leonardo Goliatt
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

The transition to renewable energy needs efficient biomass conversion technologies. Pyrolysis converts lignocellulosic biomass into bio-oil, but optimizing yield remains challenging due to complex nonlinear relationships. The present study introduces a novel hybrid framework integrating Bayesian optimization with five machine learning algorithms (CatBoost, Random Forest, K-Nearest Neighbors, Artificial Neural Networks, and ElasticNet) to enhance bio-oil yield prediction. Relying on a dataset of 329 data points encompassing biomass composition and pyrolysis conditions, Bayesian optimization efficiently tuned hyperparameters, overcoming limitations of traditional methods. The CatBoost model outperformed others, achieving a Mean Absolute Error (MAE) of 2.251±0.354 and R2=0.861±0.061. Feature importance and partial dependence analysis identified Hemicellulose (10–15 wt%) and Heating Rate (700–800 °C/min) as critical variables for yield maximization. The developed work demonstrates the transformative potential of Bayesian-optimized ML for sustainable bioenergy production.

Original languageEnglish
Article number108280
JournalBiomass and Bioenergy
Volume203
DOIs
StatePublished - Dec 2025

Bibliographical note

Publisher Copyright:
© 2025 Elsevier Ltd

Keywords

  • Energy security
  • Machine learning
  • Optimization
  • Renewable energy
  • Sustainability

ASJC Scopus subject areas

  • Forestry
  • Renewable Energy, Sustainability and the Environment
  • Agronomy and Crop Science
  • Waste Management and Disposal

Fingerprint

Dive into the research topics of 'Bayesian optimization-guided machine learning models for bio-oil yield prediction from lignocellulosic biomass pyrolysis'. Together they form a unique fingerprint.

Cite this