Abstract
While weakly supervised multi-view face reconstruction (MVR) is garnering increased attention, one critical issue still remains open: how to effectively interact and fuse multiple image information to reconstruct high-precision 3D models. In this regard, we propose a novel pipeline called Deep Fusion MVR (DF-MVR) to explore the feature correspondences between multi-view images and reconstruct high-precision 3D faces. Specifically, we present a novel multi-view feature fusion backbone that utilizes face masks to align features from multiple encoders and integrates one multi-layer attention mechanism to enhance feature interaction and fusion, resulting in one unified facial representation. Additionally, we develop one concise face mask mechanism that facilitates multi-view feature fusion and facial reconstruction by identifying common areas and guiding the network's focus on critical facial features (e.g., eyes, brows, nose, and mouth). Experiments on Pixel-Face and Bosphorus datasets indicate the superiority of the proposed method. Without the 3D annotation, DF-MVR achieves relative 5.2% and 3.0% RMSE improvement over the existing weakly supervised MVRs, respectively, on Pixel-Face and Bosphorus datasets. Our code is available at https://github.com/weiguangzhao/DF-MVR.
| Original language | English |
|---|---|
| Title of host publication | 2025 IEEE International Conference on Multimedia and Expo |
| Subtitle of host publication | Journey to the Center of Machine Imagination, ICME 2025 - Conference Proceedings |
| Publisher | IEEE Computer Society |
| ISBN (Electronic) | 9798331594954 |
| DOIs | |
| State | Published - 2025 |
| Externally published | Yes |
| Event | 2025 IEEE International Conference on Multimedia and Expo, ICME 2025 - Nantes, France Duration: 30 Jun 2025 → 4 Jul 2025 |
Publication series
| Name | Proceedings - IEEE International Conference on Multimedia and Expo |
|---|---|
| ISSN (Print) | 1945-7871 |
| ISSN (Electronic) | 1945-788X |
Conference
| Conference | 2025 IEEE International Conference on Multimedia and Expo, ICME 2025 |
|---|---|
| Country/Territory | France |
| City | Nantes |
| Period | 30/06/25 → 4/07/25 |
Bibliographical note
Publisher Copyright:© 2025 IEEE.
Keywords
- Attention
- Face mask
- Face reconstruction
- Feature fusion
- Multi-view
ASJC Scopus subject areas
- Computer Science Applications
- Computer Networks and Communications
Fingerprint
Dive into the research topics of 'From 2D Images to 3D Model: Weakly Supervised Multi-View Face Reconstruction with Deep Fusion'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver