Skip to main navigation Skip to search Skip to main content

Learning Monocular Face Reconstruction using Multi-View Supervision

  • Zhixin Shu
  • , Duygu Ceylan
  • , Kalyan Sunkavalli
  • , Eli Shechtman
  • , Sunil Hadap
  • , Dimitris Samaras
  • Stony Brook University
  • Adobe Systems Incorporated

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

We present a method to reconstruct faces from a single portrait image. While traditional face reconstruction methods fit low-dimensional 3D morphable models to images, we train a deep network to regress depth from a single image directly. We do so by combining supervised losses on synthetic data with indirect supervision on real data using a novel multi-view photo-consistency loss. Furthermore, we regularize the depth estimation using a 3D morphable model (3DMM). We demonstrate that this leads to results that preserve facial features, capture facial geometry that goes beyond 3DMMs, and is also robust to viewpoint conditions. We evaluate our method on various datasets and via ablation studies, and demonstrate that it outperforms previous work significantly.

Original languageEnglish
Title of host publicationProceedings - 2020 15th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2020
EditorsVitomir Struc, Francisco Gomez-Fernandez
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages241-248
Number of pages8
ISBN (Electronic)9781728130798
DOIs
StatePublished - Nov 2020
Event15th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2020 - Buenos Aires, Argentina
Duration: Nov 16 2020Nov 20 2020

Publication series

NameProceedings - 2020 15th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2020

Conference

Conference15th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2020
Country/TerritoryArgentina
CityBuenos Aires
Period11/16/2011/20/20

Fingerprint

Dive into the research topics of 'Learning Monocular Face Reconstruction using Multi-View Supervision'. Together they form a unique fingerprint.

Cite this