Uni MS-PS: A multi-scale encoder-decoder transformer for universal photometric stereo

Photometric Stereo (PS) addresses the challenge of reconstructing a three-dimensional (3D) representation of an object by estimating the 3D normals at all points on the object’s surface. This is achieved through the analysis of at least three photographs, all taken from the same viewpoint but with d...

Full description

Saved in:
Bibliographic Details
Published in:Computer vision and image understanding Vol. 248; p. 104093
Main Authors: Hardy, Clément, Quéau, Yvain, Tschumperlé, David
Format: Journal Article
Language:English
Published: Elsevier Inc 01.11.2024
Elsevier
Subjects:
ISSN:1077-3142, 1090-235X
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Photometric Stereo (PS) addresses the challenge of reconstructing a three-dimensional (3D) representation of an object by estimating the 3D normals at all points on the object’s surface. This is achieved through the analysis of at least three photographs, all taken from the same viewpoint but with distinct lighting conditions. This paper introduces a novel approach for Universal PS, i.e., when both the active lighting conditions and the ambient illumination are unknown. Our method employs a multi-scale encoder–decoder architecture based on Transformers that allows to accommodates images of any resolutions as well as varying number of input images. We are able to scale up to very high resolution images like 6000 pixels by 8000 pixels without losing performance and maintaining a decent memory footprint. Moreover, experiments on publicly available datasets establish that our proposed architecture improves the accuracy of the estimated normal field by a significant factor compared to state-of-the-art methods. Code and dataset available at: https://clement-hardy.github.io/Uni-MS-PS/index.html. •Universal photometric stereo for all lighting and environments.•3D reconstruction method that scales up to very high-resolution images.•New diverse training synthetic dataset for photometric stereo.
ISSN:1077-3142
1090-235X
DOI:10.1016/j.cviu.2024.104093