ScanTalk: 3D Talking Heads from Unregistered Scans

Speech-driven 3D talking heads generation has emerged as a significant area of interest among researchers, presenting numerous challenges. Existing methods are constrained by animating faces with fixed topologies, wherein point-wise correspondence is established, and the number and order of points remains consistent across all identities the model can animate. In this work, we present ScanTalk, a novel framework capable of animating 3D faces in arbitrary topologies including scanned data. Our approach relies on the DiffusionNet architecture to overcome the fixed topology constraint, offering promising avenues for more flexible and realistic 3D animations. By leveraging the power of DiffusionNet, ScanTalk not only adapts to diverse facial structures but also maintains fidelity when dealing with scanned data, thereby enhancing the authenticity and versatility of generated 3D talking heads. Through comprehensive comparisons with state-of-the-art methods, we validate the efficacy of our approach, demonstrating its capacity to generate realistic talking heads comparable to existing techniques. While our primary objective is to develop a generic method free from topological constraints, all state-of-the-art methodologies are bound by such limitations. Code for reproducing our results, and the pre-trained model are available at https://github.com/miccunifi/ScanTalk.

Mots clés

3D Talking Heads 3D Scans Animation DiffusionNet 3D Talking Heads 3D Scans Animation DiffusionNet

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

ECCV_2024_ScanTalk___Camera_Ready_Version-2.pdf (5.41 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Mohamed DAOUDI : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04732491

Soumis le : vendredi 11 octobre 2024-14:14:01

Dernière modification le : samedi 12 octobre 2024-08:42:27

Dates et versions

hal-04732491 , version 1 (11-10-2024)

Identifiants

HAL Id : hal-04732491 , version 1

Citer

Federico Nocentini, Thomas Besnier, Claudio Ferrari, Sylvain Arguillere, Stefano Berretti, et al.. ScanTalk: 3D Talking Heads from Unregistered Scans. European Conference on Computer Vision (ECCV), Sep 2024, Milan, Italy. ⟨hal-04732491⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INSMI CRISTAL GENCI CRISTAL-3D-SAM UNIV-LILLE ANR IMT-NORD-EUROPE LPP-MATH INSTITUT-MINES-TELECOM

0 Consultations

0 Téléchargements