-
Notifications
You must be signed in to change notification settings - Fork 475
Open
Copy link
Description
🚀 Feature
I would like to propose incorporating an essential evaluation metric for 3D talking heads into the TorchMetrics library: Upper Face Dynamic Deviation (FDD).
Motivation
Current TorchMetrics offerings lack dedicated metrics for evaluating 3D talking heads, except for LVE. I think this metric also fits in multimodal folder of this library.
Pitch
This metric is widely used in speech-driven facial animation research, it measures the variation of facial dynamics for motion sequences in comparison with ground truth. It gives an indication of how close the standard deviation (or upper face motion variation) of generated sequences (of test-set audios) is compared to the variation observed in ground truth.
Reference
- Paper : Codetalker
Additional context
If agreed, I would like to open a PR for the same.
bhimrazy
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request
