Microsoft AI creates talking deepfakes from single photo

News

22/04/2024

Microsoft Research Asia has released an AI model that can generate realistic, talking deepfake videos from a single still image and an audio track.

The model has been trained on footage of approximately 6,000 talking faces from the VoxCeleb2 dataset, and can animate still images that lip-sync to a supplied voice track, creating realistic vocal expressions and natural head movements.

The technology, called VASA-1, can reportedly generate synced videos at 512x512 pixels at 40 frames per second without latency.

Photo credit: Microsoft Research Asia

View More Articles..

AR virtual display capable of 100-in screen size is adapted for AI

28/04/2025

Screenberry and Datapath combine for immersive MOVE 5D live theatre in Finland

13/01/2025

What made the winners of the 2025 Inavation Awards Project Categories stand out?

05/03/2025

tvONE expands team with DACH appointment

06/03/2025

Microsoft AI creates talking deepfakes from single photo

View More Articles..

AR virtual display capable of 100-in screen size is adapted for AI

Screenberry and Datapath combine for immersive MOVE 5D live theatre in Finland

What made the winners of the 2025 Inavation Awards Project Categories stand out?

tvONE expands team with DACH appointment

Come on in, it's free!

This isn't a paywall. It's a Freewall. We don't want to get in the way of what you came here for, so this will only take a few seconds.

Already have an account