Google researchers find novel way of turning a single photo of a human into AI-generated video good enough to make you think 'this might go badly'

A screenshot showing a source photograph of a person talking and an AI generated version of them speaking.
(Image credit: Enric Corona et al)

Google researchers have found a way to create video versions of humans generated from just a single still image. This enables it to do things like, generate a video of someone speaking from input text, or changing a person's mouth movements to match an audio track in a different language to the one originally spoken. It also feels like a slippery slope into identity theft and misinformation, but what's AI if not with a hint of frightening consequences.

The tech itself is rather interesting: it's called Vlogger by the Google researchers that published the paper. In it the authors (Enric Corona et al) offer up various examples of how the AI takes a single input image of a human—in this case, I believe mostly AI-generated humans—and with an audio file produces both facial and bodily movements for them to match.

That's just one of a few potential use cases for the tech. Another is editing video, specifically a video subject's facial expressions. In an example, the researchers show various versions of the same clip: one has a presenter speaking to camera, another with the presenter's mouth closed in an eerie fashion, another with their eyes closed. My favourite is the video of the presenter with their eyes artificially held open by the AI, unblinking. Huge serial killer vibes. Thanks, AI.

The most useful feature in my opinion is the ability to swap an audio track for a video with a dubbed foreign language version and have the AI lip-sync the person's facial movements to the audio track.

It works through the use of two stages: "1) a stochastic human-to-3d-motion diffusion model, and 2) a novel diffusion based architecture that augments text-to-image models with both temporal and spatial controls. This approach enables the generation of high quality videos of variable length, that are easily controllable through high-level representations of human faces and bodies," the GitHub page says.

Admittedly the tech isn't perfect. In the examples given the mouth movements have certain qualities common across AI-generated video content. It's also pretty creepy at times, as noted by users responding to a thread about the technology by EyeingAI on X. But Vlogger doesn't need to fool everyone, or even fool anyone at all, to have some use. Similarly, if it were a more perfect technology, it'd be even more worrying to think about how this technology could be used to create deep fakes, spread misinformation, or steal identities. We'll get there one day, and I for one hope we have some handle on how to deal with this stuff a bit more by then. 

Best CPU for gamingBest gaming motherboardBest graphics cardBest SSD for gaming


Best CPU for gaming: Top chips from Intel and AMD.
Best gaming motherboard: The right boards.
Best graphics card: Your perfect pixel-pusher awaits.
Best SSD for gaming: Get into the game first.

TOPICS
Jacob Ridley
Managing Editor, Hardware

Jacob earned his first byline writing for his own tech blog. From there, he graduated to professionally breaking things as hardware writer at PCGamesN, and would go on to run the team as hardware editor. He joined PC Gamer's top staff as senior hardware editor before becoming managing editor of the hardware team, and you'll now find him reporting on the latest developments in the technology and gaming industries and testing the newest PC components.

Read more
A image representing a typical YouTube tech video thumbnail using joke elements to demonstrate the use of an AI tool
Is time too precious to waste making gurning thumbnails for your YouTube videos? Huzzah for this AI tool that does it all for you, then
One YouTuber has been poisoning AI tools that access her videos with .ass subtitle files and you can too
Symbolic photo: Logo of the video platform YouTube on June 07, 2023 in Berlin, Germany.
'It’s a whole new kind of blerp': YouTube's AI-enhanced reply suggestions seem to be working as well as you might expect
Seal
Meta's deepfake-fighting AI video watermarking tool is here, and for some reason it's decided to call it the Video Seal
CHONGQING, CHINA - OCTOBER 30: In this photo illustration - The Facebook app page is displayed on a smartphone in the Apple App Store in front of the Meta Platforms, inc. logo on October 30, 2024 in Chongqing, China. (Photo by Cheng Xin/Getty Images)
Meta might've done something useful, pioneering an AI model that can interpret brain activity into sentences with 80% accuracy
Aloy
'Creepy,' 'ghastly,' 'rancid': Viewers react to leaked video of Sony's AI-powered Aloy
Latest in AI
Aloy
'Creepy,' 'ghastly,' 'rancid': Viewers react to leaked video of Sony's AI-powered Aloy
Seattle, USA - Jul 24, 2022: The South Lake Union Google Headquarter entrance at sunset.
Google is rolling out an even more AI-heavy search engine mode because 'power users want AI responses for even more of their searches'
A digitally generated image of abstract AI chat speech bubbles overlaying a blue digital surface.
We need a better name for AI, or we risk talking past each other until actually intelligent AGI comes home mooing
MOUNTAIN VIEW, CALIFORNIA - AUGUST 22: A view of Google Headquarters in Mountain View, California, United States on August 22, 2024.
One educational company accuses Google's AI summary of leading to a 'hollowed-out information ecosystem of little use and unworthy of trust' in latest lawsuit
Nvidia Signs, its AI-led ASL teaching platform
Nvidia has built a free AI-led platform to help teach American Sign Language with '400,000 video clips representing 1,000 signed words' so far
Microsoft Muse-generated gaming in action
'A massive, massive moment of wow.' Microsoft CEO predicts AI-generated games are a 'CGI moment' for the industry
Latest in News
The UHPILCL water cooled gaming laptop
This water-cooled gaming laptop packs a full-size desktop RTX 5090 and even fits in a backpack, but I sure wouldn't want it in mine
The TikTok app with Donald Trump ranting behind it.
Trump says the United States is already talking to potential TikTok buyers: 'We're dealing with four different groups, and a lot of people want it ... all four are good'
Corsair launches Custom Labs in Europe
Corsair's Custom Labs is now available in Europe, allowing you to make your gear as cute or ugly as you want and no-one will stop you
Still from a CNET video highlighting the Samsung concept device from MWC 2025
Samsung's handheld prototype delivers folding phone screens to Switch-like gaming hardware, and I am absolutely here for it
A masked man with an axe in the woods
Rebellion CEO seems kind of awed by major studios making massive videogames: 'How do you organize a game that has 2,000 people working on it?'
A young witch watering a smiling mushroom in a magic garden
Here's a roguelite dungeon crawler Steam reviewers call 'a botanical Diablo' and 'like Cult of the Lamb' except you manage a mystical garden