Google’s new AI generates audio soundtracks from pixels

Andrew Tarantola

June 18, 2024 at 1:32 PM·2 min read

An AI generated wolf howling — Google Deep Mind

Deep Mind showed off the latest results from its generative AI video-to-audio research on Tuesday. It’s a novel system that combines what it sees on-screen with the user’s written prompt to create synced audio soundscapes for a given video clip.

The V2A AI can be paired with vide -generation models like Veo, Deep Mind’s generative audio team wrote in a blog post, and can create soundtracks, sound effects, and even dialogue for the on-screen action. What’s more, Deep Mind claims that its new system can generate “an unlimited number of soundtracks for any video input” by tuning the model with positive and negative prompts that encourage or discourage the use of a particular sound, respectively.

The system works by first encoding and compressing the video input, which the diffusion model then leverages to iteratively refine the desired audio effects from background noise based on the user’s optional text prompt and from the visual input. This audio output is finally decoded and exported as a waveform that can then be recombined with the video input.

The best part is that the user doesn’t have to go in and manually (read: tediously) sync the audio and video tracks, as the V2A system does it automatically. “By training on video, audio and the additional annotations, our technology learns to associate specific audio events with various visual scenes, while responding to the information provided in the annotations or transcripts,” the Deep Mind team wrote.

The system is not yet perfected, however. For one, the output audio quality is dependent on the fidelity of the video input and the system gets tripped up when video artifacts or other distortions are present in the input. According to the Deep Mind team, syncing dialogue to the audio track remains an ongoing challenge.

“V2A attempts to generate speech from the input transcripts and synchronize it with characters’ lip movements,” the team explained. “But the paired vide- generation model may not be conditioned on transcripts. This creates a mismatch, often resulting in uncanny lip-syncing, as the video model doesn’t generate mouth movements that match the transcript.”

The system still needs to undergo “rigorous safety assessments and testing” before the team will consider releasing it to the public. Every video and soundtrack generated by this system will be affixed with Deep Mind’s SynthID watermarks. This system is far from the only audio-generating AI currently on the market. Stability AI dropped a similar product just last week while ElevenLabs released their sound effects tool last month.

Engadget
Google DeepMind's new AI tech will generate soundtracks for videos
Google's DeepMind artificial intelligence laboratory is working on a new technology that can generate soundtracks, even dialogue, to go along with videos.
Yahoo News 360
How can the U.S. military solve its recruiting crisis?
The armed forces are struggling bring in enough enlistees to fill their ranks. Reversing the trend could require reconsidering who they try to recruit and how they reward those who do sign up.
Yahoo Life Shopping
The best stick vacuum we tested is down to its lowest price of the year
'I've had a Dyson and a Shark ... this is better': This 7-lb Tineco is cordless, has a built-in dust detector and easily turns into a handheld.
Engadget
A Meta ‘error’ broke the political content filter on Threads and Instagram
Meta continued to limit political content even for users who had opted in to seeing it due to an unspecified "error."
Yahoo Finance
Stock market today: Nasdaq leads gains as Amazon crosses $2 trillion market cap
A rally in Amazon helped push tech higher on Wednesday, extending a bounce back for the Nasdaq.
Yahoo Life Shopping
Walmart's 25+ best deals this week: Bissell, HP, Nespresso, Olay and more
Get the viral Little Green carpet cleaner for under $100, a do-it-all laptop for $240 off and comfy summer sneakers for just $7.
Autoblog
Save 50% off this highly user-rated dash cam this July 4th
This front, rear, and cabin dash cam is just $198.99 for Amazon's 4th of July sale.
Yahoo Sports
UFC 303 preview: After Conor McGregor's withdrawal, what does the pay-per-view really have to offer?
The short-notice rematch between light heavyweight champ Alex Pereira and former champ Jiří Procházka can’t help but feel like a little bit of a consolation prize. Here's a rundown of every main card matchup and why it matters.
Yahoo Tech
Eufy Security Video Doorbell E340 review: My favorite front-door security system
With its dual cameras, onboard storage, smart detection capabilities and subscription-free operation, there's currently no better bell to put on your porch.
Yahoo Life Shopping
5-star luxury at a Motel 6 price: This all-season queen comforter is down to $22
'Not too thick, not too light' — This breezy bedspread is washer/dryer-friendly, doubles as a duvet insert and is nearly 50% off.
Engadget
Amazon is reportedly working on a new AI chatbot
Amazon is reportedly working on a new AI chatbot called Metis.
Engadget
Surface Pro Copilot+ review: The best Surface tablet ever made, no thanks to AI
The Surface Pro is the fastest and most efficient Microsoft tablet we’ve seen yet, especially when paired together with its Flex keyboard. The new OLED screen is wonderful to behold and its NPU allows for powerful AI features.
Yahoo Life Shopping
Save up to 50% on Ring doorbells and cameras during the early Prime Day sales
Step up your home security with a range of top-rated surveillance bundles by the popular brand. These deals have everything to keep you safe and sound.
Yahoo Personal Finance
What is an ABA routing number?
ABA routing numbers are nine-digit codes that identify financial institutions. Learn more about how bank ABA numbers work and when you need to know them.
Yahoo Life
CDC warns of increased risk of dengue fever infections. Here's what experts say about mosquito-borne illnesses.
Cases of dengue in the Americas for the first half of 2024 are already more than double last year's rates
Autoblog
Volvo EX30 U.S. arrival delayed until at least next year
The Volvo EX30 is delayed until 2025 in the United States.
Yahoo Life Shopping
The best early Amazon Prime Day deals to shop prior to the July 16-17 sale
Amazon Prime Day 2024 was just confirmed to run July 16-17. Here's all the info we have so far, along with early deals you can shop early.
Yahoo Tech
How to clean your Apple AirPods and why you should sanitize them often
Whether you use Apple’s AirPods Pro, AirPods Max or just regular AirPods, there’s a safe and effective way to keep them clean.
Autoblog
Save nearly $100 on a Greenworks electric lawn mower with this limited-time Amazon deal
If you've got a mid-sized yard and you're looking for an electric mower, this one from Greenworks is just $284.99 for a limited time.
Engadget
Amazon Fire HD Kids Pro tablets are up to 53 percent off in an early Prime Day deal
Amazon's rugged tablets for kids are on sale in an early Prime Day deal, but you'll need to be a Prime member to score the discount.

News

Life

Entertainment

Finance

Sports

New on Yahoo

Google’s new AI generates audio soundtracks from pixels

Recommended Stories

Google DeepMind's new AI tech will generate soundtracks for videos

How can the U.S. military solve its recruiting crisis?

The best stick vacuum we tested is down to its lowest price of the year

A Meta ‘error’ broke the political content filter on Threads and Instagram

Stock market today: Nasdaq leads gains as Amazon crosses $2 trillion market cap

Walmart's 25+ best deals this week: Bissell, HP, Nespresso, Olay and more

Save 50% off this highly user-rated dash cam this July 4th

UFC 303 preview: After Conor McGregor's withdrawal, what does the pay-per-view really have to offer?

Eufy Security Video Doorbell E340 review: My favorite front-door security system

5-star luxury at a Motel 6 price: This all-season queen comforter is down to $22

Amazon is reportedly working on a new AI chatbot

Surface Pro Copilot+ review: The best Surface tablet ever made, no thanks to AI

Save up to 50% on Ring doorbells and cameras during the early Prime Day sales

What is an ABA routing number?

CDC warns of increased risk of dengue fever infections. Here's what experts say about mosquito-borne illnesses.

Volvo EX30 U.S. arrival delayed until at least next year

The best early Amazon Prime Day deals to shop prior to the July 16-17 sale

How to clean your Apple AirPods and why you should sanitize them often

Save nearly $100 on a Greenworks electric lawn mower with this limited-time Amazon deal

Amazon Fire HD Kids Pro tablets are up to 53 percent off in an early Prime Day deal