LTX Audio-to-Video
Start with audio, end with video. Upload your audio track — music, voiceover, sound — and generate video that is driven by and responds to the audio you provide.
What is LTX Audio-to-Video?
LTX Audio-to-Video inverts the usual AI video workflow. Rather than generating video first and adding audio, you start with an audio file and the model generates video that responds to it. The video output is driven by the audio's rhythm, energy, texture, and character.
This is particularly powerful for music visualization — take a track or stem and generate a visual companion that moves and develops in response to the sound. It's equally effective for podcast and voiceover content, where the audio is primary and you want a visual layer that matches the tone and pacing of the speech or narration.
For marketers and content creators who start the creative process with a music brief or voiceover script, LTX Audio-to-Video aligns with that audio-first workflow — rather than treating audio as an afterthought, it becomes the driver of the entire generation.
Audio input
Start with your audio file
Audio-driven video
Video responds to sound
Music visualisation
Visual companion for tracks
Full audio types
Music, voice, ambient, FX
How to use LTX Audio-to-Video on project.video
Open the composer
Go to your project.video dashboard and open the generation composer. LTX Audio-to-Video is available in the Lightricks model section.
Select Audio-to-Video
Choose LTX Audio-to-Video. The composer will prompt you to upload an audio file and show you the available generation options.
Upload your audio
Upload your audio file — music track, voiceover, ambient sound, or any audio you want to drive the video generation. The model analyses the audio to guide the visual output.
Add a prompt and generate
Write a text prompt describing the visual direction — style, subject matter, color palette. The model combines your audio input with the prompt to generate the video.
Technical specs
Best use cases
Music visualisation
Generate a visual companion for a music track — abstract, representational, or conceptual imagery that moves and develops in response to the music's rhythm and energy. Perfect for artist releases and streaming platform visuals.
Podcast and voiceover video
Add a compelling visual layer to podcast clips, voiceover narration, or explainer audio. The video responds to the pacing and energy of the speech, creating more engaging content for social sharing.
Ad creative from music briefs
When your ad brief starts with a music track, generate the visual from the audio rather than trying to find music that fits a pre-existing visual. Audio-first creation ensures the two elements are designed to work together.
Brand audio-to-visual identity
Generate visual content that responds consistently to your brand's audio signature — jingle, sonic logo, or brand track. The video output becomes an extension of your brand's audio identity.
Example prompts
Upload your audio and pair it with one of these prompts in the project.video composer.
"Electronic music track → Abstract geometric shapes morph and pulse in deep blue and cyan space, responding to the beat's energy and frequency, minimal and futuristic"
"Nature ambient audio with birdsong and water → A forest clearing at dawn, light filtering through canopy, gentle movement in leaves and water surface, calm and meditative"
"Voiceover narration on innovation → Close-up abstract macro shots of circuitry, light patterns, and data visualization concepts that shift with the narration pacing"