Google introduces Veo 3.1
inside Flow
, and yes, AI video editing just leveled up. Can you imagine generating a scene and then fine‑tuning it down to the last detail, including audio that sounds natural? That's exactly what this update aims for.
What Veo 3.1
brings to Flow
The update emphasizes creative control and better audiovisual quality. Now Flow includes generated audio in features that were previously visual-only, and the model promises more realism and closer adherence to your instructions. These improvements come because Google says it learned from massive Flow usage and community feedback. (blog.google)
Highlighted features
-
Ingredients to Video: you can use several reference images to control characters, objects and style, and Flow blends those “ingredients” into a coherent scene. (blog.google)
-
Frames to Video: give a start frame and an end frame, and the system generates the visual bridge between them. Great for artistic transitions or epic scenes. (blog.google)
-
Extend: lengthen an existing clip to create longer shots (even a minute or more), connecting the new section to the original clip's end. Very useful for establishing shots or continuous sequences. (blog.google)
More precise editing: insert, remove and adjust
Flow now lets you edit within the creative process: Insert elements that respect lighting and shadows so they integrate naturally, and soon you'll be able to remove objects or characters and reconstruct the background as if they were never there. If you've ever worked with complex frames, you know how valuable that can be. (blog.google)
These capabilities are experimental and evolving; Google encourages users to try them and provide feedback to keep improving. (blog.google)
Where and for whom it's available
Veo 3.1
ships integrated in Flow
for creators, and the model is also available to developers and businesses via the Gemini API, Vertex AI and the Gemini app. So whether you're an individual creator or part of a product team or agency, you can access these capabilities. (blog.google)
What does this mean in practice?
-
For independent creators: faster prototyping and results with integrated sound, without relying so much on external audio resources.
-
For teams and agencies: the ability to iterate scenes and ads more quickly, keeping visual and sonic consistency.
-
For developers: the model is available via API, making it easier to integrate into your workflows and products. (blog.google)
Think of this as moving from a visual mockup to a scene that already sounds good and behaves believably. It doesn't replace a full human team, but it speeds up creative stages and opens possibilities for people who have great ideas but few resources.
Final reflection
The Veo 3.1
update in Flow
shows how AI is moving from being just a visual trick to becoming an integral tool for audiovisual storytelling. The big question? How will you use these capabilities responsibly and creatively to tell stories that connect.