Gemini Omni Flash is now available in Envato

Gemini Omni Flash is now part of Envato's AI video generator, bringing multimodal creation, conversational editing, and richer video workflows.

Ryan Cheng 4min read
A skateboarder in mid-air, framed by a hand, against a colorful wall and building, with text 'Google Gemini Omni Flash is on Envato'.

The next evolution of AI-powered video creation has arrived.

We’re excited to announce that Gemini Omni Flash is now part of the AI models powering Envato’s AI video generator.

While you won’t need to choose between individual models, every advancement helps improve what’s possible with AI-powered video creation. We handle the technology behind the scenes so you can focus on creating, while benefiting from the latest innovations as they become available.

Gemini Omni Flash introduces powerful multimodal capabilities, more intuitive editing workflows, and new ways to work with references, context, and creative direction.

Why this matters for you

Video creation is becoming more flexible, iterative, and connected across different media.

With Gemini Omni Flash now helping power AI video generation on Envato, you can work with text, images, and video together, refine content through natural language instructions, and build on existing ideas without constantly starting over. The result is a more intuitive creative process that gives you greater control over how videos are created and refined.

Meet Gemini Omni Flash

Gemini Omni Flash is the first model in Google’s Gemini Omni family, combining Gemini’s reasoning capabilities with advanced multimodal creation and editing.

Designed to work across text, images, and video, Gemini Omni Flash supports creative workflows that go beyond simple generation. It helps creatives build, edit, refine, and evolve content using a wider range of inputs and more natural forms of interaction.

What’s enhanced with Gemini Omni Flash

World knowledge and context

Gemini Omni Flash brings a deeper understanding of locations, environments, cultural references, and historical settings, helping generate content that feels more grounded and believable. It can also render clearer, more realistic text within scenes while supporting more natural interactions between characters, objects, and environments.

Whether you’re creating a modern city street, a historical setting, or a location inspired by a specific culture, Gemini Omni Flash is designed to better understand the context behind the scene and reflect it more accurately in the final output.

Multimodal creation

    Gemini Omni Flash is built to work with text, images, and video, making it easier to bring different creative inputs together in a single workflow. Creatives can use references to guide generation, maintain character consistency across scenes, transfer styles and motion between assets, and even transform sketches or rough concepts into video sequences.

    By combining multiple forms of input, Gemini Omni Flash opens up new ways to turn inspiration, references, and existing creative assets into richer video outputs.

    Conversational editing

    Gemini Omni Flash introduces more advanced editing workflows powered by natural language instructions. Instead of starting over, creatives can refine existing videos by changing actions, replacing objects, updating scenes, adjusting camera perspectives, modifying characters, and evolving creative direction through an ongoing editing process.

    This conversational approach makes it easier to experiment, iterate, and develop ideas while maintaining continuity across edits and revisions.

    What this means for you

    More control over video creation

    Create and refine videos with greater flexibility by making changes through natural language instructions instead of repeatedly generating new versions from scratch. Small adjustments, creative experimentation, and ongoing refinement become faster and more intuitive.

    Better use of creative references

    Guide video generation using combinations of images, videos,, and text prompts within a single workflow. Multiple creative inputs can work together to shape a more cohesive, intentional final result.

    Stronger consistency across outputs

    Maintain continuity across scenes, edits, and creative iterations with support for reference-based workflows, character consistency, and more context-aware generation. This makes it easier to build projects that feel connected from beginning to end.

    Richer, more context-aware results

    Gemini Omni Flash demonstrates a stronger understanding of locations, historical settings, cultural references, real-world environments, motion, and physics interactions. The result is content that can feel more realistic, relevant, and aligned with your creative intent.

    Start creating with Gemini Omni Flash

    Gemini Omni Flash is now helping power video generation on Envato, giving creatives new ways to generate, edit, and refine video content through multimodal creation and more intuitive editing workflows.

    Whether you’re building from a prompt, working from references, refining an existing concept, or exploring new creative directions, Gemini Omni Flash expands what’s possible with AI-powered video creation.

    Ready to see what you can create? 

    Start generating videos today with Envato’s AI video generator.

    Gemini Omni Flash FAQs

    Related Posts