•

AI Models Directory

Explore 62+ cutting-edge AI models for video creation, image generation, audio, and enhancement. Find the perfect tool for your creative projects.

Video Generation Models

Transform text descriptions and images into dynamic video content with our cutting-edge AI video models.

xAI Grok Imagine Video

Video generation with Grok Imagine

Seedance 2.0

ByteDance's most advanced video gen with native audio, physics, and camera control

Wan 2.7

Advanced open source model with video generation, editing, and reference-based generation with audio support

Pixverse V6

Pixverse's latest V6 model. Up to 15s, resolution options, audio generation, and transitions.

Google Veo 3.1

The most advanced AI video generation model in the world. With integrated audio!

Google Veo 3.1 Fast

Faster and more cost effective Veo 3.1 with integrated audio

Kling O3

Kling O3 Standard unified model: text-to-video, image-to-video, reference-to-video, edit video, and reference V2V.

Kling O3 Pro

Very high quality and expensive multi-modal model

Kling O3 4K

Native 4K Kling O3 — cinema-grade clarity without upscaling

Kling 3.0 Standard

Kling 3.0 Standard with audio.

Kling 3.0 Motion Control

Cost-effective motion transfer from reference video to image for dance, gestures, and animation.

Kling 3.0 Motion Control Pro

Higher-quality motion transfer from reference video to image for dance, gestures, and animation.

Kling 3.0 Pro

Kling 3.0 Pro with audio.

Kling 3.0 4K

Native 4K Kling 3.0 with audio.

Kling 2.6 Pro

Kling 2.6 Pro. Great price for high quality generation, with native audio.

Kling 2.6 Motion Control

Transfer character actions from a reference video to a reference image. Great for dance moves, gestures, and animations.

Kling 2.5 Turbo Pro

Kling 2.5 Pro. Great price for high quality generation.

Kling 2.5 Turbo Standard

Best quality for the price.

OpenAI Sora 2

Fast video generation at 720p

OpenAI Sora 2 Pro

Professional-grade video at 720p

OpenAI Sora 2 Pro HD

Highest quality cinematic video at 1080p resolution

LTX 2.3

Fast open-source video with native audio. Sharp details, smooth motion.

LTX 2.3 Fast

Speed-optimized LTX with native audio. Up to 20 seconds, lower cost.

Minimax Hailuo 2.3 Pro

Balanced model, great for effects and camera motion

Seedance 1.0 Pro

Great model from Bytedance

Seedance 2.0 Fast

Faster, cheaper Seedance 2.0 for quick iterations

Wan 2.6 Flash

Faster and cheaper Wan 2.6 for quick iterations

VEED Background Removal

Remove backgrounds from videos with high quality edge refinement.

Image Generation Models

Create stunning images from text descriptions with our state-of-the-art AI image generation models.

Nano Banana Pro

Google's latest version of nano banana - best reference and editing capabilities.

Nano Banana 2

Google's fast image generation and editing model built on Gemini 3.1 Flash.

Nano Banana

Google's groundbreaking model with great reference and editing capabilities.

xAI Grok Imagine Image

Text-to-image with Grok Imagine

Wan 2.7 Pro

High-quality image generation and editing from Wan 2.7 Pro

Wan 2.7

Image generation and editing from Wan 2.7

GPT Image 2

OpenAI's latest image model with strong text rendering and flexible editing.

GPT Image 1.5

An impressively advanced multi-modal image generator

Seedream 4.5

An updated image model from Bytedance

Seedream 5.0 Lite

Bytedance's latest lightweight image model with editing capabilities

Imagen 4

State of the art image model

Flux 2 Pro

Great for high-quality image generation

Flux 2

Great all around model

Flux 2 Klein

Ultra-fast image generation with enhanced realism and crisp text rendering.

Flux Schnell

Super fast, super cheap!

Recraft V3

When it came out it was SOTA for text, but now we recommend nano-bana or GPT Image for text

Birefnet V2

A fantastic model that can erase the background of any image!

Voice, Lip Sync & Audio-Video Sync

Synchronize audio with video content using advanced AI lip sync and voice technology.

Elevenlabs

Generate voiceovers with elevenlabs

Voice Changer

Replicates audio in any voice, matching the timing and voice fluctuations of the original reference audio

Image Upscalers

Enhance your images with powerful AI upscalers that improve quality, resolution, and detail.

Bria Video Upscaler

Upscale videos up to 8k

SeedVR Image Upscale

Powerful open source image upscaler

SeedVR Video Upscaler

State of the art video upscaler

Topaz Video Upscaler

This one makes your videos larger

Wan Video Upscaler/Enhancer

Fantastic creative upscaler

Avatar & Talking Head Models

Create realistic talking-head videos and digital avatars powered by advanced AI.

HeyGen Avatar 4 Audio

Lip-sync any face photo to your audio

Pixverse Lipsync

Makes a person in your video lipsync your audio. Music works too!

Sync

Makes a person in your video say your written script!

Sync Audio

Makes a person in your video lipsync your audio. Music works too!

Sync Audio V3

Latest version of audio lipsyncing. Best quality.

Sync Audio Pro

Audio lipsyncing. Much better, more expensive.

Music Generation Models

Generate original music and soundtracks with AI-powered composition tools.

ElevenLabs Music

ElevenLabs Music is available on AIVideo.com for music generation.

Lyria 3 Pro

Google Lyria 3 Pro — full-length songs (up to ~3 min) with structural awareness, vocals, and lyrics.

Lyria 3 Clip

Google Lyria 3 Clip — 30-second high-fidelity audio clips from text or image prompts.

Sound Effects Models

Create realistic sound effects and ambient audio with AI generation.

ElevenLabs SFX

ElevenLabs SFX is available on AIVideo.com for sfx generation.

Ready to Create with AI?

Choose from 62+ state-of-the-art AI models to bring your creative vision to life. Whether you're generating videos, creating images, or enhancing existing content, we have the tools you need.