Skip to main content

AI Models

Hyperclip integrates 15+ AI models across text, image, video, audio, and music generation. Each model has different strengths in quality, speed, and cost.

Model categories

CategoryModelsUsed By
Image Models6 enginesImage Generation, Multi-Angle
Video Models10 enginesText-to-Video, Image-to-Video
Text Models2 enginesScript Generation, AI Transform, Mixed Media, Clip Finder
Audio Models2 enginesVoiceover, Auto Captions
Music Models5 enginesAI Music Generation
Motion Transfer2 enginesMotion Transfer

Choosing the right model

Consider these tradeoffs:

Quality vs. cost

  • Budget-friendly: LTX 2.3 (video), GPT Image 1.5 (image), Gemini Flash (text), Stable Audio Open (music)
  • Premium quality: Kling O3/Veo 3.1 (video), Nano Banana (image), Gemini Pro (text), ElevenLabs (music)

Speed vs. quality

Many models offer tier options:
  • Standard/Pro — Higher quality, slower, costs more
  • Flash/Fast — Lower quality, faster, costs less

Use case matching

  • Talking heads / character animation → Motion Transfer (DreamActor v2)
  • Cinematic scenes → Veo 3.1 or Kling O3
  • Fast iteration → Wan 2.6 Flash or LTX 2.3 Fast
  • Photo-realistic images → Nano Banana Pro or FLUX.1 Pro
  • Budget-conscious → LTX 2.3 + GPT Image 1.5 + Gemini Flash

Quick cost comparison

TaskCheapestMost Expensive
Generate 1 image2 credits (GPT Image 1.5)12 credits (Nano Banana Pro)
Generate 5s video2 credits (LTX 2.3)10 credits (Kling O3 Pro)
Generate script1 credit (Gemini Flash)2 credits (Gemini Pro)
Add voiceover4 credits (ElevenLabs)4 credits (only option)
Generate music3 credits (Stable Audio)8 credits (Minimax/ElevenLabs)