Auto Captions
The Auto Captions node uses OpenAI’s Whisper model to transcribe audio from a video and overlay styled captions. Customize fonts, colors, animations, and positioning.Inputs
| Handle ID | Data Type | Label |
|---|---|---|
video-in | Video | Video |
Outputs
| Handle ID | Data Type | Label |
|---|---|---|
video-out | Video | Video |
Available engines
| Engine ID | Label | Cost |
|---|---|---|
fal-whisper | Whisper | 1 credit |
How it works
- Receives a video with audio from the input handle
- Whisper transcribes the audio into timestamped words
- You configure the caption style (font, colors, animation, position)
- Captions are overlaid on the video
- The captioned video is output
Caption style options
| Parameter | Description | Options |
|---|---|---|
| Font Family | Caption font | Any supported font |
| Font Size | Text size | Numeric |
| Color | Main text color | Any color |
| Highlight Color | Active word highlight | Any color |
| Stroke Color | Text outline color | Any color |
| Stroke Width | Outline thickness | Numeric |
| Background Color | Optional text background | Any color |
| Background Radius | Rounded corners for background | Numeric |
| Animation | Word appearance style | pop, fade, slide, bounce, shake, zoom, none |
| Animation Scope | Animate per word or per group | word or group |
| Word Group Size | Words shown at once | Numeric |
| Position | Vertical position (0–100%) | Percent from top |
| Uppercase | Force uppercase | On/Off |
| Shadow Color | Text shadow | Any color |
| Shadow Blur | Shadow spread | Numeric |
Credit cost
1 credit per transcription (Whisper).Tips
- This node is interactive — you configure caption styles during flow execution
- The
popandbounceanimations work well for short-form vertical content - Use
wordanimation scope for a karaoke-style effect - Use
groupanimation scope for a more natural reading experience - Position captions in the lower third (position: 70–80) for standard placement
Example use cases
- Adding subtitles to voiceover narration
- Creating TikTok/Shorts-style word-by-word captions
- Adding accessibility captions to any video with audio