Auto Captions
Inputs
Outputs
Available engines
How it works
Caption style options
Credit cost
Tips
Example use cases

Auto Captions

The Auto Captions node uses OpenAI’s Whisper model to transcribe audio from a video and overlay styled captions. Customize fonts, colors, animations, and positioning.

Inputs

Handle ID	Data Type	Label
`video-in`	Video	Video

Outputs

Handle ID	Data Type	Label
`video-out`	Video	Video

Available engines

Engine ID	Label	Cost
`fal-whisper`	Whisper	1 credit

How it works

Receives a video with audio from the input handle
Whisper transcribes the audio into timestamped words
You configure the caption style (font, colors, animation, position)
Captions are overlaid on the video
The captioned video is output

Caption style options

Parameter	Description	Options
Font Family	Caption font	Any supported font
Font Size	Text size	Numeric
Color	Main text color	Any color
Highlight Color	Active word highlight	Any color
Stroke Color	Text outline color	Any color
Stroke Width	Outline thickness	Numeric
Background Color	Optional text background	Any color
Background Radius	Rounded corners for background	Numeric
Animation	Word appearance style	`pop`, `fade`, `slide`, `bounce`, `shake`, `zoom`, `none`
Animation Scope	Animate per word or per group	`word` or `group`
Word Group Size	Words shown at once	Numeric
Position	Vertical position (0–100%)	Percent from top
Uppercase	Force uppercase	On/Off
Shadow Color	Text shadow	Any color
Shadow Blur	Shadow spread	Numeric

Credit cost

1 credit per transcription (Whisper).

Tips

This node is interactive — you configure caption styles during flow execution
The pop and bounce animations work well for short-form vertical content
Use word animation scope for a karaoke-style effect
Use group animation scope for a more natural reading experience
Position captions in the lower third (position: 70–80) for standard placement

Example use cases

Adding subtitles to voiceover narration
Creating TikTok/Shorts-style word-by-word captions
Adding accessibility captions to any video with audio

Approve & Export