Generate subtitles from any video with AI — clean, properly timed captions downloadable as SRT or VTT.
AI-powered audio review & delivery for voice production teams
MP4, MOV or M4V
This creates caption files, not a captioned video.
You'll get downloadable SRT and VTT files to load into your editor or player. Just need the words? Use Video Transcription. Working from audio instead of video? Try Audio Transcription.
Upload a video to generate downloadable SRT and VTT captions
Upload a video and the generator extracts its audio, runs it through an AI speech-to-text model, and gets back every spoken word with a precise start and end time. Those word timings are the raw material for captions — but a wall of timed words is not the same as readable subtitles. The real work is segmentation: deciding where one caption ends and the next begins so viewers can actually read along.
This tool shapes every cue to the conventions professional captioners follow:
The result is captions that read the way a human would have timed them, not naive ten-word chunks.
You get both. SRT (SubRip) is the universal subtitle format: import it into Premiere Pro, DaVinci Resolve, Final Cut or CapCut, and upload it to YouTube, Facebook and most social platforms. VTT (WebVTT) is the web-native format used by HTML5 <track> elements and many streaming players, and it supports styling cues. When in doubt, use SRT for editing and uploads, and VTT for embedding captions on a website.
The Standard (42) preset follows Netflix-style timed-text guidelines and suits most online video. The Broadcast (32) preset matches the tighter CEA-608 broadcast caption limit — pick it when captions must fit a narrower safe area or meet broadcast specs. Switching presets re-segments the captions so no line exceeds your chosen limit.
Without an account you can caption videos up to 5 minutes long. A free account raises that to 30 minutes per video — enough for full interviews, lectures and long-form social content.
Upload your MP4, MOV or M4V video, choose a line-length preset, and click generate. The tool transcribes the speech with AI, segments it into properly timed captions, and gives you SRT and VTT files to download — no manual transcribing or timing required.
Use SRT for video editors (Premiere, Resolve, Final Cut, CapCut) and for uploading captions to YouTube and social platforms. Use VTT for embedding captions in HTML5 video on a website. You get both formats, so you are covered either way.
AI speech-to-text is highly accurate on clear, well-recorded speech, but it is not perfect — expect to proofread names, technical terms and overlapping speech. Because the captions are clean, properly timed SRT/VTT files, any quick edits are easy to make in your editor or a subtitle tool before you publish.
A common guideline is up to 42 characters per line with no more than two lines on screen at once (the Netflix-style standard). Broadcast captions use a tighter 32-character limit. This tool offers both presets and guarantees no line exceeds the limit you choose.
No — this tool produces caption files (SRT and VTT) that you load into a player or editor. Burning captions permanently into the video picture is a separate job, and a dedicated burn-in tool for that is coming soon.
Yes. Without an account you can generate subtitles for videos up to 5 minutes long. Sign up for a free account to caption videos up to 30 minutes each.
Just need the words, not timed captions? Get a plain-text transcript of your video.
QA your caption file against Netflix/BBC limits — characters per line and reading speed.
Validate and auto-fix subtitle numbering, timing and overlaps in any SRT file.
VoiceDeck adds AI-powered audio & video review and delivery for your whole team — so every file ships in spec, automatically.