Extract clean plain text from an SRT or VTT subtitle file — strip timestamps and numbers, merge into paragraphs, and download a .txt.
AI-powered audio review & delivery for voice production teams
Paste or load an SRT/VTT file to see the plain-text transcript here.
An SRT (SubRip) subtitle file is built from numbered cues. Each cue has a sequential index, a HH:MM:SS,mmm --> HH:MM:SS,mmm timing line and one or two lines of caption text, with a blank line between cues. That structure is perfect for a video player and useless when you actually want the words. This converter reads the file, throws away the indices and timestamps, and hands you back just the text — the fast way to get a transcript out of an srt to txt file you already have.
There are two ways to lay out the text, and the right one depends on what you are doing next:
Timestamps and cue numbers are always stripped. Two optional clean-ups are on by default: remove speaker tags drops Name: prefixes and WebVTT <v Speaker> voice tags, and remove duplicate consecutive lines deletes the repeated lines that automatic and rolling captions tend to leave behind. Turn either off if you would rather keep that information.
A clean transcript is the starting point for a lot of repurposing work: paste it into your podcast or video show notes, expand it into a blog post or article, drop it into a translation tool to localize your content, or feed it to a summarizer. Because the export is UTF-8, accented characters, non-Latin scripts and emoji all survive the round trip.
| Step | What happens |
|---|---|
| Paste or upload | Your .srt or .vtt is read entirely in the browser |
| Parse | Cues are split out; headers, NOTE blocks and cue settings are ignored |
| Clean | Indices and timestamps are removed; speaker tags and duplicate lines optional |
| Export | Copy the text or download a plain transcript.txt |
Paste the contents of your .srt file into the box above, or load the file with the picker. The converter reads every cue and shows just the spoken text on the right — no indices, no timestamps. Then copy the result or download it as a plain .txt. Nothing is uploaded; the whole srt to text conversion happens in your browser.
Timestamps are removed automatically. A subtitle cue puts the HH:MM:SS,mmm --> HH:MM:SS,mmm timing line and a cue number around the text; this tool detects that structure and keeps only the words, so you never have to delete the timecodes by hand.
Yes. Switch the layout to Merged paragraphs and the cues are joined into flowing text instead of one line each. Each caption's internal line breaks are collapsed to a single space, and a new paragraph begins whenever there is a gap of two seconds or more between cues — a good approximation of where one thought ends and the next begins.
An SRT is a timed subtitle file: every line of dialogue is wrapped in a cue number and a start/end timestamp so a player can show it at the right moment. A transcript is just the words, with no timing. This tool converts the first into the second by stripping the indices and timecodes and, optionally, the speaker labels.
Yes. WebVTT (.vtt) files are accepted transparently — the converter understands the WEBVTT header, NOTE blocks, dot-millisecond timestamps and cue settings, and skips all of them so you get the same clean text you would from an .srt. If instead you need to convert between the two subtitle formats, use the dedicated SRT to VTT converter.
Need the reverse? Turn a plain text script back into a timed .srt subtitle file.
Convert between subtitle formats — make a WebVTT file from your SRT for HTML5 video.
No subtitle file yet? Transcribe your audio to text first, then clean it up here.
VoiceDeck adds AI-powered audio & video review and delivery for your whole team — so every file ships in spec, automatically.