Question 1

How does the subtitle generator work?

Accepted Answer

We use OpenAI Whisper (Base model) compiled to WebAssembly, running entirely in your browser. Your audio is transcribed locally — nothing is uploaded to any server.

Question 2

What audio/video formats are supported?

Accepted Answer

MP3, MP4, WAV, WebM, OGG, M4A, FLAC, and most common audio/video formats. The tool extracts the audio track automatically.

Question 3

How many languages does it support?

Accepted Answer

Whisper supports 99 languages including English, Spanish, French, German, Chinese, Japanese, Korean, Arabic, Hindi, and more. Select your language or use auto-detect.

Question 4

What subtitle formats can I download?

Accepted Answer

SRT (most common, works everywhere) and VTT (WebVTT, for web video players). Both include timestamps and segmented text.

Question 5

How accurate are the subtitles?

Accepted Answer

Whisper Base provides good accuracy for clear speech in supported languages. For best results, use audio with minimal background noise. Professional-grade accuracy requires the Large model (not available in-browser).

Question 6

Why does the first transcription take longer?

Accepted Answer

The Whisper Base model (~57MB) downloads on first use. After that it is cached in your browser. Subsequent transcriptions start immediately.

Question 7

Is my audio uploaded to a server?

Accepted Answer

No. Whisper runs entirely in your browser via WebAssembly. Your audio never leaves your device. This is our key differentiator — 100% private transcription.

Question 8

How long does transcription take?

Accepted Answer

Roughly 1-3x real-time on modern devices. A 5-minute clip takes 5-15 minutes. Desktop browsers are significantly faster than mobile.

Question 9

Can I edit the subtitles before downloading?

Accepted Answer

Yes. The transcribed text appears in an editable area. Fix any errors, adjust timing, then download.

Question 10

Does it work on mobile?

Accepted Answer

Yes, but transcription is CPU-intensive. Short clips (under 2 minutes) work well on phones. For longer audio, use a desktop browser.

Question 11

What is the file size limit?

Accepted Answer

Depends on your device memory. Audio is processed in chunks. Most devices handle files up to 100MB.

Question 12

Can I transcribe a YouTube video?

Accepted Answer

Not directly. Download the video first, then upload the file. Or use our YouTube Summarizer for text summaries.

Question 13

How does this compare to Otter.ai or Rev?

Accepted Answer

Otter and Rev use cloud-based models for higher accuracy. Our advantage: 100% free, 100% private (no upload), unlimited use. Accuracy is good for clear speech but not broadcast-grade.

Question 14

Is it really free?

Accepted Answer

Yes. Completely free, no limits, no signup. Runs locally in your browser.

Question 15

Can I pay with cryptocurrency?

Accepted Answer

Video tools are free. For AI writing tools, we accept USDT, USDC, BTC, ETH. Plans start at $9.99/month.

Subtitle Generator

How It Works

Upload audio or video

Whisper transcribes locally

Download SRT or VTT

FAQ

49+ free AI tools