Question 1

How do I transcribe a podcast?

Accepted Answer

Paste the podcast's RSS feed URL and pick an episode, or paste a direct audio link (.mp3, .m4a, and similar). The tool streams the audio through a proxy to get past CORS, then transcribes it in your browser with an on-device Whisper AI model. You get plain text, timestamps, and subtitles (SRT/VTT).

Question 2

Where do I find a podcast's RSS feed URL?

Accepted Answer

Most podcast hosts publish an RSS feed link on the show's page, and many directories expose it too. Paste that feed URL and the tool lists every episode with a downloadable audio file. If you already have a direct link to the episode's MP3, you can paste that instead and skip the episode list.

Question 3

Is my audio uploaded to a server?

Accepted Answer

The transcription itself runs entirely in your browser — the audio is never stored on our servers. The only server involvement is a thin proxy that streams the episode file to your browser, which is required because podcast CDNs don't allow cross-origin browser downloads. Nothing is persisted.

Question 4

Can it transcribe Spotify or Apple Podcasts links?

Accepted Answer

Not directly. Spotify does not expose a downloadable audio file, so Spotify links can't be transcribed. Apple Podcasts page links aren't RSS feeds either. Use the show's RSS feed URL or a direct audio URL instead — those work for almost every public podcast.

Question 5

Which model should I pick?

Accepted Answer

Balanced (Whisper base, multilingual) is the default and handles Japanese and many other languages. Accurate (Whisper large-v3-turbo) gives the best quality including Japanese, at the cost of a larger first download — use it on a WebGPU browser. Fast (Whisper tiny.en) is the smallest and quickest but English only.

Question 6

How long does a full episode take?

Accepted Answer

Long episodes are transcribed in ~2-minute parts so the transcript streams in as it goes. Total time depends on episode length, the model, and your hardware — a WebGPU browser (recent Chrome or Edge) is much faster than the CPU fallback. Very long episodes are memory-heavy, so a desktop browser is recommended over mobile.

Question 7

Can it output subtitles (SRT / VTT)?

Accepted Answer

Yes. After transcribing you can switch between plain text, timestamped lines, SRT, and WebVTT, and copy or download any of them. SRT and VTT load directly into video editors and players as subtitle tracks.

Question 8

Is it free, and can I use the results commercially?

Accepted Answer

Yes. The tool is free and the transcription runs locally. Whisper is released by OpenAI under the MIT license and Transformers.js under Apache-2.0, both of which permit commercial use. Make sure you have the right to transcribe the audio you use.

Input	What happens	Best for
RSS feed URL	Lists every episode with downloadable audio; you pick one	Browsing a show and choosing an episode
Direct audio URL (`.mp3`, `.m4a`, …)	Skips the list and transcribes that file immediately	You already have the episode's audio link

Model	Languages	First download	Best for
Balanced (Whisper base)	Multilingual, incl. Japanese	~200 MB	The default for most shows
Accurate (Whisper large-v3-turbo)	Multilingual, incl. Japanese	~760 MB	Highest quality; use a WebGPU browser
Fast (Whisper tiny.en)	English only	~120 MB	Quick drafts of English shows

Transcribe a podcast to text

Transcribe a podcast to text

How it works

Steps

RSS feed vs. direct audio URL

Which model should I choose?

Example

Privacy

FAQ

Get in touch

Thanks for reaching out

What we can help with

Talk to us online