Convert calls and voice recordings to text for instant meeting notes and searchable records
Cloud Transcription uses OpenAI's Whisper API to convert audio from meetings, calls, and voice memos into accurate text transcripts. Whether you're recording a Zoom call or dictating notes, this skill delivers high-quality transcriptions in over 50 languages.
Record and transcribe team meetings for instant searchable notes
Transcribe customer interviews and research calls
Dictate ideas and get them transcribed on the spot
Create show notes and transcripts for audio content
Rough incident notes → clean technical handoff in minutes
Server acting weird at 2am. Sent a rough description. Clean technical handoff in minutes.
Say it while driving, task structured and running by arrival
Say it while driving. It structures the task, confirms back, starts executing.
Log your meals by voice, get nutrition fixes for tomorrow
Voice message about what I ate. Back comes 1-3 things to fix tomorrow. Tracks patterns.
Forward raw chaos, get a prioritized action list
Voice notes, screenshots, texts. Forward raw. Back comes a prioritized action list.
Dump voice notes at night, wake up to a clean morning brief
Dump voice notes about yesterday. Wake up to a clean brief: what matters, what's first.
Voice recap after the call: tasks with owners and deadlines
Quick voice recap after the call. Back comes: who does what, by when, success metrics.
Send an audio file or start a live recording through your preferred channel.
Supported formats: MP3, M4A, WAV, WEBM, MP4
The assistant sends audio to Whisper API and retrieves the transcript automatically.
Get clean text output with timestamps and optional speaker labels.
Save to Notion, send via email, or export to your preferred format.
Transcribe this meeting recording [audio file]
[00:00] John: Let's start with Q4 goals. [00:15] Sarah: We need to focus on retention...
Convert my voice memo to text
Transcribed: "Remember to follow up with the client about the proposal. Send revised pricing by Thursday..."
Transcribe the last 10 minutes of this call
Transcript ready (10:34 duration): Discussed action items, assigned owners, set next meeting for Dec 15.
| Parameter | Limit | Notes |
|---|---|---|
| File size | 25 MB | compress large files before upload |
| Duration | 3 hours | split longer recordings |
| Daily requests | 500 files | unlimited with subscription |
| Concurrent jobs | 5 at once | queues additional files |
| Model | Speed | Accuracy | Best For |
|---|---|---|---|
| Whisper Large | Medium | Highest | critical transcripts, multi-speaker |
| Whisper Medium | Fast | High | general meetings and calls |
| Whisper Small | Very Fast | Good | quick voice memos |
A 10-minute meeting costs ~$0.07 in transcription. Longer recordings cost proportionally more.
LLM processing cost is additional and depends on conversation complexity. BYOK users pay LLM costs directly to their provider.
* Prices include platform service fee. Actual costs may vary.
| Error | Meaning | Action |
|---|---|---|
| FILE_TOO_LARGE | Exceeds 25MB limit | Compress audio or split file |
| UNSUPPORTED_FORMAT | Audio format not recognized | Convert to MP3, M4A, or WAV |
| API_TIMEOUT | Processing took too long | Retry with shorter clip |
| LOW_QUALITY | Poor transcription result | Re-record with better audio quality |