Transcription
After recording, open the Meeting Page panel and click Transcribe.
How it works
Section titled “How it works”Audio is converted to a structured transcript with speaker labels and timestamps, saved as <recording>.transcript.json alongside the audio in your vault.
The route depends on your plan and your Transcription model choice in Settings → AI:
- Pro All-in-One / Trial — audio routes through Pensum’s worker, which streams it directly to Deepgram and returns the transcript. Nothing is persisted on Pensum’s side beyond a billing metadata row (account, duration, timestamp).
- Pro BYO — your plugin sends audio directly to the provider that owns the selected model. Pensum’s servers are not involved.
- Deepgram Nova-3 (default, recommended): speaker diarization native, ~$0.26/hour
- OpenAI Whisper /
gpt-4o-transcribe: ~$0.36/hour
Privacy and data handling
Section titled “Privacy and data handling”| Path | Where audio goes | Retention |
|---|---|---|
| Managed (All-in-One) under 20 min | Synchronous through Pensum’s worker → Deepgram | Held in worker memory for the request only. No disk, no R2 bucket, no D1 row referencing content. |
| Managed (All-in-One) 20+ min | Async via R2 staging → Deepgram | Audio + transcript deleted as soon as the plugin confirms receipt; 1-hour R2 TTL backstop covers disconnects. |
| BYO (Deepgram / OpenAI) | Direct from your device to the provider’s API | Subject to that provider’s retention policy; Pensum has no role. |
| Local storage | The recording (.webm) and transcript (.json) sit in your vault | You control the Audio retention setting (indefinite, 90 days, or 30 days) plus your own backups. |
Public claim: we do not retain your audio, we do not retain your transcript, we log only billing metadata. The full data-flow diagrams are on the architecture page.
Local transcription (e.g. whisper.cpp) is a future direction, not in v1.0 — there’s no “Local transcription URL” setting yet.
Speaker identification
Section titled “Speaker identification”After transcription, click Identify Speakers. Pensum sends the first 5 minutes of transcript plus your attendees list to the AI, which maps speaker labels to names based on context clues (introductions, who addresses whom by name).
For this to work, add attendees to the meeting frontmatter:
attendees: - Jamie - Sam - TaylorHigh and medium confidence mappings are applied automatically. Low confidence mappings are skipped.
Trial cap
Section titled “Trial cap”The 14-day trial includes 3 hours of transcription total. Text AI features stay unbounded. The cap exists because transcription is the only AI feature with meaningful per-use cost; 3 hours covers serious evaluation (3–6 meetings) without making trial-rotation worth anyone’s time.
Requirements
Section titled “Requirements”For BYO mode, a key for your chosen transcription provider (Deepgram or OpenAI) configured in Settings → AI. For managed mode, an active Pro All-in-One subscription or trial.