Skip to content

Transcription

After recording, open the Meeting Page panel and click Transcribe.

Audio is converted to a structured transcript with speaker labels and timestamps, saved as <recording>.transcript.json alongside the audio in your vault.

The route depends on your plan and your Transcription model choice in Settings → AI:

  • Pro All-in-One / Trial — audio routes through Pensum’s worker, which streams it directly to Deepgram and returns the transcript. Nothing is persisted on Pensum’s side beyond a billing metadata row (account, duration, timestamp).
  • Pro BYO — your plugin sends audio directly to the provider that owns the selected model. Pensum’s servers are not involved.
    • Deepgram Nova-3 (default, recommended): speaker diarization native, ~$0.26/hour
    • OpenAI Whisper / gpt-4o-transcribe: ~$0.36/hour
PathWhere audio goesRetention
Managed (All-in-One) under 20 minSynchronous through Pensum’s worker → DeepgramHeld in worker memory for the request only. No disk, no R2 bucket, no D1 row referencing content.
Managed (All-in-One) 20+ minAsync via R2 staging → DeepgramAudio + transcript deleted as soon as the plugin confirms receipt; 1-hour R2 TTL backstop covers disconnects.
BYO (Deepgram / OpenAI)Direct from your device to the provider’s APISubject to that provider’s retention policy; Pensum has no role.
Local storageThe recording (.webm) and transcript (.json) sit in your vaultYou control the Audio retention setting (indefinite, 90 days, or 30 days) plus your own backups.

Public claim: we do not retain your audio, we do not retain your transcript, we log only billing metadata. The full data-flow diagrams are on the architecture page.

Local transcription (e.g. whisper.cpp) is a future direction, not in v1.0 — there’s no “Local transcription URL” setting yet.

After transcription, click Identify Speakers. Pensum sends the first 5 minutes of transcript plus your attendees list to the AI, which maps speaker labels to names based on context clues (introductions, who addresses whom by name).

For this to work, add attendees to the meeting frontmatter:

attendees:
- Jamie
- Sam
- Taylor

High and medium confidence mappings are applied automatically. Low confidence mappings are skipped.

The 14-day trial includes 3 hours of transcription total. Text AI features stay unbounded. The cap exists because transcription is the only AI feature with meaningful per-use cost; 3 hours covers serious evaluation (3–6 meetings) without making trial-rotation worth anyone’s time.

For BYO mode, a key for your chosen transcription provider (Deepgram or OpenAI) configured in Settings → AI. For managed mode, an active Pro All-in-One subscription or trial.