Docs for the cute little AI listener

Lumi Documentation

Lumi is an AI listening companion for macOS. It captures audio, transcribes in real time, and gives you AI-powered answers using your uploaded documents.

Download Lumi Back to landing page

One-time purchase: $9.99 — no subscription. You bring your own OpenAI API key.

Installation

1 Download

Download Lumi from our store. You'll receive a .zip file.

2 Unzip and move to Applications

Double-click the zip to extract, then drag Lumi.app to your Applications folder.

3 Install

Double-click Install Lumi.command — it copies the app to Applications, removes the macOS quarantine flag, and launches Lumi automatically. No right-click tricks needed.

If macOS asks "Are you sure you want to open this?" for the installer script, click Open.

4 Grant permissions

Lumi needs Screen Recording permission to capture system audio. macOS will prompt you. After granting, restart the app (required by macOS on first grant).

First Launch

The onboarding wizard guides you through setup
Enter your OpenAI API key
Grant Screen Recording permission
Press Cmd+Shift+O to show the overlay — you're ready!

API Key Setup

Lumi uses OpenAI for transcription and AI responses. You need your own key.

Go to platform.openai.com/api-keys
Click "Create new secret key"
Copy the key and paste it in Lumi Settings → AI tab

Typical cost: $0.10–$0.50 per session. Much cheaper than a $96/month subscription.

Start Listening

Click Start Listening or press Cmd+Shift+Space to begin capturing audio from your video call, YouTube, or any app playing sound.

Transcript appears in real-time in the left panel
Click Ask AI anytime to get an answer
Click End Session when done — see your performance metrics

The audio pipeline stays active between questions. Ask AI multiple times — each answer builds on the full conversation.

Ask AI

Two ways to get answers:

Click Ask AI — uses the selected prompt + current transcript + your playbook
Type a question — "summarize what was discussed", "explain that concept", "what are the action items?"

Lumi remembers all Q&As in the session. Later answers reference earlier context automatically.

Knowledge Base (Playbooks)

Upload your documents so the AI answers with your specific details — not generic advice.

Setting up a playbook:

Settings → Playbooks tab → New Playbook
Click Add File — select multiple files at once (PDF, TXT, MD, RTF). Image-based/scanned PDFs are supported via OCR.
Click Set Active

What to upload:

Document	Why
Resume	AI introduces you with your actual experience
Job Description	AI tailors answers to the role
Past Q&As	AI reuses your prepared answers
Technical Notes	AI references your specific tech choices

Lumi intelligently finds the most relevant parts of your documents for each question — even if you upload 50+ pages of notes.

Custom Prompts

The prompt bar at the top lets you switch between different AI behaviors instantly.

Built-in prompts:

Interview — auto-detects question type (behavioral, coding, system design)
Meeting — summaries with action items
Coding — code review and solutions
General — adaptive format
Plus: STAR format, Code solution, System design, Explain, Summarize

Create your own:

Settings → Prompts tab → Add
Name it and write the instructions
Click the pin icon to show it in the top bar

Screenshot Queue

Capture up to 5 screenshots that are sent to AI together in one call.

Press Cmd+Shift+S (or click Capture) — screenshot is added to the queue
Thumbnails appear in the capture panel with a count (e.g., "Screenshots 3/5")
Capture more screenshots to build context (e.g., multiple slides, code files, whiteboard)
Press Cmd+Shift+Return (or click Ask AI) — all images are sent to GPT-4o vision

Managing screenshots:

Remove one — click the X on any thumbnail
Undo/redo — arrow buttons in the screenshot bar
Clear all — click the X button on the right
Max 5 — oldest screenshot is dropped when you exceed the limit

Works for coding questions, whiteboard problems, slides, or any visual content.

Session Management

Lumi automatically saves your work so you never lose data — even if the app crashes.

How it works:

Auto-save — sessions flush to disk every 30 seconds, or when memory exceeds 500KB
Session picker on launch — see your recent sessions and resume any one, or start fresh
Resume — pick up where you left off: transcript, AI conversation history, and playbook context are fully restored

Managing sessions:

Go to Settings → Sessions to:

Resume — click to restore a previous session
Rename — click the pencil icon to give sessions meaningful names
Export — save as JSON for backup or sharing
Import — load JSON files (supports multi-select)
Delete — remove sessions you no longer need

No data loss. Write-behind caching means your session is saved even if you forget to click End or the app quits unexpectedly.

Session Export

Press Cmd+Shift+E to save the entire session as a Markdown file:

Full transcript with speaker labels
All AI responses
Screenshot analyses
Session metadata (duration, mode, playbook)

Performance Metrics

Click End Session to see how you did:

Talk ratio — your speaking time vs others
Filler words — "um", "uh", "you know" count
Response pace — how quickly you answered
Questions answered — how many you responded to

Track your improvement across multiple sessions.

Stealth Mode

By default, Lumi is completely invisible to screen sharing — Teams, Meet, Zoom cannot see it.

The eye icon in the control bar toggles visibility:

Eye-slash (grey) — invisible to screen sharing (default)
Eye (red) — visible, for recording demos

Stealth toggle applies to all windows — the overlay, region selector, and performance view are all hidden from screen share simultaneously.

Updates & Licensing

Lumi checks for updates automatically on launch. When a new version is available, a banner appears at the top of the overlay.

License key — enter your Gumroad license key in Settings → AI to unlock
Signed updates — update files are cryptographically signed (Ed25519) to prevent tampering
Auto update check — compares your version against the latest release

How We Protect Your Data

Your documents and credentials are protected with industry-standard encryption. Files stored on disk are unreadable without your machine.

API keys — encrypted at rest, tied to your machine's hardware
Playbook documents — encrypted at rest, stored as binary files
No cloud storage — everything stays in ~/Library/Application Support/Lumi/
Delete anytime — remove the folder and all data is gone

Your resume and notes are never readable as plain text on disk. Even if someone copies your Lumi folder to another machine, they cannot decrypt the files.

Privacy

No analytics, no telemetry, no tracking
No account required — no email, no signup
Your documents never leave your machine
Only AI queries are sent to OpenAI — review OpenAI's data policy
Audio is processed by OpenAI Whisper API, then discarded — not stored by OpenAI for API users

Testing & Verification

Lumi is backed by 328 automated tests across 38 test suites, covering 69% of source files.

What we test:

AI uses your documents — we upload a test resume and verify the AI response contains exact details like company names, project metrics, and tech choices
Conversation context — a 7-round simulated interview verifies the AI references earlier answers when asked follow-up questions
Transcription quality — real audio files are transcribed and checked for keyword accuracy
Anti-hallucination (7 layers) — silence produces zero output; garbage text in wrong languages is blocked; music lyrics misinterpreted as speech are suppressed via cross-chunk dedup and loop detection
Encryption — saved files are verified to not contain plaintext; decryption roundtrips are confirmed
Session cache — write-behind cache verified: files written to disk, data survives simulated crashes, save/load benchmarks under 50ms per flush
Image PDF OCR — scanned PDFs are extracted via Apple Vision and indexed for search
Restart reliability — the app can be stopped and restarted multiple times without losing functionality

Verified claims:

Claim	How we verify
AI uses your resume to answer	Upload resume with unique details → ask "introduce yourself" → verify response contains those exact details
Conversation context works	7-round interview → each round verified that AI references previous answers
Documents are encrypted	Save file → read raw bytes → confirm plaintext not visible → decrypt → confirm content matches
No hallucinations on silence	Feed 3 seconds of silence → verify zero text output
Music doesn't generate garbage	Repeated phrases across chunks are deduplicated; 3+ identical outputs suppressed as music loop
Sessions survive crashes	Save session → create new store instance → load and verify all data intact
Write-behind cache works	100 flush cycles benchmarked at < 50ms avg; 500KB threshold triggers auto-flush

Use Case: Interview Preparation

Before the interview

Create a playbook with your resume, job description, and prepared Q&As
Set it as active
Select the Interview prompt

During the interview

Click Start Listening before the call
The interviewer's voice is transcribed automatically
After each question, click Ask AI — the response draws from your resume and previous answers
If the topic changes completely, click Clear to start fresh
For different question types, switch prompts (e.g., STAR format, Code solution)

After the interview

Click End Session to see performance metrics
Export the session for review

Tip: Upload past interview Q&As as documents. When similar questions come up, the AI reuses your prepared answers with your specific details.

Use Case: Meeting Notes

Select the Meeting prompt
Click Start Listening when the meeting begins
Click Ask AI at any point for a running summary with action items
Type: "What did the team decide about the deadline?"
Export the session → share structured notes with your team

Use Case: Lecture Companion

Watching a YouTube tutorial, online course, or recorded lecture?

Select Explain or General prompt
Click Start Listening
Type questions as the lecture plays:
- "Explain that in simpler terms"
- "Summarize the last 5 minutes"
- "Give me a code example of that concept"
Export as study notes when done

Use Case: Podcast Summarizer

Select the Summarize prompt
Click Start Listening while the podcast plays
Click Ask AI periodically for running summaries
Type: "What are the 3 main takeaways so far?"
Export as Markdown → structured podcast notes

Works with any audio source — YouTube, Spotify, Apple Podcasts, Zoom, or any app that plays through your system audio.