Lumi Documentation
Lumi is an AI listening companion for macOS. It captures audio, transcribes in real time, and gives you AI-powered answers using your uploaded documents.
Installation
1 Download
Download Lumi from our store. You'll receive a .zip file.
2 Unzip and move to Applications
Double-click the zip to extract, then drag Lumi.app to your Applications folder.
3 Install
Double-click Install Lumi.command — it copies the app to Applications, removes the macOS quarantine flag, and launches Lumi automatically. No right-click tricks needed.
If macOS asks "Are you sure you want to open this?" for the installer script, click Open.
4 Grant permissions
Lumi needs Screen Recording permission to capture system audio. macOS will prompt you. After granting, restart the app (required by macOS on first grant).
First Launch
- The onboarding wizard guides you through setup
- Enter your OpenAI API key
- Grant Screen Recording permission
- Press
Cmd+Shift+Oto show the overlay — you're ready!
API Key Setup
Lumi uses OpenAI for transcription and AI responses. You need your own key.
- Go to platform.openai.com/api-keys
- Click "Create new secret key"
- Copy the key and paste it in Lumi Settings → AI tab
Start Listening
Click Start Listening or press Cmd+Shift+Space to begin capturing audio from your video call, YouTube, or any app playing sound.
- Transcript appears in real-time in the left panel
- Click Ask AI anytime to get an answer
- Click End Session when done — see your performance metrics
Ask AI
Two ways to get answers:
- Click Ask AI — uses the selected prompt + current transcript + your playbook
- Type a question — "summarize what was discussed", "explain that concept", "what are the action items?"
Lumi remembers all Q&As in the session. Later answers reference earlier context automatically.
Knowledge Base (Playbooks)
Upload your documents so the AI answers with your specific details — not generic advice.
Setting up a playbook:
- Settings → Playbooks tab → New Playbook
- Click Add File — select multiple files at once (PDF, TXT, MD, RTF). Image-based/scanned PDFs are supported via OCR.
- Click Set Active
What to upload:
| Document | Why |
|---|---|
| Resume | AI introduces you with your actual experience |
| Job Description | AI tailors answers to the role |
| Past Q&As | AI reuses your prepared answers |
| Technical Notes | AI references your specific tech choices |
Lumi intelligently finds the most relevant parts of your documents for each question — even if you upload 50+ pages of notes.
Custom Prompts
The prompt bar at the top lets you switch between different AI behaviors instantly.
Built-in prompts:
- Interview — auto-detects question type (behavioral, coding, system design)
- Meeting — summaries with action items
- Coding — code review and solutions
- General — adaptive format
- Plus: STAR format, Code solution, System design, Explain, Summarize
Create your own:
- Settings → Prompts tab → Add
- Name it and write the instructions
- Click the pin icon to show it in the top bar
Screenshot Queue
Capture up to 5 screenshots that are sent to AI together in one call.
- Press
Cmd+Shift+S(or click Capture) — screenshot is added to the queue - Thumbnails appear in the capture panel with a count (e.g., "Screenshots 3/5")
- Capture more screenshots to build context (e.g., multiple slides, code files, whiteboard)
- Press
Cmd+Shift+Return(or click Ask AI) — all images are sent to GPT-4o vision
Managing screenshots:
- Remove one — click the X on any thumbnail
- Undo/redo — arrow buttons in the screenshot bar
- Clear all — click the X button on the right
- Max 5 — oldest screenshot is dropped when you exceed the limit
Works for coding questions, whiteboard problems, slides, or any visual content.
Session Management
Lumi automatically saves your work so you never lose data — even if the app crashes.
How it works:
- Auto-save — sessions flush to disk every 30 seconds, or when memory exceeds 500KB
- Session picker on launch — see your recent sessions and resume any one, or start fresh
- Resume — pick up where you left off: transcript, AI conversation history, and playbook context are fully restored
Managing sessions:
Go to Settings → Sessions to:
- Resume — click to restore a previous session
- Rename — click the pencil icon to give sessions meaningful names
- Export — save as JSON for backup or sharing
- Import — load JSON files (supports multi-select)
- Delete — remove sessions you no longer need
Session Export
Press Cmd+Shift+E to save the entire session as a Markdown file:
- Full transcript with speaker labels
- All AI responses
- Screenshot analyses
- Session metadata (duration, mode, playbook)
Performance Metrics
Click End Session to see how you did:
- Talk ratio — your speaking time vs others
- Filler words — "um", "uh", "you know" count
- Response pace — how quickly you answered
- Questions answered — how many you responded to
Track your improvement across multiple sessions.
Stealth Mode
By default, Lumi is completely invisible to screen sharing — Teams, Meet, Zoom cannot see it.
The eye icon in the control bar toggles visibility:
- Eye-slash (grey) — invisible to screen sharing (default)
- Eye (red) — visible, for recording demos
Stealth toggle applies to all windows — the overlay, region selector, and performance view are all hidden from screen share simultaneously.
Updates & Licensing
Lumi checks for updates automatically on launch. When a new version is available, a banner appears at the top of the overlay.
- License key — enter your Gumroad license key in Settings → AI to unlock
- Signed updates — update files are cryptographically signed (Ed25519) to prevent tampering
- Auto update check — compares your version against the latest release
How We Protect Your Data
Your documents and credentials are protected with industry-standard encryption. Files stored on disk are unreadable without your machine.
- API keys — encrypted at rest, tied to your machine's hardware
- Playbook documents — encrypted at rest, stored as binary files
- No cloud storage — everything stays in
~/Library/Application Support/Lumi/ - Delete anytime — remove the folder and all data is gone
Privacy
- No analytics, no telemetry, no tracking
- No account required — no email, no signup
- Your documents never leave your machine
- Only AI queries are sent to OpenAI — review OpenAI's data policy
- Audio is processed by OpenAI Whisper API, then discarded — not stored by OpenAI for API users
Testing & Verification
Lumi is backed by 328 automated tests across 38 test suites, covering 69% of source files.
What we test:
- AI uses your documents — we upload a test resume and verify the AI response contains exact details like company names, project metrics, and tech choices
- Conversation context — a 7-round simulated interview verifies the AI references earlier answers when asked follow-up questions
- Transcription quality — real audio files are transcribed and checked for keyword accuracy
- Anti-hallucination (7 layers) — silence produces zero output; garbage text in wrong languages is blocked; music lyrics misinterpreted as speech are suppressed via cross-chunk dedup and loop detection
- Encryption — saved files are verified to not contain plaintext; decryption roundtrips are confirmed
- Session cache — write-behind cache verified: files written to disk, data survives simulated crashes, save/load benchmarks under 50ms per flush
- Image PDF OCR — scanned PDFs are extracted via Apple Vision and indexed for search
- Restart reliability — the app can be stopped and restarted multiple times without losing functionality
Verified claims:
| Claim | How we verify |
|---|---|
| AI uses your resume to answer | Upload resume with unique details → ask "introduce yourself" → verify response contains those exact details |
| Conversation context works | 7-round interview → each round verified that AI references previous answers |
| Documents are encrypted | Save file → read raw bytes → confirm plaintext not visible → decrypt → confirm content matches |
| No hallucinations on silence | Feed 3 seconds of silence → verify zero text output |
| Music doesn't generate garbage | Repeated phrases across chunks are deduplicated; 3+ identical outputs suppressed as music loop |
| Sessions survive crashes | Save session → create new store instance → load and verify all data intact |
| Write-behind cache works | 100 flush cycles benchmarked at < 50ms avg; 500KB threshold triggers auto-flush |
Use Case: Interview Preparation
Before the interview
- Create a playbook with your resume, job description, and prepared Q&As
- Set it as active
- Select the Interview prompt
During the interview
- Click Start Listening before the call
- The interviewer's voice is transcribed automatically
- After each question, click Ask AI — the response draws from your resume and previous answers
- If the topic changes completely, click Clear to start fresh
- For different question types, switch prompts (e.g., STAR format, Code solution)
After the interview
- Click End Session to see performance metrics
- Export the session for review
Use Case: Meeting Notes
- Select the Meeting prompt
- Click Start Listening when the meeting begins
- Click Ask AI at any point for a running summary with action items
- Type: "What did the team decide about the deadline?"
- Export the session → share structured notes with your team
Use Case: Lecture Companion
Watching a YouTube tutorial, online course, or recorded lecture?
- Select Explain or General prompt
- Click Start Listening
- Type questions as the lecture plays:
- "Explain that in simpler terms"
- "Summarize the last 5 minutes"
- "Give me a code example of that concept"
- Export as study notes when done
Use Case: Podcast Summarizer
- Select the Summarize prompt
- Click Start Listening while the podcast plays
- Click Ask AI periodically for running summaries
- Type: "What are the 3 main takeaways so far?"
- Export as Markdown → structured podcast notes