The Prompting & Context OS

Dictate. Capture. Compress. Inject.
Send context-rich prompts to any AI — without leaving your workflow.

macOS 14+ · Apple Silicon · MIT License

Vapor — floating prompt window with context tray and screenshot shelf

Everything between you and the AI

Vapor sits between your brain and any LLM — capturing context, compressing prompts, and injecting them where you need them.

Voice Dictation

Hold Fn to speak — transcription via Apple Speech Framework. On Apple Silicon with supported languages, processing runs on-device via the Neural Engine. On Intel Macs or unsupported locales, audio is sent to Apple servers for recognition (Apple's default behavior). A Whisper-based fully-local engine is in progress — see GitHub Issues for status.

Prompt Compression

40–60% token reduction. Strip filler, fuse concepts, preserve meaning. Choose Local LLM (free, on-device) or OpenRouter (cloud).

Browser Injection

Send compressed prompts directly into ChatGPT, Claude, Gemini, Grok, Perplexity — no copy-paste needed.

Screenshot Shelf

Auto-detects screenshots on your Desktop. Add them to context with one keypress. Vapor sees what you see.

Research Interrogation

Scan live browser tabs for structured data — tables, JSON, XHR feeds, articles. Capture it all into context.

Context Tray

Captured pages, articles, and research — all in one sidebar. Search, filter, and insert context directly into your prompts.

Capture. Compose. Inject.

Dictate or type

Hold Fn to speak and see your words appear in real time. On Apple Silicon with supported languages, transcription runs on-device via Apple's Neural Engine. For Intel Macs or unsupported locales, audio is processed by Apple servers (the framework default). We're migrating to a fully local Whisper-based engine for guaranteed on-device processing across all configs.

Live voice dictation with real-time transcription

→

Gather context

Auto-detect screenshots on your Desktop, capture browser pages, and pull from your context tray. One keypress each.

Screenshot shelf with auto-detected images

→

Extract from the web

Scan open browser tabs for structured data: tables, JSON, articles, XHR feeds. Vapor discovers data sources on the pages you have open and pulls them into context automatically.

Research interrogation discovering browser data sources

→

Send

Inject directly into your AI chat tab (⌘⇧P), or compress and copy to clipboard (⌘↩) for Claude CLI, Codex, opencode, or any terminal-based AI tool.

Optionally compress before sending for 40-60% token reduction.

Two ways to compress. Both best-in-class.

Local LLM

Free. Private. On-device.

Cost: Free
Privacy: 100% on-device
Latency: <1 second
Models: Phi-4 Mini (2.3 GB), Qwen 3 4B (2.4 GB), Qwen 2.5 7B (4.7 GB)
Setup: One-click download in Settings

OpenRouter

Cloud. Powerful. Configurable.

Cost: ~$0.01/1M tokens
Privacy: Cloud-based
Latency: ~1–2 seconds
Models: GLM-5 default, any OpenRouter model
Setup: Paste API key in Settings

Your AI tabs. Connected.

The Chrome extension links Vapor directly to your AI chat interfaces. Three steps to set up.

Load the extension — included in the DMG. Open chrome://extensions, enable Developer mode, click "Load unpacked".

Copy the auth token — open Vapor Settings → Browser → Authentication → Copy.

Paste into the extension — click the Vapor icon → Settings → Paste → Save. Connected!

Supported AI sites:

ChatGPT Claude Gemini Grok Perplexity + any site via DOM picker

Get Vapor

Download DMG

Includes the app and Chrome extension. Drag to Applications, load the extension, and go.

Download Latest Release

macOS 14+ (Sonoma) · Apple Silicon

Build from Source

Clone the repo, open in Xcode, build and run.

            git clone https://github.com/memetic-research-labs/vapor.git
cd vapor/Vapor
open Vapor.xcodeproj
          

Build Instructions