Pretype

System-wide AI autocomplete for macOS.
Copilot-style suggestions in every text field — completely offline, private, and on-device.

Website · Quick Start · Why Pretype · Features · How it Works · Roadmap · Requirements

Pretype in action — gray ghost text appears at the caret, Tab accepts it

_{Type anywhere → gray ghost text appears at the caret → press Tab to accept a word, or ⇧Tab for the rest.}

Note

Pretype runs entirely on your Mac. It uses Apple Silicon MLX with Gemma 4 or the Apple Intelligence system model. Your keystrokes never leave your machine — no subscriptions, no cloud, no tracking.

⚡ Why Pretype?

Most autocomplete solutions live inside a single code editor and ship your text to remote servers. Pretype runs globally across macOS and processes everything locally.

Feature	Pretype	Cloud Autocomplete
🛡️ Privacy	100% On-Device (keystrokes never leave your Mac)	Sent to a remote server
🌐 Scope	System-Wide (works in Mail, Slack, Notes, Safari, etc.)	Usually locked to a single IDE / editor
💸 Cost	Free & Open-Source (MIT License)	Subscription fees or API token costs
⚡ Offline	Fully Functional without internet	Requires active internet connection
⚙️ Setup	One-click local model download	Account registration + API key setup

Pretype is an open-source reimplementation of the concept behind Cotypist (closed-source, freemium). Not affiliated with Cotypist.

📸 Screenshots

🔤 Inline Typo Fix

_{Misspelled words automatically show corrections in a pill above the caret. Press Tab to apply, or Esc to dismiss.}

🪄 Fix Selection (`⌥Tab`)

Fix selection: a typo-ridden line rewritten into clean text in place

_{Select any typo-ridden line or phrase and press ⌥Tab. The local LLM rewrites the selection in place while preserving your original tone.}

🎨 Presentation Modes

_{Choose between seamless inline ghost text (pixel-accurate even in Electron) or a clean floating capsule panel above the caret.}

✨ Features

🌐 Works Everywhere — Integrates with any macOS text field: native AppKit/SwiftUI apps, Electron apps (VS Code, Slack, Claude Desktop), and web views.
✨ Pixel-Perfect Ghost Text — completion text is baseline-matched and sized dynamically to match your editor's font.
⌨️ Smart Keystrokes — Press Tab to accept the next word, ⇧Tab to accept the rest of the suggestion, or simply type over it to reject.
🔤 Inline Typo Fixes — Misspelled words show an instant correction pill above the word; press Tab to apply (uses native system spell-check, supports English & Russian).
🪄 Smart Rewrites (⌥Tab) — Select any text and let the local LLM fix grammar, typos, and phrasing in place while preserving your original tone.
🤖 Local Inference Engines — Standardized on Gemma 4 via Apple's MLX framework, or Apple Intelligence system models on macOS 26+.
⚡ Zero-Lag completion — Reuses Key-Value (KV) cache to deliver completions in 0.05–0.2 seconds.
👀 Context Aware & OCR — Intelligently adjusts behavior per app (disabled in terminals). Optional on-screen OCR reads surrounding window context.

🚦 Quick Start

Important

Requires macOS 14+ on Apple Silicon with full Xcode (the MLX engine needs the Metal compiler).

# Xcode 26+ only: install the Metal toolchain once
xcodebuild -downloadComponent MetalToolchain

# Clone the repository
git clone https://github.com/nikiomori/Pretype.git && cd Pretype

# Build and run the app bundle
./Scripts/make-app.sh        # builds build/Pretype.app
open build/Pretype.app

Tip

Grant Accessibility when prompted. This permission is how Pretype reads active text fields, catches the Tab key, and types suggestions back. If you grant it after launching, please restart the app.

🛠️ Dev Loop, Headless Testing & SwiftPM Caveats

For a fast dev loop:

./Scripts/dev.sh   # swift build + auto-copies the metallib next to the binary

(Requires one prior ./Scripts/make-app.sh run to produce the initial metallib).

[!WARNING] Swift Build Caveat: Plain swift build / swift run compiles, but MLX will not work out of the box because SwiftPM cannot compile Metal shaders directly (Failed to load the default metallib). The build scripts handle this by compiling shaders through xcodebuild. If shaders are missing, Pretype disables the MLX engine gracefully rather than crashing.

When running the raw binary from a terminal, macOS attributes the Accessibility permission to your terminal app. Make sure to grant it to the terminal, or run the .app bundle instead.

🧩 How It Works

Pretype hooks into macOS accessibility APIs to provide a system-wide overlay:

flowchart LR
    App[Focused App\nAny text field] -->|AX API Text| FocusTracker
    FocusTracker -->|Prompt| MLX[MLX Engine\nGemma 4]
    MLX -->|Suggestion| Window[Suggestion Overlay]
    Window -->|Ghost Text| App
    App -->|Keystrokes| EventTap
    EventTap -->|Tab Caught| Injector[Text Injector]
    Injector -->|Simulated Keys| App

FocusTracker tracks the focused text element via AXObserver and reads the text surrounding the caret on each keystroke.
The CompletionEngine (MLX / Gemma 4, debounced and cancellable) evaluates the context and returns a short continuation.
SuggestionWindow renders the gray ghost text size- and baseline-matched to the caret.
A CGEventTap catches completion keys (Tab / ⇧Tab). If accepted, the text is typed back into the active application as synthetic key events.

🔧 Under the Hood

🤖 Engines & Models

Inference Engines

Two backends implement the CompletionEngine protocol:

In-Process MLX Inference (Default): Runs Gemma 4 locally using Apple's mlx-swift-lm framework. Models are downloaded directly from Hugging Face on first launch.
Apple Intelligence (macOS 26+): Runs the OS-provided system model on the Neural Engine via Apple's Foundation Models framework. Zero RAM footprint.

MLX Model Catalog

Variants are automatically selected on startup based on your Mac's installed RAM:

Gemma 4 E4B 8-bit (≈8.6 GB) — Best quality, default for Macs with ≥32 GB RAM.
Gemma 4 E4B 6-bit (≈6.8 GB) — Near-best quality, default for 16-32 GB RAM.
Gemma 4 E2B 8-bit (≈5.7 GB) — Small and precise.
Gemma 4 E4B 4-bit (≈5 GB) — Compact.
Gemma 4 E2B 4-bit (≈3.4 GB) — Light footprint, default for Macs under 16 GB RAM.

🪄 Typo Corrections & Selection Rewrites

Inline Typo Fix: Instantly displays the correction in a pill above the misspelled word as you type. Uses the macOS system spell-checker (English + Russian).
Fix Selection (⌥Tab): Highlight any text and press ⌥Tab. The local LLM rewrites the line in place, preserving tone and punctuation.

⚡ Latency & Cache Optimization

KV-Cache Reuse: Prefills only the newly typed tokens and reuses the existing Key-Value cache.
Performance: Prefill speed of 400–750 tokens/sec and decode speed of ~90–105 tokens/sec on M-series chips, delivering hot completion latency of 0.05–0.2s.

👀 Context & Vision OCR

App Awareness: Adapts prompt style based on the active application (e.g., short completions in chats, disabled in terminal emulators).
Screen Context: Runs Apple's local Vision OCR framework on the focused application's window to pull nearby text (like reading the email thread you are replying to). Requires Screen Recording permission; disabled by default.

🩺 Troubleshooting

If you don't see any autocomplete suggestions, check the Diagnostics panel by clicking the menu bar icon.

Common Issues & Fixes

Accessibility: NOT granted ✗: If running from a terminal, verify that the terminal has accessibility permissions. If you built the app locally, binary signature changes may confuse macOS permissions. Reset them by running tccutil reset Accessibility app.pretype.Pretype and re-grant permissions.
Text element: none: The active app does not expose its text boxes via the macOS Accessibility API.
Last: engine returned no suggestion: The model chose to remain silent to avoid suggesting irrelevant text. Keep typing.

🗺️ Roadmap

Emoji completion (:shrug: → 🤷)
Multi-language support overrides
Per-app compatibility DB and blacklist/whitelist settings
Sparkle auto-updates, notarized Homebrew builds

💻 Requirements

OS: macOS 14+ (macOS 26+ for Apple Intelligence system model)
Hardware: Apple Silicon Mac (M1/M2/M3/M4 or newer)
Software: Full Xcode installation (Metal toolchain)
Storage: 3.4–8.6 GB for the local MLX model

📄 License

Pretype is licensed under the MIT License.

_{Built with Swift, MLX, and Gemma — entirely on-device. ⌨️}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.github		.github
Assets		Assets
Scripts		Scripts
Sources/Pretype		Sources/Pretype
Tests/PretypeTests		Tests/PretypeTests
docs		docs
.gitignore		.gitignore
.swiftformat		.swiftformat
.swiftlint.yml		.swiftlint.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Package.resolved		Package.resolved
Package.swift		Package.swift
README.md		README.md
SECURITY.md		SECURITY.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pretype

⚡ Why Pretype?

📸 Screenshots

🔤 Inline Typo Fix

🪄 Fix Selection (`⌥Tab`)

🎨 Presentation Modes

✨ Features

🚦 Quick Start

🧩 How It Works

🔧 Under the Hood

Inference Engines

MLX Model Catalog

🩺 Troubleshooting

🗺️ Roadmap

💻 Requirements

📄 License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Pretype

⚡ Why Pretype?

📸 Screenshots

🔤 Inline Typo Fix

🪄 Fix Selection (⌥Tab)

🎨 Presentation Modes

✨ Features

🚦 Quick Start

🧩 How It Works

🔧 Under the Hood

Inference Engines

MLX Model Catalog

🩺 Troubleshooting

🗺️ Roadmap

💻 Requirements

📄 License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

🪄 Fix Selection (`⌥Tab`)

Packages