Syrinx

Open voice orchestration and media-transport layer for Kuralle.

Syrinx is the self-hostable voice engine behind Kuralle, the open alternative to closed "voice agent API" platforms. It keeps provider and client quirks at the transport edge and hands the agent runtime a clean stream of mono PCM16 audio.

What it provides

Resumable WebSocket audio protocol (mono PCM16, turn and session management, sequence and sample-rate locks, reconnect within a retention window).
Telephony adapters: SIP, Twilio, LiveKit.
A provider-testing suite for realtime audio backends.
Runs on Node and Cloudflare Workers — one hibernatable Durable Object per conversation (WebSocketPair inbound, timers→DO alarms, SQLite session store, optional R2 call recording). See docs/serverless-edge-port-implementation-notes.md.

Edge deployment (Cloudflare Workers)

The @kuralle-syrinx/server-workers package is the deployable template: it runs the full engine — live Deepgram STT + OpenAI + Deepgram Aura TTS — on withVoice(Agent) (the @kuralle-syrinx/cf-agents mixin over the Cloudflare agents SDK), one hibernatable Durable Object per session. The Agent provides hibernation, the keepAlive() lease, and SQLite natively — no hand-rolled schedulers or session stores.

pnpm --filter @kuralle-syrinx/server-workers exec wrangler deploy
# set DEEPGRAM_API_KEY / OPENAI_API_KEY via `wrangler secret put` (see .dev.vars.example)

Endpoints: wss://<worker>/ws?sessionId=<id> (browser/edge voice), wss://<worker>/twilio?sessionId=<callSid> (Twilio Media Streams phone leg), POST /incoming-call (Twilio Voice webhook → <Connect><Stream> TwiML), GET /health, GET /recordings?sessionId=<id> (lists R2 recordings). Bind an R2 bucket as RECORDINGS to capture, per call, a stereo conversation.wav (user left / assistant right, time-aligned) plus user.wav / assistant.wav stems and a manifest.json.

Full walkthrough — bindings, secrets, browser + phone, local verify: Deploy Syrinx on Cloudflare.

Guides

Building a voice agent — end-to-end guide: kuralle-agents (brain) + Syrinx (voice), all bridges, Node and Cloudflare deploy.

Playground

Live browser demo — Syrinx Studio (apps/studio, a Cloudflare static-assets Worker): mic capture (server owns turns — no client VAD), a Web-Audio visualizer, and a live transcript over the WebSocket audio protocol. Use the ?ws= switcher to point it at your own hosted voice worker (wss://<your-worker>/ws?sessionId=<id>) — two reference shapes ship in @kuralle-syrinx/server-workers: a cascade path (Deepgram STT → reasoner → TTS) and a realtime bi-model path (gpt-realtime front → reasoner back). A bundled "Play sample" / sample.wav no-mic path gives a deterministic demo turn.

Deploy your own voice worker (wrangler deploy) and add auth before exposing its /ws — voice endpoints are unauthenticated by default and incur provider cost per connection.

Configuration

Syrinx reads provider credentials from the environment. Copy your keys into a local .env (which is gitignored and never committed):

OPENAI_API_KEY=
GEMINI_API_KEY=
GOOGLE_GENERATIVE_AI_API_KEY=
DEEPGRAM_API_KEY=
ELEVENLABS_API_KEY=
ELEVENLABS_VOICE_ID=
CARTESIA_API_KEY=
CARTESIA_VOICE_ID=

See docs/websocket-audio-protocol.md for the wire protocol and PROVIDER-TESTING.md for the provider test matrix.

Contributing

New here? Start with CONTRIBUTING.md — it's the orientation guide: what to read in what order, the package map, how to run the engine locally, and the bar a change clears before it ships.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 303 Commits
apps		apps
docs		docs
examples		examples
packages		packages
research		research
scripts		scripts
.dockerignore		.dockerignore
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile.studio-spike		Dockerfile.studio-spike
Dockerfile.telephony-spike		Dockerfile.telephony-spike
LICENSE		LICENSE
PROVIDER-TESTING.md		PROVIDER-TESTING.md
README.md		README.md
baseline-v2.json		baseline-v2.json
fly.bot-telephony-spike.toml		fly.bot-telephony-spike.toml
fly.studio-spike.toml		fly.studio-spike.toml
fly.synthetic-carrier-spike.toml		fly.synthetic-carrier-spike.toml
fly.telephony-spike.toml		fly.telephony-spike.toml
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Syrinx

What it provides

Edge deployment (Cloudflare Workers)

Guides

Playground

Configuration

Contributing

License

About

Uh oh!

Releases 6

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Syrinx

What it provides

Edge deployment (Cloudflare Workers)

Guides

Playground

Configuration

Contributing

License

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages