| CI | Package | |
|---|---|---|
| Python | ||
| TypeScript |
A lightweight wiretap for LLM SDKs: capture all requests and responses with a single line of code.
Shuntly wraps LLM SDKs to record every request and response as JSON. Calling shunt() wraps and returns a client with its original interface and types preserved, permitting consistent IDE autocomplete and type checking. Shuntly provides a collection of configurable "sinks" to write records to stderr, files, named pipes, or any combination.
While debugging LLM tooling, maybe you want to see exactly what is being sent and returned. When launching an agent, maybe you want to record every call to the LLM. Shuntly can capture it all without TLS interception, a proxy or web-based platform, or complicated logging infrastructure.
pip install shuntly
Given an LLM SDK (e.g. anthropic, openai, google-genai, etc.), simply call shunt() with the instantiated SDK class. The returned object has the same type and interface.
from anthropic import Anthropic
from shuntly import shunt
# Without providing a sink Shuntly output goes to stderr
client = shunt(Anthropic(api_key=API_KEY))
# Now use the client as before
message = client.messages.create(
model="claude-sonnet-4-20250514",
max_tokens=1024,
messages=[{"role": "user", "content": "Hello"}],
)Each call to messages.create() writes a complete JSON record:
{
"timestamp": "2025-01-15T12:00:00+00:00",
"hostname": "dev1",
"user": "alice",
"pid": 42,
"client": "anthropic.Anthropic",
"method": "messages.create",
"request": {"model": "claude-sonnet-4-20250514", "max_tokens": 1024, "messages": [{"role": "user", "content": "Hello"}]},
"response": {"id": "msg_...", "content": [{"type": "text", "text": "Hi!"}]},
"duration_ms": 823.4,
"error": null
}Shuntly presently supports the following SDKs and clients:
| Client | Package | Methods |
|---|---|---|
anthropic.Anthropic |
PyPI |
messages.create, messages.stream |
openai.OpenAI |
PyPI |
chat.completions.create |
google.genai.Client |
PyPI |
models.generate_content |
litellm |
PyPI |
completion |
any-llm |
PyPI |
completion |
ollama, ollama.Client |
PyPI |
chat, generate |
For anything else, method paths can be explicitly provided:
client = shunt(my_client, methods=["chat.send", "embeddings.create"])Shuntly JSON output can be streamed or read with a JSON viewer like fx. These tools provide JSON syntax highlighting and collapsible sections.
Shuntly output, by default, goes to stderr; this is equivalent to providing a SinkStream to shunt():
from shuntly import shunt, SinkStream
client = shunt(Anthropic(api_key=API_KEY), SinkStream())Given a command, you can view Shuntly stderr output in fx with the following:
$ command 2>&1 >/dev/null | fxTo view Shuntly output via a named pipe in another terminal, the SinkPipe sink can be used. First, name the pipe when providing SinkPipe to shunt():
from shuntly import shunt, SinkPipe
client = shunt(Anthropic(api_key=API_KEY), SinkPipe('/tmp/shuntly.fifo'))Then, in a terminal to view Shuntly output, create the named pipe and provide it to fx
$ mkfifo /tmp/shuntly.fifo; fx < /tmp/shuntly.fifoThen, in another terminal, launch your command.
To store Shuntly output in a file, the SinkFile sink can be used. Name the file when providing SinkFile to shunt():
from shuntly import shunt, SinkFile
client = shunt(Anthropic(api_key=API_KEY), SinkFile('/tmp/shuntly.jsonl'))Then, after your command is complete, view the file:
$ fx /tmp/shuntly.jsonlFor long-running applications, SinkRotating writes JSONL records to a directory with automatic file rotation and cleanup. Files are named with UTC timestamps (e.g. 2025-02-15T210530Z.jsonl).
from shuntly import shunt, SinkRotating
client = shunt(Anthropic(api_key=API_KEY), SinkRotating('/tmp/shuntly'))When a file exceeds max_bytes_file (default 10 MB), a new file is created. When the directory exceeds max_bytes_dir (default 100 MB), the oldest files are pruned. Set max_bytes_dir=0 to disable pruning and retain all files. Both limits are configurable:
client = shunt(Anthropic(api_key=API_KEY), SinkRotating(
'/tmp/shuntly',
max_bytes_file=50 * 1024 * 1024, # 50 MB per file
max_bytes_dir=500 * 1024 * 1024, # 500 MB total
))Using SinkMany, multiple sinks can be written to simultaneously.
from shuntly import shunt, SinkStream, SinkFile, SinkMany
client = shunt(Anthropic(), SinkMany([
SinkStream(),
SinkFile('/tmp/shuntly.jsonl'),
]))Custom sinks can be implemented by subclassing Sink and implementing write():
from shuntly import Sink, ShuntlyRecord
class SinkPrint(Sink):
def write(self, record: ShuntlyRecord) -> None:
print(record.client, record.method, record.duration_ms)Added support for Mozilla any_llm.completion()
Added support for Ollama interfaces.
Added new SinkRotating for rotating log handling.
Added support for the LiteLLM completion interface.
Corrected interleaved writes in SinkPipe.
Renamed Record to ShuntlyRecord.
Export shunt() without Shuntly class.
Fully tested and integrated support for OpenAI and Google SDKs.
SinkPipe is now interruptible.
Initial release.