A MAKER-based autonomous feature engineering system
Ultracode is a production-grade autonomous software engineering system that plans, implements, validates, and commits complete software features using a structured multi-agent architecture.
Ultracode is inspired by the MAKER architecture and implements a general-purpose agent harness for reliable, auditable, and cost-aware AI-driven development.
Ultracode is not a chatbot.
It is an engineering system.
Ultracode is actively being developed and tested. While core functionality works reliably, some features may change. Use in production at your own discretion.
Intuitive UI with project management, feature tracking, and real-time execution monitoring. See screenshots below.
Based on MAKER (Cognizant AI Lab), Ultracode eliminates the "hallucination problem" that plagues modern LLMs:
- Separation of concerns: Planning, execution, and voting are independent
- Error decorrelation: Multiple agents generate diverse solutions
- Structured voting: Consensus replaces blind generation
- Near-zero failure rates on long-running projects
- Use expensive models only for planning (Example: gpt-5.2 or opus-4.5)
- Cheaper models for coding: GPT-4o Mini, Gemini 2, or your local models
- Local model support: Run open models via LM Studio (GLM, Mistral, etc.) β zero API costs
- Full cost transparency: Real-time token breakdown per project
- Every decision is logged (event sourcing)
- Human-in-the-loop for risky operations
- Filesystem sandbox prevents accidental damage
- Git commits with full feature history
- Organize work as features with dependencies
- Automatic task decomposition and prioritization
- Resume paused work without losing context
Ultracode is directly inspired by:
MAKER: Multi-Agent Architecture for Knowledge-Driven Reasoning
Cognizant AI Lab (2024)
https://www.cognizant.com/us/en/ai-lab/blog/maker
From the MAKER paper, Ultracode adopts:
- Separation of planning, execution, and validation
- Explicit intermediate representations
- Error decorrelation via multiple samples
- Structured voting instead of blind generation
- Human-in-the-loop safety boundaries
Ultracode extends MAKER from task execution to a complete feature-based software engineering pipeline.
Ultracode turns large language models into reliable engineering agents by embedding them inside a deterministic system with memory, validation, and control.
Project β Features (prioritized, dependency-aware) β Subtasks (atomic, executable) β Voting-based execution β Validation β Git commits
- Work is organized as features, not prompts
- Priorities: A / B / C
- Explicit dependencies and blocking rules
- Planner Agent β decomposes features into subtasks
- Executor Agent β generates concrete code and actions
- Voting / Verification Agent β selects safe, correct outputs
- Multi-sample generation with adaptive temperature
- First-to-lead-by-k voting strategy
- Prompt paraphrasing for error decorrelation
- Red-flag detection before execution
- SQLite (WAL mode) as single source of truth
- Projects, features, subtasks, events
- Full audit trail (event sourcing)
- Filesystem sandboxing
- Command risk classification
- Human approval for risky actions
- No arbitrary code execution
- Auto-start dev servers (Node, PHP, static)
- Screenshot-based UI testing (Puppeteer)
- LLM-based verification against Definition of Done
- Token estimation and pricing per model
- Real-time project cost breakdown
- Zero cost for local models
ββββββββββββββ β Web UI β (live SSE updates) βββββββ¬βββββββ βΌ ββββββββββββββ β HTTP API β βββββββ¬βββββββ βΌ ββββββββββββββ β Feature β β Manager β βββββββ¬βββββββ βΌ ββββββββββββββ β Orchestratorβ β (MAKER) β βββββββ¬βββββββ βΌ ββββββββββββββ β Voting β β Engine β β + RedFlags β βββββββ¬βββββββ βΌ ββββββββββββββ β LLM β β Registry β βββββββ¬βββββββ βΌ βββββββββββββββββββββββββββββββ β OpenAI Β· Claude Β· Gemini β β Local Models (LM Studio) β βββββββββββββββββββββββββββββββ
Ultracode applies targeted voting at deterministic decision gates, following the MAKER philosophy.
LLMs fail in correlated ways.
Voting combined with prompt paraphrasing breaks that correlation and improves reliability.
Executor Outputs β Red-Flag Filtering β Vote Tally β [ Approve | Retry | Fail | Human Review ]
Voting is applied to:
- Subtask execution
- Structured JSON outputs
- Ambiguous or risky operations
- FeatureStore β SQLite persistence & event sourcing
- FeatureManager β dependency resolution & execution queue
- FeaturePlanner β LLM-based feature decomposition
- ContextBuilder β intelligent prompt assembly
- Orchestrator β MAKER-style execution engine
- VotingEngine β consensus selection & error filtering
- ProjectGuard β filesystem sandbox
- ExecutionGuard β command safety
- ResourceMonitor β token & cost tracking
Ultracode includes a structured 3-step project wizard:
-
Project Basics
- Name and description
- Folder creation & git init
-
AI-Guided Clarification
- Architecture & stack
- Data models & authentication
- Non-functional requirements
- Optional web research
-
Model Assignment
- Planner model
- Executor model
- Voting model
Outputs:
project.md(engineering specification)- Feature list with Definition of Done
init.shfor automated bootstrapping
- Node.js β₯ 18
- At least one LLM provider (or LM Studio for local models)
git clone https://github.com/tomgoeck/Ultracode.git
cd Ultracode
npm installcp config.json.example data/config.json
# or
Run the application below and click on the settings button in the top rightAdd API keys (the data/ directory is gitignored).
node src/server.js
# or
PORT=4173 npm startOpen: http://localhost:4173
βΈ»
π§ͺ Feature Lifecycle
pending β running β completed β verified β β paused failed β blocked (dependency)
Ultracode always knows what can run next β and why.
βΈ»
π§ Why Ultracode Exists
Ultracode demonstrates that: β’ LLMs become reliable inside systems β’ Autonomous coding requires memory, structure, and validation β’ Voting beats prompt cleverness β’ MAKER-style architectures scale beyond research prototypes
This repository is both: β’ a usable engineering tool β’ a reference implementation of modern agent research
βΈ»
π License
MIT License
βΈ»
π Acknowledgements β’ Cognizant AI Lab β MAKER architecture β’ OpenAI, Anthropic, Google β’ The autonomous agents research community
βΈ»
Ultracode β Autonomous engineering, grounded in systems, not prompts.





