Custom Resilient RAG Engine

notion link: text

Custom Resilient RAG Engine

A production-minded, full-stack Retrieval-Augmented Generation (RAG) platform built to handle heavy document ingestion, layout-preserving text parsing, and verifiable semantic retrieval.

Instead of treating the data pipeline like a black box, this system uses a custom Engine Inspector UI layer that breaks down vector similarity distances and uncovers the literal context strings injected into the LLM.

Technical Stack

Framework: Next.js (App Router) running the Turbopack compiler
Language: TypeScript (moduleResolution: "bundler")
Database & Vectors: PostgreSQL + pgvector
Database Layer: Drizzle ORM
Binary Extraction: Native pdfjs-dist (Legacy ESM build configuration)

System Engineering Log & Critical Breakthroughs

Building this application without using bulk external wrappers (like LangChain or LlamaIndex) meant hitting several low-level compilation and runtime bugs. Here is what broke and how we engineered around it:

1. Next.js Bundler Path Clashes

The Problem: Configuring the workspace with strict TypeScript bundler resolution parameters resulted in fatal route compilation crashes (Module not found: Can't resolve './schema.js').
The Fix: Under the hood, Next.js expects clean, extensionless absolute mapping inside App Router chunks. We stripped all rigid .js or .ts file suffixes across our active route pathways (route.ts, ingest.ts, and our Drizzle configuration files), leaving native asset lookup completely to the framework compiler.

2. PDF.js Isolated Worker Thread Collapse

The Problem: Forcing server-side string extraction directly through pdfjs-dist threw execution breaks because Next.js sandboxes server logic into memory buffers:
```
Setting up fake worker failed: "Cannot find module '.../pdf.worker.mjs'"
```


* **The Fix:** Standard dynamic imports fail when bundled via Turbopack since the relative path maps out of runtime context. We bypassed implicit module loaders completely. We combined native Node.js filesystem primitives to read the `.mjs` asset directly from the local node node module tree and cast it to an absolute static string path using `pathToFileURL`:
```typescript
const workerPath = path.join(process.cwd(), "node_modules", "pdfjs-dist", "legacy", "build", "pdf.worker.mjs");
pdfjs.GlobalWorkerOptions.workerSrc = pathToFileURL(workerPath).toString();

3. Connection Socket Exhaustion (`ECONNREFUSED`)

The Problem: Heavy document streams and concurrent ingestion transactions caused transient pipeline dropouts due to localized connection pool exhaustion.
The Fix: Verified socket bindings locally (netstat -ano | findstr 5432) and transitioned the application client from short-lived singular database client definitions to an explicit, persistent connection pool configuration under Drizzle to safely absorb simultaneous text transactions.

Core Retrieval Safeguard: Similarity Thresholding

During testing, asking an out-of-bounds query (e.g., "Tell me who is Elon Musk?") when the database only holds project management or business workflows exposed a massive flaw in standard, naive top-$K$ vector setups. The vector space always yields matches, forcing entirely irrelevant chunks into the prompt context. This wastes tokens and causes model hallucinations.

We corrected this by implementing a hard distance constraint directly in our SQL query layout:

$$\text{Similarity Score } (S) = 1 - \text{CosineDistance}(\vec{u}, \vec{v}) \ge 0.70$$

If the closest vector chunks fail to cross this $0.70$ similarity floor, the retrieval layer instantly blocks prompt modification. The engine routes the execution clean away from the context generator and sets a visible warning state in the interface: ⚠️ Direct Model Knowledge (No Grounded Context Found).

UI/UX Framework

The interface avoids cluttered data loops in favor of a responsive, two-column layout (bg-zinc-950):

Left Column (35%): Document Control Center handling drop zone tracking, real-time extraction latency metrics, and ingestion load volumes.
Right Column (65%): Unified chat feed utilizing clear semantic contrast. Every message payload contains an expandable Inspector Accordion exposing the exact database matching metrics (e.g., Match: 84.2%) and the final compiled context layout for maximum system accountability.

Current Status & Next Steps

🔧 Status: Active Improvement / Work-In-Progress

The core pipeline from document stream to semantic vector validation is stable. Current efforts are focused on:

Hardening the thresholding boundaries to test how different vector distance metrics handle varying text structures.
Integrating a session-based metadata layer to ensure multiple parallel users can query distinct private datasets simultaneously.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
knowledge-base		knowledge-base
lib		lib
src		src
.gitignore		.gitignore
Readme.md		Readme.md
docker-compose.yml		docker-compose.yml
drizzle.config.ts		drizzle.config.ts
image.png		image.png
init.sql		init.sql
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Custom Resilient RAG Engine

Technical Stack

System Engineering Log & Critical Breakthroughs

1. Next.js Bundler Path Clashes

2. PDF.js Isolated Worker Thread Collapse

3. Connection Socket Exhaustion (`ECONNREFUSED`)

Core Retrieval Safeguard: Similarity Thresholding

UI/UX Framework

Current Status & Next Steps

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Custom Resilient RAG Engine

Technical Stack

System Engineering Log & Critical Breakthroughs

1. Next.js Bundler Path Clashes

2. PDF.js Isolated Worker Thread Collapse

3. Connection Socket Exhaustion (ECONNREFUSED)

Core Retrieval Safeguard: Similarity Thresholding

UI/UX Framework

Current Status & Next Steps

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

3. Connection Socket Exhaustion (`ECONNREFUSED`)

Packages