text2sql

Natural language to SQL over a competition-platform database.
A full-stack research project: schema design → data generation → retrieval-augmented generation → TUI tooling.

Architecture

User question
      │
      ▼
┌─────────────────────────────────────────────────┐
│                 LLM Service (FastAPI)            │
│                                                  │
│  ┌─────────────────────────────────────────┐    │
│  │            RAG Pipeline                  │    │
│  │  BM25 ──┐                               │    │
│  │         ├──► HybridRetriever            │    │
│  │  FAISS ─┘        │                      │    │
│  │             CrossEncoder reranker        │    │
│  │                  │                       │    │
│  │         tables + examples context        │    │
│  └──────────────────┼──────────────────────┘    │
│                     │                            │
│  ┌──────────────────▼──────────────────────┐    │
│  │         Knowledge Graph                  │    │
│  │   FK-path expansion (Dijkstra)           │    │
│  │   DDL enrichment → JOIN hints            │    │
│  └──────────────────┼──────────────────────┘    │
│                     │                            │
│              LLM (OpenRouter / Ollama)           │
│                     │                            │
└─────────────────────┼───────────────────────────┘
                      │ SQL
                      ▼
               PostgreSQL (Docker)
                      │
                      ▼
                  JSON result

Stack

Layer	Tech
Database	PostgreSQL 16 (Docker)
Embeddings	`intfloat/multilingual-e5-base` (sentence-transformers, MPS-accelerated)
Dense retrieval	FAISS `IndexFlatIP`
Sparse retrieval	BM25 (custom implementation)
Reranking	`cross-encoder/ms-marco-MiniLM-L-6-v2`
Graph traversal	NetworkX (Dijkstra)
LLM	OpenRouter API / Ollama (local)
Service	FastAPI + uvicorn
TUI	Textual
Data generation	asyncpg + Faker

Project structure

text2sql/
├── infra/                  # Docker Compose + DB config
├── initdb/                 # SQL schema (DDL, constraints, indexes)
├── seed/                   # Async data generator
│   ├── inserter.py         # Generic batch inserter with dependency resolution
│   ├── seed_base/core/sub  # Seeding layers (users → teams → submissions)
│   └── seed_runner.py      # Entry point
├── llm/
│   ├── src/
│   │   ├── rag/            # Hybrid retriever pipeline
│   │   │   ├── faiss_retriever.py
│   │   │   ├── bm25_retriever.py
│   │   │   ├── hybrid_retriever.py
│   │   │   ├── cross_encoder_scorer.py
│   │   │   └── ddl_enricher.py
│   │   ├── graph.py        # FK knowledge graph + Dijkstra expansion
│   │   ├── llm.py          # LLM client (OpenRouter)
│   │   ├── llm_service.py  # FastAPI service
│   │   ├── benchmark.py    # Automated evaluation runner
│   │   └── judje.py        # LLM-based SQL judge
│   ├── docs/
│   │   ├── rag.yaml        # Table descriptions + retrieval examples
│   │   └── graph.yaml      # FK graph definition + algorithm config
│   └── benchmark_cases.json
├── cli/                    # Textual TUI (seeder + LLM query)
└── main.py                 # TUI entry point

Setup

1. Environment

Copy and fill .env:

POSTGRES_USER=competition_user
POSTGRES_PASSWORD=competition_pass
POSTGRES_DB=competition_db
DB_HOST=127.0.0.1
DB_PORT=5436

OPENROUTER_API_KEY=sk-or-...
OPENROUTER_MODEL=qwen/qwen-2.5-coder-32b-instruct

2. Database

make up      # start PostgreSQL in Docker
make seed    # populate with synthetic data (~3 000 rows across all tables)

To reset:

make reset

3. LLM service

Requires ollama running locally or a valid OPENROUTER_API_KEY.

cd llm
make serve-api   # uvicorn on :8000

With a local Ollama model:

make serve-llama   # start ollama daemon
make serve-api

4. TUI

make cli           # or: python3 main.py

The TUI provides:

layered data seeding with configurable counts
table row inspection
LLM query interface (sends to http://localhost:8000/generate)

5. CLI client (lightweight alternative to TUI)

cd llm
python3 -m src.cli

Retrieval pipeline

The RAG pipeline operates over two slot types — tables and examples — loaded from llm/docs/rag.yaml.

Each query goes through:

BM25 — keyword overlap over tokenized docs (Unicode-aware, custom IDF)
FAISS — cosine similarity over E5 embeddings
Cross-encoder reranking — ms-marco-MiniLM-L-6-v2 scores all candidates jointly
Minimum enforcement — guarantees at least 4 table docs and 2 example docs in the final context
Knowledge Graph expansion — adds FK-adjacent tables and injects Dijkstra-computed JOIN path as a hint

The final context passed to the LLM contains:

DDL with inline column annotations
FK-path hint (Tables: A → B → C + JOIN clauses)
Up to 3 similar example queries

Benchmark

Evaluation uses an LLM-based judge that scores generated SQL against the user's intent (not exact string match). Scoring rubric: 0.95–1.0 = semantically equivalent, 0.70–0.84 = partial, <0.40 = wrong.

Model: qwen2.5-coder:14b (local, Ollama)
Cases: 12
──────────────────────────────
Passed:    8 / 12  (66.7 %)
Avg score: 0.918
Avg time:  80 s / query

To run:

cd llm
python3 -m src.benchmark --input benchmark_cases.json --output results.json

Reports

reports/report.pdf — academic paper: problem formulation, RAG/CoT/LLM-as-Judge methodology, implementation, benchmark analysis
reports/summary.pdf — technical overview with architecture diagram, component rationale, and development directions

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
docs		docs
logs		logs
reports		reports
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
benchmark_cases.json		benchmark_cases.json
results.json		results.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

text2sql

Architecture

Stack

Project structure

Setup

1. Environment

2. Database

3. LLM service

4. TUI

5. CLI client (lightweight alternative to TUI)

Retrieval pipeline

Benchmark

Reports

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

text2sql

Architecture

Stack

Project structure

Setup

1. Environment

2. Database

3. LLM service

4. TUI

5. CLI client (lightweight alternative to TUI)

Retrieval pipeline

Benchmark

Reports

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages