Skip to content

Pinned Loading

  1. agentic-med-diag agentic-med-diag Public

    Agentic Graph RAG for medical diagnosis and clinical QA. Combines LangGraph-orchestrated multi-hop reasoning with parallel semantic/relational retrieval, Neo4j knowledge graphs, and Milvus vector s…

    3

  2. biothink biothink Public

    Self-Reflective Question Answering for Biomedical Reasoning. GRPO fine-tuning via QLoRA & Unsloth with rewards for correctness, relevance, groundness, utility & XML structure. Structured think → an…

    Python 5 1

  3. rag-pipelines rag-pipelines Public

    Advanced RAG pipelines for medical (HealthBench, MedCaseReasoning, MetaMedQA, PubMedQA) and financial (FinanceBench, Earnings Calls) QA. LangGraph orchestration + BAML structructed generation, Milv…

    Python 13 1

  4. dspy-opt dspy-opt Public

    Advanced RAG pipeline optimization framework using DSPy. Implements modular RAG pipelines with Query-Rewriting, Sub-Query Decomposition, and Hybrid Search via Weaviate. Automates prompt tuning and …

    Python 10 1

  5. llm-finetuning llm-finetuning Public

    Advanced LLM fine-tuning techniques: SFT (LoRA, QLoRA, DoRA, P-/Prefix-Tuning), GRPO, DPO, ORPO, KTO & PPO; composable correctness/format rewards + LLM-as-a-Judge evals (DeepEval, Evidently AI) acr…

    Python 7 3

  6. rag-model-training rag-model-training Public

    Training code for advanced RAG techniques - Adaptive-RAG, Corrective RAG, RQ-RAG, Self-RAG, Agentic RAG, and ReZero. Reproduces paper methodologies to fine-tune LLMs via SFT and GRPO for adaptive r…

    Python 8 2

Repositories

Showing 10 of 25 repositories
  • grpo Public

    Group Relative Policy Optimization (GRPO) implementations - NanoAhaMoment, GRPO:Zero, Simple GRPO, and GRPO from Scratch - spanning vLLM + DeepSpeed, custom Transformer stack, Bottle HTTP reference server, and pure PyTorch. Compares generation backends, reference policy strategies, reward designs, and loss functions on GSM8K and Countdown tasks.

    avnlp/grpo’s past year of commit activity
    Python 4 MIT 0 0 2 Updated Jun 1, 2026
  • vectordb Public

    Production-ready Haystack/LangChain pipelines for Hybrid & Parent-Child Retrieval, Diversity Filtering, MMR, Metadata Filtering, Reranking, Query Enhancement, Multi-Tenancy, Agentic RAG across Pinecone, Weaviate, Milvus, Qdrant, Chroma with dataset loaders, evaluation tooling, and benchmarks for TriviaQA, ARC, PopQA, FactScore and Earnings Calls.

    avnlp/vectordb’s past year of commit activity
    Python 4 MIT 1 0 2 Updated Jun 1, 2026
  • rankers Public

    Modular LLM ranking library for Information Retrieval and RAG. Implements state-of-the-art Pairwise, Setwise, and Listwise ranking with structured generation and specialized models (RankZephyr, RankLlama). Features efficient sorting algorithms, sliding windows, and zero-shot capabilities.

    avnlp/rankers’s past year of commit activity
    Python 5 MIT 1 0 1 Updated Jun 1, 2026
  • llm-finetuning Public

    Advanced LLM fine-tuning techniques: SFT (LoRA, QLoRA, DoRA, P-/Prefix-Tuning), GRPO, DPO, ORPO, KTO & PPO; composable correctness/format rewards + LLM-as-a-Judge evals (DeepEval, Evidently AI) across math, multi-hop, medical & general QA on Llama 3, Mistral, Phi-4, Gemma & Qwen3. Built on TRL, PEFT & Unsloth.

    avnlp/llm-finetuning’s past year of commit activity
    Python 7 MIT 3 0 1 Updated Jun 1, 2026
  • rag-model-training Public

    Training code for advanced RAG techniques - Adaptive-RAG, Corrective RAG, RQ-RAG, Self-RAG, Agentic RAG, and ReZero. Reproduces paper methodologies to fine-tune LLMs via SFT and GRPO for adaptive retrieval, corrective evaluation, query refinement, self-reflection, and agentic search behaviors.

    avnlp/rag-model-training’s past year of commit activity
    Python 8 MIT 2 0 1 Updated Jun 1, 2026
  • rag-pipelines Public

    Advanced RAG pipelines for medical (HealthBench, MedCaseReasoning, MetaMedQA, PubMedQA) and financial (FinanceBench, Earnings Calls) QA. LangGraph orchestration + BAML structructed generation, Milvus Hybrid search (Dense + BM25 + RRF), three-layer Metadata Enrichment, Contextual AI instruction-following reranker, and DeepEval evaluation.

    avnlp/rag-pipelines’s past year of commit activity
    Python 13 MIT 1 0 1 Updated Jun 1, 2026
  • prp Public

    Pairwise Ranking Prompting (PRP): Zero-shot LLM reranking library implementing efficient pairwise strategies (Heapsort, Sliding Window, All-Pairs). Mitigates position bias via bidirectional comparison and ensures reliability with structured Pydantic validation. Built for Haystack pipelines.

    avnlp/prp’s past year of commit activity
    Python 4 MIT 0 0 1 Updated Jun 1, 2026
  • med-reason-evals Public

    Medical Reasoning Evaluation

    avnlp/med-reason-evals’s past year of commit activity
    Python 2 MIT 0 0 1 Updated Jun 1, 2026
  • biothink Public

    Self-Reflective Question Answering for Biomedical Reasoning. GRPO fine-tuning via QLoRA & Unsloth with rewards for correctness, relevance, groundness, utility & XML structure. Structured think → answer → self-reflection with context grading, relevance assessment & groundness evaluation. DeepEval LLM-as-a-Judge (GEval, Faithfulness, Relevancy).

    avnlp/biothink’s past year of commit activity
    Python 5 MIT 1 0 1 Updated Jun 1, 2026
  • dspy-opt Public

    Advanced RAG pipeline optimization framework using DSPy. Implements modular RAG pipelines with Query-Rewriting, Sub-Query Decomposition, and Hybrid Search via Weaviate. Automates prompt tuning and few-shot selection using GEPA, SIMBA, MIPRO, COPRO, and BootstrapFewShot optimizers on datasets like FreshQA, HotpotQA, TriviaQA, Wikipedia and PubMedQA.

    avnlp/dspy-opt’s past year of commit activity
    Python 10 MIT 1 0 1 Updated Jun 1, 2026

Top languages

Loading…

Most used topics

Loading…