#

gguf-quantization

Here are 16 public repositories matching this topic...

qskousen / ggufy

CLI/GUI tool for efficient and easy safetensors and gguf model conversion

zig diffusion-models safetensors comfyui ggml gguf gguf-quantization

Updated Jun 5, 2026
Zig

magiccodingman / MagicQuant-Wiki

Evolution process to find the best quant tensor weights to build the most optimal GGUF options for an AI model.

ai quantization quantization-algorithms gguf gguf-quantization gguf-hybrid

Updated May 13, 2026

xphot / app

Análise Avançada de Dados com Causalidade e Aprendizado por Reforço

reinforcement-learning bug-tracker data-preprocessing experimental-psychology causal-machine-learning shap-analysis hypergraph-neural-network llms-reasoning llm-fine-tuning explainability-metric unsloth gguf-quantization

Updated Feb 27, 2025
Jupyter Notebook

Mainframework / Quanta

Convert and quantize llm models

Updated Dec 30, 2025
Python

lm-webui

lm-webui / lm-webui

Unified Local AI Interface & LLM Runtime (Support GGUF, Ollama, OpenAI, Gemini, etc.). Insearch of building sovereign AI system ✨

ai webui hardware-acceleration rag ai-assistant llm llm-inference ollama gguf llm-webui gemini-sdk openai-compatible gguf-quantization llm-runtime lm-webui

Updated Feb 26, 2026
Python

arcxteam / gguf-convert-model

Auto GGUF Converter for HuggingFace Hub Models with Multiple Quantizations (GGUF Format)

machine-learning cmake ai tensorflow transformers huggingface ai-models huggingface-models bf16 safetensors llama-cpp gguf gguf-models gguf-quantization gguf-editor convert-gguf

Updated Feb 11, 2026
Python

LukasDrews97 / Gemma-4-IT-SFT-RLVR-Medical

Gemma-4-It fine-tuned on PubMedQA using SFT & RLVR

quantization fine-tuning sft llm low-rank-adaptation qlora parameter-efficient-fine-tuning grpo gguf-quantization rlvr pubmedqa gemma-4

Updated Apr 27, 2026
Python

dAppCore / go-mlx

go-mlx native inference, training and research tooling.

golang ai lora mlx ai-training ai-serving ai-tools ai-runtime apple-mlx ai-research-and-development gguf-quantization ai-lora-trainer apple-ml

Updated Jun 7, 2026
Go

waterblower / purai

AI Toolchain in Pure Zig, No Python, No C++

ai zig inference gguf-quantization

Updated Mar 1, 2026
Zig

merterbak / HFtoGGUF

Convert Hugging Face models to GGUF with xet support.

huggingface huggingface-models gguf gguf-quantization

Updated Apr 26, 2025
Jupyter Notebook

TAO71-AI / AutoQuantizer

Quantize LLMs automatically.

python cli python3 quantization llm llms llamacpp llama-cpp gguf gguf-quantization

Updated Mar 31, 2026
Python

daniau23 / LoRAfrica_CPU

Deploying LoRAfrica on consumer CPU devices

huggingface llm llms llamacpp ollama gguf-models gguf-quantization

Updated Apr 7, 2026
Jupyter Notebook

GrandFuzard / glm4-7flash-opus-colab

Ready-to-run Colab notebook to run GLM-4.7-Flash Finetuned on Claude Opus 4.5 xHigh-Reasoning (GGUF) with llama.cpp, featuring GPU/CPU split loading, streaming chat, multi-chat manager, and a Gradio web UI — optimized for free T4 environments.

glm llm llama-cpp gguf gguf-quantization

Updated Mar 29, 2026
Jupyter Notebook

AdityaKarippadathUdai / Rag-chat-bot

Rag chat bot

python fast nextjs gemini-api chromadb retrieval-augmented-generation ollama rag-chatbot gguf-quantization

Updated May 29, 2026
TypeScript

spicyneuron / gguf-clone

Create optimized GGUF quantizations by cloning from any GGUF of the same architecture.

huggingface llamacpp gguf unsloth gguf-quantization

Updated Feb 28, 2026
Python

asystemoffields / PMRA

Production Mixed-Rate Allocation helps anyone make and run highly performant ggufs!

quantization gguf gguf-models gguf-quantization

Updated Jun 3, 2026
Python

Improve this page

Add a description, image, and links to the gguf-quantization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gguf-quantization topic, visit your repo's landing page and select "manage topics."