Popular repositories Loading
-
vllm-pro6000-nvfp4-hybrid
vllm-pro6000-nvfp4-hybrid PublicDrop-in hybrid patch for vLLM on RTX Pro 6000 Blackwell NVFP4. Marlin for decode + CUTLASS prefill shadow. Fixes W4A16 mislabel, 1.73× prefill gain
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.