-
Notifications
You must be signed in to change notification settings - Fork 379
Pull requests: NousResearch/atropos
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: correct over-escaped regexes in instruction-following reward validators
#489
opened Jul 1, 2026 by
golldyck
Loading…
9 of 10 tasks
fix: count other envs' queued data in /status-env when group size is 1
#488
opened Jul 1, 2026 by
golldyck
Loading…
10 tasks done
fix: remove double retry on chat_completion (9 attempts instead of 3)
#487
opened Jul 1, 2026 by
golldyck
Loading…
10 tasks done
fix: avoid duplicate request for n=1 when the API ignores n
#486
opened Jul 1, 2026 by
golldyck
Loading…
10 tasks done
fix: apply sub-reward weights in CombinedReward
#485
opened Jul 1, 2026 by
golldyck
Loading…
10 tasks done
fix: don't let response-body read mask the original HTTP error
#484
opened Jul 1, 2026 by
golldyck
Loading…
10 tasks done
fix: don't truncate trajectory alternatives already at the preserve-minimum
#483
opened Jul 1, 2026 by
golldyck
Loading…
10 tasks done
fix: resolve correct reward class for modules defining several rewards
#482
opened Jul 1, 2026 by
golldyck
Loading…
10 tasks done
fix: clamp variance to keep GRPO std real for constant float rewards
#481
opened Jul 1, 2026 by
golldyck
Loading…
10 tasks done
Stop parse_tool_call from corrupting JSON tool calls that contain apostrophes
#479
opened Jun 21, 2026 by
MaxFreedomPollard
Loading…
fix(docs): correct broken GitHub Flow link in CONTRIBUTING.md
#477
opened May 28, 2026 by
abhicris
Loading…
fix(submodule): mark bleuberi as update=none so recursive init doesn't break installs
#476
opened May 17, 2026 by
abhicris
Loading…
Add shared structured-output parsing helpers and migrate multimodal answer extraction
#475
opened May 6, 2026 by
dlkakbs
Loading…
7 of 17 tasks
feat(infra): Unified SSoT Reasoning Wrapper and Server Stability Hardening
#474
opened Apr 28, 2026 by
RUFFY-369
Loading…
3 tasks done
docs: add WSL troubleshooting for venv setup and pip install memory e…
#470
opened Apr 25, 2026 by
PhantomTee
Loading…
8 of 12 tasks
fix: Make student/teacher model and tokenizer configurable
#469
opened Apr 24, 2026 by
RUFFY-369
Loading…
9 tasks done
fix: Harden process termination with PID verification
#468
opened Apr 24, 2026 by
RUFFY-369
Loading…
8 tasks done
fix: Add rollout queue limit and backpressure to prevent OOM
#467
opened Apr 24, 2026 by
RUFFY-369
Loading…
8 tasks done
fix: Add bridge config cleanup to prevent stale CUDA IPC leaks
#466
opened Apr 24, 2026 by
RUFFY-369
Loading…
9 tasks done
fix: Implement robust advantage normalization for low-variance groups
#465
opened Apr 24, 2026 by
RUFFY-369
Loading…
8 tasks done
fix: Implement on-policy teacher distillation with KL loss
#464
opened Apr 24, 2026 by
RUFFY-369
Loading…
9 tasks done
fix: Enhance RoPE theta detection and fix meta tensor traversal
#463
opened Apr 24, 2026 by
RUFFY-369
Loading…
9 tasks done
fix: Implement strict dtype validation for shared vLLM tensors
#462
opened Apr 24, 2026 by
RUFFY-369
Loading…
9 tasks done
fix(api): use query param for GET /status-env instead of JSON body
#452
opened Apr 23, 2026 by
skyc1e
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-29.