Skip to content

Pull requests: AI-Hypercomputer/maxtext

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Feature: add standalone DPO training and data hooks
#3883 opened May 12, 2026 by igorts-git Collaborator Loading…
4 tasks done
Plumbing and core MoE logic for router replay
#3881 opened May 12, 2026 by xuefgu Collaborator Draft
4 tasks done
Use env credentials instead of passing them through
#3880 opened May 12, 2026 by bvandermoon Collaborator Loading…
4 tasks done
Improve error message when tokenize_data config doesn't match dataset
#3879 opened May 12, 2026 by aireenmei Collaborator Loading…
4 tasks done
Secure deserialization and model checkpoint loading pipelines
#3877 opened May 11, 2026 by bvandermoon Collaborator Loading…
4 tasks done
enable aot identification test
#3876 opened May 11, 2026 by NuojCheng Collaborator Draft
4 tasks
Improve performance of gemma4 MoE inference.
#3875 opened May 11, 2026 by NicoGrande Collaborator Loading…
4 tasks done
WIP DO NOT REVIEW Enable sft_trainer_correctness_test
#3874 opened May 11, 2026 by igorts-git Collaborator Draft
4 tasks
write dequantization scripts for DeepSeek V4 FP4/FP8 weights
#3873 opened May 11, 2026 by snehalv2002 Collaborator Loading…
4 tasks done
Add zero1 aot support in train compile
#3872 opened May 11, 2026 by NuojCheng Collaborator Draft
4 tasks done
Implement custom MoE HashRouter, TopKRouter, and sqrtsoftplus
#3871 opened May 11, 2026 by parambole Collaborator Draft
4 tasks
Conditionally branch tokamax.ragged_dot calls based on use_manual_quantization
#3869 opened May 11, 2026 by zxhe-sean Collaborator Loading…
4 tasks done
DeepSeek V4 Integration
#3867 opened May 11, 2026 by parambole Collaborator Draft
4 tasks
Implement DeepSeek-V4 Compressed Attention Layers
#3866 opened May 11, 2026 by parambole Collaborator Draft
4 tasks
DeepSeek-V4 Core Primitives
#3865 opened May 11, 2026 by parambole Collaborator Draft
4 tasks
[DeepSeek v3] Add grad mask and update MLA init gemini-review
#3864 opened May 10, 2026 by gagika Collaborator Loading…
4 tasks done
Enable Qwen3-Omni SFT on ChartQA
#3863 opened May 10, 2026 by hengtaoguo Collaborator Draft
4 tasks
Optimize MaxText unit and integration test suite runtime
#3860 opened May 9, 2026 by shralex Collaborator Loading…
4 tasks done
Update optimization docs and add TPU v7x guide
#3857 opened May 8, 2026 by jacoguzo Collaborator Loading…
4 tasks done
Update docker image guide
#3855 opened May 8, 2026 by melissawm Collaborator Loading…
1 task done
Update JAX to 0.10.0 for pre-training
#3854 opened May 8, 2026 by SurbhiJainUSC Collaborator Draft
4 tasks done
Update First Run tutorial
#3853 opened May 8, 2026 by melissawm Collaborator Loading…
1 task done
Trigger tests using PR comments
#3850 opened May 8, 2026 by shralex Collaborator Loading…
4 tasks done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.