-
๐ I like training interesting transformers.
-
๐ The maximum model size I have trained from scratch is 3B, using 128 A100 GPUs. Looking forward to the opportunity to use more GPUs to train larger models in the future!
-
๐ซ How to reach me: [email protected]
| Tool | All-time | 30d | 7d | Top models | Last seen |
|---|---|---|---|---|---|
| Codex | 11.88B |
5.65B |
1.50B |
gpt-5.5 gpt-5.4 gpt-5.3-codex |
2026-05-29 |
| Claude Code | 404.1M |
0 |
0 |
claude-opus-4-5 claude-opus-4-6 gemini-3-pro-high |
2026-04-23 |




