🎯
Focusing
Intern in Beijing
[email protected]
-
Peking University
- Beijing
- https://tongruiliu.github.io/
Pinned Loading
-
Guided-GRPO
Guided-GRPO PublicA Guided Reinforcement Learning framework enhancing MLLM reasoning via process-level verification and collaborative rollout strategies.
Python 48
-
OpenDCAI/DataFlow-MM
OpenDCAI/DataFlow-MM PublicDataflow-MM, multi-media operators for Dataflow. We aim to prepare data for Multimodal Large Language Models.
-
-
OpenDCAI/DataFlex
OpenDCAI/DataFlex PublicData-centric LLM training with dynamic sample selection, domain mixture optimization, and example reweighting inside the LLaMA-Factory training loop.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


