tongruiliu

Follow

🎯

Focusing

tongruiliu

🎯

Focusing

Follow

Intern in Beijing [email protected]

16 followers · 14 following

Peking University
Beijing
https://tongruiliu.github.io/

Achievements

Achievements

Pinned Loading

Guided-GRPO Guided-GRPO Public

A Guided Reinforcement Learning framework enhancing MLLM reasoning via process-level verification and collaborative rollout strategies.

Python 48
GMT GMT Public

GMT: Graph-as-Memory Tuning for deep KG–LLM fusion via cross-attention.

Python 11 1
OpenDCAI/DataFlow-MM OpenDCAI/DataFlow-MM Public

Dataflow-MM, multi-media operators for Dataflow. We aim to prepare data for Multimodal Large Language Models.

Python 44 17
tongruiliu.github.io tongruiliu.github.io Public

my page

HTML 1
OpenDCAI/DataFlex OpenDCAI/DataFlex Public

Data-centric LLM training with dynamic sample selection, domain mixture optimization, and example reweighting inside the LLaMA-Factory training loop.

Python 1.3k 162
canvas-rl canvas-rl Public

Python 3