Building robot intelligence across VLA, VLM, simulation data, and embodied AI systems.
Website · Hugging Face · X
I work on the stack that makes robots understand, plan, and act in the real world: multimodal perception, embodied policy, robot brain architecture, simulation data engines, and runtime infrastructure.
Perception VLMs, multimodal grounding, scene understanding
Policy VLA models, robot action heads, task-level planning
Embodiment robot "big brain / small brain" architecture
Simulation Data synthetic data, real2sim, sim2real validation loops
Infrastructure data engines, evaluation, deployment, robot runtime toolingflowchart LR
A[World Data] --> B[Simulation Engine]
B --> C[Embodied Dataset]
C --> D[VLM World Understanding]
D --> E[VLA Policy]
E --> F[Robot Brain]
F --> G[Motion / Control]
G --> H[Real Robot Feedback]
H --> A
style A fill:#101828,stroke:#00e5ff,color:#ffffff
style B fill:#101828,stroke:#7c3aed,color:#ffffff
style C fill:#101828,stroke:#22c55e,color:#ffffff
style D fill:#101828,stroke:#f97316,color:#ffffff
style E fill:#101828,stroke:#f43f5e,color:#ffffff
style F fill:#101828,stroke:#38bdf8,color:#ffffff
style G fill:#101828,stroke:#a3e635,color:#ffffff
style H fill:#101828,stroke:#facc15,color:#ffffff
- Vision-language-action models for robot skills and generalist policies
- VLM-based scene parsing, affordance reasoning, and instruction grounding
- Robot brain architecture: high-level cognition plus low-level control/runtime
- Large-scale simulation data generation, curation, replay, and evaluation
- Real2sim asset pipelines, digital twins, domain randomization, and sim2real tests
- Tooling that turns research prototypes into inspectable, repeatable systems
My public repositories also include AI agent infrastructure, developer tools, model routing, code review automation, and knowledge workflows. Those systems are useful building blocks for robotics work because embodied AI needs the same discipline: traceable data, reproducible evaluation, observable runtime behavior, and reliable deployment.
| Layer | Direction |
|---|---|
| Robot big brain | multimodal planning, world understanding, instruction grounding |
| Robot small brain | motion/runtime orchestration, control adapters, execution feedback |
| VLA / VLM | policy learning, scene semantics, action grounding, evaluation |
| Simulation data | synthetic scenes, real2sim assets, domain randomization, dataset QA |
| AI infrastructure | agents, code automation, model routing, workflow verification |
I am especially interested in systems where:
- simulation produces data that is good enough to train or evaluate robot policies
- VLMs provide semantic world models instead of just image captions
- VLA policies connect language goals to executable robot actions
- runtime infrastructure makes robot behavior observable, debuggable, and reproducible
VLA / VLM / Robotics / Simulation Data / Embodied AI




