longtao eust-w

Longtao Wu

Building robot intelligence across VLA, VLM, simulation data, and embodied AI systems.

Focus

I work on the stack that makes robots understand, plan, and act in the real world: multimodal perception, embodied policy, robot brain architecture, simulation data engines, and runtime infrastructure.

Perception          VLMs, multimodal grounding, scene understanding
Policy             VLA models, robot action heads, task-level planning
Embodiment         robot "big brain / small brain" architecture
Simulation Data    synthetic data, real2sim, sim2real validation loops
Infrastructure     data engines, evaluation, deployment, robot runtime tooling

What I Am Building Toward

flowchart LR
  A[World Data] --> B[Simulation Engine]
  B --> C[Embodied Dataset]
  C --> D[VLM World Understanding]
  D --> E[VLA Policy]
  E --> F[Robot Brain]
  F --> G[Motion / Control]
  G --> H[Real Robot Feedback]
  H --> A

  style A fill:#101828,stroke:#00e5ff,color:#ffffff
  style B fill:#101828,stroke:#7c3aed,color:#ffffff
  style C fill:#101828,stroke:#22c55e,color:#ffffff
  style D fill:#101828,stroke:#f97316,color:#ffffff
  style E fill:#101828,stroke:#f43f5e,color:#ffffff
  style F fill:#101828,stroke:#38bdf8,color:#ffffff
  style G fill:#101828,stroke:#a3e635,color:#ffffff
  style H fill:#101828,stroke:#facc15,color:#ffffff

Engineering Interests

Vision-language-action models for robot skills and generalist policies
VLM-based scene parsing, affordance reasoning, and instruction grounding
Robot brain architecture: high-level cognition plus low-level control/runtime
Large-scale simulation data generation, curation, replay, and evaluation
Real2sim asset pipelines, digital twins, domain randomization, and sim2real tests
Tooling that turns research prototypes into inspectable, repeatable systems

Selected Work Surface

My public repositories also include AI agent infrastructure, developer tools, model routing, code review automation, and knowledge workflows. Those systems are useful building blocks for robotics work because embodied AI needs the same discipline: traceable data, reproducible evaluation, observable runtime behavior, and reliable deployment.

Layer	Direction
Robot big brain	multimodal planning, world understanding, instruction grounding
Robot small brain	motion/runtime orchestration, control adapters, execution feedback
VLA / VLM	policy learning, scene semantics, action grounding, evaluation
Simulation data	synthetic scenes, real2sim assets, domain randomization, dataset QA
AI infrastructure	agents, code automation, model routing, workflow verification

Current Direction

I am especially interested in systems where:

simulation produces data that is good enough to train or evaluate robot policies
VLMs provide semantic world models instead of just image captions
VLA policies connect language goals to executable robot actions
runtime infrastructure makes robot behavior observable, debuggable, and reproducible

Tech Surface

GitHub Signal

VLA / VLM / Robotics / Simulation Data / Embodied AI

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

longtao eust-w

Achievements

Achievements

Highlights

Organizations

Block or report eust-w