Skip to content
View eust-w's full-sized avatar
:shipit:
:shipit:

Organizations

@doocs @OSTGO

Block or report eust-w

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
eust-w/README.md

VLA VLM Robotics banner

Longtao Wu

Building robot intelligence across VLA, VLM, simulation data, and embodied AI systems.

Website · Hugging Face · X

Repositories Research and engineering Simulation data


Focus

I work on the stack that makes robots understand, plan, and act in the real world: multimodal perception, embodied policy, robot brain architecture, simulation data engines, and runtime infrastructure.

Embodied AI stack summary

Perception          VLMs, multimodal grounding, scene understanding
Policy             VLA models, robot action heads, task-level planning
Embodiment         robot "big brain / small brain" architecture
Simulation Data    synthetic data, real2sim, sim2real validation loops
Infrastructure     data engines, evaluation, deployment, robot runtime tooling

What I Am Building Toward

flowchart LR
  A[World Data] --> B[Simulation Engine]
  B --> C[Embodied Dataset]
  C --> D[VLM World Understanding]
  D --> E[VLA Policy]
  E --> F[Robot Brain]
  F --> G[Motion / Control]
  G --> H[Real Robot Feedback]
  H --> A

  style A fill:#101828,stroke:#00e5ff,color:#ffffff
  style B fill:#101828,stroke:#7c3aed,color:#ffffff
  style C fill:#101828,stroke:#22c55e,color:#ffffff
  style D fill:#101828,stroke:#f97316,color:#ffffff
  style E fill:#101828,stroke:#f43f5e,color:#ffffff
  style F fill:#101828,stroke:#38bdf8,color:#ffffff
  style G fill:#101828,stroke:#a3e635,color:#ffffff
  style H fill:#101828,stroke:#facc15,color:#ffffff
Loading

Engineering Interests

  • Vision-language-action models for robot skills and generalist policies
  • VLM-based scene parsing, affordance reasoning, and instruction grounding
  • Robot brain architecture: high-level cognition plus low-level control/runtime
  • Large-scale simulation data generation, curation, replay, and evaluation
  • Real2sim asset pipelines, digital twins, domain randomization, and sim2real tests
  • Tooling that turns research prototypes into inspectable, repeatable systems

Selected Work Surface

My public repositories also include AI agent infrastructure, developer tools, model routing, code review automation, and knowledge workflows. Those systems are useful building blocks for robotics work because embodied AI needs the same discipline: traceable data, reproducible evaluation, observable runtime behavior, and reliable deployment.

Layer Direction
Robot big brain multimodal planning, world understanding, instruction grounding
Robot small brain motion/runtime orchestration, control adapters, execution feedback
VLA / VLM policy learning, scene semantics, action grounding, evaluation
Simulation data synthetic scenes, real2sim assets, domain randomization, dataset QA
AI infrastructure agents, code automation, model routing, workflow verification

Longtao Wu personal robotics style card

Current Direction

I am especially interested in systems where:

  • simulation produces data that is good enough to train or evaluate robot policies
  • VLMs provide semantic world models instead of just image captions
  • VLA policies connect language goals to executable robot actions
  • runtime infrastructure makes robot behavior observable, debuggable, and reproducible

Tech Surface

Python PyTorch ROS NVIDIA Docker Kubernetes TypeScript Three.js

GitHub Signal

GitHub stats Top languages

Contribution streak


VLA / VLM / Robotics / Simulation Data / Embodied AI

Pinned Loading

  1. weeklyCodingTime weeklyCodingTime
    1
    Python     12 hrs 23 mins ██████▋░░░░░░░░░░░░░░  31.7%
    2
    Markdown   10 hrs 36 mins █████▋░░░░░░░░░░░░░░░  27.1%
    3
    Other      6 hrs 20 mins  ███▍░░░░░░░░░░░░░░░░░  16.2%
    4
    Go         3 hrs 40 mins  █▉░░░░░░░░░░░░░░░░░░░   9.4%
    5
    JavaScript 3 hrs 2 mins   █▋░░░░░░░░░░░░░░░░░░░   7.8%