Edge AI & Computer Vision engineer. Math-stat background, self-taught into ML systems.
Currently building SMBoost — a reliability harness for small LLMs at the edge. Came out of a failure mode I observed running a 3B VLM on Jetson Orin during the Machina AI pilot.
Previously founded Machina AI — zero-cloud visual inspection for high-mix manufacturing. Built EdgeRunner, signed a pilot LOI with a Shenzhen powder metallurgy manufacturer, learned that ToB sales on a single-founder timeline is a different problem than the engineering.
→ edge-runner — Real-time vision system on NVIDIA Jetson. YOLO + VLM dual-thread architecture. <12ms P99 latency.
→ smboost — Reliability harness for 2B–7B local LLMs. State machine + GBNF constraints + adaptive scoring.
Python · NVIDIA Jetson · TensorRT · YOLO · VLMs · LangGraph · llama.cpp · Computer Vision · ML Systems · Edge Deployment
[email protected] · haiyang5535.github.io · linkedin.com/in/h-wu



