Skip to content
View sarsanaee's full-sized avatar
🏁
Some don't even get to see the checkered flag!
🏁
Some don't even get to see the checkered flag!

Highlights

  • Pro

Block or report sarsanaee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sarsanaee/README.md

Hi, I'm Ryan 👋

I build high-performance systems for modern datacenters, focused on making AI/ML workloads efficient at scale — across networking, memory, and runtime layers.

  • 🔬 Visiting researcher at the Cambridge Computer Laboratory (with Andrew Moore)
  • 🎓 PhD in Computer Systems, Queen Mary University of London (adv. Gianni Antichi & Brent Stephens)
  • ⚡ Deep experience with Compute Express Link (CXL) across QEMU and the Linux kernel
  • 🧩 Comfortable across the stack — kernel, networking datapaths, emulation, and runtimes

🔭 What I'm building

  • qemu-cxl — CXL device & memory experiments in QEMU (correct + performant non-interleaved path, MHSLD / dynamic capacity)
  • qemu-lab — QEMU sandbox tracking upstream, for fast systems experiments
  • httpd-ab — ApacheBench extended with nanosecond timing + per-request tracing
  • Cook-RDMA — curated, tested RDMA examples

🏆 Selected work

BackdraftLossless virtual switch · USENIX NSDI 2022 First author. A software virtual switch that eliminates the slow-receiver problem in data planes via per-flow queues, dynamic buffering, and backpressure-aware scheduling. I led the design, the DPDK implementation, and the evaluation.

MorpheusRun-time data-plane optimization · ASPLOS 2022 Co-author. A framework that specializes software data planes to their actual workload, applying domain-specific optimizations at run time for large throughput gains. I contributed to the system design and experimental evaluation.

machnetLow-latency cloud messaging (Microsoft) Contributor. DPDK-based messaging for public-cloud VMs (~750K RPS, 61µs P99.9 on Azure). I worked on the SACK-based reliable-transport path and hardened the packet-processing fast path for correctness and maintainability.

qemu-cxlCXL emulation for systems research Added a correct and performant emulation path for non-interleaved Compute Express Link (CXL) memory configurations in QEMU — used to study how datacenter memory should evolve.

🧠 Interests

CXL & memory systems · RDMA / high-performance networking · systems for AI/ML · datacenter energy efficiency

📫 Find me

🌐 sarsanaee.github.io · 📄 PhD dissertation

Pinned Loading

  1. machnett/Backdraft machnett/Backdraft Public

    Backdraft: a Lossless Virtual Switch that Prevents the Slow Receiver Problem. USENIX NSDI 2022

    C 15 5

  2. Morpheus-compiler/Morpheus Morpheus-compiler/Morpheus Public

    Morpheus: Domain Specific Run Time Optimization for Software Data Planes -- Presented at ASPLOS22

    Shell 36 1

  3. microsoft/machnet microsoft/machnet Public

    Machnet provides applications like databases and finance an easy way to access low-latency DPDK-based messaging on public cloud VMs. 750K RPS on Azure at 61 us P99.9.

    C++ 156 28

  4. qemu-lab qemu-lab Public

    QEMU sandbox tracking upstream for fast systems experiments (CXL, memory, emulation)

    C