Omar Abdrabo oabdrabo

Hi, I'm Omar 👋

Founder @pyxis3-ai · autonomous AI infrastructure operations · London

ex-Seldon (vLLM, LLM inference) · ex-AWS Industry Specialist (semiconductors, AI/ML) · ex-Dell EMC · ex-IBM · author of the canonical AWS guide on decoupling RDS from Elastic Beanstalk — the same procedure AWS's official YouTube channel cites as "Watch Omar's video to learn more".

I work on production AI and cloud infrastructure: autonomous infrastructure operations, LLM inference serving, and Kubernetes-native operations.

🚀 What I'm building

PYXIS3 — autonomous AI that runs your cloud and data-center infrastructure operations across AWS, Google Cloud, Azure, VMware, Nutanix, and on-prem, spanning cost, capacity, reliability, security, and governance, within your guardrails. One subscription, never a share of savings. Org: @pyxis3-ai.

🛠️ Public projects

All under @pyxis3-ai:

pyxis-arch — public architecture notes on model-agnostic LLM serving: design notes, runtime-adapter abstraction, decision rationale.
vllm-bench — throughput + latency benchmark for OpenAI-compatible LLM endpoints (vLLM, TGI, llama.cpp, Ollama). Measures TTFT, TPOT, request and token throughput at percentiles. Async; two-dependency footprint. MIT.
llm-serving-cookbook — production recipes for K8s-native vLLM-first serving. vLLM-on-EKS, KEDA autoscaling, token economics, TTFT optimisation, runtime selection. Apache-2.0.
awesome-model-agnostic-llm — curated list of model-agnostic LLM tooling: serving runtimes, routers, evaluators, observability, standards, open weights. CC0.
noor — semantic search over the Quran + Hadith corpus. Arabic-aware multilingual embeddings on sqlite-vec. FastAPI + Vue. Runs as a single Docker image, no external services.
lens — in-cluster Kubernetes observability with in-browser kubectl exec. Vue 3 + Bun. Single binary, ServiceAccount-token auth. Built for ML-serving and GPU clusters.

📚 Published

Decouple Amazon RDS instances from Elastic Beanstalk environments — AWS Knowledge Center. Canonical AWS guidance; still ranks #1 on Google for the topic. Authorship attributed on AWS's official YouTube channel.
How do I safely decouple an Amazon RDS instance from an Elastic Beanstalk environment? — companion AWS Knowledge Center walkthrough + video on AWS's official channel.

🏛️ Background

Seldon Technologies · 2025–present · Senior Solutions Engineer on the production MLOps platform: vLLM-based LLM inference and multi-tenant model serving on Kubernetes.
AWS London · 2022–2025 · Solutions Architect, Industry Specialist for the semiconductor industry vertical — AI/ML workloads on Inferentia, Trainium, SageMaker, Bedrock.
AWS Cape Town · 2017–2022 · Cloud DevOps Engineer. Authored two AWS Knowledge Center articles + the companion AWS YouTube video.
Dell EMC · 2016–2017 · Storage engineering (Isilon).
IBM · 2016 · Cloud infrastructure.
Earlier · OrecX, HONEST · Egypt, 2007–2012.

🧰 Stack

vLLM · Triton · Kubernetes · KEDA · Helm · Prometheus · Caddy · AWS · GCP · Azure · Python · Go · TypeScript

📫 Reach me

Maintenance

This repository is maintained with small, reviewable updates. Supporting documentation lives in docs/, example inputs live in examples/, and lightweight validation notes live in tests/smoke/.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Omar Abdrabo oabdrabo

Sponsoring

Achievements