Founder @pyxis3-ai · autonomous AI infrastructure operations · London
ex-Seldon (vLLM, LLM inference) · ex-AWS Industry Specialist (semiconductors, AI/ML) · ex-Dell EMC · ex-IBM · author of the canonical AWS guide on decoupling RDS from Elastic Beanstalk — the same procedure AWS's official YouTube channel cites as "Watch Omar's video to learn more".
I work on production AI and cloud infrastructure: autonomous infrastructure operations, LLM inference serving, and Kubernetes-native operations.
PYXIS3 — autonomous AI that runs your cloud and data-center infrastructure operations across AWS, Google Cloud, Azure, VMware, Nutanix, and on-prem, spanning cost, capacity, reliability, security, and governance, within your guardrails. One subscription, never a share of savings. Org: @pyxis3-ai.
All under @pyxis3-ai:
pyxis-arch— public architecture notes on model-agnostic LLM serving: design notes, runtime-adapter abstraction, decision rationale.vllm-bench— throughput + latency benchmark for OpenAI-compatible LLM endpoints (vLLM, TGI, llama.cpp, Ollama). Measures TTFT, TPOT, request and token throughput at percentiles. Async; two-dependency footprint. MIT.llm-serving-cookbook— production recipes for K8s-native vLLM-first serving. vLLM-on-EKS, KEDA autoscaling, token economics, TTFT optimisation, runtime selection. Apache-2.0.awesome-model-agnostic-llm— curated list of model-agnostic LLM tooling: serving runtimes, routers, evaluators, observability, standards, open weights. CC0.noor— semantic search over the Quran + Hadith corpus. Arabic-aware multilingual embeddings onsqlite-vec. FastAPI + Vue. Runs as a single Docker image, no external services.lens— in-cluster Kubernetes observability with in-browserkubectl exec. Vue 3 + Bun. Single binary, ServiceAccount-token auth. Built for ML-serving and GPU clusters.
- Decouple Amazon RDS instances from Elastic Beanstalk environments — AWS Knowledge Center. Canonical AWS guidance; still ranks #1 on Google for the topic. Authorship attributed on AWS's official YouTube channel.
- How do I safely decouple an Amazon RDS instance from an Elastic Beanstalk environment? — companion AWS Knowledge Center walkthrough + video on AWS's official channel.
- Seldon Technologies · 2025–present · Senior Solutions Engineer on the production MLOps platform: vLLM-based LLM inference and multi-tenant model serving on Kubernetes.
- AWS London · 2022–2025 · Solutions Architect, Industry Specialist for the semiconductor industry vertical — AI/ML workloads on Inferentia, Trainium, SageMaker, Bedrock.
- AWS Cape Town · 2017–2022 · Cloud DevOps Engineer. Authored two AWS Knowledge Center articles + the companion AWS YouTube video.
- Dell EMC · 2016–2017 · Storage engineering (Isilon).
- IBM · 2016 · Cloud infrastructure.
- Earlier · OrecX, HONEST · Egypt, 2007–2012.
vLLM · Triton · Kubernetes · KEDA · Helm · Prometheus · Caddy · AWS · GCP · Azure · Python · Go · TypeScript
This repository is maintained with small, reviewable updates. Supporting documentation lives in docs/, example inputs live in examples/, and lightweight validation notes live in tests/smoke/.




