AI Engineer building production Computer Vision, Multimodal AI, and Machine Learning systems.
🔭 Currently building computer vision pipelines for automated housing-code-violation detection at Third Estate Analytics, processing GPS/video telemetry and VLM-based classification at 90% accuracy across a 300K-parcel dataset.
🎓 M.S. in Artificial Intelligence — University at Buffalo (2025–2026)
🌱 Currently exploring: Retrieval-Augmented Generation (RAG), LLM applications, and agentic AI systems
📫 Reach me: LinkedIn · Portfolio · [email protected]
Languages: Python · SQL · Java ML / DL: PyTorch · TensorFlow · Scikit-learn · Transformers (DistilBERT, VLMs) Computer Vision: OpenCV · ResNet · SMPL-X · Active Learning pipelines Infra / MLOps: Docker · Kubernetes · FastAPI · Redis · AWS · GCP Data: PostgreSQL · MongoDB · Spark · Hadoop · ETL pipelines
| Project | Description | Stack |
|---|---|---|
| Multimodal Movie Genre Prediction | Fused text (DistilBERT) + image (ResNet-18) deep learning model for multi-label genre classification. Live demo | PyTorch, Transformers, ResNet |
| Deepfake Detection (β-VAE) | Hybrid generative + discriminative deepfake detection — 98% accuracy, 0.997 ROC-AUC, on FFHQ + Stable Diffusion data | PyTorch, VAEs, Scikit-learn |
| 3D Human Mesh Reconstruction | SMPL-X-based 3D human reconstruction & rendering pipeline from monocular video | PyTorch, Trimesh, Pyrender, Gradio |
| AI Agent Microservice Platform | Distributed orchestration runtime simulating 50+ concurrent AI agents with health checks & retry logic | FastAPI, Docker, Redis |


