Skip to content
View astroanand-6e's full-sized avatar
🎯
Pre-Training
🎯
Pre-Training

Highlights

  • Pro

Block or report astroanand-6e

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
astroanand-6e/README.md

Hi, I'm Anand πŸ‘‹

I'm an MS in Artificial Intelligence student at Northeastern University (Khoury College), working at the intersection of foundation models, multimodal learning, and applied AI systems. Most of my recent work has been on adapting large pretrained models for new modalities and domains β€” vision transformers for medical imaging during my undergrad, and audio/language foundation models during my MS.

What I'm working on

  • CALM (Conformer Audio-Language Model) β€” a cross-modal fusion architecture pairing Gemma 4's audio Conformer with its text encoder via bidirectional attention. Reaches 83.9% on FMA-Medium 16-genre classification with ~4M trainable parameters, +9.1 pp over the best audio-only baseline. Currently extending toward a paper submission.
  • Local LLM serving infrastructure β€” a home GPU server (RTX 5060 Ti, Ubuntu) reachable over Tailscale, running Gemma and other open models via Ollama and llama.cpp for offline development workflows.
  • Coronary artery segmentation with SAM2 β€” fine-tuned SAM2 with two PEFT approaches achieving 92.0% Dice score, deployed as an interactive Hugging Face Space. (Live demo)

What I care about

Foundation models, multimodal learning, and systems that actually run in the real world. I lean toward research that produces a working artifact at the end, not just a paper β€” but I take papers seriously too: my undergrad work on transformer-based atrial fibrillation detection (ResFormer) was presented at ICIOT'25.

Currently

MS at Northeastern (Boston). Open to research assistant, co-op, and collaborator roles in applied ML, multimodal AI, and AI systems integration.

Stack

Python, PyTorch, Hugging Face, gRPC, Docker, Linux, SLURM/HPC (Northeastern Explorer cluster). Comfortable with Swift, TypeScript, and the local LLM stack (Ollama, llama.cpp, Continue). W&B for experiment tracking.

Let's connect

Pinned Loading

  1. ResFormerAF ResFormerAF Public

    Forked from Yashvardhan1103/ResFormer-Integrating-Deep-Learning-Models-for-Atrial-Fibrillation-Detection-Using-ECG

    ResFormerAF: Integrating deep learning models for Atrial fibrillation detection using ECG.

    Jupyter Notebook 1

  2. A_web_of_AI_startups_and_investors A_web_of_AI_startups_and_investors Public

    JavaScript

  3. EInops_and_Einsum_intro EInops_and_Einsum_intro Public

    A collection of few notebooks edited by me to be more understandable.

    Jupyter Notebook

  4. coronary_SAM2 coronary_SAM2 Public

    Jupyter Notebook

  5. Beta_VAE Beta_VAE Public

    Python

  6. Rock_Paper_Scissor Rock_Paper_Scissor Public

    JavaScript