Skip to content
View sanjay-dilip's full-sized avatar

Highlights

  • Pro

Block or report sanjay-dilip

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sanjay-dilip/README.md

Hi, I'm Sanjay.

I build end-to-end analytics systems - clean data in, dashboards and decisions out. Most of my work sits at the intersection of BI (Power BI), analytics engineering (SQL + Python), and applied ML when it adds real value.

If you only click two things: start with Supply Chain Analytics (Azure + Power BI) and MovieLens (deployed Streamlit).

  • Supply Chain Disruption Analytics - Azure to Power BI pipeline, KPI monitoring, stats + delay risk modeling (Repo)
  • MovieLens Recommender - ALS + hybrid ranking, deployed Streamlit app (Repo)
  • Letterboxd Sentiment API - weak labels + NLP model, FastAPI service (Repo)
  • Sim2Real Engagement - sim vs real churn signals, Streamlit comparison dashboard (Repo)

What I work on

  • DA/BI: Power BI (DAX), Tableau, KPI reporting, dashboard storytelling
  • Analytics: SQL, Python (pandas), experimentation, forecasting
  • Data Engineering: ETL/ELT, curated layers (Parquet), data modeling, quality checks
  • ML (supporting): NLP, recommenders/ranking, model evaluation, deployment (FastAPI)

Highlights

  • I like projects where metrics tie to real decisions (late delivery risk, churn risk, ranking quality).
  • I care about reproducibility: clear READMEs, runnable steps, and basic checks.

Current Focus

  • Shipping more BI artifacts: dashboard screenshots, KPI definitions, and DAX notes
  • Strengthening data engineering habits: curated layers, data checks, and clean project structure
  • Improving project READMEs: clearer results, visuals, and 3-step run instructions
  • Adding lightweight automation: GitHub Actions for linting/tests and sanity checks

Tools I use

SQL, Power BI, Excel, Python (pandas, scikit-learn), Streamlit, FastAPI, Snowflake, Azure (ADF/Blob), GitHub Actions

Contact

Outside of work, I like exploring media datasets and building analytics around user behavior.

Pinned Loading

  1. letterboxd-sentiment-api letterboxd-sentiment-api Public

    Sentiment analysis system for Letterboxd reviews with TF-IDF baseline, DistilBERT, weak labels, manual evaluation, and FastAPI backend.

    Jupyter Notebook

  2. sim2real-engagement sim2real-engagement Public

    Sim2real comparison of user engagement and churn modeling using simulated anime viewing data and real Steam gameplay data, highlighting how data constraints shape modeling decisions.

    Jupyter Notebook

  3. Data-Science-Projects Data-Science-Projects Public

    A collection of Python data science projects with EDA, modeling workflows, and full end-to-end systems.

    Jupyter Notebook

  4. Deep-Learning-Projects Deep-Learning-Projects Public

    Jupyter Notebook