Skip to content

Anisca-hub/AetherVision

Repository files navigation

AetherVision: Image Based Weather Classification 🌦️

AetherVision is a high-performance computer vision framework designed to classify meteorological conditions with precision. Leveraging Vision Transformers (ViT) and custom Focal Loss optimization, the system achieves state-of-the-art accuracy in handling imbalanced environmental datasets.

🚀 Key Features

State-of-the-Art Architecture: Powered by timm (PyTorch Image Models) using pre-trained Vision Transformers for global contextual awareness.

Imbalance Mitigation: Implements custom Focal Loss to ensure robust performance on minority weather classes.

Interpretability: Built-in Grad-CAM integration to visualize model attention and debug feature identification.

Production-Ready Pipeline: Optimized data loaders with seamless support for large-scale image processing and hardware acceleration (CUDA).

📊 Results & Visualization

The AetherVision system has been optimized to ensure high-fidelity classification and reliable feature identification. Below is the performance trajectory of the model and a sample visualization of its decision-making process.

Performance Metrics: The graph above illustrates the model's convergence, showing minimal divergence between training and validation loss, confirming the system's ability to generalize to unseen data.

Interpretability: Using Grad-CAM, we can visualize the model's attention. The heatmap below highlights how the Vision Transformer (ViT) effectively isolates meteorological features, ignoring extraneous background noise.

alt text

alt text

alt text

📋 Project Structure

image

🏗️ Technical Stack

This stack highlights your proficiency in deep learning, data engineering, and model interpretability. alt text

🚀 Getting Started

This project is optimized for high-performance computing. Because training Vision Transformers (ViT) on local hardware can be time-prohibitive, this workflow is designed to run seamlessly on Google Colab with a T4 GPU.

Option A: Running on Google Colab (Recommended)

This workflow is optimized for the T4 GPU to significantly reduce training times.

Upload to Drive: Upload the entire AetherVision/ folder to your Google Drive.

Open Notebook: Open AetherVision.ipynb in Google Colab.

Configure Runtime: Navigate to Runtime > Change runtime type and select T4 GPU.

Mount & Setup: In the notebook, run the following cells:

from google.colab import drive drive.mount('/content/drive')

%cd /content/drive/MyDrive/AetherVision/

pip install -r requirements.txt -q

Execute Pipeline: python main.py

Option B: Running on Local Machine (More execution time)

If you prefer local environment:

  1. Clone the Repository:

git clone https://github.com/Anisca-hub/AetherVision.git

cd AetherVision

  1. Install Dependencies:

pip install -r requirements.txt

  1. Run the Pipeline:

Execute the main entry point to start training or evaluation:

python main.py

🎓 About the Author Built by [Anisca Jha] | [https://www.linkedin.com/in/anisca-jha-93ba83308]

Designed to showcase excellence in modern AI development, AetherVision implements high-performance Vision Transformers (ViT) and sophisticated interpretability modules (Grad-CAM). The project serves as a blueprint for robust machine learning workflows, emphasizing modular data engineering, hardware-accelerated training, and verifiable, reproducible model performance.

Releases

No releases published

Packages

 
 
 

Contributors