Generative Models

Introduction

This repository contains a project developed for the Deep Learning course during the 2024/2025 Summer Semester at the Warsaw University of Technology, Faculty of Mathematics and Information Science.

Dataset

The dataset can be obtained from various sources, but we recommend using the fferlito/Cat-faces-dataset repository.

To download the dataset on Linux, run the following commands:

wget [https://github.com/fferlito/Cat-faces-dataset/raw/master/dataset-part1.tar.gz](https://github.com/fferlito/Cat-faces-dataset/raw/master/dataset-part1.tar.gz)

wget [https://github.com/fferlito/Cat-faces-dataset/raw/master/dataset-part2.tar.gz](https://github.com/fferlito/Cat-faces-dataset/raw/master/dataset-part2.tar.gz)

wget [https://github.com/fferlito/Cat-faces-dataset/raw/master/dataset-part3.tar.gz](https://github.com/fferlito/Cat-faces-dataset/raw/master/dataset-part3.tar.gz)

Extract the downloaded archives into the cats/images directory:

mkdir -p cats/images
tar -xzvf dataset-part1.tar.gz -C cats/images --strip-components=1
tar -xzvf dataset-part2.tar.gz -C cats/images --strip-components=1
tar -xzvf dataset-part3.tar.gz -C cats/images --strip-components=1

Dependencies

To set up the environment on Linux, create a virtual environment and install the required packages:

python3 -m venv venv
source venv/bin/activate
pip install torch torchvision torchsummary matplotlib

Usage

This project implements four different architectures for image generation:

Autoencoders
Variational Autoencoders (VAEs)
Diffusion Models
GAN

Each model architecture is maintained in a separate branch within this repository.

Results

Generated Image Samples

Below are samples of the generated images.

Autoencoders

Note: Please note that all images generated using the standard Autoencoder exhibit a distinct blue artifact in the center. The root cause of this phenomenon requires further investigation.