ImageSentiment

A deep learning computer vision pipeline for classifying multi-dimensional human emotion utilizing Object Detection mapping.

Overview

This repository contains a unified Python architecture for processing the Emotion6 sentiment classification dataset.

It implements a two-stage approach to classifying eight distinct emotion probabilities:

Object Detection: Extracts bounding boxes using YOLOv5 around relevant context windows (i.e. humans, faces).
Transfer Learning: Passes the YOLO-cropped regions through a fine-tuned VGG16 backbone architecture containing customized Global Average Pooling and Dropout headers.

Architecture Improvements

The pipeline has been thoroughly refactored from disorganized Jupyter Notebook files into a standard, modular Python package structure:

src/dataset.py: Condenses data loading into a live tf.data.Dataset pipeline. This dynamic generator prevents sudden RAM crashes by processing YOLO crops, standardizing sizes, and normalizing arrays line-by-line rather than pre-loading a massive monolithic Numpy Array explicitly.
src/model.py: Defines the VGG16 model with Keras Functional API. Implements essential Regularization via internal Data Augmentation nodes (RandomFlip, RandomRotation, RandomZoom) to force generalization on simpler datasets.
src/train.py: Orchestrates deep learning workflows. Contains safeguards such as EarlyStopping execution via callback to halt model degrading, and automatic configuration of .h5 model checkpoints.

Getting Started

1. Requirements

Python 3.7+
TensorFlow 2.x
OpenCV
Pandas

2. Prepare Data

Ensure your /data/ folder contains the appropriate raw images/ directory extracted from your source archive. If you contain raw labels (e.g., ground_truth.txt), you can optionally convert them via Pandas.

3. Usage

You can execute the entire pipeline via the interactive Python orchestrator.

Launch run_pipeline.ipynb
Specify the absolute or relative pathing to your generated yolo5_output.csv and label.csv files.
Run the top-level block to invoke train_emotion_model seamlessly.

from src.train import train_emotion_model

model, history = train_emotion_model(
    images_dir="data/images",
    labels_csv="data/label.csv",
    yolo_csv="data/yolo5_output.csv",
    output_dir="checkpoints",
    batch_size=32,
    epochs=20,
    fine_tune=False
)

By default, this will freeze the underlying VGG16 convolutions to train the final sentiment header. Set fine_tune=True explicitly in your configuration to adjust deeper structural weights on subsequent epochs.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
src		src
.DS_Store		.DS_Store
Fine_tuning.ipynb		Fine_tuning.ipynb
Pre-Processing_Object_detection.ipynb		Pre-Processing_Object_detection.ipynb
README.md		README.md
data_preparation.ipynb		data_preparation.ipynb
label.csv		label.csv
label.npy		label.npy
run_pipeline.ipynb		run_pipeline.ipynb
vgg16_phase1.ipynb		vgg16_phase1.ipynb
yolo5_output.csv		yolo5_output.csv
yolo5_phase.ipynb		yolo5_phase.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ImageSentiment

Overview

Architecture Improvements

Getting Started

1. Requirements

2. Prepare Data

3. Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ImageSentiment

Overview

Architecture Improvements

Getting Started

1. Requirements

2. Prepare Data

3. Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages