Skip to content

dannypctang528/python---Stock-Data-Extract-Transform-Load---ETL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Stock Data Extract-Transform-Load (ETL) Project

This project demonstrates a complete ETL (Extract, Transform, Load) pipeline for stock market data using Python. It was developed as part of the NPower Canada Junior Data Analyst program and showcases practical skills in data extraction, cleaning, transformation, and visualization.


Project Overview

The goal of this project is to automate the process of collecting historical stock data, transforming it into a clean and structured format, and preparing it for analysis or visualization. The project focuses on two popular stocks: Tesla (TSLA) and GameStop (GME).


Tools & Technologies

  • Python 3.x
  • Pandas for data manipulation
  • NumPy for numerical operations
  • Matplotlib / Seaborn for visualization
  • Google Colab for interactive development
  • Git & GitHub for version control and publishing

ETL Pipeline Breakdown

1. Extract

  • Pulls historical stock data from Coursera.
  • Handles file reading and basic validation.

2. Transform

  • Cleans missing or inconsistent data.
  • Normalizes column formats and renames headers.

3. Load

  • Data for visualization.

Sample Visualizations

  • Line charts
  • Chart size optimization

About

Extracting and Visualizing Stock Data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors