Captionizer

Captionizer is a caption generator program provide offline transcription and translation capability.

This project is my Final Year Project for Diploma.

Background of Project:

Captions are getting more attention nowadays as numerous of videos are posted everyday. Accessibility is the most important concern for creators and developers so that every viewer gets the opportunity to enjoy the media shared.

Objectives of this Project:

To automate speech to text transcription process
To automate pairing text with speech into caption
To enable easy bulk translation of text and caption files

Project Modules:

Transcription Module
Translation Module
Caption Pairing Module
File Import Module
Recording Module
Audio Player Module
History Module
Storage Optimizing Module
User Interface Module
Neural Machine Translation Model Training Module

Requirements:

Python 3.7+
PyAudio
PyDub
MoviePy
Librosa
SoundFile
PyQt5
SoundDevice
Proglog
PyTorch
Sacremoses
Transformers
SentencePiece
TensorFlow
PyWhisper
GoogleTrans 3.1.0a0

NMT Model

Two of the translation model is built and trained with the model training module of this project by using TensorFlow and Keras. However, the translation model is roughly built due to short timeframe, therefore it produces relatively low accuracy with little vocabulary of 5000 words. Hence, external model from MarianMT has been downloaded through HuggingFace for better translation result. The trained model is also included in the project marked as Experimental.

Limitations:

Relatively slow start up

Further Enhancement:

Further polish NMT model architecture and train with better dataset for accurate result

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
model		model
resources		resources
translator_AI		translator_AI
AcceptableFiles.py		AcceptableFiles.py
AudioRecord.py		AudioRecord.py
Captionizer.ui		Captionizer.ui
Dialog.py		Dialog.py
ExperimentalTranslator.py		ExperimentalTranslator.py
FileImports.py		FileImports.py
FileOperations.py		FileOperations.py
ModelTranslator.py		ModelTranslator.py
README.md		README.md
ThreadConversion.py		ThreadConversion.py
ThreadPlayer.py		ThreadPlayer.py
ThreadRecording.py		ThreadRecording.py
ThreadTranscribe.py		ThreadTranscribe.py
ThreadTranslate.py		ThreadTranslate.py
Transcribe.py		Transcribe.py
Transcriber.py		Transcriber.py
Translate.py		Translate.py
UiCaptionizer.py		UiCaptionizer.py
UiFunction.py		UiFunction.py
UiMain.py		UiMain.py
UiPageChanger.py		UiPageChanger.py
UiSideResizeGrip.py		UiSideResizeGrip.py
UiStyle.py		UiStyle.py
resources.qrc		resources.qrc
resources_rc.py		resources_rc.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Captionizer

Background of Project:

Objectives of this Project:

Project Modules:

Requirements:

NMT Model

Limitations:

Further Enhancement:

About

Uh oh!

Releases 1

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Captionizer

Background of Project:

Objectives of this Project:

Project Modules:

Requirements:

NMT Model

Limitations:

Further Enhancement:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Contributors

Uh oh!

Languages