Skip to content

abhishekmallav/SamvaadSetu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

✨ SamvaadSetu

AI Powered Communication and Presentation Coach


🚀 Overview

SamvaadSetu is an innovative, AI-powered, web-based system designed to transform speakers into confident, self-aware, and impactful communicators.
By analyzing both verbal and non-verbal communication in real time, SamvaadSetu provides instant, holistic, and adaptive feedback to help users elevate their speaking skills.


🎯 What Makes SamvaadSetu Unique?

  • Multimodal Integration: Combines audio, video, and text to deliver a complete analysis of communication style.
  • Accessibility & Inclusivity: Tailored for marginalized communities, individuals with disabilities, non-native speakers, and anyone seeking affordable, personalized coaching.
  • Scenario-Based Training: Practice in real-world contexts like interviews, presentations, and team discussions.
  • Comprehensive Feedback: Integrates posture tracking, speech metrics, grammar correction, emotional and sentiment analysis, and facial expression recognition.

🛠️ Tech Stack

  • 🤖 Mediapipe: Gesture, posture, and face tracking
  • 🗣️ Deepgram STT API: Speech-to-text transcription
  • ✍️ LanguageTool: Grammar checks
  • 🧠 Google Gemini (LLM): Emotional analysis, sentiment summarization, and enhanced feedback
  • 🐍 Python & Flask: Backend logic and web server
  • 🎞️ ffmpeg: Video/audio processing and conversion
  • 🏷️ NLTK: Text analysis and sentence tokenization
  • 📄 Markdown: Displaying formatted feedback

🎉 Features

  • Speech Transcription: Converts spoken words into text
  • Grammar Correction: Identifies and suggests improvements for grammatical errors
  • Facial Expression Recognition: Analyzes emotions conveyed through facial cues
  • Hand Gesture & Posture Analysis: Evaluates body language for effectiveness
  • Sentiment Summarization: Provides an overview of emotional tone
  • Real-Time Feedback: On both verbal and non-verbal communication
  • Filler Word & Pause Detection: Highlights habits and helps reduce usage
  • Scenario-Based Training: Simulates real-life communication challenges
  • Comprehensive Reporting: Unified feedback integrating all analysis aspects
  • Playback & Review: Watch recordings with synchronized feedback overlays
  • Inclusivity: Adaptive interfaces and feedback for diverse user needs

🧑‍💻 Usage Guide

  1. Start the Application

    python app.py
  2. Access the Web Interface

    • Open your browser at http://localhost:5000.
  3. Begin Your Session

    • 🎤 Speak or present as you normally would.
    • 👀 The system tracks gestures, posture, and facial expressions.
    • 📝 Receive instant, actionable feedback on your delivery.
    • 📊 Review your performance with comprehensive, AI-generated reports.

📂 Folder Structure

SamvaadSetu/
├── app.py
├── requirements.txt
├── static/
│   └── (HTML/CSS/JS files)
├── templates/
│   └── (Web templates)
├── videos/
├── posture/
├── transcripts/
├── analysis/
├── reports/
├── LLM/
└── README.md

🔗 API Endpoints

Endpoint Method Description
/ GET Serves main recording interface
/save_video POST Receives and processes recordings
/video GET Serves latest processed video
/report GET Returns latest posture report
/transcript GET Returns latest speech transcript
/analysis GET Returns latest speech analysis
/combined_report GET Returns all analysis components
/playback GET Serves playback interface
/analyze_report POST Triggers AI enhancement of analysis
/llm_response GET Provides latest enhanced AI feedback

⚙️ Installation Guide

  1. Clone the repository

    git clone https://github.com/abhishekmallav/SamvaadSetu.git
    cd SamvaadSetu
  2. Create a virtual environment

    python -m venv venv
  3. Activate your virtual environment

    • On Windows:

      venv\Scripts\activate
    • On Mac/Linux:

      source venv/bin/activate
  4. Install required packages

    pip install -r requirements.txt


📈 Future Scope

  • Advanced AI Feedback: Deeper, context-aware analysis and recommendations
  • Multilingual Support: Feedback in multiple languages
  • VR/AR Integration: Immersive practice environments
  • Industry Modules: Legal, medical, technical, and more
  • Mobile App: Practice and feedback on the go
  • API Development: Integration with other platforms
  • Collaborative Features: Group practice and peer feedback

📚 References

  • Flask Documentation
  • MediaPipe
  • Deepgram
  • Google Gemini
  • NLTK
  • LanguageTool

📝 License

This project is licensed under the MIT License.

Author

Created by abhishekmallav, 09Namratakhade, PrashantTakale369, anushree612

Empower your voice with SamvaadSetu! 🎤✨

Let me know if you want to add anything specific or need a section tailored further!

LinkedIn Gmail Instagram Twitter

About

SamvaadSetu is a GenAi based application to enhance speech and presentation skills by analyzing video input and providing real-time transcript and posture improvement suggestions. Leveraged MediaPipe and Computer Vision to extract and analyze posture data for actionable feedback.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors