Speak Smart Flask App

Speech Recognition and Feedback Application

This project focuses on developing a web application that integrates machine learning models for real-time audio transcription and feedback processing.

Real life application

Helps individuals improve their public speaking skills.
Useful for students, professionals, and anyone preparing for presentations or speeches.
Can be used in educational settings for language learning and oral presentations.
Can be used to transcribe meetings or conferences, providing a real-time record and feedback.
Enhances accessibility for individuals with hearing impairments by providing accurate transcriptions and feedback.

Project Description

This Flask application allows users to record audio in real-time, convert it to text using Google's Speech-to-Text API, and process the transcript for feedback. The feedback includes suggestions for improvements using GPT-based models and grammar checks using LanguageTool. Users can start and stop recording, view real-time transcription updates, and get detailed feedback on the transcript.

Key Features

Real-time audio recording and streaming.
Real-time transcription and confidence score updates.
Feedback processing for the transcript using GPT-based models.
Grammar checking using LanguageTool.
User-friendly web interface.

Here's a brief flow of how the web app operates:

User Interface Interaction:

Start Recording: User clicks the "Start Recording" button. The app sends a request to /start_recording with parameters like filename, phrases, and language codes. The server begins recording audio and streaming it to the Google Cloud Speech API. Real-time updates are sent to the client via Socket.IO, including transcript and confidence updates.

Stop Recording: User clicks the "Stop Recording" button. The app sends a request to /stop_recording. The server stops recording, saves the audio, and updates the interface. The feedback modal is shown for the user to review and provide feedback.

Submit Feedback: User clicks the "Ready" button in the feedback modal. The app sends a request to /feedback with the transcript and language code. The server processes the feedback, checks grammar, and uses OpenAI to analyze the transcript. The feedback, including any grammar issues and improvement suggestions, is returned and displayed in the UI.

Retry: User clicks the "Retry" button. The app sends a request to /retry to reset the session. The server clears the audio queue and recording state. The UI is reset to allow for a new recording session.

Real-Time Updates: Transcript and Confidence Updates: During recording, the server sends real-time updates about the transcript and confidence scores to the client via Socket.IO. The UI updates the transcript and confidence display accordingly.

Final Processing:
Transcript and Feedback Handling: Once recording stops, the app provides the option to review and process the transcript, integrating feedback and grammar checks before displaying the results.

Installation and Setup

Clone the repository: git clone [https://github.com/](https://github.com/FabianaMFZ/Speak-Smart-Web-App)
Create a virtual environment and activate it: python -m venv venv; venv\Scripts\activate
Install dependencies: pip install -r requirements.txt
Set up environment variables for Google Cloud credentials and OpenAI API key.
Run the application: python app.py
Access the application in your browser at http://localhost:5000.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
flask_speech_recognition		flask_speech_recognition
LICENSE		LICENSE
README.md		README.md
Speak_Smart_Web_App.pptx		Speak_Smart_Web_App.pptx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speak Smart Flask App

Speech Recognition and Feedback Application

Real life application

Project Description

Key Features

Here's a brief flow of how the web app operates:

Installation and Setup

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

FabianaMFZ/Speak-Smart-Flask-App

Folders and files

Latest commit

History

Repository files navigation

Speak Smart Flask App

Speech Recognition and Feedback Application

Real life application

Project Description

Key Features

Here's a brief flow of how the web app operates:

Installation and Setup

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages