General Introduction
MeetingMind is an advanced AI application designed to improve the efficiency of capturing and summarizing business meetings. The app integrates OpenAI's Whisper technology for accurate speech-to-text and uses IBM Watson's AI to analyze and extract key points in the transcribed text. The entire process is conducted through a user-friendly interface provided by Hugging Face's Gradio, making it easy to use even for non-technical users.
Function List
- Speech to text: Convert spoken content into accurate text using OpenAI's Whisper model.
- Meeting Summary: Integrates IBM Watson's AI processing of transcribed text to generate a summary of key points from the meeting.
- User-friendly interface: Hugging Face Gradio provides an intuitive web interface for easy user interaction.
- Real-time output display: display transcribed and summarized content directly in the Gradio interface.
- Environment settings: configured to deploy seamlessly using a Python virtual environment and the necessary libraries.
Using Help
Installation process
- clone warehouse::
git clone https://github.com/misbahsy/meetingmind.git cd meetingmind
- Setting up a virtual environment::
python -m venv env source env/bin/activate # Windows users use `env\Scripts\activate`.
- Installation of required libraries::
pip install -r requirements.txt
- Install FFmpeg: Install FFmpeg as required by your system to process audio files.
- Setting environment variables: Create the
.env
file and add the API key:OPENAI_API_KEY=your_openai_api_key_here IBM_WATSON_API_KEY=your_ibm_watson_api_key_here
- Running the application::
python app.py
- access interface: Open a browser and navigate to the address of the local server provided by Gradio (usually the
http://127.0.0.1:7860
) to interact with the MeetingMind application.
Usage Process
- Uploading audio files: Use the Gradio interface to upload recordings of meetings.
- make a copy of a recording: The app will use OpenAI's Whisper model to transcribe spoken content into text.
- summarize: IBM Watson AI processes transcribed text to generate a summary of key points from the meeting.
- View Results: View transcriptions and summaries in the Gradio interface.
MeetingMind is designed to make it easy for users to capture and summarize meeting content and increase productivity, especially for corporate and educational environments.