AI Personal Learning
and practical guidance

Weebo: a real-time voice chatbot that provides a natural language conversational experience

General Introduction

Weebo is an open source real-time voice chatbot that utilizes the Whisper Small for speech recognition, Llama 3.2 for natural language generation, and Kokoro-82M for speech synthesis. Developed by Amanvir Parhar, the project aims to provide an efficient voice dialog solution that runs on native devices.Weebo supports multiple voices and smoothly generates real-time responses for a wide range of application scenarios that require voice interaction.

 

Function List

  • Real-time speech recognition: Efficient speech-to-text processing using the Whisper Small model.
  • Natural Language Generation: Generate natural language responses with the Llama 3.2 model.
  • Speech Synthesis: Converts text to speech using the Kokoro-82M model.
  • Multi-sound support: Provides multiple sound options to enhance the user experience.
  • Runs locally: No need to rely on cloud services, all processing is done on the local device.
  • Open source code: the code is open, allowing users to freely modify and extend the functionality.

 

Using Help

Installation process

  1. Download the required model:
    • Download Kokoro-82M model file kokoro-v0_19.onnx and placed in the project folder.
    • utilization Ollama The tool pulls the Llama 3.2 model.
  2. Clone Weebo project code:
   git clone https://github.com/amanvirparhar/weebo.git
cd weebo
  1. Install the dependencies:
   pip install -r requirements.txt
  1. Run the chatbot:
   python main.py

Instructions for use

  1. After starting the program, Weebo will start listening for voice input.
  2. Users can speak naturally and Weebo will generate a voice response after a short pause.
  3. check or refer to Ctrl+C The program can be stopped.

Main function operation flow

  • speech recognition: Weebo uses the Whisper Small model for speech recognition and is able to accurately convert a user's speech into text.
  • natural language generation: Using the Llama 3.2 model, Weebo understands the user's speech input and generates a natural language response.
  • speech synthesis: Using the Kokoro-82M model, Weebo converts the generated text response into speech and plays it back over the loudspeaker.
  • Multi-Voice Support: Users can select different sound models in the configuration file to meet different application requirements.

Detailed steps

  1. Launch Weebo: Run python main.pyThe program will start listening to the user's voice input.
  2. voice input: Users can speak directly into the microphone and Weebo will automatically recognize and process the voice.
  3. Generating a Response: After recognizing the speech, Weebo generates a natural language response using the Llama 3.2 model and converts it to speech using the Kokoro-82M model.
  4. Playback Response: The generated voice response is played through the speaker and the user can hear Weebo's answer.
  5. stop program: Press Ctrl+C Weebo can be stopped at any time.

With the above steps, users can easily start using Weebo to have real-time voice conversations and experience natural and smooth voice interaction.


May not be reproduced without permission:Chief AI Sharing Circle " Weebo: a real-time voice chatbot that provides a natural language conversational experience

Chief AI Sharing Circle

Chief AI Sharing Circle specializes in AI learning, providing comprehensive AI learning content, AI tools and hands-on guidance. Our goal is to help users master AI technology and explore the unlimited potential of AI together through high-quality content and practical experience sharing. Whether you are an AI beginner or a senior expert, this is the ideal place for you to gain knowledge, improve your skills and realize innovation.

Contact Us
en_USEnglish