Chapta: AIGC-based audio storytelling and picture book creation platform with strong consistency of picture book characters
Comprehensive Introduction Chapta (Chapta) is an audiobook creation platform based on Artificial Intelligence Generated Content (AIGC) technology. The platform aims to provide users with an authoring environment that integrates text, image, sound and video editing tools through state-of-the-art AIGC technology. Users can...
Retrieval based Voice Conversion WebUI: A Framework for Retrieval-based Voice Conversion | Simulating Real-life Singing Voices
Comprehensive Introduction Retrieval based Voice Conversion WebUI is an easy-to-use VITS-based voice conversion framework that enables voice conversion between any speakers, including song covers and real-time voice changes. It has low ...
ReechoAI: Ultra-Faux AI Speech Synthesis and Instantaneous Cloning Platform
Comprehensive Introduction Reecho AI (Reecho) is an ultra-fidelity AI voice synthesis and instantaneous cloning platform that utilizes advanced AI technology to allow users to quickly create and clone specific voice characters by uploading or recording an audio sample. The platform features the ability to create audio samples from shorter...
Zide Speech: Intelligent Speech Synthesis Platform|Speech Cloning
Comprehensive Introduction Zide Voice is a voice synthesis platform that uses advanced AI technology. Users can simply upload a piece of voice, which can be supplemented with text to generate realistic and emotional voice clips. The platform is equipped with features such as quick character customization, cloud-based voice generation, and anthropomorphic voice synthesis. There is no need to download any software through...
VoiceCraft: open source zero-sample speech cloning and text-to-speech tool
Comprehensive Introduction VoiceCraft is an open source speech editing and zero-sample speech synthesis tool based on the neural codec language model. It employs an innovative coded sequence generation method that enables insertion, deletion and replacement operations on existing speech sequences to generate natural, coherent edited speech...
Happy Scribe: Audio Transcription and Video Subtitling Platform | Free Video Subtitle Editing Software
Happy Scribe General Description Happy Scribe provides automated and manual audio transcription services to convert audio to text with high accuracy and support for multiple languages and formats. It includes an interactive editor, collaboration tools, multiple export formats, machine translation, and other features...
Whisper GPGPU: OpenAI Whisper running on Windows|Whisperdesktop
Comprehensive Introduction Whisper is a GitHub open source project developed by Const-me, focusing on high performance inference of OpenAI's Whisper Automatic Speech Recognition (ASR) model using GPGPU. This project is based on the MPL-2.0 license...
Buzz: open source offline audio transcription translation tool | IOS voice transcription
Buzz General Introduction Buzz is an open source project created by chidiwilliams that enables offline transcription and translation of audio on personal computers. The project relies on OpenAI's Whisper technology, which allows users to not rely on an Internet connection for audio text...
Deepgram: service API for high-precision speech recognition and synthesis solutions
General Description Deepgram is a company focused on speech recognition and natural language processing technologies, providing powerful Speech-to-Text and Text-to-Speech APIs.The platform utilizes advanced artificial intelligence...
Seaweed AI: Intelligent Speech Synthesis and Voice Cloning Platform
Comprehensive Introduction Seaweed AI is an intelligent dubbing product that can convert text into voice online, powered by the Yun Zhisheng AI open platform. Users can self-help realize voice cloning, and provide AI pronouncers of different genders, accents and languages, and directly dub the voice after inputting text. It can quickly dub short...









