50 Articles
Tags :AI text-to-speech
General Introduction Llasa-3B is an open source text-to-speech (TTS) model developed by the Audio Lab of the Hong Kong University of Science and Technology (HKUST Audio). The model is based on the Llama 3.2B architecture, which has been carefully tuned to provide high-quality speech generation that not only supports multiple languages, but also enables emotional expression and personality...
General Introduction Kokoro-ONNX is an open source text-to-speech (TTS) tool based on ONNX runtime. Developed by thewh1teagle, the project aims to provide efficient and fast speech synthesis solutions.Kokoro-ONNX supports multiple languages, including English, and plans to support French, Japanese, Korean...
General Introduction OpenAI Edge TTS is an open source project that provides a native text-to-speech (TTS) API compatible with OpenAI.The project uses Microsoft Edge's online text-to-speech service to allow users to generate high-quality speech output.OpenAI Edge TTS supports a wide range of speech options...
General Introduction Jellypod is a powerful AI podcast studio designed to help users easily create, edit, and publish high-quality AI podcasts. With Jellypod, users can design personalized podcast hosts, refine scripts, and publish podcasts to Spotify, YouTube, Apple P...
General Introduction sherpa-onnx is an open source project developed by the Next-gen Kaldi team to provide efficient offline speech recognition and speech synthesis solutions. It supports a variety of platforms , including Android, iOS, Raspberry Pi , etc., can be in the absence of network connectivity in real-time ...
General Introduction Audiblez is an open source project designed to convert eBooks (e.g. .epub format) into audiobooks (e.g. .m4b format). The project utilizes Kokoro's high-quality speech synthesis technology to support multiple languages and multiple voices. Users can convert eBooks with a simple command line ...
Acoust is an online AI speech generation and text-to-speech (TTS) service platform that utilizes the latest AI technology to generate realistic speech. The platform also provides powerful video editing tools that allow users to create videos without having to use multiple software programs.Acoust supports more than 30 languages...
Comprehensive Introduction Kokoro-FastAPI is a Docker-based FastAPI package designed to provide support for the Kokoro-82M text-to-speech model. The project supports NVIDIA GPU acceleration and provides queue processing and auto splicing to make speech output of raw grown text more efficient and coherent. The project ...
General Introduction Kokoro 82M is an efficient speech synthesis model provided by Hugging Face, designed to generate high quality speech with fewer parameters and data. The model has 82 million parameters, is distributed under the Apache 2.0 license, supports a variety of voice packs (Voicepacks), and can generate...
General Introduction ebook2audiobook is a powerful open source ebook to audiobook tool. It can convert multiple formats of ebooks into audiobooks with full chapter markers and metadata. The tool uses Calibre for e-book format conversion , using Coqui's XTTSv2 and Fairseq into...