🚀 Invitation to Experience: China's First AI IDE Intelligent Programming Software Trae Chinese version downloadThe DeepSeek-R1 and Doubao-pro are available for unlimited use!

Total 79 articles

Tags: ai text to speech Page 2

Cat and Star: a story-listening app that writes exclusive fairy tales with your child

Comprehensive Introduction "Cat & Star" (maoyuxing.com) is an interactive story creation platform designed for children, helping parents and children to create personalized fairy tales together through mobile applications. Users can enter the child's name, preferences and other information to generate unique story content, allowing the child to become the story...

2025-02-23AI tools AI educational tools AI Text-to-Speech

Azure TTS Importer：将语音合成服务集成到阅读软件中-首席AI分享圈

Azure TTS Importer: Integrating speech synthesis services into reading software

Comprehensive Introduction TTS Importer is an open source project designed to easily import Azure TTS (Text-to-Speech) speech synthesis service into various reading software. The tool supports several popular reading software, including Read (legado), Love Reader, Source Reader, and more. With TTS Importer, ...

2025-02-17AI tools AI open source project AI Text-to-Speech

Trae Chinese Version First Invitation to Download: Unlimited use of DeepSeek-R1 after registration!

Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.

2025-05-09

NVIDIA PDF to Podcast：设置引导提示词将PDF转换为播客的AI工具-首席AI分享圈

NVIDIA PDF to Podcast: AI Tool for Converting PDF to Podcast by Setting Guiding Prompts

General Introduction NVIDIA AI Blueprint: PDF to Podcast is an open source project developed by NVIDIA to convert PDF documents into engaging audio content. The project utilizes NVIDIA NIM (NVIDIA Inference Microservices) technology to be able to securely run on private networks...

2025-02-15AI News AI open source project AI Text-to-Speech

Kokoro WebGPU: A Text-to-Speech Service for Offline Operation in Browsers

General Introduction Kokoro WebGPU is the WebGPU version of the Kokoro text-to-speech (TTS) model, provided by WebML Community on the Hugging Face platform. The project utilizes WebGPU technology to enable users to run efficient text-to-speech conversions locally in their browsers.WebGPU is a modern...

2025-02-09AI tools AI open source project AI Text-to-Speech

Orate: A Unified API for Integrating Well-Known Speech Generation, Speech Transcription and Voice Change Models

General Description Orate is an AI toolkit focused on speech generation and transcription. It provides a unified API that seamlessly integrates with leading AI providers such as OpenAI, ElevenLabs, and AssemblyAI to help users create realistic, human-like speech and transcribe audio into text.Ora...

2025-02-01AI tools AI open source project AI Text-to-Speech AI Speech to Text

Weights: a voice-imitation cover song and text-to-speech authoring platform

General Introduction Weights is a social platform that utilizes AI for creation, allowing users to create voice covers, text-to-speech, images, music, and videos with simple operations. The platform provides a wealth of tools and templates to help users get started creating quickly and share their work with the community....

2025-01-30AI tools AI Text-to-Speech AI voice cloning

AnyVoice: free online voice cloning, just 3 seconds to realize the voice cloning

General Introduction AnyVoice is an advanced AI speech generation platform that provides ultra-realistic speech generation and voice cloning services. The platform allows users to convert text into natural speech and choose from hundreds of preset voices. If you can't find the right voice, just 3 seconds recording is...

2025-01-30AI tools AI Text-to-Speech AI voice cloning

Open NotebookLM: convert PDF to podcasts of open source tools

General Introduction Open NotebookLM is an open source project designed to convert any PDF document into a podcast. The tool utilizes open source Large Language Model (LLM) and Text-to-Speech (TTS) models to process PDF content, generate natural dialog suitable for audio podcasts, and output to MP3 files. The project is supported by the N...

2025-01-29AI tools AI open source project AI Text-to-Speech

Llasa 1~8B: an open source text-to-speech model for high quality speech generation and cloning

General Introduction Llasa-3B is an open source text-to-speech (TTS) model developed by the Audio Lab of the Hong Kong University of Science and Technology (HKUST Audio). The model is based on the Llama 3.2B architecture, which has been carefully tuned to provide high-quality speech generation that not only supports multiple languages, but also enables emotional expression and personality...

2025-01-27AI tools AI open source project AI Text-to-Speech AI voice cloning

Kokoro-ONNX: Efficient Text-to-Speech Tool with Multi-Language and Multi-Voice Support

General Introduction Kokoro-ONNX is an open source text-to-speech (TTS) tool based on ONNX runtime. Developed by thewh1teagle, the project aims to provide efficient and fast speech synthesis solutions.Kokoro-ONNX supports multiple languages, including English, and plans to support French, Japanese, Korean...

2025-01-19AI tools AI open source project AI Text-to-Speech

OpenAI Edge TTS：利用 Edge TTS 的免费文本转语音API，兼容 OpenAI 格式-首席AI分享圈

OpenAI Edge TTS: Free text-to-speech API utilizing Edge TTS, compatible with OpenAI formats

General Introduction OpenAI Edge TTS is an open source project that provides a native text-to-speech (TTS) API compatible with OpenAI.The project uses Microsoft Edge's online text-to-speech service to allow users to generate high-quality speech output.OpenAI Edge TTS supports a wide range of speech options...

2025-01-18AI tools AI open source project AI Text-to-Speech

Jellypod: produce multilingual AI podcasts, create, edit and distribute AI podcasts

General Introduction Jellypod is a powerful AI podcast studio designed to help users easily create, edit, and publish high-quality AI podcasts. With Jellypod, users can design personalized podcast hosts, refine scripts, and publish podcasts to Spotify, YouTube, Apple P...

2025-01-17AI tools AI Text-to-Speech

Sherpa-ONNX：使用ONNXRuntime实现离线语音识别和合成-首席AI分享圈

Sherpa-ONNX: Offline Speech Recognition and Synthesis with ONNXRuntime

General Introduction sherpa-onnx is an open source project developed by the Next-gen Kaldi team to provide efficient offline speech recognition and speech synthesis solutions. It supports a variety of platforms , including Android, iOS, Raspberry Pi , etc., can be in the absence of network connectivity in real-time ...

2025-01-16AI tools AI open source project AI Text-to-Speech AI Speech to Text

Audiblez：生成有声书，使用Kokoro将电子书转换为有声读物-首席AI分享圈

Audiblez: Generate Audiobooks, Convert eBooks to Audiobooks with Kokoro

General Introduction Audiblez is an open source project designed to convert eBooks (e.g. .epub format) into audiobooks (e.g. .m4b format). The project utilizes Kokoro's high-quality speech synthesis technology to support multiple languages and multiple voices. Users can convert eBooks with a simple command line ...

2025-01-16AI tools AI open source project AI Text-to-Speech

Acoust: Online AI Speech Generation and Text-to-Speech (TTS) Services Platform

Acoust is an online AI speech generation and text-to-speech (TTS) service platform that utilizes the latest AI technology to generate realistic speech. The platform also provides powerful video editing tools that allow users to create videos without having to use multiple software programs.Acoust supports more than 30 languages...

2025-01-10AI tools AI Text-to-Speech AI Speech to Text

Kokoro TTS API：快速文本转语音的Docker化FastAPI封装（Kokoro-82M模型）-首席AI分享圈

Kokoro TTS API: Dockerized FastAPI wrapper for fast text-to-speech (Kokoro-82M model)

Comprehensive Introduction Kokoro-FastAPI is a Docker-based FastAPI package designed to provide support for the Kokoro-82M text-to-speech model. The project supports NVIDIA GPU acceleration and provides queue processing and auto splicing to make speech output of raw grown text more efficient and coherent. The project ...

2025-01-09AI tools AI open source project AI Text-to-Speech

Kokoro: Efficient Speech Synthesis Models to Generate Natural and Smooth Speech

General Introduction Kokoro 82M is an efficient speech synthesis model provided by Hugging Face, designed to generate high quality speech with fewer parameters and data. The model has 82 million parameters, is distributed under the Apache 2.0 license, supports a variety of voice packs (Voicepacks), and can generate...

2025-01-08AI tools AI open source project AI Text-to-Speech

ebook2audiobook：将电子书转换为有声读物，支持多语言和语音克隆的开源工具-首席AI分享圈

ebook2audiobook: convert e-books to audiobooks, open-source tool that supports multilingualism and voice cloning

General Introduction ebook2audiobook is a powerful open source ebook to audiobook tool. It can convert multiple formats of ebooks into audiobooks with full chapter markers and metadata. The tool uses Calibre for e-book format conversion , using Coqui's XTTSv2 and Fairseq into...

2024-12-31AI tools AI open source project AI Text-to-Speech

Edge TTS Worker：使用Cloudflare部署微软语音合成API，兼容OpenAI 格式并封装Web界面-首席AI分享圈

Edge TTS Worker: Deploying Microsoft Speech Synthesis APIs with Cloudflare, OpenAI Compatible Format and Wrapped Web Interface

General Introduction Edge TTS Worker (depends on edge-tts ) is a proxy service deployed on Cloudflare Worker that encapsulates the Microsoft Edge TTS service into an API interface compatible with the OpenAI format. With this project, users can easily use without Microsoft certification...

2024-12-29AI tools AI Side Hustle Money Making Program AI open source project AI Text-to-Speech

preceding page
1
2
3
4
5
next page
Total 5 pages