AI Text-to-Speech

Total 79 articles posts
MegaTTS3:合成中英文语音的轻量模型

MegaTTS3: A Lightweight Model for Synthesizing Chinese and English Speech

Comprehensive Introduction MegaTTS3 is an open source speech synthesis tool developed by ByteDance in cooperation with Zhejiang University, focusing on generating high-quality Chinese and English speech. Its core model is only 0.45B parameters , lightweight and efficient , support for mixed Chinese and English speech generation and speech cloning . The project is hosted on ...
4mos ago
01.3K
Llasa 1~8B:高品质语音生成和克隆的开源文本转语音模型

Llasa 1~8B: an open source text-to-speech model for high quality speech generation and cloning

General Introduction Llasa-3B is an open source text-to-speech (TTS) model developed by the Audio Lab of the Hong Kong University of Science and Technology (HKUST Audio). The model is based on the Llama 3.2B architecture, which has been carefully tuned to provide high-quality speech generation that not only supports multiple...
6mos ago
01.7K
ViiTor AI:音频/视频多语言翻译合成与语音克隆服务

ViiTor AI: Audio/Video Multilingual Translation Synthesis and Speech Cloning Service

Comprehensive Introduction ViiTor AI is a powerful artificial intelligence platform focused on providing high-quality video translation, voice cloning, AI-generated avatar videos, and speech synthesis services. The platform supports multiple languages and is designed to help users easily realize multilingual content creation.ViiTo...
8mos ago
02.5K
GizAI:全能AI助手,集成主流生成式AI工具,让每个人免费使用商业化AI工具

GizAI: All-in-one AI assistant, integrating mainstream generative AI tools, making commercialized AI tools free for everyone to use

General Introduction GizAI is a one-stop platform that integrates AI generation, note-taking and cloud storage capabilities. Users can generate images, videos, audios, texts, characters, stories and games with GizAI, and can take collaborative notes and cloud storage on the platform.GizAI provides multi...
8mos ago
02.8K
SoniTranslate:开源视频翻译配音解决方案,多人配音、调整语速与模仿原声

SoniTranslate: open source video translation and dubbing solution, multi-person dubbing, adjust the speed of speech and mimic the original sound

General Description SoniTranslate is a powerful and user-friendly video multilingual dubbing tool designed to provide a solution for video translation and synchronized audio. It uses advanced speech recognition and machine translation technologies to translate video content into multiple languages and keep the audio synchronized. The program ...
10mos ago
03.7K
逗哥配音:专注短视频解说、创作的智能配音神器

Teaser Dubbing: Intelligent dubbing tool that focuses on short video narration and creation

Comprehensive Introduction Tease Dubbing is a popular AI dubbing software with over 5 million users. The software utilizes advanced AI intelligent dubbing technology to provide professional and realistic dubbing effects, which is applicable to a variety of scenarios such as short videos, advertisement production, education and training. Teaser Dubbing is committed to providing users with fast...
10mos ago
01.8K
Resemble AI:人工智能语音合成平台|声音克隆|深度伪造音频检测

Resemble AI: Artificial Intelligence Speech Synthesis Platform | Voice Cloning | Deep Fake Audio Detection

Comprehensive Introduction Resemble AI is an artificial intelligence speech synthesis platform designed for the enterprise. The platform provides cutting-edge AI voice generator technology and deep forged audio detection for future information security. Features include voice cloning, real-time deep fake audio detection, AI watermarking technology...
10mos ago
02K
XAudioPro:专业在线音频剪辑工具|有声书制作|文字转语音|伴奏分离

XAudioPro: Professional Online Audio Editing Tool|Audiobook Maker|Text to Speech|Accompaniment Separation

General Introduction XAudioPro is an advanced online audio real-time editing and transcoding tool that is both professional and portable. It supports professional audio editing functions such as cutting, cropping, copying, deleting, restoring, and amplitude gain control. It also provides denoising services such as spectral subtraction noise reduction, low-pass...
10mos ago
01.6K
魔音工坊:专业配音与短视频解说创作平台|真人配音|克隆声音|一键成片

Magic Voice Workshop: professional voice-over and short video narration creation platform | real person voice-over | clone voice | one-click into a film

Comprehensive Introduction Magic Voice Workshop is a one-stop short video and AI dubbing platform with information on software dubbing, real-life dubbing, sound libraries, cloning services and more. The platform integrates audio editing, AI copy generation, video editing and collaboration tools for audio-related services and content creation. Users experience the audio editor...
10mos ago
01.6K
录咖:一站式音视频处理平台|视频生成|AI字幕|提取音频|语音转文字

Record Cafe: One-stop Audio/Video Processing Platform|Video Generation|AI Subtitle|Audio Extraction|Speech to Text

Comprehensive Introduction Record Cafe is a one-stop audio/video processing platform that provides AI video dialog, AI subtitles and AI speech to text services. Functions include recording screen, editing video, converting GIF/audio, etc., and supports cloud storage and sharing. The interface is intuitive and easy to use, and it also supports multi-screen recording and multi-language smart...
8mos ago
02K
IMS Toucan:快速可控的多语言(支持7000+语言)文本转语音工具

IMS Toucan: Fast and Controllable Multilingual (7000+ languages supported) Text-to-Speech Tool

General Introduction IMS Toucan is a state-of-the-art text-to-speech (TTS) toolkit developed by the Institute for Natural Language Processing (IMS) at the University of Stuttgart, Germany. The toolkit supports more than 7000 languages and is characterized by fast, controllable and low computational resource requirements.IMS...
6mos ago
01.7K
ChatTTS:模仿真人说话声音的语音生成模型(ChatTTS一键加速包)

ChatTTS: a speech generation model that mimics the voice of a real person speaking (ChatTTS one-click acceleration package)

General Introduction ChatTTS is a generative speech model designed for conversational scenarios. It generates natural and expressive speech, supports multiple languages and multiple speakers, and is suitable for interactive conversations. The model does this by predicting and controlling fine-grained prosodic features such as laughter, pauses and interjections, sup...
6mos ago
01.9K
腾讯智影:智能视频创作工具|AI数字人、动漫生成套件

Tencent Smart Shadow: Intelligent Video Creation Tool | AI Digital Man, Anime Generation Kit

Comprehensive Introduction Tencent Smart Shadow is an online intelligent video creation platform launched by Tencent, which can support text dubbing, digital human broadcasting, automatic subtitle recognition and other functions through powerful AI tools provided by cloud services.It integrates material search, video editing, rendering export and publishing, bringing users a convenient visual...
1yrs ago
02.2K
音剪:喜马拉雅自然人声、多人旁白音频创作平台

Sound clipping: Himalaya's natural human voice, multi-narrator audio creation platform

Comprehensive Introduction Himalaya Audio Editor is a comprehensive AI audio creation platform. It offers powerful features that support users with professional-grade podcast production, multi-track recording, audio editing, and the ability to convert text to speech. The platform also contains multiple options for professional voice, helping users...
1yrs ago
02.2K