AI Text-to-Speech

Total 79 articles posts

Sorting

Muyan-TTS: Personalized Podcast Speech Training and Synthesis

Synthesis Muyan-TTS is an open source text-to-speech (TTS) model designed for podcasting scenarios. It is pre-trained with over 100,000 hours of podcast audio data and supports zero-sample speech synthesis to generate high-quality natural speech. The model is based on Llama-3.2-3...

11mos ago

071.9K

Kimi-Audio: Open Source Audio Processing and Dialogue Base Modeling

Comprehensive Introduction Kimi-Audio is an open source audio base model developed by Moonshot AI that focuses on audio understanding, generation and dialog. It supports a wide range of audio processing tasks such as speech recognition, audio Q&A and speech emotion recognition. The model has been tested over 130...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech # AI Speech to Text

11mos ago

0124.6K

Audibit: turning popular tech articles into ready-to-listen audio podcasts

General Introduction Audibit is an open source project, the core function is to Hacker News, TechCrunch and other popular technology articles automatically turned into audio podcasts, so that users in the commute, fitness, or busy when listening to information through the Web or mobile. The project makes ...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

11mos ago

052.9K

Dia: text-to-speech modeling for generating hyper-realistic multiplayer conversations

General Introduction Dia is an open source text-to-speech (TTS) model developed by Nari Labs that focuses on generating hyper-realistic dialog audio. It transforms text scripts into realistic multi-character dialog in a single process, supports emotion and intonation control, and even generates non-verbal representations...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

12mos ago

075.6K

Orpheus-TTS: Text-to-Speech Tool for Generating Natural Chinese Speech

General Introduction Orpheus-TTS is an open source text-to-speech (TTS) system developed on the Llama-3b architecture, with the goal of generating audio that is close to natural human speech. It is launched by the Canopy AI team and supports English, Spanish, French...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

092K

ElevenLabs MCP: Speech Generation MCP Service

General Introduction ElevenLabs MCP is an official ElevenLabs open source project hosted on GitHub. It is a service based on the Model Control Protocol (Model Context Protocol, MCP)...

Latest AI Resources # AI text-to-speech # MCP services

1yrs ago

059.4K

Vapi: Helping developers quickly build low-latency voice assistants

Comprehensive Introduction Vapi is a voice AI platform for developers. It enables users to build, test and deploy voice AI assistants in minutes, solving the problem of time-consuming and difficult to scale traditional voice application development.Vapi provides complete tools and infrastructure to support real-time conversations, electric...

Latest AI Resources # AI Open Services # AI text-to-speech

1yrs ago

071.4K

Conch Speech (MiniMax Audio): AI tool for generating natural speech

Comprehensive Introduction MiniMax Audio is an AI speech generation tool from MiniMax, the core feature of which is to quickly convert text to natural speech with high similarity. It is based on the Speech-02 model, with a speech synthesis similarity of up to 99...

Latest AI Resources # AI text-to-speech # AI voice cloning

10mos ago

0131.8K

Text2Voice: A Text-to-Speech Graphical Interface Based on Silicon Flow APIs

General Introduction Text2Voice is an open source tool that provides text-to-speech functionality based on a silicon-based mobility API, and is best characterized as coming with a clean graphical user interface (GUI). It was created by developer Sheldon Lee on GitHub to allow...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

057.5K

Open source operational project integrating multiple advanced speech synthesis services

General Introduction Open-VoiceCanvas is an open source speech synthesis platform developed by the ItusiAI team. It supports more than 50 languages, and can convert text to natural speech, as well as clone personalized voices by uploading audio. The project integrates Ope...

Latest AI Resources # AI Side Hustle Money Making Programs # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

059.2K

Paper to Podcast: Converting Academic Papers to Multi-Person Conversation Podcasts

General Introduction Paper to Podcast is an open source tool that specializes in transforming academic research papers into lively and entertaining podcasts. It uses artificial intelligence technology to turn a PDF-formatted paper into a dialog between three characters - the host, the learner, and the expert - to make complex...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

049.4K

MegaTTS3: A Lightweight Model for Synthesizing Chinese and English Speech

Comprehensive Introduction MegaTTS3 is an open source speech synthesis tool developed by ByteDance in cooperation with Zhejiang University, focusing on generating high-quality Chinese and English speech. Its core model is only 0.45B parameters , lightweight and efficient , support for mixed Chinese and English speech generation and speech cloning . The project is hosted on ...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech # AI voice cloning

1yrs ago

069.7K

Podcastle: the AI tool for quickly creating high-quality podcasts

General Introduction Podcastle is an AI-based online platform that specializes in helping users quickly create and edit high-quality podcasts. It integrates recording, editing, and publishing features, and users can do it all through a browser without the need for specialized equipment or complex software. The platform utilizes ...

Latest AI Resources # AI text-to-speech # AI audio/video editor

1yrs ago

055.6K

IndexTTS: Text-to-Speech Tool with Chinese-English Mixing Support

General Introduction IndexTTS is an open source text-to-speech (TTS) tool hosted on GitHub and developed by the index-tts team. It is based on XTTS and Tortoise technology , by improving the module design , to provide efficient and ...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

0124.7K

csm-mlx: csm speech generation model for Apple devices

Comprehensive Introduction csm-mlx is based on the MLX framework developed by Apple, specifically optimized for Apple Silicon (Apple Silicon) CSM (Conversation Speech Model) voice conversation model. This project allows the use of ...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

062.4K

Autiobooks: convert epub ebooks to m4b audiobooks

General Introduction Autiobooks is an open source tool designed to help users quickly convert eBooks in .epub format to audiobooks in .m4b format. It uses quality speech synthesis technology provided by Kokoro to generate natural and smooth audio. This tool was developed by...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

058.5K

PlayHT: an AI tool for generating hyper-realistic speech

Comprehensive Introduction PlayHT is an efficient online platform focusing on AI speech generation, helping users quickly convert text into natural, realistic speech. It provides more than 600 AI voices supporting more than 60 languages and diverse accents for podcast production, educational content, marketing promotion...

Latest AI Resources # AI text-to-speech # AI voice cloning

1yrs ago

058.8K

MLX-Audio: A Text-to-Speech Tool Based on Apple's MLX Framework

General Introduction MLX-Audio is an open source tool developed based on Apple's MLX framework, focusing on Text-to-Speech (TTS) and Speech-to-Speech (STS) functionality. It leverages the power of Apple Silicon (e.g. M-series chips)...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

0107.6K

Spark-TTS: A Text-to-Speech Tool for Generating Natural Speech

General Introduction Spark-TTS is an open source Text-to-Speech (TTS) tool developed by the SparkAudio team, hosted on GitHub, designed to help users efficiently convert text into natural and fluent speech...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech # AI voice cloning

1yrs ago

073.4K

Cat and Star: a story-listening app that writes exclusive fairy tales with your child

Comprehensive Introduction "Cat & Star" (maoyuxing.com) is an interactive story creation platform designed for children, helping parents and children create personalized fairy tales together through a mobile application. Users can input information such as the child's name and preferences to generate unique story content...

Latest AI Resources # AI Educational Tools # AI text-to-speech

1yrs ago

058.2K

Azure TTS Importer: Integrating speech synthesis services into reading software

Comprehensive Introduction TTS Importer is an open source project designed to easily import Azure TTS (Text-to-Speech) speech synthesis services into a variety of reading software. The tool supports several popular reading programs, including Read (legado...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

055.2K

NVIDIA PDF to Podcast：设置引导提示词将PDF转换为播客的AI工具

NVIDIA PDF to Podcast: AI Tool for Converting PDF to Podcast by Setting Guiding Prompts

General Introduction NVIDIA AI Blueprint: PDF to Podcast is an open source project developed by NVIDIA to convert PDF documents into engaging audio content. The project utilizes NVIDIA NIM (NVID...

AI News # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

057.8K

Kokoro WebGPU: A Text-to-Speech Service for Offline Operation in Browsers

General Introduction Kokoro WebGPU is a WebGPU version of the Kokoro text-to-speech (TTS) model, provided by WebML Community on the Hugging Face platform. The project utilizes WebGPU technology to enable users to...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

080K

Orate: A Unified API for Integrating Well-Known Speech Generation, Speech Transcription and Voice Change Models

Comprehensive Introduction Orate is an AI toolkit focused on speech generation and transcription. It provides a unified API that seamlessly integrates with leading AI providers such as OpenAI, ElevenLabs, and AssemblyAI to help users create forced...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech # AI Speech to Text

1yrs ago

064.7K

Weights: a voice-imitation cover song and text-to-speech authoring platform

General Introduction Weights is a social platform that utilizes AI for creation, allowing users to create voice covers, text-to-speech, images, music, and videos with simple operations. The platform provides a wealth of tools and templates to help users get started creating quickly and share with the community since...

Latest AI Resources # AI text-to-speech # AI voice cloning

1yrs ago

0128.7K

AnyVoice: free online voice cloning, just 3 seconds to realize the voice cloning

General Introduction AnyVoice is an advanced AI speech generation platform that provides ultra-realistic speech generation and voice cloning services. The platform allows users to convert text into natural speech and choose from hundreds of preset voices. If you can't find the right voice, just...

Latest AI Resources # AI text-to-speech # AI voice cloning

1yrs ago

084.2K

Open NotebookLM: convert PDF to podcasts of open source tools

General Introduction Open NotebookLM is an open source project designed to convert any PDF document into a podcast. The tool utilizes open source Large Language Model (LLM) and Text-to-Speech (TTS) models to process PDF content and generate natural dialog suitable for audio podcasts...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

060.9K

Llasa 1~8B: an open source text-to-speech model for high quality speech generation and cloning

General Introduction Llasa-3B is an open source text-to-speech (TTS) model developed by the Audio Lab of the Hong Kong University of Science and Technology (HKUST Audio). The model is based on the Llama 3.2B architecture, which has been carefully tuned to provide high-quality speech generation that not only supports multiple...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech # AI voice cloning

1yrs ago

076K

Kokoro-ONNX: Efficient Text-to-Speech Tool with Multi-Language and Multi-Voice Support

General Introduction Kokoro-ONNX is an open source text-to-speech (TTS) tool based on ONNX runtime. Developed by thewh1teagle, the project aims to provide efficient and fast speech synthesis solutions.Kokoro-ONNX supports ...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

0107.2K

OpenAI Edge TTS：利用 Edge TTS 的免费文本转语音API，兼容 OpenAI 格式

OpenAI Edge TTS: Free text-to-speech API utilizing Edge TTS, compatible with OpenAI formats

General Introduction OpenAI Edge TTS is an open source project that provides an OpenAI-compatible native text-to-speech (TTS) API.The project uses Microsoft Edge's online text-to-speech service to allow users to generate high-quality...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

084.9K

Jellypod: produce multilingual AI podcasts, create, edit and distribute AI podcasts

General Introduction Jellypod is a powerful AI podcast studio designed to help users easily create, edit and publish high-quality AI podcasts. With Jellypod, users can design personalized podcast hosts, refine scripts, and publish podcasts to ...

Latest AI Resources # AI text-to-speech

1yrs ago

061.4K

Sherpa-ONNX: Offline Speech Recognition and Synthesis with ONNXRuntime

General Introduction sherpa-onnx is an open source project developed by the Next-gen Kaldi team to provide efficient offline speech recognition and speech synthesis solutions. It supports multiple platforms including Android, iOS, Raspber...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech # AI Speech to Text

1yrs ago

0286.6K

Audiblez: Generate Audiobooks, Convert eBooks to Audiobooks with Kokoro

General Introduction Audiblez is an open source project designed to convert eBooks (e.g. .epub format) into audiobooks (e.g. .m4b format). The project utilizes Kokoro's high-quality speech synthesis technology to support multiple languages and multiple voices. Users can simply...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

062.3K

Acoust: Online AI Speech Generation and Text-to-Speech (TTS) Services Platform

General Introduction Acoust is an online AI speech generation and text-to-speech (TTS) service platform that utilizes the latest AI technology to generate realistic speech. The platform also provides powerful video editing tools that allow users to complete video production without the need to use multiple software.Acou...

Latest AI Resources # AI text-to-speech # AI Speech to Text

1yrs ago

054.3K

Kokoro TTS API：快速文本转语音的Docker化FastAPI封装（Kokoro-82M模型）

Kokoro TTS API: Dockerized FastAPI wrapper for fast text-to-speech (Kokoro-82M model)

General Introduction Kokoro-FastAPI is a Docker-based FastAPI wrapper designed to provide support for the Kokoro-82M text-to-speech model. The project supports NVIDIA GPU acceleration and provides queue processing and auto splicing...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

0129.6K

Kokoro: Efficient Speech Synthesis Models to Generate Natural and Smooth Speech

General Introduction Kokoro 82M is an efficient speech synthesis model provided by Hugging Face, designed to generate high quality speech with fewer parameters and data. The model has 82 million parameters and is licensed under Apache 2.0...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

071.6K

ebook2audiobook：将电子书转换为有声读物，支持多语言和语音克隆的开源工具

ebook2audiobook: convert e-books to audiobooks, open-source tool that supports multilingualism and voice cloning

General Introduction ebook2audiobook is a powerful open source ebook to audiobook tool. It is capable of converting eBooks in multiple formats into audiobooks with full chapter markers and metadata. The tool uses Calibre for eBook format conversion using Co...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

089.2K

Edge TTS Worker：使用Cloudflare部署微软语音合成API，兼容OpenAI 格式并封装Web界面

Edge TTS Worker: Deploying Microsoft Speech Synthesis APIs with Cloudflare, OpenAI Compatible Format and Wrapped Web Interface

General Introduction Edge TTS Worker (dependent on edge-tts) is a proxy service deployed on Cloudflare Worker that encapsulates the Microsoft Edge TTS service in an OpenAI-compatible format ...

Latest AI Resources # AI Side Hustle Money Making Programs # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

0121.7K

ViiTor AI: Audio/Video Multilingual Translation Synthesis and Speech Cloning Service

Comprehensive Introduction ViiTor AI is a powerful artificial intelligence platform focused on providing high-quality video translation, voice cloning, AI-generated avatar videos, and speech synthesis services. The platform supports multiple languages and is designed to help users easily realize multilingual content creation.ViiTo...

Latest AI Resources # AI text-to-speech # AI voice cloning # AI audio/video editor

1yrs ago

083.1K

Wondercraft: text-to-audio tool focusing on commercial voiceovers, multiplayer audiobooks and podcasts

General Introduction Wondercraft is a revolutionary AI-driven audio and video creation platform that provides content creators with a one-stop solution for audio and video production. The platform utilizes advanced AI technology that can convert text content into natural and smooth speech, supporting more than 20 languages...

Latest AI Resources # AI text-to-speech

1yrs ago

052K

NotebookLM Podcast: Generate Multilingual Personalized AI Podcasts from Any Document (Paid)

General Description NotebookLM Podcast is an innovative platform that utilizes artificial intelligence technology to transform any textual content into dynamic, engaging audio podcasts. Whether you're a student, educator, content creator or busy professional, NotebookLM...

Latest AI Resources # AI text-to-speech

1yrs ago

049.1K

AivisSpeech: Generating Emotionally Rich Japanese Speech Synthesis Software

General Introduction AivisSpeech is a Japanese speech synthesizer based on the VOICEVOX editor UI. It integrates the AivisSpeech Engine to easily generate emotionally rich speech.AivisSpeech supports...

Latest AI Resources # AI text-to-speech

1yrs ago

083.1K

PlayAI: providing smooth and emotional voice dialog and speech synthesis services (English)

Comprehensive Introduction PlayAI is an artificial intelligence platform focused on speech generation and speech cloning. It offers a wide range of speech models capable of generating smooth and emotional conversations. Users can use the platform to create personalized voice agents to enhance the interactive experience.PlayAI's technology is applicable...

Latest AI Resources # AI text-to-speech

1yrs ago

063.9K

GizAI：全能AI助手，集成主流生成式AI工具，让每个人免费使用商业化AI工具

GizAI: All-in-one AI assistant, integrating mainstream generative AI tools, making commercialized AI tools free for everyone to use

General Introduction GizAI is a one-stop platform that integrates AI generation, note-taking and cloud storage capabilities. Users can generate images, videos, audios, texts, characters, stories and games with GizAI, and can take collaborative notes and cloud storage on the platform.GizAI provides multi...

Latest AI Resources # AI online image generation # AI text-to-speech # AI Integrated Multi-Model Dialog Platform

1yrs ago

090.4K

OuteTTS: an experimental text-to-speech model, TTS implemented using a pure language modeling approach

Comprehensive Introduction OuteTTS is an experimental text-to-speech (TTS) model that uses a pure language modeling approach to generate high-quality speech. Unlike traditional TTS systems, OuteTTS does not require external adapters or complex architectures. The model is based on the LLaMa architecture...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

077.9K

PodLM: Generate multilingual audio podcasts of conversations, web pages or long texts (paid)

General Introduction PodLM is a state-of-the-art AI podcast generation platform designed to help users quickly convert text, document or URL content into high-quality podcast audio. By utilizing cutting-edge AI technology, PodLM is able to automatically generate structured and engaging podcast scripts and...

Latest AI Resources # AI text-to-speech

1yrs ago

050.2K

SoniTranslate：开源视频翻译配音解决方案，多人配音、调整语速与模仿原声

SoniTranslate: open source video translation and dubbing solution, multi-person dubbing, adjust the speed of speech and mimic the original sound

General Description SoniTranslate is a powerful and user-friendly video multilingual dubbing tool designed to provide a solution for video translation and synchronized audio. It uses advanced speech recognition and machine translation technologies to translate video content into multiple languages and keep the audio synchronized. The program ...

Latest AI Resources # AI text-to-speech # AI Translation # AI Speech to Text

1yrs ago

0139K

Teaser Dubbing: Intelligent dubbing tool that focuses on short video narration and creation

Comprehensive Introduction Tease Dubbing is a popular AI dubbing software with over 5 million users. The software utilizes advanced AI intelligent dubbing technology to provide professional and realistic dubbing effects, which is applicable to a variety of scenarios such as short videos, advertisement production, education and training. Teaser Dubbing is committed to providing users with fast...

Latest AI Resources # AI text-to-speech # AI audio/video editor

1yrs ago

067.1K

YouTube Dubbing：实时将YouTube视频翻译为不同语言并同步配音

YouTube Dubbing: Translate YouTube videos into different languages and synchronize dubbing in real time

General Introduction YouTube Dubbing is an intelligent dubbing platform that specializes in multilingual dubbing for video creators and viewers. Through AI technology, the platform is able to automatically translate and generate dubs from YouTube videos, supporting multiple languages and voice styles. Users only need to install...

Latest AI Resources # AI text-to-speech

1yrs ago

066.8K

Podcastfy：多源内容转多语言音频对话工具，NotebookLM 播客功能的开源替代方案

Podcastfy: Multi-source Content to Multilingual Audio Conversation Tool, an Open Source Alternative to NotebookLM's Podcasting Capability

General Introduction Podcastfy is an open source Python package that utilizes Generative Artificial Intelligence (GenAI) technology to convert web content, PDF files, text, images, youtube videos, and many other sources into engaging multilingual...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

057.9K

QuickPiperAudiobook：一键生成自然音质的有声书,支持PDF、epub、docx等格式

QuickPiperAudiobook: a key to generate natural sound quality audiobooks, support for PDF, epub, docx and other formats

Comprehensive Introduction QuickPiperAudiobook is an open source project designed to convert various text formats (e.g. epub, mobi, txt, PDF, HTML, etc.) into natural-sounding audiobooks through a simple one command. The tool uses Pi...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

2yrs ago

052.9K

PDF2Audio: PDF to audio conversion tool, PDF converter

General Introduction PDF2Audio is an open source project designed to convert PDF files into audio content such as podcasts, lectures and summaries. The tool utilizes OpenAI's GPT model for text generation and text-to-speech conversion, and allows users to upload multiple PDF ...

Latest AI Resources # AI text-to-speech

2yrs ago

062.9K

Seaweed AI: Intelligent Speech Synthesis and Voice Cloning Platform

Comprehensive Introduction Seaweed AI is an intelligent dubbing product that can convert text into voice online, powered by the Yun Zhisheng AI open platform. Users can self-help realize voice cloning, and provide AI pronouncers of different genders, accents and languages, and directly dub the voice after inputting text. It can quickly dub short...

Latest AI Resources # AI text-to-speech # AI voice cloning

2yrs ago

051K

edge-tts: Text-to-Speech Python Module | Free Text-to-Speech Service

General Description edge-tts is an open source Python module that allows users to use Microsoft Edge's online text-to-speech service in Python code without the need for the Microsoft Edge browser, Windows operating system or API secret...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

2yrs ago

0108.6K

Descript: One-stop video and podcast editing, as simple as editing a document

Descript General Description Descript is a powerful yet easy to use video and podcast editing tool. It has industry-leading transcription accuracy and speed and powerful correction tools, as well as the ability to transcribe video to text with AI technology and edit video by editing text. In addition to...

Latest AI Resources # AI text-to-speech # AI audio/video editor

2yrs ago

065K

Murf AI: Voice Changer|Speech to Text|Text to Speech|Audio Editor

General Introduction Murf AI is a powerful online artificial intelligence voice generation tool that converts text into near real human speech. It offers up to 120+ AI voice options, supports 20+ languages, and is suitable for a variety of occasions, such as podcasts, videos, professional presentations, etc.Mu...

Latest AI Resources # AI text-to-speech # AI Speech to Text

2yrs ago

057K

Resemble AI: Artificial Intelligence Speech Synthesis Platform | Voice Cloning | Deep Fake Audio Detection

Comprehensive Introduction Resemble AI is an artificial intelligence speech synthesis platform designed for the enterprise. The platform provides cutting-edge AI voice generator technology and deep forged audio detection for future information security. Features include voice cloning, real-time deep fake audio detection, AI watermarking technology...

Latest AI Resources # AI text-to-speech # AI voice cloning

2yrs ago

058.5K

Ondoku: Online Text Reader|Text to Speech|Image to Speech Reader

Ondoku General Introduction Ondoku is an online text-to-speech software that allows users to enter text content into the text box provided by the website, and the software is able to convert the article into a voice readout according to the user's needs, and supports saving the voice as an MP3 format file. This service is suitable for both instant listening...

Latest AI Resources # AI text-to-speech

2yrs ago

0100.1K

XAudioPro: Professional Online Audio Editing Tool|Audiobook Maker|Text to Speech|Accompaniment Separation

General Introduction XAudioPro is an advanced online audio real-time editing and transcoding tool that is both professional and portable. It supports professional audio editing functions such as cutting, cropping, copying, deleting, restoring, and amplitude gain control. It also provides denoising services such as spectral subtraction noise reduction, low-pass...

Latest AI Resources # AI text-to-speech # AI audio/video editor

2yrs ago

064.5K

Hume AI：赋予AI情感识别能力|从声音和表情识别情感状态|生成具有情感状态的语音

Hume AI: Empowering AI with Emotion Recognition | Recognizing Emotional States from Sounds and Expressions | Generating Speech with Emotional States

General Introduction Hume AI is an AI company focused on emotional intelligence, developing multimodal AI technologies that understand and respond to human emotions. Its flagship product, the Empathic Voice Interface (EVI), is able to recognize and respond to a user's...

Latest AI Resources # AI Open Services # AI text-to-speech

2yrs ago

073.5K

Magic Voice Workshop: professional voice-over and short video narration creation platform | real person voice-over | clone voice | one-click into a film

Comprehensive Introduction Magic Voice Workshop is a one-stop short video and AI dubbing platform with information on software dubbing, real-life dubbing, sound libraries, cloning services and more. The platform integrates audio editing, AI copy generation, video editing and collaboration tools for audio-related services and content creation. Users experience the audio editor...

Latest AI Resources # AI text-to-speech # AI voice cloning # AI audio/video editor

2yrs ago

067.4K

EmotiVoice: Text-to-Speech Engine with Multi-Voice and Emotional Cueing Controls

Comprehensive Introduction EmotiVoice is a text-to-speech (TTS) engine with multiple voices and emotional cue control developed by NetEaseYoudao. This open source TTS engine supports English and Chinese, has more than 2000 different voices, and has the ability to synthesize emotions to create a voice with happy...

Latest AI Resources # AI text-to-speech

1yrs ago

087.3K

Listnr: Multilingual AI Speech Generator, Transformative Human Voice Synthesis Technology

General Introduction Listnr is a text-to-speech software with a generative AI engine that creates speech synthesis in 1,000+ different voices in 142+ languages, including cloning your own voice. The platform serves over 1 million users across short videos, YouTub...

Latest AI Resources # AI text-to-speech # AI voice cloning

2yrs ago

062.8K

Uberduck: AI-generated rap music and voice cloning platform|Text to Speech

General Introduction Uberduck AI is an innovative platform for creative agencies, music producers and programmers to synthesize singing and speaking voices with AI. Users can choose different musical rhythms, generate lyrics using AI or write their own, select specific sounds and ultimately create rap songs...

Latest AI Resources # AI text-to-speech # AI voice cloning # AI Music

2yrs ago

058.9K

NotebookLM: Knowledge Notes Retrieval Reading, Multi-Class Document Generation Voice Dialog Podcasts

General Introduction NotebookLM is a personalized AI collaboration tool from Google designed to help users use their minds to their full potential. Users can upload documents, and NotebookLM instantly masters the content from those sources, making it easy to read...

Latest AI Resources # AI Educational Tools # AI text-to-speech # AI Notes

10mos ago

060.6K

Record Cafe: One-stop Audio/Video Processing Platform|Video Generation|AI Subtitle|Audio Extraction|Speech to Text

Comprehensive Introduction Record Cafe is a one-stop audio/video processing platform that provides AI video dialog, AI subtitles and AI speech to text services. Functions include recording screen, editing video, converting GIF/audio, etc., and supports cloud storage and sharing. The interface is intuitive and easy to use, and it also supports multi-screen recording and multi-language smart...

Latest AI Resources # AI text to video # AI text-to-speech # AI Speech to Text

1yrs ago

066.6K

IMS Toucan: Fast and Controllable Multilingual (7000+ languages supported) Text-to-Speech Tool

General Introduction IMS Toucan is a state-of-the-art text-to-speech (TTS) toolkit developed by the Institute for Natural Language Processing (IMS) at the University of Stuttgart, Germany. The toolkit supports more than 7000 languages and is characterized by fast, controllable and low computational resource requirements.IMS...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

058K

ChatTTS: a speech generation model that mimics the voice of a real person speaking (ChatTTS one-click acceleration package)

General Introduction ChatTTS is a generative speech model designed for conversational scenarios. It generates natural and expressive speech, supports multiple languages and multiple speakers, and is suitable for interactive conversations. The model does this by predicting and controlling fine-grained prosodic features such as laughter, pauses and interjections, sup...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

070.6K

FreeTTS: Free Online Text-to-Speech Tool|Audio Enhancement|Audio Clips

FreeTTS General Description FreeTTS is a free online text-to-speech tool that allows users to convert text to natural sounding voice files. Supporting multiple languages and sound options, users can convert text to MP3, WAV, OGG and ACC formats...

Latest AI Resources # AI text-to-speech # AI Speech to Text # AI audio/video editor

2yrs ago

068.5K

ElevenLabs: High Quality AI Speech Generation Platform, Text Dubbing and Speech Cloning Tool

General Introduction ElevenLabs is a startup based in New York, USA, specializing in the field of generative AI speech. The company offers a range of powerful services for text-generated speech, speech-generated speech, speech cloning, and speech recognition.ElevenLabs excels in...

Latest AI Resources # AI text-to-speech # AI voice cloning

2yrs ago

058.1K

Easy Voice Toolkit: AI Voice Toolkit for Local Deployment

Comprehensive Introduction Easy-Voice-Toolkit is a multifunctional toolkit based on the Open Source Speech Project, providing a variety of automated audio tools for speech recognition, speech transcription, speech conversion, dataset creation and model training. Users can selectively use these tools as needed...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech # AI voice cloning

2yrs ago

063.5K

DupDub: AI-powered Video Editor|Dubbing|Video Translation|Photo Digitizer

General Description Dupdub is a side-heavy podcast and video presentation creation platform that offers a range of AI tools to support users' creativity. Features cover text to video creation, offering AI voice and video dubbing services, as well as video editing, transcription and subtitling. Dupdub is also ...

Latest AI Resources # AI Digital Man # AI text-to-speech # AI Speech to Text

2yrs ago

055.1K

TTSMaker: free online text-to-speech tool

General Introduction TTSMaker is a free online text-to-speech tool that supports more than 100 languages and 300 speech styles. Users can convert text to natural and smooth speech and download audio files for commercial use. The tool is suitable for video dubbing, audiobooks, education and training...

Latest AI Resources # AI text-to-speech

2yrs ago

068.3K

Vidnoz AI: Generate Digital Human Speaking Videos with Just a Photo, Multiple Free Video Generation Tools

General Description Vidnoz is a free AI video generation platform to quickly create AI videos in less than 1 minute. No cost, download or experience required. The platform offers 500+ AI avatars, 470+ realistic AI voiceovers and 500+ templates. With Vidnoz AI video...

Latest AI Resources # AI Image to Video # AI Digital Man # AI text to video

2yrs ago

083.4K

Memo AI: Native Client for Video to Subtitle, Converting Multilingual Subtitles

General Description MemoAI is a powerful video translation tool specialized in converting video and audio files to text, subtitles and notes. Whether it's a YouTube video, a podcast or a local file, MemoAI can handle it with ease. It supports more than 90 languages such as Chinese, English, Japanese...

Latest AI Resources # AI text-to-speech # AI Speech to Text # AI audio/video editor

1yrs ago

065.8K

Tencent Smart Shadow: Intelligent Video Creation Tool | AI Digital Man, Anime Generation Kit

Comprehensive Introduction Tencent Smart Shadow is an online intelligent video creation platform launched by Tencent, which can support text dubbing, digital human broadcasting, automatic subtitle recognition and other functions through powerful AI tools provided by cloud services.It integrates material search, video editing, rendering export and publishing, bringing users a convenient visual...

Latest AI Resources # AI Writing # AI Digital Man # AI text to video

2yrs ago

081.9K

pyvideotrans: Video Translation Dubbing Tool

pyVideoTrans General Introduction pyvideotrans is a video translation dubbing tool. Users are able to translate video content from one language to another, and add appropriate dubbing and subtitles to the video. It is based on openai-whisper offline...

Latest AI Resources # AI text-to-speech # AI Speech to Text # AI audio/video editor

2yrs ago

082.9K

Sound clipping: Himalaya's natural human voice, multi-narrator audio creation platform

Comprehensive Introduction Himalaya Audio Editor is a comprehensive AI audio creation platform. It offers powerful features that support users with professional-grade podcast production, multi-track recording, audio editing, and the ability to convert text to speech. The platform also contains multiple options for professional voice, helping users...

Latest AI Resources # AI text-to-speech # AI audio/video editor

2yrs ago

063.1K

Parler-TTS: Generating speaker-specific text-to-speech models from input text

General Introduction Parler-TTS is an open source text-to-speech (TTS) modeling library developed by Hugging Face, designed to generate high-quality, natural-sounding speech. The model is capable of generating speech based on input text with a specific speaker style (e.g. gender, pitch, speaking style...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

068K

No more