🚀 Invitation to Experience: China's First AI IDE Intelligent Programming Software Trae Chinese version downloadThe DeepSeek-R1 and Doubao-pro are available for unlimited use!

Total 53 articles

Tags: ai speech to text Page 3

Buzz: open source offline audio transcription translation tool | IOS voice transcription

Buzz General Introduction Buzz is an open source project created by chidiwilliams that enables offline transcription and translation of audio on personal computers. The project relies on OpenAI's Whisper technology, which allows users to work on transcribing and translating audio files without relying on an Internet connection. Via GitHub, ...

2024-10-09AI tools AI Speech to Text

Deepgram: service API for high-precision speech recognition and synthesis solutions

General Description Deepgram is a company focused on speech recognition and natural language processing technologies, providing powerful Speech-to-Text and Text-to-Speech APIs.The platform utilizes advanced AI technology to help developers incorporate speech transcription and comprehension capabilities...

2024-10-09AI tools AI Open Services AI Speech to Text

Trae Chinese Version First Invitation to Download: Unlimited use of DeepSeek-R1 after registration!

Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.

2025-04-24

Murf AI: Voice Changer|Speech to Text|Text to Speech|Audio Editor

Comprehensive Introduction Murf AI is a powerful online artificial intelligence voice generation tool that converts text into near-life-like speech. It offers up to 120+ AI voice options, supports 20+ languages, and is suitable for a variety of occasions such as podcasts, videos, professional presentations, etc.Murf AI also features audio...

2024-10-08AI tools AI Text-to-Speech AI Speech to Text

VideoLingo：视频转录单词级时间轴字幕，视频字幕翻译和本地化配音开源工具-首席AI分享圈

VideoLingo: video transcription word-level timeline subtitles, video subtitle translation and localized dubbing open source tools

General Description VideoLingo is a one-stop video translation and localization dubbing tool designed to generate Netflix-grade, high-quality subtitles, eliminating raw machine translation and multi-line subtitles and adding high-quality voiceovers to enable global knowledge to be shared across language barriers. With intuitive Streamlit ...

2024-09-30AI tools AI Side Hustle Money Making Program AI translation AI Speech to Text

ALog: portable AI voice diary app with speech-to-text support.

General Introduction ALog is an AI-based voice diary application designed to help users record their daily lives by voice. The project is developed by duxins and open-sourced on GitHub. Users can record their diary through voice input, and the app will automatically convert the voice to text and analyze it intelligently...

2024-09-23AI tools AI open source project AI Speech to Text

录咖：一站式音视频处理平台|视频生成|AI字幕|提取音频|语音转文字-首席AI分享圈

Record Cafe: One-stop Audio/Video Processing Platform|Video Generation|AI Subtitle|Audio Extraction|Speech to Text

Comprehensive Introduction Record Cafe is a one-stop audio/video processing platform that provides AI video dialog, AI subtitles and AI speech to text services. Features include recording screen, editing video, converting GIF/audio, etc., and supports cloud storage and sharing. The interface is intuitive and easy to use, and it also supports multi-screen recording and multi-language intelligent reading...

2024-09-11AI tools AI Text to Video AI Text-to-Speech AI Speech to Text

CrisperWhisper: Accurate Verbatim Speech Transcription Tool

General Description CrisperWhisper is an advanced speech recognition tool based on OpenAI Whisper that focuses on fast, accurate and word-by-word speech transcription. It provides accurate word-level timestamps, even in the presence of speech fills and pauses.CrisperWhisper works by tuning...

2024-09-09AI tools AI open source project AI Speech to Text

Babelfish.ai: Browser-Run Real-Time Speech Transcription and Translation Application

General Introduction Babelfish.ai is a real-time transcription and translation application built on Huggingface Transformer.js and Supabase Realtime. The application can load large models in the browser and run them locally to realize real-time speech-to-text and translation functions. Users can use the simple...

2024-09-09AI tools AI open source project AI Speech to Text

FreeTTS: Free Online Text-to-Speech Tool|Audio Enhancement|Audio Clips

FreeTTS General Description FreeTTS is a free online text-to-speech tool that allows users to convert text to natural sounding voice files. Supporting multiple languages and sound options, users can convert text to MP3, WAV, OGG and ACC formats.FreeTTS also provides voice transcription, sound...

2024-09-05AI tools AI Text-to-Speech AI Speech to Text AI audio and video editing

Easy Voice Toolkit: AI Voice Toolkit for Local Deployment

Comprehensive Introduction Easy-Voice-Toolkit is a multifunctional toolkit based on the Open Source Speech Project that provides a wide range of automated audio tools for speech recognition, speech transcription, speech conversion, dataset creation and model training. Users can use these tools selectively or sequentially as needed...

2024-09-04AI tools AI open source project AI Text-to-Speech AI voice cloning AI Speech to Text

DupDub: AI-powered Video Editor|Dubbing|Video Translation|Photo Digitizer

General Description Dupdub is a side-heavy podcast and video presentation creation platform that offers a range of AI tools to support user creativity. Features cover text to video creation, offering AI voice and video dubbing services, as well as video editing, transcription and subtitling. Dupdub again out of the gate launched...

2024-08-31AI tools AI digital person AI Text-to-Speech AI Speech to Text AI audio and video editing

Tongyi Listening and Understanding: Ali Tongyi Audio and Video Content Transcription AI Assistant

Comprehensive Introduction Tongyi Listening and Understanding is a work-study AI assistant launched by Aliyun, focusing on transcribing and analyzing audio and video content. It relies on AliCloud's powerful AI models to transcribe audio and video content into text in real time, and provides translation, summarization, positioning and other functions. Tongyi Listening Woo supports multiple languages and scenarios...

2024-08-30AI tools AI text and audio/video summarization tool AI Speech to Text

Insanely Fast Whisper: fast and efficient transcription of speech to text open source project

Comprehensive Introduction insanely-fast-whisper is an audio transcription tool that combines OpenAI's Whisper model with various optimization techniques (e.g. Transformers, Optimum, Flash Attention) to provide a command line interface (CLI) designed to transcribe large amounts of audio quickly and efficiently. It uses Whi...

2024-08-17AI tools AI open source project AI Speech to Text

Memo AI: Native Client for Video to Subtitle, Converting Multilingual Subtitles

General Description MemoAI is a powerful video translation tool specialized in converting video and audio files to text, subtitles and notes. Whether it's a YouTube video, a podcast or a local file, MemoAI can handle it with ease. It supports transcription and translation in more than 90 languages such as Chinese, English, Japanese, etc.MemoAI...

2024-08-16AI tools AI Text-to-Speech AI Speech to Text AI audio and video editing

pyvideotrans: Video Translation Dubbing Tool

pyVideoTrans General Introduction pyvideotrans is a video translation dubbing tool. Users are able to translate video content from one language to another and add corresponding voiceovers and subtitles to the video. It is based on the openai-whisper offline model and supports a variety of translation and speech synthesis services, ex...

2024-08-07AI tools AI Text-to-Speech AI Speech to Text AI audio and video editing

preceding page
1
2
3
Total 3 pages