Comprehensive Introduction Vexa is an open source real-time meeting transcription and knowledge management platform designed to provide efficient meeting recording and intelligent knowledge extraction services for enterprises and individuals. It automatically joins Google Meet, Zoom and other platforms through API-driven meeting robots, transcribes voice to text in real time, and...
General Introduction realtime-transcription-fastrtc is an open source project focused on converting speech to text in real time. It uses FastRTC technology to process low-latency audio streams, combined with native Whisper models to achieve efficient speech recognition. The project is maintained by developer sofi444 and hosted on G...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
General Introduction Transkriptor is an AI-driven transcription tool that focuses on quickly converting audio and video to text. It supports over 100 languages with an accuracy rate of up to 99% and is suitable for a wide range of scenarios such as meetings, interviews, classroom notes and more. Users can upload files, record directly or transcribe via links...
Comprehensive Introduction Otter.ai is an AI-powered meeting management and voice transcription tool with core functionality to convert voice to text in real-time and automatically generate meeting notes, summaries and action items. It provides intelligent support through AI Meeting Agent, which can automatically join meetings such as Zoom, Google Meet...
General Description TurboScribe is an AI-based transcription tool that focuses on quickly converting audio and video to text. It supports more than 98 languages with an accuracy rate of 99.8%, suitable for users who need to process voice content efficiently. Users can upload files to generate transcripts or subtitles with simple...
General Introduction Aqua Voice is an intelligent speech-based text generation tool focused on quickly converting user speech into formatted text. It was created in 2023 by Finnian Brown and Jack McIntire and is based in San Francisco, USA, under the Y Combinator W24 incubation program.A...
Comprehensive Introduction Dolphin is an open source model developed by DataoceanAI and Tsinghua University, focusing on speech recognition and language recognition for Asian languages. It supports 40 languages from East Asia, South Asia, Southeast Asia, and the Middle East, as well as 22 Chinese dialects. The model is based on over 210,000 hours of...
TwinMind is a smart tool developed by ThirdEar AI, Inc. that "helps you remember everything". It can record and convert conversations, meetings or lectures into text in real time, in more than 100 languages, and can be used offline even if your phone is in your pocket. Users don't have to take notes themselves, TwinM...
General Description Wispr Flow is a voice-enabled text input tool that helps users write quickly on their computers. It's a "3x faster than typing" experience that allows users to enter text into any application such as Word, Slack, or Gmail just by speaking naturally.Wispr Flow supports 100...
General Introduction Meeting Minutes (aka Meetily) is a free and open source AI meeting assistant tool developed by Zackriya Solutions that focuses on capturing meeting audio in real-time, generating transcribed text and automatically extracting meeting summaries. The tool runs entirely on local devices and supports macOS ...
General Introduction Local-NotebookLM is an open source project that aims to provide locally run intelligent document processing and content generation tools. It is inspired by Google NotebookLM , focusing on helping users to PDF and other documents into a variety of output formats , such as podcasts , interviews or lectures , etc., while supporting ...
General Introduction AssemblyAI is a platform focused on speech AI technology, providing developers and enterprises with efficient speech-to-text and audio analysis tools. Its core highlight is the Universal family of models, especially the newly released Universal-2, which is AssemblyAI's most advanced speech...
Comprehensive Introduction FireRedASR is a speech recognition model developed and open-sourced by the Little Red Book FireRed team, focusing on providing high-precision, multi-language-supported automatic speech recognition (ASR) solutions. The project is hosted on GitHub for developers and researchers, provides industrial-grade design, and supports Mandarin, Chinese...
General Introduction WhisperChain is an AI-based open source project hosted on GitHub and led by developer Chris Choy. It is mainly used to convert speech into text and automatically optimize the expression through AI technology, removing redundant colloquial words (such as "ah", "hmmm" and other filler words...
General Introduction LLPlayer is an open source media player designed for language learners, hosted on GitHub and created by developer umlx5h. It integrates a variety of useful features, such as bilingual subtitle display, AI auto-generated subtitles, real-time translation, and word search, etc. It aims to help users watch video...
General Introduction CapsWriter-Offline is a voice input and subtitle transcription tool for PC, hosted on GitHub and built by developer HaujetZhao. It runs completely offline and does not require an internet connection to realize speech-to-text and audio/video file to subtitle transcription, supporting unlimited hours of recording...
General Introduction Whisper Input is an open source speech transcription tool that allows users to start recording speech by pressing the Option button and end the recording by lifting the button. The tool calls Groq Whisper Large V3 Turbo model for speech translation, and can quickly feedback the translation results in 1-2 seconds....
General Introduction LiberSonora, meaning "free sound", is a powerful AI-enabled open source audiobook toolset that supports intelligent subtitle extraction, AI title generation, and multi-language translation in GPU-accelerated batch offline processing. It supports intelligent subtitle extraction, AI title generation, multi-language translation, etc., and is capable of batch offline processing under GPU acceleration.LiberSonora is designed with the concept of modular...
AudioNotes is an audio/video to structured notes system based on FunASR and Qwen2. It can quickly extract audio/video content and call the big model to organize it and generate a structured Markdown notes, which is convenient for users to read and find information quickly. The system supports multiple ...