AI Speech to Text

Total 56 articles posts
Vexa:实时会议转录与智能知识提取工具

Vexa: a real-time meeting transcription and intelligent knowledge extraction tool

Comprehensive Introduction Vexa is an open source real-time meeting transcription and knowledge management platform designed to provide efficient meeting recording and intelligent knowledge extraction services for enterprises and individuals. It automatically joins platforms such as Google Meet, Zoom, etc. through API-driven meeting robots...
4mos ago
01K
Transkriptor:将音频和视频转为文字的AI智能转录工具

Transkriptor: the AI-smart transcription tool that turns audio and video into text

General Introduction Transkriptor is an AI-driven transcription tool that focuses on quickly converting audio and video to text. It supports over 100 languages with an accuracy rate of up to 99% and is suitable for a wide range of scenarios such as meetings, interviews, classroom notes, and more. Users can upload files, direct...
4mos ago
01.2K
TwinMind:免费离线语音转录文字的APP

TwinMind: free offline voice to text transcription app

General Introduction TwinMind is a smart tool developed by ThirdEar AI, Inc. that "helps you remember everything". TwinMind is a smart tool developed by ThirdEar AI, Inc. that "remembers everything for you". It can record conversations, meetings, or lectures in real time and convert them to text in more than 100 languages, even with your cell phone in your pocket...
4mos ago
0967
LiberSonora:有声书字幕提取与多语言翻译,有声小说转录为多语言

LiberSonora: Audiobook Subtitle Extraction and Multilingual Translation, Audiobook Transcription into Multiple Languages

General Introduction LiberSonora, which means "free sound", is a powerful AI-enabled open source audiobook toolset. The toolset supports intelligent subtitle extraction, AI title generation, multi-language translation, etc., and is capable of batch offline processing under GPU acceleration.LiberSo...
6mos ago
01.4K
Notta:AI会议记录与音频转录工具,自动转录会议、采访或录音

Notta: AI meeting recording and audio transcription tool to automatically transcribe meetings, interviews or recordings

General Description Notta is a powerful AI meeting recording and audio transcription tool designed to help users automatically convert meetings, interviews or audio recordings into searchable text. With Notta, users can easily transcribe, edit, summarize and collaborate to boost productivity.Notta supports...
7mos ago
02K
FunClip:智能剪辑视频内容为短片,轻松实现精准视频片段提取/裁剪

FunClip: Intelligent editing of video content into short clips, easy to realize accurate video clip extraction/cropping

Comprehensive Introduction FunClip is a fully open source localized automatic video editing tool developed by TONGYI Speech Lab of Alibaba Dharma Institute. The tool integrates the industrial-grade Paraformer-Large speech recognition model, which can accurately recognize the speech in the video...
7mos ago
01.9K
SoniTranslate:开源视频翻译配音解决方案,多人配音、调整语速与模仿原声

SoniTranslate: open source video translation and dubbing solution, multi-person dubbing, adjust the speed of speech and mimic the original sound

General Description SoniTranslate is a powerful and user-friendly video multilingual dubbing tool designed to provide a solution for video translation and synchronized audio. It uses advanced speech recognition and machine translation technologies to translate video content into multiple languages and keep the audio synchronized. The program ...
10mos ago
03.5K
FunASR:开源语音识别工具包,说话人分离/ 多人对话语音识别

FunASR: Open Source Speech Recognition Toolkit, Speaker Separation / Multi-Person Conversation Speech Recognition

Comprehensive Introduction FunASR is an open source speech recognition toolkit developed by Alibaba's Dharma Institute to bridge academic research and industrial applications. It supports a wide range of speech recognition features, including speech recognition (ASR), voice endpoint detection (VAD), punctuation recovery, language modeling, speaking...
10mos ago
02.6K
AsrTools:语音转字幕工具,内置剪映、快手、必剪接口的轻量客户端

AsrTools: speech-to-subtitle tool, lightweight client with built-in interfaces to Cutscene, Racer, and Must-Cut

Comprehensive Introduction AsrTools is an intelligent speech-to-text tool with built-in interfaces from big players such as Cutscene, Racer, Must Cut, etc. It does not require GPU or cumbersome configuration, and supports efficient multi-threaded batch processing. It is based on PyQt5 development, beautiful and user-friendly interface, able to output SRT and TXT format words...
10mos ago
02.6K
VideoLingo:视频转录单词级时间轴字幕,视频字幕翻译和本地化配音开源工具

VideoLingo: video transcription word-level timeline subtitles, video subtitle translation and localized dubbing open source tools

General Description VideoLingo is a one-stop video translation and localization dubbing tool designed to generate Netflix-grade, high-quality subtitles, eliminating raw machine translation and multi-line subtitles, and adding high-quality voiceovers that enable global knowledge to be shared across language barriers. By...
10mos ago
01.8K
录咖:一站式音视频处理平台|视频生成|AI字幕|提取音频|语音转文字

Record Cafe: One-stop Audio/Video Processing Platform|Video Generation|AI Subtitle|Audio Extraction|Speech to Text

Comprehensive Introduction Record Cafe is a one-stop audio/video processing platform that provides AI video dialog, AI subtitles and AI speech to text services. Functions include recording screen, editing video, converting GIF/audio, etc., and supports cloud storage and sharing. The interface is intuitive and easy to use, and it also supports multi-screen recording and multi-language smart...
8mos ago
02K
通义听悟:阿里通义音视频内容转录AI助手

Tongyi Listening and Understanding: Ali Tongyi Audio and Video Content Transcription AI Assistant

Comprehensive Introduction Tongyi Listening and Understanding is a work-study AI assistant launched by Aliyun, focusing on transcribing and analyzing audio and video content. It relies on AliCloud's powerful AI models to transcribe audio and video content into text in real time, and provides translation, summarization, positioning and other functions. Tongyi Listening Woo supports multiple languages and scenarios...
11mos ago
01.7K