AI Personal Learning
and practical guidance
讯飞绘镜
Total 45 articles

Tags: ai speech to text Page 2

BetterWhisperX:自动语音识别与说话人分离,提供高精度单词级时间戳-首席AI分享圈

BetterWhisperX: Automated speech recognition separated from the speaker, providing highly accurate word-level timestamps

Comprehensive Introduction BetterWhisperX is an optimized version of the WhisperX-based project focused on providing efficient and accurate Automatic Speech Recognition (ASR) services. As an improved offshoot of WhisperX, the project is maintained by Federico Torrielli, who is committed to keeping the project continuously updated and improving performance...

Freed:AI医疗抄写助手,准确转录医生和患者对话,减少就诊记录文书工作-首席AI分享圈

Freed: AI medical transcription assistant that accurately transcribes doctor-patient conversations and reduces visit documentation paperwork

General Description Freed is an AI medical transcription assistant designed for healthcare professionals. It helps doctors and other healthcare practitioners automate the recording of patient visits, reduce paperwork, and increase productivity through advanced AI technology.Freed's AI transcription assistant is able to listen in real time,...

Voice-Pro:开源多功能视频翻译工具,语音转录并翻译为多语言,Windows一键安装-首席AI分享圈

Voice-Pro: open source multifunctional video translation tool, voice transcription and translation into multiple languages, Windows one-click installation

General Introduction Voice-Pro is a multifunctional tool based on Gradio WebUI that supports speech-to-text, text-to-speech, real-time translation, YouTube video downloads and human voice separation. It integrates Whisper, Faster-Whisper and Whisper-Timestamped technologies to provide efficient...

AI Hear:本地离线运行的实时语音转录与翻译软件-首席AI分享圈

AI Hear: Real-Time Speech Transcription and Translation Software for Native Offline Operation

General Description If you're using a MacBook, try AI Hear: you can record, real-time local speech to text, and translate, and eventually export subtitles. You can use it to assist you in listening to cross-country conferences and English audiobooks. AI Hear is a locally-run software that provides one-click real-time translation and transcription, supports multiple...

SoniTranslate:开源视频翻译配音解决方案,多人配音、调整语速与模仿原声-首席AI分享圈

SoniTranslate: open source video translation and dubbing solution, multi-person dubbing, adjust the speed of speech and mimic the original sound

General Description SoniTranslate is a powerful and user-friendly video multilingual dubbing tool designed to provide a solution for video translation and synchronized audio. It uses advanced speech recognition and machine translation technologies to translate video content into multiple languages and keep the audio synchronized. The program is based on Gradi...

FunASR:开源语音识别工具包,说话人分离/ 多人对话语音识别-首席AI分享圈

FunASR: Open Source Speech Recognition Toolkit, Speaker Separation / Multi-Person Conversation Speech Recognition

Comprehensive Introduction FunASR is an open source speech recognition toolkit developed by Alibaba's Dharma Institute to bridge academic research and industrial applications. It supports a wide range of speech recognition features, including speech recognition (ASR), voice endpoint detection (VAD), punctuation recovery, language modeling, speaker verification, speak...

AsrTools:语音转字幕工具,内置剪映、快手、必剪接口的轻量客户端-首席AI分享圈

AsrTools: speech-to-subtitle tool, lightweight client with built-in interfaces to Cutscene, Racer, and Must-Cut

Comprehensive Introduction AsrTools is an intelligent speech-to-text tool with built-in interfaces from big players like Cutscene, Racer, Must Cut, etc. It doesn't require GPU or cumbersome configurations, and supports efficient multi-threaded batch processing. It is developed based on PyQt5, with a beautiful and user-friendly interface, capable of outputting subtitle files in SRT and TXT formats. The tool works by tuning ...

Happy Scribe:音频转录和视频字幕平台|免费视频字幕编辑软件-首席AI分享圈

Happy Scribe: Audio Transcription and Video Subtitling Platform | Free Video Subtitle Editing Software

Happy Scribe General Description Happy Scribe provides automated and manual audio transcription services to convert audio to text with high accuracy and support for multiple languages and formats. It includes an interactive editor, collaboration tools, multiple export formats, machine translation, and more. The platform is safe and reliable,...

VideoLingo:视频转录单词级时间轴字幕,视频字幕翻译和本地化配音开源工具-首席AI分享圈

VideoLingo: video transcription word-level timeline subtitles, video subtitle translation and localized dubbing open source tools

General Description VideoLingo is a one-stop video translation and localization dubbing tool designed to generate Netflix-grade, high-quality subtitles, eliminating raw machine translation and multi-line subtitles and adding high-quality voiceovers to enable global knowledge to be shared across language barriers. With intuitive Streamlit ...

录咖:一站式音视频处理平台|视频生成|AI字幕|提取音频|语音转文字-首席AI分享圈

Record Cafe: One-stop Audio/Video Processing Platform|Video Generation|AI Subtitle|Audio Extraction|Speech to Text

Comprehensive Introduction Record Cafe is a one-stop audio/video processing platform that provides AI video dialog, AI subtitles and AI speech to text services. Features include recording screen, editing video, converting GIF/audio, etc., and supports cloud storage and sharing. The interface is intuitive and easy to use, and it also supports multi-screen recording and multi-language intelligent reading...

en_USEnglish