Latest AI Resources

Total 2756 articles posts
CosyVoice:阿里推出的3秒急速语音克隆开源项目,支持情感控制标签

CosyVoice: 3-second rush voice cloning open source project launched by Ali with support for emotionally controlled tags

Comprehensive Introduction CosyVoice is a multilingual large-scale speech generation model that provides full-stack capabilities from inference, training to deployment. Developed by the FunAudioLLM team, it aims to achieve high quality speech through advanced autoregressive transformers and ODE-based diffusion models...
8mos ago
065K
Codeium(Windsurf Editor):免费的AI代码补全与聊天工具,Windsurf以对话方式编写完整项目代码

Codeium (Windsurf Editor): free AI code-completion and chat tool, Windsurf writes complete project code in a conversational manner

General Introduction Codeium is a free AI code-completion and chat tool designed to improve developers' programming efficiency. It supports more than 70 programming languages and is compatible with more than 40 integrated development environments (IDEs).Codeium not only provides automatic code completion, but also has generation...
11mos ago
059.5K
小智 AI 聊天机器人:打造你的AI聊天伴侣,轻松实现语音对话和智能互动

Xiaozhi AI Chatbot: Build your AI chatting companion, easily realize voice conversation and intelligent interaction

Comprehensive Introduction Xiaozhi AI Chatbot is an open source project based on the ESP32 development board, designed to help users build their own AI chat companion. The project was developed by Shrimp and is mainly used for teaching purposes to help more people get started with AI hardware development and to understand how to apply large language models to real...
7mos ago
059K
VisoMaster:强大且易用的图片/视频换脸和编辑软件

VisoMaster: Powerful and easy-to-use photo/video face changing and editing software

General Introduction VisoMaster is a powerful and easy-to-use video face-swapping and editing tool that utilizes artificial intelligence technology to achieve natural and realistic face-swapping effects. Whether it's an image or a video, VisoMaster can generate high-quality face swap results with simple operations, suitable for general...
8mos ago
052.1K
IOPaint:全能AI图像处理工具,擦除、扩图、替换元素与绘制文本

IOPaint: All-around AI image processing tool, erasing, expanding, replacing elements and drawing text.

General Introduction IOPaint is a free and open source AI image processing tool that supports image erasing, repairing and expanding. It uses state-of-the-art AI models to help users easily remove unwanted objects from an image, repair blemishes, add new content, and even expand an image.IOPa...
12mos ago
051.7K
EXO:利用闲置家用设备运行分布式AI集群,支持多种推理引擎和自动设备发现。

EXO: Running distributed AI clusters using idle home devices with support for multiple inference engines and automated device discovery.

General Introduction Exo is an open source project designed to run its own AI cluster using everyday devices (e.g. iPhone, iPad, Android, Mac, Linux, etc.). Through dynamic model partitioning and automated device discovery, Exo is able to unify multiple devices into one powerful...
11mos ago
049.2K
MinerU:PDF文档提取转换为多模态Markdown格式,支持电子书OCR扫描

MinerU: PDF document extraction and conversion to multimodal Markdown format, support e-book OCR scanning

Comprehensive Introduction MinerU is an open source data extraction tool developed by the OpenDataLab team at the Shanghai Artificial Intelligence Laboratory, focusing on efficiently extracting content from complex PDF documents, web pages, and eBooks. It can take multimodal PDFs containing images, formulas, tables and other elements...
1yrs ago
048.9K
元宝/元器:腾讯混元支持的AI助手和开放智能体设计平台

Yuanbao/yuanqi: Tencent Mixed Yuan supported AI assistant and open intelligent body design platform

Comprehensive Introduction Tencent Yuanbao is a C-end AI assistant app launched by Tencent based on its self-developed hybrid big model.It not only provides core functions such as AI search, AI summarization and AI writing in work scenarios, but also parses multiple WeChat public number links, URLs, and documents in multiple formats. Yuanbao also supports ...
7mos ago
048.2K
豆包:抖音旗下AI智能助手

Doubao: Shake's AI Intelligent Assistant

Beanbag Comprehensive Introduction Beanbag is an artificial intelligence AI assistant developed by a subsidiary of Jitterbug, the domestic version of which uses the latest Lark Large model. It is an intelligent assistant tool that can help users solve problems, get information and improve efficiency. Beanbag supports Chinese and English, can be used online, and provides web version, Android...
9mos ago
047.5K
DeepMosaics:自动去除图像和视频中的马赛克,或向其添加马赛克

DeepMosaics: Automatically removing mosaics from, or adding mosaics to, images and videos

General Introduction DeepMosaics is an open source project based on semantic segmentation and image-to-image conversion techniques designed to automatically remove mosaics from, or add mosaic effects to, images and videos. The project utilizes the power of deep learning to provide users with an efficient way to deal with mosaic...
1yrs ago
047.2K
FunASR:开源语音识别工具包,说话人分离/ 多人对话语音识别

FunASR: Open Source Speech Recognition Toolkit, Speaker Separation / Multi-Person Conversation Speech Recognition

Comprehensive Introduction FunASR is an open source speech recognition toolkit developed by Alibaba's Dharma Institute to bridge academic research and industrial applications. It supports a wide range of speech recognition features, including speech recognition (ASR), voice endpoint detection (VAD), punctuation recovery, language modeling, speaking...
1yrs ago
044.9K
SoniTranslate:开源视频翻译配音解决方案,多人配音、调整语速与模仿原声

SoniTranslate: open source video translation and dubbing solution, multi-person dubbing, adjust the speed of speech and mimic the original sound

General Description SoniTranslate is a powerful and user-friendly video multilingual dubbing tool designed to provide a solution for video translation and synchronized audio. It uses advanced speech recognition and machine translation technologies to translate video content into multiple languages and keep the audio synchronized. The program ...
12mos ago
043.8K
Meetily:生成会议纪要的AI助手,实时转录和生成会议摘要

Meetily: an AI assistant for generating meeting minutes, transcribing and generating meeting summaries in real-time

General Description Meetily is an AI-powered meeting assistant developed by Zackriya Solutions that captures meeting audio in real-time, performs voice transcription, and generates meeting summaries. It is unique in that all processing is done locally on the device, ensuring user privacy...
8mos ago
042K
老师帮 - AI教师工作助手,支持课件一键转PPT

Teacher's Help - AI teacher's work assistant, support courseware to PPT in one click

Teacher Help is an AI intelligent tool platform designed for teachers to improve their work efficiency and teaching quality based on AI technology. The platform provides a variety of functions, including lesson plan generation, one-click conversion of courseware to PPT, homework and test question design, student comment generation, and teaching plan writing. The platform supports text translation...
4mos ago
041.6K
Danswer: 专注企业知识管理与文档搜索的AI助手,集成多种工作工具

Danswer: AI assistant specializing in enterprise knowledge management and document search, integrating multiple work tools

General Introduction Danswer is an open source enterprise document retrieval AI assistant designed to connect to team documents, applications and people to provide unified search and natural language query answers through an intelligent chat interface and unified search capabilities. Ensuring that user data and chats are fully controlled...
7mos ago
041K
FaceSwapper:免费AI换脸网站,单人或多人照片及视频换脸

FaceSwapper: free AI face swapping website, single or multiple photo and video face swapping

General Introduction FaceSwapper is an online free face swapping platform based on artificial intelligence technology, which allows users to upload photos or videos to quickly achieve face replacement and generate funny or realistic effects. No specialized skills are required, just a few clicks, you can change your face to others...
7mos ago
040.7K
SynClub 提供安全的AI角色互动与情感支持虚拟社交平台

SynClub Provides Secure AI Character Interaction and Emotionally Supportive Virtual Social Platforms

Comprehensive Introduction SynClub is a virtual chat platform that combines AI big model technology, aiming to provide users with diverse character interaction and emotional support experience. Users can have real-time conversations with AI characters of different styles, including text and voice modes, covering daily chit-chat, emotional counseling and scenario play...
8mos ago
039.7K
LiveTalking:开源实时互动数字人直播系统,实现音视频同步对话

LiveTalking: open source real-time interactive digital human live system, to achieve synchronous audio and video dialogues

Comprehensive introduction LiveTalking is an open source real-time interactive digital human system , is committed to building high-quality digital human live solution . The project uses the Apache 2.0 open source protocol and integrates a number of cutting-edge technologies , including ER-NeRF rendering , real-time audio and video streaming processing ...
9mos ago
039.6K
Galaxy.ai:集成1700+AI工具库的多功能平台,用于了解市场中各类生成式AI工具(付费)

Galaxy.ai: a multifunctional platform integrating 1700+ AI tool libraries for understanding all types of generative AI tools in the market (paid)

Comprehensive Introduction Galaxy.ai is a platform that integrates a wide range of AI tools designed to provide users with comprehensive AI solutions. Whether it's text generation, image processing, video production or speech synthesis, Galaxy.ai is able to satisfy a wide range of user needs. The platform offers...
11mos ago
039.4K
Dippy:与AI角色聊天的互动工具

Dippy: an interactive tool for chatting with AI characters

General Introduction Dippy is a mobile app that lets you chat with AI characters, easy to use for people who like interaction and role-playing. It offers a wide range of virtual characters, such as friends, therapists or romantic interests, which the user is free to choose from. The app has no ads, remembers your preferences, and the chatting experience...
7mos ago
038.4K
FunClip:智能剪辑视频内容为短片,轻松实现精准视频片段提取/裁剪

FunClip: Intelligent editing of video content into short clips, easy to realize accurate video clip extraction/cropping

Comprehensive Introduction FunClip is a fully open source localized automatic video editing tool developed by TONGYI Speech Lab of Alibaba Dharma Institute. The tool integrates the industrial-grade Paraformer-Large speech recognition model, which can accurately recognize the speech in the video...
9mos ago
037.9K
PDFMathTranslate:保留PDF完整排版的AI翻译工具

PDFMathTranslate: AI translation tool that preserves the full typography of PDFs

Comprehensive introduction PDFMathTranslate is an open source tool focusing on the translation of scientific papers , PDF documents can be translated in full and generate a bilingual version . It uses AI technology to retain the full layout of the original document , including formulas , diagrams , tables of contents and notes , support ...
4mos ago
037.8K