Latest AI Resources

Total 2832 articles posts
小智 AI 聊天机器人:打造你的AI聊天伴侣,轻松实现语音对话和智能互动

Xiaozhi AI Chatbot: Build your AI chatting companion, easily realize voice conversation and intelligent interaction

Comprehensive Introduction Xiaozhi AI Chatbot is an open source project based on the ESP32 development board, designed to help users build their own AI chat companion. The project was developed by Shrimp and is mainly used for teaching purposes to help more people get started with AI hardware development and to understand how to apply large language models to real...
9mos ago
0111.4K
DeepMosaics:自动去除图像和视频中的马赛克,或向其添加马赛克

DeepMosaics: Automatically removing mosaics from, or adding mosaics to, images and videos

General Introduction DeepMosaics is an open source project based on semantic segmentation and image-to-image conversion techniques designed to automatically remove mosaics from, or add mosaic effects to, images and videos. The project utilizes the power of deep learning to provide users with an efficient way to deal with mosaic...
1yrs ago
0102.7K
Codeium(Windsurf Editor):免费的AI代码补全与聊天工具,Windsurf以对话方式编写完整项目代码

Codeium (Windsurf Editor): free AI code-completion and chat tool, Windsurf writes complete project code in a conversational manner

General Introduction Codeium is a free AI code-completion and chat tool designed to improve developers' programming efficiency. It supports more than 70 programming languages and is compatible with more than 40 integrated development environments (IDEs).Codeium not only provides automatic code completion, but also has generation...
1yrs ago
092.6K
CosyVoice:阿里推出的3秒急速语音克隆开源项目,支持情感控制标签

CosyVoice: 3-second rush voice cloning open source project launched by Ali with support for emotionally controlled tags

Comprehensive Introduction CosyVoice is a multilingual large-scale speech generation model that provides full-stack capabilities from inference, training to deployment. Developed by the FunAudioLLM team, it aims to achieve high quality speech through advanced autoregressive transformers and ODE-based diffusion models...
10mos ago
091.8K
VisoMaster:强大且易用的图片/视频换脸和编辑软件

VisoMaster: Powerful and easy-to-use photo/video face changing and editing software

General Introduction VisoMaster is a powerful and easy-to-use video face-swapping and editing tool that utilizes artificial intelligence technology to achieve natural and realistic face-swapping effects. Whether it's an image or a video, VisoMaster can generate high-quality face swap results with simple operations, suitable for general...
9mos ago
086.5K
FunASR:开源语音识别工具包,说话人分离/ 多人对话语音识别

FunASR: Open Source Speech Recognition Toolkit, Speaker Separation / Multi-Person Conversation Speech Recognition

Comprehensive Introduction FunASR is an open source speech recognition toolkit developed by Alibaba's Dharma Institute to bridge academic research and industrial applications. It supports a wide range of speech recognition features, including speech recognition (ASR), voice endpoint detection (VAD), punctuation recovery, language modeling, speaking...
1yrs ago
083.4K
元宝/元器:腾讯混元支持的AI助手和开放智能体设计平台

Yuanbao/yuanqi: Tencent Mixed Yuan supported AI assistant and open intelligent body design platform

Comprehensive Introduction Tencent Yuanbao is a C-end AI assistant app launched by Tencent based on its self-developed hybrid big model.It not only provides core functions such as AI search, AI summarization and AI writing in work scenarios, but also parses multiple WeChat public number links, URLs, and documents in multiple formats. Yuanbao also supports ...
9mos ago
083.4K
MinerU:PDF文档提取转换为多模态Markdown格式,支持电子书OCR扫描

MinerU: PDF document extraction and conversion to multimodal Markdown format, support e-book OCR scanning

Comprehensive Introduction MinerU is an open source data extraction tool developed by the OpenDataLab team at the Shanghai Artificial Intelligence Laboratory, focusing on efficiently extracting content from complex PDF documents, web pages, and eBooks. It can take multimodal PDFs containing images, formulas, tables and other elements...
1yrs ago
077K
EXO:利用闲置家用设备运行分布式AI集群,支持多种推理引擎和自动设备发现。

EXO: Running distributed AI clusters using idle home devices with support for multiple inference engines and automated device discovery.

General Introduction Exo is an open source project designed to run its own AI cluster using everyday devices (e.g. iPhone, iPad, Android, Mac, Linux, etc.). Through dynamic model partitioning and automated device discovery, Exo is able to unify multiple devices into one powerful...
1yrs ago
072.9K
豆包:抖音旗下AI智能助手

Doubao: Shake's AI Intelligent Assistant

Beanbag Comprehensive Introduction Beanbag is an artificial intelligence AI assistant developed by a subsidiary of Jitterbug, the domestic version of which uses the latest Lark Large model. It is an intelligent assistant tool that can help users solve problems, get information and improve efficiency. Beanbag supports Chinese and English, can be used online, and provides web version, Android...
11mos ago
072.1K
SoniTranslate:开源视频翻译配音解决方案,多人配音、调整语速与模仿原声

SoniTranslate: open source video translation and dubbing solution, multi-person dubbing, adjust the speed of speech and mimic the original sound

General Description SoniTranslate is a powerful and user-friendly video multilingual dubbing tool designed to provide a solution for video translation and synchronized audio. It uses advanced speech recognition and machine translation technologies to translate video content into multiple languages and keep the audio synchronized. The program ...
1yrs ago
071.2K
Meetily:生成会议纪要的AI助手,实时转录和生成会议摘要

Meetily: an AI assistant for generating meeting minutes, transcribing and generating meeting summaries in real-time

General Description Meetily is an AI-powered meeting assistant developed by Zackriya Solutions that captures meeting audio in real-time, performs voice transcription, and generates meeting summaries. It is unique in that all processing is done locally on the device, ensuring user privacy...
10mos ago
071.1K
PDFMathTranslate:保留PDF完整排版的AI翻译工具

PDFMathTranslate: AI translation tool that preserves the full typography of PDFs

Comprehensive introduction PDFMathTranslate is an open source tool focusing on the translation of scientific papers , PDF documents can be translated in full and generate a bilingual version . It uses AI technology to retain the full layout of the original document , including formulas , diagrams , tables of contents and notes , support ...
6mos ago
068.5K
IOPaint:全能AI图像处理工具,擦除、扩图、替换元素与绘制文本

IOPaint: All-around AI image processing tool, erasing, expanding, replacing elements and drawing text.

General Introduction IOPaint is a free and open source AI image processing tool that supports image erasing, repairing and expanding. It uses state-of-the-art AI models to help users easily remove unwanted objects from an image, repair blemishes, add new content, and even expand an image.IOPa...
1yrs ago
065.9K
LiveTalking:开源实时互动数字人直播系统,实现音视频同步对话

LiveTalking: open source real-time interactive digital human live system, to achieve synchronous audio and video dialogues

Comprehensive introduction LiveTalking is an open source real-time interactive digital human system , is committed to building high-quality digital human live solution . The project uses the Apache 2.0 open source protocol and integrates a number of cutting-edge technologies , including ER-NeRF rendering , real-time audio and video streaming processing ...
11mos ago
065.1K
SynClub 提供安全的AI角色互动与情感支持虚拟社交平台

SynClub Provides Secure AI Character Interaction and Emotionally Supportive Virtual Social Platforms

Comprehensive Introduction SynClub is a virtual chat platform that combines AI big model technology, aiming to provide users with diverse character interaction and emotional support experience. Users can have real-time conversations with AI characters of different styles, including text and voice modes, covering daily chit-chat, emotional counseling and scenario play...
9mos ago
062.5K
即创:依托巨量引擎生成电商营销物料,快速发布适合抖音推广的商品讲解视频

That is to create: relying on a huge engine to generate e-commerce marketing materials, rapid release of products suitable for jittery voice promotion of explaining the video

Introduction of Instant Creation Instant Creation is a one-stop intelligent creative production and management platform launched by Jitterbug, aiming to provide efficient, convenient and professional content creation services for creators. The platform integrates a variety of AI functions, such as intelligent filming, AI video scripts, graphic tools, merchandise card tools, AI live backgrounds, AI direct...
1yrs ago
062.3K
Dippy:与AI角色聊天的互动工具

Dippy: an interactive tool for chatting with AI characters

General Introduction Dippy is a mobile app that lets you chat with AI characters, easy to use for people who like interaction and role-playing. It offers a wide range of virtual characters, such as friends, therapists or romantic interests, which the user is free to choose from. The app has no ads, remembers your preferences, and the chatting experience...
9mos ago
061K
FunClip:智能剪辑视频内容为短片,轻松实现精准视频片段提取/裁剪

FunClip: Intelligent editing of video content into short clips, easy to realize accurate video clip extraction/cropping

Comprehensive Introduction FunClip is a fully open source localized automatic video editing tool developed by TONGYI Speech Lab of Alibaba Dharma Institute. The tool integrates the industrial-grade Paraformer-Large speech recognition model, which can accurately recognize the speech in the video...
11mos ago
059.1K