AI open source project

Total 1020 articles posts
IOPaint:全能AI图像处理工具,擦除、扩图、替换元素与绘制文本

IOPaint: All-around AI image processing tool, erasing, expanding, replacing elements and drawing text.

General Introduction IOPaint is a free and open source AI image processing tool that supports image erasing, repairing and expanding. It uses state-of-the-art AI models to help users easily remove unwanted objects from an image, repair blemishes, add new content, and even expand an image.IOPa...
10mos ago
09.6K
CosyVoice:阿里推出的3秒急速语音克隆开源项目,支持情感控制标签

CosyVoice: 3-second rush voice cloning open source project launched by Ali with support for emotionally controlled tags

Comprehensive Introduction CosyVoice is a multilingual large-scale speech generation model that provides full-stack capabilities from inference, training to deployment. Developed by the FunAudioLLM team, it aims to achieve high quality speech through advanced autoregressive transformers and ODE-based diffusion models...
6mos ago
03.2K
XHS-Downloader:免费小红书数据采集工具,支持笔记批量下载、视频提取、图片去水印

XHS-Downloader: Free Xiaohongshu data collection tool, support notes batch download, video extraction, image watermarking

General Introduction XHS-Downloader is an open source tool designed for Xiaohongshu users to support extracting and downloading watermark-free images and video works on Xiaohongshu. The tool provides a variety of features, including getting cookies from browsers, support for command line operations, batch download...
12mos ago
02.9K
PDFMathTranslate:保留PDF完整排版的AI翻译工具

PDFMathTranslate: AI translation tool that preserves the full typography of PDFs

Comprehensive introduction PDFMathTranslate is an open source tool focusing on the translation of scientific papers , PDF documents can be translated in full and generate a bilingual version . It uses AI technology to retain the full layout of the original document , including formulas , diagrams , tables of contents and notes , support ...
2mos ago
02.8K
PraisonAI:低代码多智能体框架,简化复杂任务的自动化解决方案

PraisonAI: A Low-Code Multi-Intelligent Body Framework to Simplify Automation Solutions for Complex Tasks

Comprehensive Introduction PraisonAI is an out-of-the-box multi-intelligence body framework for production environments, designed to create AI intelligences to automate and solve problems ranging from simple tasks to complex challenges. The framework provides a low-code solution that simplifies the building of multi-intelligent body LLM systems and...
6mos ago
02.8K
ElizaOS:构建自主执行的多智能体,功能完备的开源AI智能体开发框架

ElizaOS: Building Autonomously Executing Multi-Intelligents, a Fully Functional Open Source AI Intelligent Body Development Framework

Comprehensive introduction Eliza is an advanced multi-intelligent body (Multi-Agent) development framework , is committed to simplifying the construction and deployment of autonomous intelligent body (Autonomous Agent) process . It supports the deployment of multiple intelligent bodies with different role settings , can realize intelligent ...
7mos ago
02.7K
Dify:生成式AI应用开发平台,可视化编排, 支持私有化部署

Dify: generative AI application development platform, visual orchestration, private deployment support

Comprehensive Introduction Dify is an open source generative AI application development platform designed to help developers rapidly build and operate native AI applications based on Large Language Models (LLMs). The platform provides everything from Agent building to AI workflow orchestration, RAG retrieval...
7mos ago
02.7K
Browser-Use:构建智能网页自动化工具,让AI智能体轻松操作浏览器

Browser-Use: Building Intelligent Web Automation Tools for AI Intelligents to Easily Operate Browsers

Comprehensive Introduction Browser-Use is an innovative open source web automation tool specifically designed to enable Language Models (LLMs) to naturally interact with websites. It provides a powerful and flexible framework that supports a wide range of mainstream language models, including GPT-4, Claud...
8mos ago
02.6K
FunASR:开源语音识别工具包,说话人分离/ 多人对话语音识别

FunASR: Open Source Speech Recognition Toolkit, Speaker Separation / Multi-Person Conversation Speech Recognition

Comprehensive Introduction FunASR is an open source speech recognition toolkit developed by Alibaba's Dharma Institute to bridge academic research and industrial applications. It supports a wide range of speech recognition features, including speech recognition (ASR), voice endpoint detection (VAD), punctuation recovery, language modeling, speaking...
10mos ago
02.6K
Browser Use Web UI:运行AI智能体浏览网页,让AI能够自动操作网页的开源框架

Browser Use Web UI: an open source framework for running AI intelligences to browse the web, allowing AI to automatically manipulate web pages

Comprehensive Introduction Browser Use Web UI is an innovative open source project focused on providing AI agents with a graphical interface tool for browser interaction capabilities. The project is built on top of the browser-use core framework, built with Gradio ...
2mos ago
02.6K
MMAudio:为视频画面生成同步音效与配乐,视频到音频的多模态联合训练工具

MMAudio: generating synchronized sound effects and soundtracks for video footage, video-to-audio multimodal co-training tool

General Introduction MMAudio is an open-source project aiming to generate high-quality synchronized audio through joint multimodal training. Developed by Ho Kei Cheng et al. at the Chinese University of Hong Kong, the project's main function is to generate synchronized audio based on video and/or text input.MM...
8mos ago
02.6K
Smolagents: open source project for rapid development of AI intelligences and lightweight construction of intelligences

Smolagents: open source project for rapid development of AI intelligences and lightweight construction of intelligences

Comprehensive Introduction Smolagents is a lightweight intelligent agent library developed by HuggingFace that focuses on simplifying the development process of AI agent systems. The project is known for its clean design philosophy, with only about 1000 lines of core code, yet provides powerful feature integration capabilities. It is most ...
7mos ago
02.6K
AsrTools:语音转字幕工具,内置剪映、快手、必剪接口的轻量客户端

AsrTools: speech-to-subtitle tool, lightweight client with built-in interfaces to Cutscene, Racer, and Must-Cut

Comprehensive Introduction AsrTools is an intelligent speech-to-text tool with built-in interfaces from big players such as Cutscene, Racer, Must Cut, etc. It does not require GPU or cumbersome configuration, and supports efficient multi-threaded batch processing. It is based on PyQt5 development, beautiful and user-friendly interface, able to output SRT and TXT format words...
10mos ago
02.6K
腾讯混元3D(Hunyuan3D):生成高分辨率3D资产,多种3D素材生成工作流

Tencent Hybrid 3D (Hunyuan3D): Generate high-resolution 3D assets, multiple 3D material generation workflows

Comprehensive Introduction Tencent Hunyuan3D (Hunyuan3D 2.0) is an advanced large-scale 3D synthesis system from Tencent designed to generate high-resolution textured 3D assets. The system consists of two core components: Hunyuan3D-DiT, a large-scale shape generation model, and Hunyuan3D-DiT, a large-scale texture...
7mos ago
02.5K
cognee:基于知识图谱构建的RAG开源框架,核心prompts学习

cognee: a RAG open source framework for knowledge graph based construction, core prompts learning

General Introduction Cognee is a reliable data layer solution designed for AI applications and AI agents. Designed to load and build LLM (Large Language Model) contexts to create accurate and interpretable AI solutions through knowledge graphs and vector stores. The framework favors cost-saving, interpretable...
7mos ago
02.5K
EXO:利用闲置家用设备运行分布式AI集群,支持多种推理引擎和自动设备发现。

EXO: Running distributed AI clusters using idle home devices with support for multiple inference engines and automated device discovery.

General Introduction Exo is an open source project designed to run its own AI cluster using everyday devices (e.g. iPhone, iPad, Android, Mac, Linux, etc.). Through dynamic model partitioning and automated device discovery, Exo is able to unify multiple devices into one powerful...
8mos ago
02.5K
OpenBB:开源金融数据分析平台,集成私有数据集和 AI 来增强投资决策

OpenBB: Open Source Financial Data Analytics Platform Integrates Private Datasets and AI to Enhance Investment Decisions

General Introduction OpenBB is a free and fully open source financial data analytics platform designed to provide easy access to financial data and analytics tools for all. The platform integrates over 100 different data sources covering stocks, options, cryptocurrencies, forex, macroeconomic indicators, fixed...
7mos ago
02.5K
NeoAI:让AI接管电脑远程操作,使用自然语言控制电脑的开源项目

NeoAI: Open source project that lets AI take over remote operation of computers and control them using natural language

General Introduction NeoAI is an innovative open source AI assistant tool that allows users to easily control and manage their computers through natural language conversations. Without writing any code, users can simply use everyday conversations to find files, automate tasks, manage devices, etc.NeoAI...
7mos ago
02.5K
RAGFlow:基于深度文档理解的开源RAG引擎,提供高效的检索增强生成工作流

RAGFlow: an open source RAG engine based on deep document understanding, providing efficient retrieval-enhanced generation workflows

Comprehensive Introduction RAGFlow is an open source Retrieval Augmented Generation (RAG) engine based on deep document understanding technology. It provides an efficient RAG workflow for organizations of all sizes, incorporating a large-scale language model (LLM) capable of delivering data in complex formats based on real...
7mos ago
02.4K
DeOldify:使用AI技术为黑白照片和视频上色的经典开源工具

DeOldify: the classic open-source tool for colorizing black-and-white photos and videos using AI technology

Comprehensive Introduction DeOldify is an open source project based on deep learning technology, specifically designed for intelligent colorization and restoration of black and white photos and videos. The project uses an innovative NoGAN training method to successfully solve the common defects of traditional GAN networks in the image coloring process...
8mos ago
02.4K
Memary:利用知识图谱增强Agent长期记忆的开源项目

Memary: an open-source project to enhance Agent long-term memory using knowledge graphs

General Introduction Memary is an innovative open source project focused on providing long-term memory management solutions for autonomous intelligences. The project helps intelligences break through the limitations of traditional context windows to achieve smarter interaction experiences through knowledge graphs and specialized memory modules.Memary adopts...
7mos ago
02.4K
TRELLIS:Microsoft开发的3D资产生成模型,支持多种格式和灵活编辑

TRELLIS: Microsoft-developed 3D asset generation model with multiple format support and flexible editing

General Introduction TRELLIS is a large-scale 3D asset generation model developed by Microsoft. It is capable of receiving text or image prompts and generating high-quality 3D assets in a variety of formats, such as radial fields, 3D Gaussians, and meshes.At the heart of TRELLIS is a unified structured latent...
8mos ago
02.4K
AI Hedge Fund:开源自动化交易系统,利用多智能体进行复杂对冲基金交易决策

AI Hedge Fund: open-source automated trading system utilizing multiple intelligences for complex hedge fund trading decisions

General Introduction AI Hedge Fund is an artificial intelligence hedge fund that utilizes a multi-agent system for trading decisions. The system works in concert with multiple specialized agents, including market data agents, quantitative agents, risk management agents, and portfolio management agents, to achieve complex trading...
7mos ago
02.4K
OpenSPG:开源知识图谱引擎

OpenSPG: Open Source Knowledge Graph Engine

Comprehensive Introduction OpenSPG is an open source knowledge graph engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic Augmented Programmable Graph) framework. The engine is designed to provide features such as explicit semantic representation, logical rule definition and operational framework to support the construction and management of domain knowledge graphs...
11mos ago
02.4K
小智 AI 聊天机器人:打造你的AI聊天伴侣,轻松实现语音对话和智能互动

Xiaozhi AI Chatbot: Build your AI chatting companion, easily realize voice conversation and intelligent interaction

Comprehensive Introduction Xiaozhi AI Chatbot is an open source project based on the ESP32 development board, designed to help users build their own AI chat companion. The project was developed by Shrimp and is mainly used for teaching purposes to help more people get started with AI hardware development and to understand how to apply large language models to real...
5mos ago
02.4K
Chatbot UI:模仿ChatGPT界面和功能的开源AI聊天应用程序

Chatbot UI: an open source AI chat app that mimics ChatGPT's interface and functionality

General Introduction Chatbot UI is an open source project designed to help developers create personalized and intelligent conversational interfaces. The project provides a series of interface components and interactive features that can be easily integrated into the existing Chatbot system to provide users with a more fluent and intelligent dialog body...
11mos ago
02.3K
LiveTalking:开源实时互动数字人直播系统,实现音视频同步对话

LiveTalking: open source real-time interactive digital human live system, to achieve synchronous audio and video dialogues

Comprehensive introduction LiveTalking is an open source real-time interactive digital human system , is committed to building high-quality digital human live solution . The project uses the Apache 2.0 open source protocol and integrates a number of cutting-edge technologies , including ER-NeRF rendering , real-time audio and video streaming processing ...
7mos ago
02.3K
MinerU:PDF文档提取转换为多模态Markdown格式,支持电子书OCR扫描

MinerU: PDF document extraction and conversion to multimodal Markdown format, support e-book OCR scanning

Comprehensive Introduction MinerU is an open source data extraction tool developed by the OpenDataLab team at the Shanghai Artificial Intelligence Laboratory, focusing on efficiently extracting content from complex PDF documents, web pages, and eBooks. It can take multimodal PDFs containing images, formulas, tables and other elements...
10mos ago
02.3K