AI open source project

Total 1020 articles posts
Danswer: 专注企业知识管理与文档搜索的AI助手,集成多种工作工具

Danswer: AI assistant specializing in enterprise knowledge management and document search, integrating multiple work tools

General Introduction Danswer is an open source enterprise document retrieval AI assistant designed to connect to team documents, applications and people to provide unified search and natural language query answers through an intelligent chat interface and unified search capabilities. Ensuring that user data and chats are fully controlled...
1yrs ago
087.5K
Orion:小米开源的端到端自动驾驶推理与规划框架

Orion: Xiaomi's Open Source End-to-End Autonomous Driving Reasoning and Planning Framework

Comprehensive Introduction Orion is an open source project developed by Xiaomi Labs, focusing on end-to-end (E2E) autonomous driving technology. It solves the problem of insufficient causal reasoning in complex scenarios of traditional autonomous driving approaches through visual language modeling (VLM) and generative planners.Orion integrates long...
11mos ago
087K
Dify:生成式AI应用开发平台,可视化编排, 支持私有化部署

Dify: generative AI application development platform, visual orchestration, private deployment support

Comprehensive Introduction Dify is an open source generative AI application development platform designed to help developers rapidly build and operate native AI applications based on Large Language Models (LLMs). The platform provides everything from Agent building to AI workflow orchestration, RAG retrieval...
1yrs ago
086.8K
Sim Studio:开源的AI代理工作流构建工具

Sim Studio: open source workflow builder for AI agents

Comprehensive Introduction Sim Studio is an open source AI agent workflow building platform focused on helping users quickly design, test, and deploy large-scale language model (LLM) workflows through a lightweight, intuitive visual interface. Users can create complex workflows without deep programming by dragging and dropping...
9mos ago
086K
R2R:多模态内容解析并结合知识图谱与混合搜索的先进AI检索(RAG)系统

R2R: An Advanced AI Retrieval (RAG) System for Multimodal Content Parsing and Combining Knowledge Graph with Hybrid Search

Comprehensive Introduction R2R (RAG to Riches) is an advanced AI retrieval system supporting Retrieval Augmented Generation (RAG) functionality with production-ready features. Built on a containerized RESTful API, the system provides multimodal content parsing, hybrid search functionality...
1yrs ago
085.2K
RAGFlow:基于深度文档理解的开源RAG引擎,提供高效的检索增强生成工作流

RAGFlow: an open source RAG engine based on deep document understanding, providing efficient retrieval-enhanced generation workflows

Comprehensive Introduction RAGFlow is an open source Retrieval Augmented Generation (RAG) engine based on deep document understanding technology. It provides an efficient RAG workflow for organizations of all sizes, incorporating a large-scale language model (LLM) capable of delivering data in complex formats based on real...
1yrs ago
084.9K
VITA:开源视觉与语音实时交互的多模态大语言模型

VITA: Open Source Multimodal Large Language Model for Real-Time Interaction between Vision and Speech

General Introduction VITA is a leading open source interactive multimodal large language modeling project, pioneering the ability to achieve true full multimodal interaction. The project launched VITA-1.0 in August 2024, pioneering the first open source interactive fully-modal large language model.2024...
1yrs ago
082.4K
MakeSense:免费使用的图像标注工具,提升计算机视觉项目效率

MakeSense: a free-to-use image annotation tool to improve computer vision project efficiency

General Introduction Make Sense is a free online image annotation tool designed to help users quickly prepare datasets for computer vision projects. It requires no complicated installation, just open a browser access to use it, supports multiple operating systems, and is perfect for small deep learning projects. Users can...
1yrs ago
082.4K
LibreChat:模仿ChatGPT界面交互的AI对话开源项目

LibreChat: mimic ChatGPT interface interaction AI dialog open source project

General Introduction LibreChat is a free, open source AI chat platform with extensive customization options and support for multiple AI providers, services and integrations. It brings together all AI conversations in one place with a familiar interface and innovative features, supporting multiple AI models, plugins and multiple languages. By...
2yrs ago
081.3K
Linly-Talker:数字人智能对话系统,结合大语言模型与视觉模型,实现互动新体验

Linly-Talker: An Intelligent Dialogue System for Digital People, Combining Big Language Modeling and Visual Modeling for a New Interactive Experience

Comprehensive Introduction Linly-Talker is an innovative digital human dialog system that combines Large Language Models (LLMs) with visual models to create a novel approach to human-computer interaction. The system integrates a variety of technologies such as Whisper, Linly, Micros...
1yrs ago
081.1K
OmniSVG:从文本和图像生成SVG矢量图形的开源项目

OmniSVG: from text and images to generate SVG vector graphics open source project

General Introduction OmniSVG is an open source project focused on generating high-quality vector graphics (SVG) through a multimodal model. It utilizes pre-trained visual-linguistic models to support SVG generation from textual descriptions or image input, covering a wide range of scenarios from simple icons to complex anime characters. Item ...
11mos ago
080.5K
OpenSPG:开源知识图谱引擎

OpenSPG: Open Source Knowledge Graph Engine

Comprehensive Introduction OpenSPG is an open source knowledge graph engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic Augmented Programmable Graph) framework. The engine is designed to provide features such as explicit semantic representation, logical rule definition and operational framework to support the construction and management of domain knowledge graphs...
1yrs ago
079.7K
KrillinAI:一键翻译和配音的视频多语言全球化工具

KrillinAI: Multilingual Globalization Tool for Video with One-Click Translation and Dubbing

Comprehensive Introduction KrillinAI is an open-source video processing tool focused on using artificial intelligence to help users translate videos and automatically dub them. It can start from the video download, all the way to generating the finished product adapted to different platforms, the whole process is just a few clicks. The developers are available on GitHub...
9mos ago
079.5K
WrenAI:对话式数据分析AI助手,直接获取答案、SQL查询与分析报表

WrenAI: Conversational Data Analytics AI Assistant with Direct Access to Answers, SQL Queries & Analytics Reports

General Introduction WrenAI is an open source SQL AI assistant specifically designed to help data teams, product teams and business teams gain data insights through natural language conversations. It is capable of converting natural language into SQL queries, generating charts, spreadsheets and reports, supporting multilingual...
1yrs ago
078.3K
Smolagents: open source project for rapid development of AI intelligences and lightweight construction of intelligences

Smolagents: open source project for rapid development of AI intelligences and lightweight construction of intelligences

Comprehensive Introduction Smolagents is a lightweight intelligent agent library developed by HuggingFace that focuses on simplifying the development process of AI agent systems. The project is known for its clean design philosophy, with only about 1000 lines of core code, yet provides powerful feature integration capabilities. It is most ...
1yrs ago
078.3K
Linly-Dubbing:智能视频多语言AI配音/翻译工具

Linly-Dubbing: Intelligent Video Multilingual AI Dubbing/Translation Tool

Comprehensive Introduction Linly-Dubbing is an intelligent multilingual AI dubbing and translation tool designed to provide users with high-quality multilingual video dubbing and subtitle translation services by integrating advanced AI technology. The tool is especially suitable for international education, global content localization and other scenarios, helping...
1yrs ago
078.2K
MaxKB:开箱即用的AI知识库问答系统,适合智能客服和企业内部知识库

MaxKB: Out-of-the-box AI Knowledge Base Q&A System for Smart Customer Service and In-house Knowledge Base

Comprehensive Introduction MaxKB (Max Knowledge Base) is an open source knowledge base Q&A system based on large language modeling and RAG (Retrieval Augmented Generation). The system is widely used in intelligent customer service, enterprise internal knowledge base, academic research and education and other scenarios.MaxKB...
1yrs ago
077.6K
Qlib:微软开发的AI量化投资研究工具

Qlib: an AI quantitative investment research tool developed by Microsoft

Comprehensive Introduction Qlib is an open source platform developed by Microsoft that focuses on using AI technology to help users research quantitative investments. It starts from the most basic data processing and supports users to explore investment ideas and turn them into usable strategies. The platform is simple and easy to use, and is suitable for those who want to use machine learning to improve their investment research...
11mos ago
077K
NeoAI:让AI接管电脑远程操作,使用自然语言控制电脑的开源项目

NeoAI: Open source project that lets AI take over remote operation of computers and control them using natural language

General Introduction NeoAI is an innovative open source AI assistant tool that allows users to easily control and manage their computers through natural language conversations. Without writing any code, users can simply use everyday conversations to find files, automate tasks, manage devices, etc.NeoAI...
1yrs ago
076.2K
Deep Live Cam:开源的实时AI换脸工具,一张照片就能实现实时换脸直播

Deep Live Cam: open source real-time AI face-swapping tool, a photo can realize real-time face-swapping live

General Introduction Deep Live Cam is an open source artificial intelligence tool designed to enable real-time face replacement and deep fake video generation from a single photo. The tool utilizes advanced deep learning algorithms to enable real-time face replacement in live streams or video calls, protecting user privacy and adding fun...
1yrs ago
076.1K
Ragas:评估RAG召回QA准确率与答案相关性

Ragas: assessing RAG recall QA accuracy and answer correlation

Comprehensive Introduction Ragas is a tool specifically designed to evaluate and optimize Retrieval Augmented Generation (RAG) systems. It provides a comprehensive set of evaluation metrics by analyzing the relationships between queries, retrieval contexts, and generated answers. These metrics include fidelity, answer relevance, context relevance, on...
1yrs ago
076K
Browser Use Web UI:运行AI智能体浏览网页,让AI能够自动操作网页的开源框架

Browser Use Web UI: an open source framework for running AI intelligences to browse the web, allowing AI to automatically manipulate web pages

Comprehensive Introduction Browser Use Web UI is an innovative open source project focused on providing AI agents with a graphical interface tool for browser interaction capabilities. The project is built on top of the browser-use core framework, built with Gradio ...
9mos ago
075.8K
TRV:将幻灯片/PPT和讲解备注快速生成演讲视频

TRV: Rapidly Generate Presentation Videos from Slides/PPTs and Explanatory Notes

General Introduction TRV is an open source tool, hosted on GitHub, designed to help users quickly convert slides and presentation notes into videos with narration. It automatically generates audio and video content from incoming presentation files through simple command line operations, suitable for those who need to quickly create presentations...
1yrs ago
075.7K
cognee:基于知识图谱构建的RAG开源框架,核心prompts学习

cognee: a RAG open source framework for knowledge graph based construction, core prompts learning

General Introduction Cognee is a reliable data layer solution designed for AI applications and AI agents. Designed to load and build LLM (Large Language Model) contexts to create accurate and interpretable AI solutions through knowledge graphs and vector stores. The framework favors cost-saving, interpretable...
1yrs ago
075.5K
RMBG-2-Studio:批量移除图像和视频背景的开源程序,基于RMBG 2.0优化

RMBG-2-Studio: open source program for batch removal of image and video backgrounds, optimized for RMBG 2.0

General Introduction RMBG-2-Studio is an enhanced background removal and replacement application developed based on the BRIA-RMBG-2.0 model. The application is designed to provide users with efficient and accurate image background processing capabilities for a variety of image types, including e-commerce, gaming and...
1yrs ago
075.4K
MatAnyone: 提取视频指定目标人像的开源工具,生成目标人像视频

MatAnyone: Extract video to specify the target portrait of the open-source tool to generate the target portrait video

General Introduction MatAnyone is an open source project focusing on video keying, developed and released on GitHub by a research team at S-Lab, Nanyang Technological University, Singapore. It provides users with stable and efficient video processing capabilities through coherent memory propagation techniques, especially...
1yrs ago
075.2K
Agent S:像人类一样操作电脑的开源智能体框架

Agent S: An Open Source Framework for Intelligent Bodies to Operate Computers Like Humans

General Introduction Agent S is an open-source framework developed by Simular AI that lets intelligences operate computers like humans through a graphical user interface (GUI). It uses a multimodal large language model and empirical learning techniques to accomplish tasks such as browsing the web, editing documents, using software...
11mos ago
074.4K
SP-MangaEditer:专业四格漫画插图创作工具,生成图像、编辑漫画页面

SP-MangaEditer: Professional four-panel manga illustration creation tool, generating images, editing manga pages

General Introduction SP-MangaEditer is an independent manga editing platform designed for manga creators. The platform supports image generation, layer editing, image adjustment, filter application and many other functions to help users easily create high-quality manga illustrations. Users can simply manipulate...
1yrs ago
074.4K
Goose:开源可扩展的编程智能体,自动化执行编程全流程任务

Goose: open source scalable programming intelligences that automate the full range of programming tasks

General Introduction Goose is an open source AI agent tool developed by Block, Inc. designed to help developers automate everyday development tasks. It supports a wide range of Large Language Models (LLMs) and interacts with users via the command line or desktop application interfaces.Goose can perform a wide range of tasks from agent...
1yrs ago
073.9K
AI Hedge Fund:开源自动化交易系统,利用多智能体进行复杂对冲基金交易决策

AI Hedge Fund: open-source automated trading system utilizing multiple intelligences for complex hedge fund trading decisions

General Introduction AI Hedge Fund is an artificial intelligence hedge fund that utilizes a multi-agent system for trading decisions. The system works in concert with multiple specialized agents, including market data agents, quantitative agents, risk management agents, and portfolio management agents, to achieve complex trading...
1yrs ago
073.8K
RealtimeVoiceChat:低延迟与AI进行自然口语对话

RealtimeVoiceChat: low-latency natural spoken conversation with AI

General Introduction RealtimeVoiceChat is an open source project focused on real-time, natural conversations with artificial intelligence via voice. Users use a microphone to input their voice, and the system captures the audio through a browser, quickly converts it to text, and a large-scale language model (LLM) generates back...
10mos ago
073.5K
小红书AI运营助手:自动生成和发布小红书文章

Xiaohongshu AI operation assistant: automatically generate and publish Xiaohongshu articles

Comprehensive Introduction Xiaohongshu AI Operation Assistant (xhsaipublisher) is an automation tool designed for publishing articles on the Xiaohongshu platform. The program combines a graphical user interface with automation scripts that utilize big model technology to generate content and automatically log in and publish via browser...
1yrs ago
073.3K
Dify-WebUI:基于Dify API的桌面智能对话客户端,提供企业级AI对话能力

Dify-WebUI: Desktop Intelligent Conversation Client based on Dify API, providing enterprise-grade AI conversation capabilities

Comprehensive Introduction Dify-WebUI is a modern desktop smart conversation app based on the Dify API, designed to provide enterprises with powerful AI conversation capabilities. The application supports a variety of preset theme colors to meet the personalized needs of enterprises, and has a knowledge base management function to support...
1yrs ago
072.6K