AI open source project

Total 1020 articles posts
FinGPT:开源金融大语言模型平台,助力金融分析与预测

FinGPT: Open Source Financial Big Language Modeling Platform for Financial Analytics and Prediction

Comprehensive Introduction FinGPT is an open source financial big language modeling platform developed by the AI4Finance Foundation, designed for the financial sector to solve complex financial tasks and drive innovation in fintech.FinGPT utilizes lightweight adaptation techniques and reinforcement learning approaches...
7mos ago
02.6K
Ragas:评估RAG召回QA准确率与答案相关性

Ragas: assessing RAG recall QA accuracy and answer correlation

Comprehensive Introduction Ragas is a tool specifically designed to evaluate and optimize Retrieval Augmented Generation (RAG) systems. It provides a comprehensive set of evaluation metrics by analyzing the relationships between queries, retrieval contexts, and generated answers. These metrics include fidelity, answer relevance, context relevance, on...
7mos ago
03K
AutoGen:微软开发的多智能体对话框架

AutoGen: A Multi-Intelligent Body Dialog Framework Developed by Microsoft

Comprehensive Introduction AutoGen is an open source framework developed by a team of Microsoft researchers focused on simplifying the building of large language model (LLM) applications through multi-intelligent body conversations. It allows developers to create AI agents that can talk to each other and collaborate to solve tasks. This approach not only improves the performance of LLM...
7mos ago
03.4K
TankWork:通过语音和文字操作电脑,并提供实时语音反馈的智能体

TankWork: an intelligent body that operates computers via voice and text and provides real-time voice feedback

General Introduction TankWork is an open source desktop agent framework designed to enable AI to perceive and control your computer through computer vision and system-level interaction. The framework allows agents to directly control computers through voice and text commands, process real-time screen content, and provide continuous audio visual...
7mos ago
02.9K
XRAG:优化检索增强生成系统的可视化评估工具

XRAG: A Visual Evaluation Tool for Optimizing Retrieval Enhancement Generation Systems

Comprehensive Introduction XRAG (eXamining the Core) is a benchmarking framework designed for evaluating the underlying components of advanced retrieval augmentation generation (RAG) systems. By profiling and analyzing each core module, XRAG provides information on how different configurations and components affect RAG...
7mos ago
02.9K
CHRONOS:新闻时间线总结工具,提升新闻检索和时间线生成效率

CHRONOS: News Timeline Summarization Tool to Improve News Retrieval and Timeline Generation Efficiency

Comprehensive Introduction CHRONOS is a news timeline summarization tool developed by Alibaba NLP team. The tool generates timeline summaries of news events through iterative self-questioning.CHRONOS is not only capable of handling open-domain timeline summarization tasks, but also in terms of efficiency and scalability...
7mos ago
02.5K
X-Dyna:静态人像参考视频姿态生成视频,让小姐姐的照片跳舞

X-Dyna: Static Portrait Reference Video Pose Generation Video to Make Missy's Photos Dance

Comprehensive Introduction X-Dyna is an open source project developed by ByteDance to generate dynamic portrait animations using zero-sample diffusion techniques. The project utilizes facial expressions and body movements in drive video to animate individual portrait images, generating realistic and context-aware motion effects.X-D...
7mos ago
02.7K
腾讯混元3D(Hunyuan3D):生成高分辨率3D资产,多种3D素材生成工作流

Tencent Hybrid 3D (Hunyuan3D): Generate high-resolution 3D assets, multiple 3D material generation workflows

Comprehensive Introduction Tencent Hunyuan3D (Hunyuan3D 2.0) is an advanced large-scale 3D synthesis system from Tencent designed to generate high-resolution textured 3D assets. The system consists of two core components: Hunyuan3D-DiT, a large-scale shape generation model, and Hunyuan3D-DiT, a large-scale texture...
7mos ago
03.6K
RAG Web UI:构建智能文档问答系统,简单构建私有Web端知识库

RAG Web UI: Building an Intelligent Documentation Q&A System and Simply Building a Private Web-Side Knowledge Base

Comprehensive Introduction RAG Web UI is an intelligent dialog system based on RAG (Retrieval Augmented Generation) technology. It helps organizations and individuals build intelligent Q&A systems based on their own knowledge base. By combining document retrieval and large language modeling, RAG Web UI provides accurate and reliable...
7mos ago
02.6K
Kheish:多角色智能体,审查、验证和格式化输出以生成高质量结果

Kheish: multi-actor intelligences that review, validate and format output to produce high quality results

Comprehensive Introduction Kheish is an open source multi-role agent designed for Large Language Model (LLM) tasks that require structured, step-by-step collaboration.Kheish is more than just a simple coordinator, it is an intelligent agent in its own right, requesting modules on demand, integrating user-reversal...
7mos ago
02.8K
AI ContentCraft:生成短故事、对话脚本、配音、配图的多功能AI内容创作工具

AI ContentCraft: a versatile AI content creation tool for generating short stories, dialog scripts, voiceovers, and graphics

General Introduction AI ContentCraft is a versatile content creation tool that integrates text generation, speech synthesis, image generation and more. It helps creators quickly generate stories, podcast scripts, and accompanying audio and video content. The tool supports multiple language conversions and can batch...
7mos ago
03K
Unigraph:构建本地运行的知识图谱和个人搜索引擎

Unigraph: building locally running knowledge graphs and personal search engines

Comprehensive Introduction Unigraph is a local-first general-purpose knowledge graph and personal search engine designed to provide users with an integrated workspace to help manage and search for a wide variety of data in their personal lives. With Unigraph, users can integrate data from different sources into a...
7mos ago
02.7K
MaxKB:开箱即用的AI知识库问答系统,适合智能客服和企业内部知识库

MaxKB: Out-of-the-box AI Knowledge Base Q&A System for Smart Customer Service and In-house Knowledge Base

Comprehensive Introduction MaxKB (Max Knowledge Base) is an open source knowledge base Q&A system based on large language modeling and RAG (Retrieval Augmented Generation). The system is widely used in intelligent customer service, enterprise internal knowledge base, academic research and education and other scenarios.MaxKB...
7mos ago
03.6K
OmniThink:生成高质量长文的写作框架,搜索外部知识后反思并逐步构建知识树

OmniThink: a writing framework for generating high-quality long articles, searching for external knowledge and then reflecting on it and building a knowledge tree step by step

Comprehensive Introduction OmniThink is an innovative machine writing framework designed to generate high-quality, long-form essays by mimicking the iterative expansion and reflection of human cognitive processes. The framework focuses on extending the boundaries of knowledge and generating information that is rich and deep.OmniThink does this by constructing...
7mos ago
02.6K
MiniRAG:简化检索增强生成框架,实体图索引召回相关文本块

MiniRAG: Simplified Retrieval Enhanced Generation Framework, Entity Graph Index Recall Relevant Text Blocks

Comprehensive Introduction MiniRAG is an extremely simple Retrieval Augmented Generation (RAG) framework that aims to enable good RAG performance even for small models through heterogeneous graph indexing and lightweight topology-enhanced retrieval. It is developed by the Data Science Laboratory of the University of Hong Kong (HKUDS) to address ...
7mos ago
03.4K
百聆 (Bailing):低延时的开源语音对话助手,轻松实现自然对话交流

Bailing: a low-latency open source voice dialog assistant that easily realizes natural conversational exchanges

Comprehensive Introduction Bailing (Bailing) is an open source voice conversation assistant designed to engage in natural conversations with users through speech. The project combines speech recognition (ASR), voice activity detection (VAD), large language modeling (LLM) and speech synthesis (TTS) technologies to achieve...
7mos ago
03.3K
AIEvo:创建多智能体协作应用的高效框架

AIEvo: An Efficient Framework for Creating Multi-Intelligent Collaborative Applications

General Introduction AIEvo is Ant Group's open source multi-agent framework designed to efficiently create multi-agent applications. The framework strictly follows the SOP task graph to improve the execution success rate of complex tasks , and through feedback and monitoring mechanisms to ensure high flexibility and scalability.AIEvo has been produced within Ant Group ...
7mos ago
02.5K
BotSharp:基于.NET的多智能体AI应开发与管理平台

BotSharp: .NET-based multi-intelligence body AI should development and management platform

Comprehensive Introduction BotSharp is an open source project based on .NET Core dedicated to providing a comprehensive AI chatbot platform building tool. It uses C# programming, supports cross-platform operation, and aims to simplify the application of machine learning algorithms, enabling enterprise-level developers to efficiently ...
7mos ago
02.9K
OmAgent:构建多模态智能设备的智能体框架

OmAgent: an intelligent body framework for building multimodal smart devices

Comprehensive Introduction OmAgent is a multimodal intelligent body framework developed by Om AI Lab, aiming to provide powerful AI-powered features for smart devices. By integrating state-of-the-art multimodal base models and intelligent body algorithms, the project enables developers to create efficient smart devices on a variety of...
7mos ago
02.7K
Memora:构建人性化AI记忆模块,保存并更新与人类的互动信息

Memora: building humanized AI memory modules to save and update information about interactions with humans

General Introduction Memora is an agent designed to replicate human memories for each personalized AI. It helps AIs remember details of past interactions, emotions, and shared experiences just like humans do through features like timestamped memories, emotion markers, and multimodal memories.Memora supports multi-tenancy and is capable of handling...
7mos ago
03K
Search o1:赋予推理模型主动搜索能力,让大模型边思考边搜索外部知识

Search o1: Empowering inference models to actively search for external knowledge while the larger model is thinking

Comprehensive Introduction Search-o1 is an open source project that aims to enhance the performance of large-scale reasoning models (LRMs) by integrating advanced search mechanisms. The core idea is to solve the knowledge deficit problem encountered in the reasoning process through dynamic search and knowledge integration. The project was developed by sunn...
7mos ago
02.4K
LiveTalking:开源实时互动数字人直播系统,实现音视频同步对话

LiveTalking: open source real-time interactive digital human live system, to achieve synchronous audio and video dialogues

Comprehensive introduction LiveTalking is an open source real-time interactive digital human system , is committed to building high-quality digital human live solution . The project uses the Apache 2.0 open source protocol and integrates a number of cutting-edge technologies , including ER-NeRF rendering , real-time audio and video streaming processing ...
7mos ago
03.9K
JoyGen:音频驱动的3D深度感知人像说话视频编辑工具

JoyGen: Audio-Driven 3D Depth-Sensitive Portrait Talking Video Editing Tool

Comprehensive Introduction JoyGen is an innovative two-stage video generation framework for talking faces, focusing on solving the problem of audio-driven facial expression generation. Developed by a team from Jingdong Technology, the project uses advanced 3D reconstruction techniques and audio feature extraction methods to accurately capture the identity characteristics of the speaker and the expression...
7mos ago
02.8K
Browser Use Web UI:运行AI智能体浏览网页,让AI能够自动操作网页的开源框架

Browser Use Web UI: an open source framework for running AI intelligences to browse the web, allowing AI to automatically manipulate web pages

Comprehensive Introduction Browser Use Web UI is an innovative open source project focused on providing AI agents with a graphical interface tool for browser interaction capabilities. The project is built on top of the browser-use core framework, built with Gradio ...
2mos ago
04.4K