AI open source project

Total 1020 articles posts
Dynamiq:智能体编排框架,支持RAG和LLM代理,简化AI应用开发

Dynamiq: Intelligent Body Orchestration Framework with RAG and LLM Agent Support to Simplify AI Application Development

Comprehensive Introduction Dynamiq is an open source AI orchestration framework designed for agent AI and Large Language Model (LLM) applications. It is designed to simplify the development of AI-driven applications, especially in the area of Retrieval Augmented Generation (RAG) and the orchestration of LLM agents.Dynamiq proposes...
11mos ago
024.4K
FiveThirtyNine:基于搜索知识对未来事件发生概率预测

FiveThirtyNine: Predicting the probability of future events based on search knowledge

Comprehensive Introduction Forecast AI is a superb forecasting platform based on advanced artificial intelligence technology. It utilizes powerful data analytics and machine learning algorithms to provide users with highly accurate predictions of future events. Whether it's political elections, economic trends or social events, Forecast ...
1yrs ago
024.4K
BuffGPT:企业级生成式AI应用低代码开发平台

BuffGPT: A Low-Code Development Platform for Enterprise-Grade Generative AI Applications

Comprehensive Introduction BuffGPT is an open source AI application development platform based on the Large Language Model (LLM), providing out-of-the-box features such as data processing, model invocation, RAG retrieval, and visual workflow orchestration to help users easily build and operate generative AI applications. The platform supports privatization...
7mos ago
024.4K
DocsGPT:文档聊天助手,从单个文档、网站来源获取可靠的答案,支持本地部署

DocsGPT: Document Chat Assistant, get reliable answers from single documents, web sources, support local deployment

General Introduction DocsGPT is an open source documentation assistant designed to simplify the process of querying project documentation. By integrating a powerful GPT model , developers can easily ask questions about the project and get accurate answers.DocsGPT supports local deployment to ensure data privacy while...
11mos ago
024.4K
VideoRAG:理解超长视频的RAG框架,支持多模态检索和知识图谱构建

VideoRAG: A RAG framework for understanding ultra-long videos with support for multimodal retrieval and knowledge graph construction

Comprehensive Introduction VideoRAG is a retrieval-enhanced generative framework designed for processing and understanding very long contextual videos. The tool combines a graph-driven textual knowledge base with hierarchical multimodal context encoding to efficiently process on a single NVIDIA RTX 3090 GPU...
8mos ago
024.3K
AppAgent:利用多模态智能体自动操作智能手机

AppAgent: automated smartphone operation using multimodal intelligences

Comprehensive Introduction AppAgent is a large language model (LLM)-based multimodal agent framework designed to manipulate smartphone applications. The framework mimics human interactions such as taps and swipes through a simplified manipulation space, thus eliminating the need for system back-end access and extending its use across different app...
10mos ago
024.3K
飞桨 PP-TableMagic:复杂表格结构化信息提取神器

Flying Paddle PP-TableMagic: Structured Information Extraction for Complex Tables

The goal of table recognition is to parse tables in images, accurately identify table structures and cell locations, and reduce them to structured table formats (e.g., HTML). In today's information age, a large amount of important tabular data still exists in an unstructured state (e.g., scanned documents with pictures of statistical tables...).
7mos ago
024.2K
Knowledge Table:高效提取与探索结构化数据的开源工具

Knowledge Table: an open source tool for efficient extraction and exploration of structured data

Comprehensive Introduction Knowledge Table (Knowledge Table) is an open source project designed to simplify the process of extracting and exploring structured data from unstructured documents. Users can create structured knowledge representations such as tables and graphs through a natural language query interface. The tool supports customizing the extraction ...
1yrs ago
024.2K
Maxun:开源无代码平台,自动抓取网页数据并转换为API或电子表格

Maxun: open source no-code platform that automatically crawls web data and converts it to APIs or spreadsheets

Comprehensive Introduction Maxun is an open source no-code web data extraction platform that allows users to train robots in minutes to automatically crawl web data and convert it into APIs or spreadsheets. The platform supports paging and scrolling, can adapt to changes in website layout, provides powerful data crawling...
9mos ago
024.2K
Moondream:批量反推图像提示词的开源轻量级视觉语言模型

Moondream: an open source lightweight visual language model for batch backpropagation of image cue words

Comprehensive Introduction Moondream is an open source lightweight visual language model designed to enable image description capabilities through deep learning and computer vision techniques. The model is able to run efficiently on a variety of platforms and is particularly suitable for edge devices.Moondream uses advanced techniques and...
9mos ago
024.2K
PhiData:构建拥有记忆、知识和工具的AI智能体

PhiData: Building AI Intelligence with Memory, Knowledge and Tools

Comprehensive Introduction PhiData is a framework designed for developing intelligent AI assistants. It enables AI assistants to have long conversations, provide accurate business context, and perform various operations through enhanced memory, knowledge integration, and tool invocation capabilities.PhiData not only enhances AI assistant...
7mos ago
024.1K
Cooragent:一句话构建多智能体任务协作工具

Cooragent: building a multi-intelligence task collaboration tool in one sentence

General Introduction Cooragent is an open source AI agent collaboration framework developed by LeapLab at Tsinghua University and hosted on GitHub.It allows users to create intelligent AI agents with a one-sentence description and supports multiple agents to collaborate on complex tasks. The framework provides two...
5mos ago
024.1K
X-Dyna:静态人像参考视频姿态生成视频,让小姐姐的照片跳舞

X-Dyna: Static Portrait Reference Video Pose Generation Video to Make Missy's Photos Dance

Comprehensive Introduction X-Dyna is an open source project developed by ByteDance to generate dynamic portrait animations using zero-sample diffusion techniques. The project utilizes facial expressions and body movements in drive video to animate individual portrait images, generating realistic and context-aware motion effects.X-D...
9mos ago
024.1K
Diffbot GraphRAG LLM:依赖外部实时知识图谱数据的LLM推理服务

Diffbot GraphRAG LLM: LLM reasoning service relying on external real-time knowledge graph data

Comprehensive Introduction Diffbot LLM Reasoning Server is an innovative large-scale language modeling system with special optimizations and improvements based on the LLama model architecture. The most important feature of the project is the integration of real-time Knowledge Graph with retrieval-enhanced generation...
9mos ago
024.1K
AI投资系统:自动化A股投资决策系统,利用多智能体系统分析市场数据

AI investment system: automated A-share investment decision-making system that utilizes a multi-intelligence system to analyze market data

Comprehensive Introduction A_Share_investment_Agent is an A-share investment decision aid based on a multi-intelligence system. The system is designed to analyze market data, calculate the intrinsic value of stocks, analyze market sentiment, and fundamental data through multiple collaborative intelligences to...
9mos ago
024K
wdoc:从海量、多源文档中检索内容并总结知识

wdoc: retrieve content and summarize knowledge from massive, multi-source documents

Comprehensive Introduction wdoc is a powerful RAG (Retrieval Augmentation Generation) system designed for processing and analyzing large and diverse documents. It is capable of retrieving from a wide range of document types, including PDFs, web pages, YouTube videos, audio files, etc. wdoc is particularly well suited for processing...
8mos ago
024K
DB-GPT:构建AI原生数据应用开发框架,集成多模型管理与智能数据处理

DB-GPT: Building AI Native Data Application Development Framework, Integrating Multi-Model Management and Intelligent Data Processing

Comprehensive Introduction DB-GPT is an open source AI native data application development framework built using AWEL (Agentic Workflow Expression Language) and smart body technology. The project aims to build infrastructure in the field of large modeling...
7mos ago
024K
HN中文播客:自动抓取热门科技文章,AI生成中文总结并转换为播客

HN Chinese Podcast: Automatically grab popular tech articles, AI-generated Chinese summaries and convert them to podcasts

General Introduction The Hacker News Chinese Podcast project is an innovative platform based on AI technology, aiming to automatically grab popular articles on Hacker News every day and generate Chinese summaries and podcast content through AI. The project is led by ccbikai ...
8mos ago
024K
RealtimeVoiceChat:低延迟与AI进行自然口语对话

RealtimeVoiceChat: low-latency natural spoken conversation with AI

General Introduction RealtimeVoiceChat is an open source project focused on real-time, natural conversations with artificial intelligence via voice. Users use a microphone to input their voice, and the system captures the audio through a browser, quickly converts it to text, and a large-scale language model (LLM) generates back...
5mos ago
024K
VideoMind:视频按时间戳定位内容与问答的开源项目

VideoMind: video by timestamp positioning content and Q&A open source project

General Introduction VideoMind is an open source multimodal AI tool focused on inference, Q&A and summary generation for long videos. It was developed by Ye Liu of the Hong Kong Polytechnic University and a team from Show Lab at the National University of Singapore. The tool mimics human understanding of video...
4mos ago
023.9K
xyks:小猿口算逆向笔记,逆向工程与解密算法

xyks: small ape oral math reverse notes, reverse engineering and decryption algorithms

Comprehensive Introduction Ape Mouth Calculator Reverse Notes is an open source project that aims to document and share the process and methods of reverse engineering the Ape Mouth Calculator application. The project contains a variety of reverse tools and techniques to use the instructions , such as Frida, dexdump , etc., to help users understand and crack the little ape oral math add...
1yrs ago
023.9K
ExtractThinker:提取和分类文档为结构化数据,优化文档处理流程

ExtractThinker: extracting and classifying documents into structured data to optimize the document processing flow

Comprehensive Introduction ExtractThinker is a flexible document intelligence tool that utilizes Large Language Models (LLMs) to extract and classify structured data from documents, providing a seamless ORM-like document processing workflow. It supports a variety of document loaders, including Tess...
9mos ago
023.9K
TransRouter:基于Gemini多模态模型,实时中英互译的音频转换工具

TransRouter: A Real-Time Audio Conversion Tool for Chinese-to-English Translation Based on Gemini Multimodal Modeling

TransRouter is a real-time voice translation tool based on Google's Gemini model, specifically designed for real-time voice translation between English and Chinese. The tool can be seamlessly integrated into video conferencing software such as Zoom, providing an easy way for cross-language...
9mos ago
023.9K
ColorFlow:漫画着色,黑白图像自动着色,提升图像色彩一致性和质量

ColorFlow: Comic book coloring, automatic coloring of black and white images to improve image color consistency and quality

Comprehensive Introduction ColorFlow is an image sequence auto-coloring tool developed by Tencent's ARC team to solve the problem of auto-coloring black and white image sequences. The tool utilizes a retrieval-enhanced coloring pipeline to accurately generate the colors of various elements through a pool of reference images, including the character's hair color and service...
10mos ago
023.9K
OpenAOE:大模型群聊框架:同时与多个大语言模型聊天

OpenAOE: Large Model Group Chat Framework: Chatting with Multiple Large Language Models Simultaneously

Comprehensive Introduction OpenAOE is an open source large model group chat framework, aiming to solve the problem of the lack of chat frameworks in the current market with multiple models responding in parallel. With OpenAOE, users can talk to multiple Large Language Models (LLMs) at the same time and get parallel output. The framework supports ...
8mos ago
023.8K
Rankify:支持信息检索与重排序的Python工具包

Rankify: a Python toolkit supporting information retrieval and reordering

General Introduction Rankify is an open source Python toolkit developed by the Data Science Group at the University of Innsbruck, Austria. It focuses on information retrieval, reordering and retrieval augmentation generation (RAG), providing a unified framework. The toolkit comes with a built-in set of 40 pre-retrieved benchmarks...
7mos ago
023.7K