AI open source project

Total 1020 articles posts
dsRAG:用于处理非结构化数据和复杂查询的检索引擎

dsRAG: A Retrieval Engine for Unstructured Data and Complex Queries

Comprehensive Introduction dsRAG is a high-performance retrieval engine designed to handle complex queries on unstructured data. It performs particularly well in handling challenging queries in dense text such as financial reports, legal documents, and academic papers. dsRAG employs three key approaches to improve performance: language...
1yrs ago
056.6K
Unigraph:构建本地运行的知识图谱和个人搜索引擎

Unigraph: building locally running knowledge graphs and personal search engines

Comprehensive Introduction Unigraph is a local-first general-purpose knowledge graph and personal search engine designed to provide users with an integrated workspace to help manage and search for a wide variety of data in their personal lives. With Unigraph, users can integrate data from different sources into a...
1yrs ago
056.6K
Claude生成深度研究报告的MCP服务

Claude's MCP service for generating in-depth research reports

Comprehensive Introduction MCP Server Deep Research is an open source tool that automatically generates structured research reports for complex problems through artificial intelligence and web search. Users enter a research question, and the tool breaks down the question, searches for authoritative information, assesses source credibility...
11mos ago
056.6K
GPT Academic:最佳Arxiv学术论文翻译、纠错与代码解释

GPT Academic: Best Arxiv Academic Paper Translation, Error Correction and Code Interpretation

Comprehensive Introduction GPT Academic is a large language model interaction platform optimized for academic research, providing tools for pragmatic interaction interfaces for large language models such as GPT/GLM, specifically optimized for paper translation, paper reading, touch-ups and writing experience. It uses a modular design...
1yrs ago
056.4K
Pyramid Flow:快手推出的开源版

Pyramid Flow: an open source version of "Kringle" launched by Racer, based on SD3 and running on GPUs of less than 8GB (one-click deployment version)

Comprehensive Introduction Pyramid Flow is an efficient autoregressive video generation method based on the Flow Matching technique. The method achieves higher computational efficiency in generating and decompressing video content by interpolating between different resolutions and noise levels...
1yrs ago
056.2K
Dynamiq:智能体编排框架,支持RAG和LLM代理,简化AI应用开发

Dynamiq: Intelligent Body Orchestration Framework with RAG and LLM Agent Support to Simplify AI Application Development

Comprehensive Introduction Dynamiq is an open source AI orchestration framework designed for agent AI and Large Language Model (LLM) applications. It is designed to simplify the development of AI-driven applications, especially in the area of Retrieval Augmented Generation (RAG) and the orchestration of LLM agents.Dynamiq proposes...
1yrs ago
056.2K
SciToolAgent:整合500+科研工具,自动化研究科研任务的智能体

SciToolAgent: Integration of 500+ research tools and automation of research and scientific tasks for intelligent bodies

Comprehensive Introduction SciToolAgent is an open source tool platform developed by the Innovation Center of Zhejiang University in Hangzhou (HICAI-ZJU). It integrates more than 500 scientific tools through knowledge graph (SciToolKG) and big language modeling technologies to help researchers deal with...
1yrs ago
056.1K
XiaoYuanKouSuan_Auto:小猿口算自动答题工具,高效解决口算题目

XiaoYuanKouSuan_Auto: XiaoYuanKouSuan automatic question and answer tool, efficiently solving oral arithmetic questions

Comprehensive introduction Ape Mouth Calculator Automatic Question Answer Tool is a Python based open source project designed to efficiently solve the questions in the Ape Mouth Calculator application through OCR recognition and automation scripts. The tool utilizes technologies such as OpenCV and Tesseract to be able to recognize the questions on the screen in real time...
2yrs ago
055.9K
Rankify:支持信息检索与重排序的Python工具包

Rankify: a Python toolkit supporting information retrieval and reordering

General Introduction Rankify is an open source Python toolkit developed by the Data Science Group at the University of Innsbruck, Austria. It focuses on information retrieval, reordering and retrieval augmentation generation (RAG), providing a unified framework. The toolkit comes with a built-in set of 40 pre-retrieved benchmarks...
1yrs ago
055.8K
AgentIQ:灵活连接和管理AI智能体的开源工具

AgentIQ: An open source tool for flexible connection and management of AI intelligences

General Introduction AgentIQ is an open source tool from NVIDIA designed to help developers efficiently connect and manage AI intelligences. It enables intelligences from different frameworks to seamlessly collaborate, connect enterprise data and tools, and build workflows like calling functions. The tool's biggest...
1yrs ago
055.8K
ChainForge:测试和评估大型语言模型提示效果的开源可视化编程环境

ChainForge: An Open Source Visual Programming Environment for Testing and Evaluating the Effectiveness of Large Language Model Hints

Comprehensive Introduction ChainForge is an open source visual programming environment designed for testing and evaluating the effectiveness of Large Language Model (LLM) cues. It provides a data flow cueing engineering environment through which users can quickly explore and analyze the quality of different cues on LLM response...
1yrs ago
055.6K
Kheish:多角色智能体,审查、验证和格式化输出以生成高质量结果

Kheish: multi-actor intelligences that review, validate and format output to produce high quality results

Comprehensive Introduction Kheish is an open source multi-role agent designed for Large Language Model (LLM) tasks that require structured, step-by-step collaboration.Kheish is more than just a simple coordinator, it is an intelligent agent in its own right, requesting modules on demand, integrating user-reversal...
1yrs ago
055.5K
Story-Flicks:输入主题自动生成儿童短故事视频

Story-Flicks: Input topics to automatically generate children's short story videos

Comprehensive Introduction Story-Flicks is an open source AI tool focused on helping users quickly generate HD story videos. Users only need to input a story topic, and the system will generate the story content through a large language model, and combine the AI-generated images, audio and subtitles to output a complete video...
1yrs ago
055.5K
DisPose:生成人体姿态精准控制的视频,创作跳舞的小姐姐

DisPose: generating videos with precise control of human posture, creating dancing ladies

General Introduction DisPose is an innovative open source artificial intelligence project focused on controlled character image animation generation. Developed by a team of researchers and open-sourced on GitHub, the project uses advanced deep learning techniques to achieve precise character animation control by decomposing skeletal pose information.D...
1yrs ago
055.1K
Marco-o1:基于Qwen2-7B-Instruct微调的开源版OpenAI o1模型,探索开放式推理模型,解决复杂问题

Marco-o1: An Open Source Version of the OpenAI o1 Model Based on Qwen2-7B-Instruct Fine-Tuning to Explore Open Inference Models for Solving Complex Problems

Comprehensive Introduction Marco-o1 is an open reasoning model developed by Alibaba International Digital Commerce Group (AIDC-AI) to solve complex real-world problems. The model combines Chain of Thought (CoT) fine-tuning, Monte Carlo Tree Search (MCTS), and innovative reasoning strategies...
1yrs ago
055K
TextDistiller:一键总结一整本书,高效提炼书籍内容,快速掌握核心思想

TextDistiller: summarize an entire book in one click, efficiently distill the content of the book, quickly grasp the core ideas

Comprehensive Introduction TextDistiller is an advanced AI-driven tool designed to summarize books chapter-by-chapter or as a whole, providing a concise yet comprehensive overview. By using TextDistiller, users are able to quickly grasp the core ideas and key points of any book...
1yrs ago
054.9K
STORM:基于Topic搜索网络数据,生成带引用的论文、长文报告

STORM: Search web data based on Topic to generate papers with citations, long paper reports

General Introduction STORM is a knowledge integration and article generation system developed by the Oval team at Stanford University. It focuses on generating exhaustive Wikipedia-like articles (systematic papers) from scratch. The system utilizes large-scale language models for topic research, preparing synopses and simulating actual interconnected...
1yrs ago
054.6K
muAgent:由 LLM 和 EKG(行业知识)驱动的全新Agent编排框架

muAgent: A New Agent Orchestration Framework Driven by LLM and EKG (Industry Knowledge)

General Introduction muAgent is an innovative multi-intelligentsia framework developed by Ant Group. The framework collaborates with multi-intelligentsia, function calls, code interpreters and other technologies through canvas drag-and-drop and simple text writing to help users execute various complex standard operating procedures (SOPs) under human guidance...
1yrs ago
054.6K
中文基于满血 DeepSeek-R1 蒸馏数据集,支持中文R1蒸馏SFT数据集

Chinese based full-blooded DeepSeek-R1 distillation dataset, supports Chinese R1 distillation SFT dataset

Comprehensive Introduction The Chinese DeepSeek-R1 distillation dataset is an open source Chinese dataset containing 110K pieces of data designed to support machine learning and natural language processing research. The dataset is released by Cong Liu's NLP team. The dataset contains not only mathematical data, but also a large number of general types...
1yrs ago
054.4K
Rowfill:批量提取文档结构化信息并自动化分析

Rowfill: Batch Extraction of Structured Information from Documents and Automated Analysis

General Introduction Rowfill is an open source document processing platform designed for knowledge workers. It uses advanced artificial intelligence techniques to extract, analyze and process data from complex documents, images and PDFs.Rowfill supports Native Large Language Model (LLM) and Ope...
1yrs ago
054.3K
X-Dyna:静态人像参考视频姿态生成视频,让小姐姐的照片跳舞

X-Dyna: Static Portrait Reference Video Pose Generation Video to Make Missy's Photos Dance

Comprehensive Introduction X-Dyna is an open source project developed by ByteDance to generate dynamic portrait animations using zero-sample diffusion techniques. The project utilizes facial expressions and body movements in drive video to animate individual portrait images, generating realistic and context-aware motion effects.X-D...
1yrs ago
054.3K
LiberSonora:有声书字幕提取与多语言翻译,有声小说转录为多语言

LiberSonora: Audiobook Subtitle Extraction and Multilingual Translation, Audiobook Transcription into Multiple Languages

General Introduction LiberSonora, which means "free sound", is a powerful AI-enabled open source audiobook toolset. The toolset supports intelligent subtitle extraction, AI title generation, multi-language translation, etc., and is capable of batch offline processing under GPU acceleration.LiberSo...
1yrs ago
053.9K
DB-GPT:构建AI原生数据应用开发框架,集成多模型管理与智能数据处理

DB-GPT: Building AI Native Data Application Development Framework, Integrating Multi-Model Management and Intelligent Data Processing

Comprehensive Introduction DB-GPT is an open source AI native data application development framework built using AWEL (Agentic Workflow Expression Language) and smart body technology. The project aims to build infrastructure in the field of large modeling...
1yrs ago
053.9K