AI open source project

Total 1020 articles posts
LangManus:支持多智能体协作的开源AI自动化框架

LangManus: an open source AI automation framework supporting multi-intelligence collaboration

General Introduction LangManus is an open source AI automation framework hosted on GitHub. Developed by a group of former colleagues in their spare time, it is an academically-driven project with the goal of combining language models and specialized tools to accomplish web search, data crawling, and code execution...
7mos ago
022.5K
Aggregator:一站式代理爬取与聚合平台,免费代理池(请合规使用)

Aggregator: one-stop agent crawling and aggregation platform, free agent pool (please use in compliance)

Comprehensive introduction Aggregator is an open source project aimed at creating a free proxy pool that can crawl a variety of available proxy nodes. The platform has a flexible plug-in system , the user can according to the special needs of the target site , through plug-ins to achieve specific functions . The project is mainly used to learn to crawl ...
11mos ago
022.5K
CogView4:生成中英双语高清图片的开源文生图模型

CogView4: An Open Source Literature Graph Model for Generating Bilingual HD Images

General Introduction CogView4 is an open source text-to-graph model developed by the KEG Lab (THUDM) at Tsinghua University, focusing on converting text descriptions into high-quality images. It supports bilingual cue word input, and is especially good at understanding Chinese cues and generating images with Chinese characters, non...
7mos ago
022.5K
SmartRead:自动标注技术PDF文档并提供相关引用源

SmartRead: Automatically annotate technical PDF documents and provide relevant citation sources

Comprehensive Introduction SmartRead is an AI-based open source tool designed for technical documents. It can automatically analyze PDF files, mark key content, such as important terms, titles or core ideas to help users quickly understand complex documents. At the same time, it can also provide with the main document...
7mos ago
022.4K
Story-Flicks:输入主题自动生成儿童短故事视频

Story-Flicks: Input topics to automatically generate children's short story videos

Comprehensive Introduction Story-Flicks is an open source AI tool focused on helping users quickly generate HD story videos. Users only need to input a story topic, and the system will generate the story content through a large language model, and combine the AI-generated images, audio and subtitles to output a complete video...
7mos ago
022.4K
AIEvo:创建多智能体协作应用的高效框架

AIEvo: An Efficient Framework for Creating Multi-Intelligent Collaborative Applications

General Introduction AIEvo is Ant Group's open source multi-agent framework designed to efficiently create multi-agent applications. The framework strictly follows the SOP task graph to improve the execution success rate of complex tasks , and through feedback and monitoring mechanisms to ensure high flexibility and scalability.AIEvo has been produced within Ant Group ...
9mos ago
022.4K
AigoTools:自动收录网站并支持多语言的开源AI工具导航站

AigoTools: automatic inclusion of the site and support for multilingual open source AI tools navigation station

General Introduction AigoTools is an open source AI web site navigation designed to help users quickly create and manage navigation sites. It has built-in site management and AI-based auto-inclusion features , support for multi-language , dark/light theme switching , and SEO optimization.AigoTools proposes ...
12mos ago
022.3K
VideoSeal:先进的开源视频隐藏水印嵌入与提取工具,保护视频版权

VideoSeal: Advanced open source video hidden watermark embedding and extraction tools to protect video copyrights

General Introduction VideoSeal is an open source video watermarking tool developed by Facebook Research, designed to provide efficient video watermark embedding and extraction. The tool supports the latest open source models and contains pre-trained models, training code, inference code and evaluation tools...
10mos ago
022.3K
GPTme:在命令行终端中运行的智能编程助手,ChatGPT代码解释器的本地化替代方案

GPTme: Intelligent Programming Assistant Running in a Command Line Terminal, Localized Alternative to ChatGPT Code Interpreter

Comprehensive Introduction GPTMe is a revolutionary terminal AI assistant tool designed to enhance developers' work efficiency. It perfectly combines powerful AI capabilities with the terminal environment, supporting diverse functions such as code execution, file editing, web browsing and visual recognition. As ChatGPT code solving...
10mos ago
022.3K
BotSharp:基于.NET的多智能体AI应开发与管理平台

BotSharp: .NET-based multi-intelligence body AI should development and management platform

Comprehensive Introduction BotSharp is an open source project based on .NET Core dedicated to providing a comprehensive AI chatbot platform building tool. It uses C# programming, supports cross-platform operation, and aims to simplify the application of machine learning algorithms, enabling enterprise-level developers to efficiently ...
9mos ago
022.3K
TxAgent:帮医生分析药物作用和治疗方案的AI工具

TxAgent: the AI tool that helps doctors analyze drug effects and treatment options

Comprehensive Introduction TxAgent is an open-source AI tool developed by Harvard University's Medical and Scientific Artificial Intelligence Team (MIMS) to help physicians analyze drug interactions and develop personalized treatment plans. It combines patient-specific situations through multi-step reasoning and real-time retrieval of biomedical knowledge...
7mos ago
022.2K
Reactive Resume:支持多语言、多模板的开源免费简历生成器

Reactive Resume: open source free resume builder with multi-language and multi-template support

General Description Reactive Resume is a free and open source resume builder designed to simplify the process of creating, updating and sharing resumes. The platform focuses on user privacy with no user tracking or advertising. Users can self-host the app in less than 30 seconds, taking full control of their...
10mos ago
022.2K
TextDistiller:一键总结一整本书,高效提炼书籍内容,快速掌握核心思想

TextDistiller: summarize an entire book in one click, efficiently distill the content of the book, quickly grasp the core ideas

Comprehensive Introduction TextDistiller is an advanced AI-driven tool designed to summarize books chapter-by-chapter or as a whole, providing a concise yet comprehensive overview. By using TextDistiller, users are able to quickly grasp the core ideas and key points of any book...
10mos ago
022.2K
Claude生成深度研究报告的MCP服务

Claude's MCP service for generating in-depth research reports

Comprehensive Introduction MCP Server Deep Research is an open source tool that automatically generates structured research reports for complex problems through artificial intelligence and web search. Users enter a research question, and the tool breaks down the question, searches for authoritative information, assesses source credibility...
5mos ago
022.2K
ReCamMaster:从单一视频生成多视角视频的渲染工具

ReCamMaster: Rendering Tool for Generating Multi-View Videos from a Single Video

General Introduction ReCamMaster is an open source video processing tool, the core function is to generate new camera views from a single video. Users can specify the camera track and re-render the video to get a dynamic picture with different angles. It is developed by a team of Zhejiang University and Racer Technology, based on text-to...
6mos ago
022.1K
OmniThink:生成高质量长文的写作框架,搜索外部知识后反思并逐步构建知识树

OmniThink: a writing framework for generating high-quality long articles, searching for external knowledge and then reflecting on it and building a knowledge tree step by step

Comprehensive Introduction OmniThink is an innovative machine writing framework designed to generate high-quality, long-form essays by mimicking the iterative expansion and reflection of human cognitive processes. The framework focuses on extending the boundaries of knowledge and generating information that is rich and deep.OmniThink does this by constructing...
9mos ago
022.1K
CHRONOS:新闻时间线总结工具,提升新闻检索和时间线生成效率

CHRONOS: News Timeline Summarization Tool to Improve News Retrieval and Timeline Generation Efficiency

Comprehensive Introduction CHRONOS is a news timeline summarization tool developed by Alibaba NLP team. The tool generates timeline summaries of news events through iterative self-questioning.CHRONOS is not only capable of handling open-domain timeline summarization tasks, but also in terms of efficiency and scalability...
9mos ago
022.1K
VoAPI:高颜值的AI模型转发接口管理系统,官网每日提供免费API额度

VoAPI: High-value AI model forwarding interface management system, the official website provides free API quota on a daily basis

Comprehensive Introduction VoAPI is a new high-color and high-performance AI model interface management and distribution system, which is mainly used for personal or enterprise internal management and distribution channels. Developed based on NewAPI, the system provides rich functional modules and optimized user interface, aiming to enhance...
11mos ago
022K
TripoSF:快速生成高分辨率3D模型的实用工具

TripoSF: A useful tool for quickly generating high-resolution 3D models

Comprehensive Introduction TripoSF is an open source project built by the VAST-AI-Research team, specifically designed to quickly generate high-resolution 3D models from a single image. It uses a technique called SparseFlex, which has high processing efficiency and is able to generate high-resolution 3D models from a single image in a general...
7mos ago
022K
CogView3:智谱轻言开源的级联扩散文本生成图像模型

CogView3: Wisdom Spectrum Light Word open source cascade diffusion text to generate image models

Comprehensive Introduction CogView3 is an advanced text generation image system developed by Tsinghua University and Think Tank Team (Chi Spectrum Qingyan). It is based on a cascading diffusion model to generate high-resolution images through multiple stages.The key features of CogView3 include multi-stage generation, innovative architecture and efficient performance...
1yrs ago
022K
PantoMatrix(EMAGE):全身手势生成框架,从音频生成全身手势的3D动画框架

PantoMatrix (EMAGE): full-body gesture generation framework, 3D animation framework for generating full-body gestures from audio

Comprehensive Introduction PantoMatrix is an advanced full-body gesture generation framework capable of generating complete human movements from audio and partial gestures, including face, partial body, hand and full-body movements. The framework utilizes the latest multimodal datasets and deep learning techniques to provide high-quality 3D...
11mos ago
021.9K
MM-EUREKA:探索视觉推理的多模态强化学习工具

MM-EUREKA: A Multimodal Reinforcement Learning Tool for Exploring Visual Reasoning

Comprehensive Introduction MM-EUREKA is an open source project developed by Shanghai Artificial Intelligence Laboratory, Shanghai Jiao Tong University and other parties. It extends textual reasoning capabilities to multimodal scenarios through rule-based reinforcement learning techniques to help models process image and textual information. The core of this tool...
7mos ago
021.8K
dsRAG:用于处理非结构化数据和复杂查询的检索引擎

dsRAG: A Retrieval Engine for Unstructured Data and Complex Queries

Comprehensive Introduction dsRAG is a high-performance retrieval engine designed to handle complex queries on unstructured data. It performs particularly well in handling challenging queries in dense text such as financial reports, legal documents, and academic papers. dsRAG employs three key approaches to improve performance: language...
8mos ago
021.8K
Vision is All You Need:使用视觉语言模型构建智能文档检索系统(Vision RAG)

Vision is All You Need: Building an Intelligent Document Retrieval System Using Visual Language Models (Vision RAG)

Comprehensive Introduction Vision-is-all-you-need is an innovative visual RAG (Retrieval Augmented Generation) system demonstration project that breaks new ground in applying Visual Language Modeling (VLM) to the document processing domain. Unlike traditional text chunking methods, the system directly makes...
9mos ago
021.8K
XiaoYuanKouSuan_Auto:小猿口算自动答题工具,高效解决口算题目

XiaoYuanKouSuan_Auto: XiaoYuanKouSuan automatic question and answer tool, efficiently solving oral arithmetic questions

Comprehensive introduction Ape Mouth Calculator Automatic Question Answer Tool is a Python based open source project designed to efficiently solve the questions in the Ape Mouth Calculator application through OCR recognition and automation scripts. The tool utilizes technologies such as OpenCV and Tesseract to be able to recognize the questions on the screen in real time...
1yrs ago
021.8K
PraisonAI:低代码多智能体框架,简化复杂任务的自动化解决方案

PraisonAI: A Low-Code Multi-Intelligent Body Framework to Simplify Automation Solutions for Complex Tasks

Comprehensive Introduction PraisonAI is an out-of-the-box multi-intelligence body framework for production environments, designed to create AI intelligences to automate and solve problems ranging from simple tasks to complex challenges. The framework provides a low-code solution that simplifies the building of multi-intelligent body LLM systems and...
8mos ago
021.7K
SpeechGPT 2.0-preview:实时交互的端到端拟人语音对话大模型

SpeechGPT 2.0-preview: an end-to-end anthropomorphic speech dialog grand model for real-time interaction

SpeechGPT 2.0-preview is the first anthropomorphic real-time interaction system introduced by OpenMOSS, which is trained based on millions of hours of speech data. The system is equipped with anthropomorphic spoken expression and 100ms low latency response, supporting natural and smooth real...
9mos ago
021.7K