AI open source project

Total 1020 articles posts
AigoTools:自动收录网站并支持多语言的开源AI工具导航站

AigoTools: automatic inclusion of the site and support for multilingual open source AI tools navigation station

General Introduction AigoTools is an open source AI web site navigation designed to help users quickly create and manage navigation sites. It has built-in site management and AI-based auto-inclusion features , support for multi-language , dark/light theme switching , and SEO optimization.AigoTools proposes ...
10mos ago
02.8K
AnkiAIUtils: Anki Flashcard Learning AI Toolset, an intelligent assistant that automatically optimizes memorized cards

AnkiAIUtils: Anki Flashcard Learning AI Toolset, an intelligent assistant that automatically optimizes memorized cards

General Description AnkiAIUtils is a set of AI-enhanced tools designed for the Anki flashcard learning system. Developed by a medical student, the tool is designed to automatically improve cards that users are struggling with during the learning process through AI technology. It can intelligently provide users with personalized...
7mos ago
02.8K
DB-GPT:构建AI原生数据应用开发框架,集成多模型管理与智能数据处理

DB-GPT: Building AI Native Data Application Development Framework, Integrating Multi-Model Management and Intelligent Data Processing

Comprehensive Introduction DB-GPT is an open source AI native data application development framework built using AWEL (Agentic Workflow Expression Language) and smart body technology. The project aims to build infrastructure in the field of large modeling...
5mos ago
02.8K
OmniSVG:从文本和图像生成SVG矢量图形的开源项目

OmniSVG: from text and images to generate SVG vector graphics open source project

General Introduction OmniSVG is an open source project focused on generating high-quality vector graphics (SVG) through a multimodal model. It utilizes pre-trained visual-linguistic models to support SVG generation from textual descriptions or image input, covering a wide range of scenarios from simple icons to complex anime characters. Item ...
4mos ago
02.8K
tldraw:开源无限画布白板SDK,AI生成简约线框图和UML图

tldraw: open source unlimited canvas whiteboard SDK, AI to generate minimalist wireframe diagrams and UML diagrams

General Description tldraw is a free and instant collaborative drawing tool that provides an unlimited canvas where users can quickly draw graphics, write text and collaborate instantly. Featuring an intuitive interface and excellent performance, it is suitable for team collaboration and remote work. Supported through the open source community, tldr...
9mos ago
02.8K
Story-Adapter:根据长篇故事生成连续且风格一致的图像插画

Story-Adapter: generating continuous and consistent graphic illustrations based on a long story

General Introduction Story-Adapter is an innovative story visualization framework that converts textual stories into coherent image sequences. Developed by researchers, this project employs an iterative approach that requires no training to generate high-quality story illustrations. The framework is characterized by its ability to handle long...
7mos ago
02.8K
MindSearch:开源AI搜索引擎框架,部署您自己的 Perplexity 搜索引擎!

MindSearch: open source AI search engine framework to deploy your own Perplexity search engine!

Comprehensive Introduction MindSearch is an open source AI search engine framework launched by Shanghai Artificial Intelligence Laboratory (SAL), aiming to simulate human thought process for complex information gathering and integration. The tool combines the advanced technology of large-scale language modeling (LLM) and search engine through multi-intelligence...
8mos ago
02.8K
MoneyPrinterTurbo:输入视频主题一键生成视频文案和高清短视频

MoneyPrinterTurbo: Generate video copy and short HD videos in one click by entering a video theme

Comprehensive Introduction MoneyPrinterTurbo is an open source project that utilizes advanced AI big model technology to achieve the function of generating short HD videos with one click. Users only need to provide a video theme or keywords, the system will automatically generate video copy, video clips, video subtitles and...
5mos ago
02.8K
OmniParse:从文档/多媒体中提取任何非结构化数据解析为结构化数据

OmniParse: extract any unstructured data from documents/multimedia and parse it into structured data

Comprehensive Introduction OmniParse is a powerful data parsing and optimization platform designed to convert any unstructured data into structured, actionable data optimized for GenAI (Generative Artificial Intelligence) framework. Whether you are working with documents, tables, images, videos, audio files or...
9mos ago
02.8K
JoyGen:音频驱动的3D深度感知人像说话视频编辑工具

JoyGen: Audio-Driven 3D Depth-Sensitive Portrait Talking Video Editing Tool

Comprehensive Introduction JoyGen is an innovative two-stage video generation framework for talking faces, focusing on solving the problem of audio-driven facial expression generation. Developed by a team from Jingdong Technology, the project uses advanced 3D reconstruction techniques and audio feature extraction methods to accurately capture the identity characteristics of the speaker and the expression...
7mos ago
02.8K
Bambo:轻量灵活的智能体框架,简单配置角色和工具,处理多种负载任务

Bambo: a lightweight and flexible framework for intelligent bodies, with simple configuration of roles and tools to handle multiple loads of tasks

Comprehensive Introduction Bambo is a new type of proxy framework, which is lighter and more flexible than the mainstream frameworks and can handle a variety of load tasks.Bambo achieves efficient proxy functionality by defining all the tools in the tool catalog and using asynchronous custom functions. Users can use the llm_c...
8mos ago
02.8K
OmAgent:构建多模态智能设备的智能体框架

OmAgent: an intelligent body framework for building multimodal smart devices

Comprehensive Introduction OmAgent is a multimodal intelligent body framework developed by Om AI Lab, aiming to provide powerful AI-powered features for smart devices. By integrating state-of-the-art multimodal base models and intelligent body algorithms, the project enables developers to create efficient smart devices on a variety of...
7mos ago
02.7K
GPTme:在命令行终端中运行的智能编程助手,ChatGPT代码解释器的本地化替代方案

GPTme: Intelligent Programming Assistant Running in a Command Line Terminal, Localized Alternative to ChatGPT Code Interpreter

Comprehensive Introduction GPTMe is a revolutionary terminal AI assistant tool designed to enhance developers' work efficiency. It perfectly combines powerful AI capabilities with the terminal environment, supporting diverse functions such as code execution, file editing, web browsing and visual recognition. As ChatGPT code solving...
8mos ago
02.7K
NodeRAG:基于异构图的精准信息检索与生成工具

NodeRAG: A Heterogeneous Graph-Based Tool for Accurate Information Retrieval and Generation

A Comprehensive Introduction NodeRAG is an open source Retrieval Augmented Generation (RAG) system hosted on GitHub and developed by Terry-Xu-666. It optimizes information retrieval and generation through heterogeneous graph structures, significantly improving retrieval accuracy and contextual relevance.Nod...
4mos ago
02.7K
X-Dyna:静态人像参考视频姿态生成视频,让小姐姐的照片跳舞

X-Dyna: Static Portrait Reference Video Pose Generation Video to Make Missy's Photos Dance

Comprehensive Introduction X-Dyna is an open source project developed by ByteDance to generate dynamic portrait animations using zero-sample diffusion techniques. The project utilizes facial expressions and body movements in drive video to animate individual portrait images, generating realistic and context-aware motion effects.X-D...
7mos ago
02.7K
Agent Laboratory:为研究人员提供自动化代码及研究报告撰写助手

Agent Laboratory: automated code and study writing assistant for researchers

Comprehensive Introduction Agent Laboratory is an end-to-end autonomous research workflow designed to help researchers realize their research ideas. The system consists of dedicated agents driven by large language models that support the entire research workflow - from conducting literature reviews and developing plans to executing...
4mos ago
02.7K
MatAnyone: 提取视频指定目标人像的开源工具,生成目标人像视频

MatAnyone: Extract video to specify the target portrait of the open-source tool to generate the target portrait video

General Introduction MatAnyone is an open source project focusing on video keying, developed and released on GitHub by a research team at S-Lab, Nanyang Technological University, Singapore. It provides users with stable and efficient video processing capabilities through coherent memory propagation techniques, especially...
6mos ago
02.7K
CogView3:智谱轻言开源的级联扩散文本生成图像模型

CogView3: Wisdom Spectrum Light Word open source cascade diffusion text to generate image models

Comprehensive Introduction CogView3 is an advanced text generation image system developed by Tsinghua University and Think Tank Team (Chi Spectrum Qingyan). It is based on a cascading diffusion model to generate high-resolution images through multiple stages.The key features of CogView3 include multi-stage generation, innovative architecture and efficient performance...
10mos ago
02.7K
ChainForge:测试和评估大型语言模型提示效果的开源可视化编程环境

ChainForge: An Open Source Visual Programming Environment for Testing and Evaluating the Effectiveness of Large Language Model Hints

Comprehensive Introduction ChainForge is an open source visual programming environment designed for testing and evaluating the effectiveness of Large Language Model (LLM) cues. It provides a data flow cueing engineering environment through which users can quickly explore and analyze the quality of different cues on LLM response...
8mos ago
02.7K
ExtractThinker:提取和分类文档为结构化数据,优化文档处理流程

ExtractThinker: extracting and classifying documents into structured data to optimize the document processing flow

Comprehensive Introduction ExtractThinker is a flexible document intelligence tool that utilizes Large Language Models (LLMs) to extract and classify structured data from documents, providing a seamless ORM-like document processing workflow. It supports a variety of document loaders, including Tess...
7mos ago
02.7K
Agentic Security:开源的LLM漏洞扫描工具,提供全面的模糊测试和攻击技术

Agentic Security: open source LLM vulnerability scanning tool that provides comprehensive fuzz testing and attack techniques

General Introduction Agentic Security is an open source LLM (Large Language Model) vulnerability scanning tool designed to provide developers and security professionals with comprehensive fuzz testing and attack techniques. The tool supports customized rule sets or agent-based attacks and is able to integrate LLM AP...
6mos ago
02.7K
Knowledge Table:高效提取与探索结构化数据的开源工具

Knowledge Table: an open source tool for efficient extraction and exploration of structured data

Comprehensive Introduction Knowledge Table (Knowledge Table) is an open source project designed to simplify the process of extracting and exploring structured data from unstructured documents. Users can create structured knowledge representations such as tables and graphs through a natural language query interface. The tool supports customizing the extraction ...
10mos ago
02.7K
Flow(Laminar):构建智能体的轻量级任务引擎,简化并灵活管理任务

Flow (Laminar): a lightweight task engine for building intelligences that simplifies and flexibly manages tasks

Comprehensive Introduction Flow is a lightweight task engine designed for building AI agents, emphasizing simplicity and flexibility. Unlike traditional node- and edge-based workflows, Flow uses a dynamic task queuing system that supports parallel execution, dynamic scheduling, and intelligent dependency management. Its core concept is ...
8mos ago
02.7K
MedRAX: 利用多模态大模型进行胸部X光片分析的智能体

MedRAX: A Smart Body for Chest X-ray Analysis Using Multimodal Large Models

Comprehensive Introduction MedRAX is a state-of-the-art AI intelligence designed for chest radiograph (CXR) analysis. It integrates state-of-the-art CXR analysis tools and multimodal large language models to dynamically process complex medical queries without additional training.MedRAX, through its modular design...
5mos ago
02.7K
Harbor:一键部署本地LLM开发环境,轻松管理和运行AI服务的容器化工具集

Harbor: a containerized toolset for easily managing and running AI services with one-click deployment of local LLM development environments

Comprehensive Introduction Harbor is a revolutionary containerized LLM toolset focused on simplifying the deployment and management of local AI development environments. It enables developers with a clean command line interface (CLI) and companion application to launch and manage with a single click, including LLM backends, API interfaces, front...
7mos ago
02.7K
VideoSeal:先进的开源视频隐藏水印嵌入与提取工具,保护视频版权

VideoSeal: Advanced open source video hidden watermark embedding and extraction tools to protect video copyrights

General Introduction VideoSeal is an open source video watermarking tool developed by Facebook Research, designed to provide efficient video watermark embedding and extraction. The tool supports the latest open source models and contains pre-trained models, training code, inference code and evaluation tools...
8mos ago
02.7K