AI open source project

Total 1020 articles posts
TRV:将幻灯片/PPT和讲解备注快速生成演讲视频

TRV: Rapidly Generate Presentation Videos from Slides/PPTs and Explanatory Notes

General Introduction TRV is an open source tool, hosted on GitHub, designed to help users quickly convert slides and presentation notes into videos with narration. It automatically generates audio and video content from incoming presentation files through simple command line operations, suitable for those who need to quickly create presentations...
6mos ago
02.1K
LazyLLM:商汤开源构建多智能体应用的低代码开发工具

LazyLLM: Shangtang's open source low-code development tool for building multi-intelligence body applications

Comprehensive Introduction LazyLLM is an open source tool developed by the LazyAGI team, focusing on simplifying the development process of multi-intelligence large model applications. It helps developers quickly build complex AI applications through one-click deployment and lightweight gateway mechanisms, saving tedious engineering configuration...
6mos ago
02.5K
MagicArticulate:将静态3D模型生成骨骼结构动画资产

MagicArticulate: generating skeletal structure animation assets from static 3D models

Comprehensive Introduction MagicArticulate is an AI framework developed by ByteDance in collaboration with Nanyang Technological University, focusing on rapidly transforming static 3D models into animation-enabled digital assets. It does this through an advanced autoregressive Transformer and functional diffusion modeling, self...
6mos ago
02.3K
MakeSense:免费使用的图像标注工具,提升计算机视觉项目效率

MakeSense: a free-to-use image annotation tool to improve computer vision project efficiency

General Introduction Make Sense is a free online image annotation tool designed to help users quickly prepare datasets for computer vision projects. It requires no complicated installation, just open a browser access to use it, supports multiple operating systems, and is perfect for small deep learning projects. Users can...
6mos ago
02.9K
AutoAgent:通过自然语言快速创建并部署AI智能体的框架

AutoAgent: a framework for rapid creation and deployment of AI intelligences through natural language

General Introduction AutoAgent is an open source AI intelligences framework developed by the Data Intelligence Laboratory of the University of Hong Kong (HKUDS) and hosted on GitHub.It allows users to rapidly create and deploy customized AI intelligences by describing their requirements in purely natural language, without any programming base...
2mos ago
03K
Crawl4LLM:为LLM预训练提供的高效网页爬取工具

Crawl4LLM: An Efficient Web Crawling Tool for LLM Pretraining

Comprehensive Introduction Crawl4LLM is an open source project jointly developed by Tsinghua University and Carnegie Mellon University, focusing on optimizing the efficiency of web crawling for pre-training of large models (LLM). It significantly reduces ineffective crawling by intelligently selecting high-quality web page data, claiming to be able to originally need to crawl 1...
6mos ago
02.4K
dsRAG:用于处理非结构化数据和复杂查询的检索引擎

dsRAG: A Retrieval Engine for Unstructured Data and Complex Queries

Comprehensive Introduction dsRAG is a high-performance retrieval engine designed to handle complex queries on unstructured data. It performs particularly well in handling challenging queries in dense text such as financial reports, legal documents, and academic papers. dsRAG employs three key approaches to improve performance: language...
6mos ago
02.2K
中文基于满血 DeepSeek-R1 蒸馏数据集,支持中文R1蒸馏SFT数据集

Chinese based full-blooded DeepSeek-R1 distillation dataset, supports Chinese R1 distillation SFT dataset

Comprehensive Introduction The Chinese DeepSeek-R1 distillation dataset is an open source Chinese dataset containing 110K pieces of data designed to support machine learning and natural language processing research. The dataset is released by Cong Liu's NLP team. The dataset contains not only mathematical data, but also a large number of general types...
6mos ago
02.5K
HealthGPT:支持医学图像分析与诊断问答的医疗大模型

HealthGPT: A Medical Big Model to Support Medical Image Analysis and Diagnostic Q&A

Comprehensive Introduction HealthGPT is a state-of-the-art medical grand visual language model designed to enable unified medical visual understanding and generation capabilities through heterogeneous knowledge adaptation. The goal of the project is to integrate medical visual understanding and generation capabilities into a unified autoregressive framework that significantly improves the medical graph...
6mos ago
01.9K
MatAnyone: 提取视频指定目标人像的开源工具,生成目标人像视频

MatAnyone: Extract video to specify the target portrait of the open-source tool to generate the target portrait video

General Introduction MatAnyone is an open source project focusing on video keying, developed and released on GitHub by a research team at S-Lab, Nanyang Technological University, Singapore. It provides users with stable and efficient video processing capabilities through coherent memory propagation techniques, especially...
6mos ago
02.5K
Step-Audio:多模态语音交互框架,识别语音并使用克隆语音交流等功能

Step-Audio: a multimodal voice interaction framework that recognizes speech and communicates using cloned speech, among other features

Comprehensive Introduction Step-Audio is an open source intelligent speech interaction framework designed to provide out-of-the-box speech understanding and generation capabilities for production environments. The framework supports multi-language dialog (e.g., Chinese, English, Japanese), emotional speech (e.g., happy, sad), regional dialects (e.g., Cantonese, Szechuan ...
6mos ago
02.8K
FoloUp:开源AI语音面试平台,生成定制面试题并进行智能分析

FoloUp: Open Source AI Voice Interview Platform Generates Customized Interview Questions and Performs Intelligent Analysis

General Introduction FoloUp is an open source platform that specializes in AI-powered voice interview solutions for enterprises. With FoloUp, enterprises can quickly generate customized interview questions for job descriptions and conduct natural conversational interviews with AI. The platform also provides detailed interview analysis...
5mos ago
02.4K
Confident AI:自动化大语言模型评估框架,对比不同大模型提示词输出质量

Confident AI: A Framework for Automated Large Language Model Evaluation, Comparing the Output Quality of Different Large Model Cue Words

Comprehensive Introduction DeepEval is an easy-to-use open source LLM evaluation framework for evaluating and testing large language modeling systems. It is similar to Pytest, but focuses on unit testing of LLM output.DeepEval combines the latest research results through G-Eval, phantom...
6mos ago
02.8K
PraisonAI:低代码多智能体框架,简化复杂任务的自动化解决方案

PraisonAI: A Low-Code Multi-Intelligent Body Framework to Simplify Automation Solutions for Complex Tasks

Comprehensive Introduction PraisonAI is an out-of-the-box multi-intelligence body framework for production environments, designed to create AI intelligences to automate and solve problems ranging from simple tasks to complex challenges. The framework provides a low-code solution that simplifies the building of multi-intelligent body LLM systems and...
6mos ago
03.8K
HN中文播客:自动抓取热门科技文章,AI生成中文总结并转换为播客

HN Chinese Podcast: Automatically grab popular tech articles, AI-generated Chinese summaries and convert them to podcasts

General Introduction The Hacker News Chinese Podcast project is an innovative platform based on AI technology, aiming to automatically grab popular articles on Hacker News every day and generate Chinese summaries and podcast content through AI. The project is led by ccbikai ...
6mos ago
02K
LangGraph Supervisor:利用监督智能体来管理多智能体协作的工具

LangGraph Supervisor: a tool for managing multi-intelligence collaboration using supervising intelligences

Comprehensive Introduction LangGraph Supervisor is a Python library based on the LangGraph framework, designed for creating and managing multi-intelligent body systems. The library coordinates the work of multiple specialized agents through a central supervisory agent, ensuring that communication flows and tasks are divided...
6mos ago
02.5K
Deep Research:基于AI的深度研究助手,提供高效的研究工具和报告生成功能

Deep Research: an AI-based deep research assistant that provides efficient research tools and report generation capabilities

General Introduction Deep Research is an AI-based research assistant designed to perform iterative deep research by combining search engines, web crawling, and large language models. The project was released by dzhng on GitHub with the goal of providing an easy-to-use deep research genera...
4mos ago
02.2K