AI open source project

Total 1020 articles posts
Sim Studio:开源的AI代理工作流构建工具

Sim Studio: open source workflow builder for AI agents

Comprehensive Introduction Sim Studio is an open source AI agent workflow building platform focused on helping users quickly design, test, and deploy large-scale language model (LLM) workflows through a lightweight, intuitive visual interface. Users can create complex workflows without deep programming by dragging and dropping...
3mos ago
01.2K
RealtimeVoiceChat:低延迟与AI进行自然口语对话

RealtimeVoiceChat: low-latency natural spoken conversation with AI

General Introduction RealtimeVoiceChat is an open source project focused on real-time, natural conversations with artificial intelligence via voice. Users use a microphone to input their voice, and the system captures the audio through a browser, quickly converts it to text, and a large-scale language model (LLM) generates back...
3mos ago
0798
Claude生成深度研究报告的MCP服务

Claude's MCP service for generating in-depth research reports

Comprehensive Introduction MCP Server Deep Research is an open source tool that automatically generates structured research reports for complex problems through artificial intelligence and web search. Users enter a research question, and the tool breaks down the question, searches for authoritative information, assesses source credibility...
3mos ago
0861
Deep Recall:为大模型提供企业级记忆框架的开源工具

Deep Recall: an open source tool that provides an enterprise-class memory framework for large models

Comprehensive Introduction Deep Recall is an open source, enterprise-class memory framework designed for large-scale language models (LLMs). It provides hyper-personalized responsiveness through efficient contextual retrieval and integration. The framework uses a three-tier architecture, including a memory service, a reasoning service, and a coordinator, supporting...
3mos ago
0981
Potpie AI:快速创建专属代码库的AI工程助手

Potpie AI: An AI engineering assistant for quickly creating proprietary code bases

Comprehensive Introduction Potpie AI is an open source platform focused on providing developers with customized AI engineering assistants. It allows AI agents to deeply understand code structure and logic and automate tasks such as debugging, testing, and code generation by building a knowledge graph of the code base. Users can use simple...
4mos ago
01.2K
Vexa:实时会议转录与智能知识提取工具

Vexa: a real-time meeting transcription and intelligent knowledge extraction tool

Comprehensive Introduction Vexa is an open source real-time meeting transcription and knowledge management platform designed to provide efficient meeting recording and intelligent knowledge extraction services for enterprises and individuals. It automatically joins platforms such as Google Meet, Zoom, etc. through API-driven meeting robots...
4mos ago
01K
LLManager:智能自动化流程审批与人类审核结合的管理工具

LLManager: a management tool that combines intelligent automated process approvals with human reviews

Comprehensive Introduction LLManager is an open source intelligent approval management tool, developed based on LangChain's LangGraph framework, focused on automating the processing of approval requests while optimizing decision making with human review. It does this through semantic search, sample less learning and...
4mos ago
01.1K
NodeRAG:基于异构图的精准信息检索与生成工具

NodeRAG: A Heterogeneous Graph-Based Tool for Accurate Information Retrieval and Generation

A Comprehensive Introduction NodeRAG is an open source Retrieval Augmented Generation (RAG) system hosted on GitHub and developed by Terry-Xu-666. It optimizes information retrieval and generation through heterogeneous graph structures, significantly improving retrieval accuracy and contextual relevance.Nod...
4mos ago
01.3K
FramePack:6G低显存快速生成长视频的开源项目

FramePack: 6G low graphics memory fast raw long video open source project

General Introduction FramePack is an open source video generation tool focused on making video diffusion techniques more practical. It decouples the generation workload from the video length by compressing the input frames to a fixed length through a unique next frame prediction neural network. This means that even when generating long videos, the video memory requirements...
3mos ago
0944
OmniSVG:从文本和图像生成SVG矢量图形的开源项目

OmniSVG: from text and images to generate SVG vector graphics open source project

General Introduction OmniSVG is an open source project focused on generating high-quality vector graphics (SVG) through a multimodal model. It utilizes pre-trained visual-linguistic models to support SVG generation from textual descriptions or image input, covering a wide range of scenarios from simple icons to complex anime characters. Item ...
4mos ago
01.4K
Orion:小米开源的端到端自动驾驶推理与规划框架

Orion: Xiaomi's Open Source End-to-End Autonomous Driving Reasoning and Planning Framework

Comprehensive Introduction Orion is an open source project developed by Xiaomi Labs, focusing on end-to-end (E2E) autonomous driving technology. It solves the problem of insufficient causal reasoning in complex scenarios of traditional autonomous driving approaches through visual language modeling (VLM) and generative planners.Orion integrates long...
4mos ago
0830
ReCamMaster:从单一视频生成多视角视频的渲染工具

ReCamMaster: Rendering Tool for Generating Multi-View Videos from a Single Video

General Introduction ReCamMaster is an open source video processing tool, the core function is to generate new camera views from a single video. Users can specify the camera track and re-render the video to get a dynamic picture with different angles. It is developed by a team of Zhejiang University and Racer Technology, based on text-to...
4mos ago
01K
A2A:谷歌发布AI智能间通信的开放协议

A2A: Google releases open protocol for communication between AI intelligences

General Introduction A2A (Agent2Agent) is an open source protocol developed by Google to allow AI intelligences developed by different frameworks or vendors to communicate and collaborate with each other. It provides a standardized set of methods for intelligences to discover each other's capabilities, share tasks, and complete work...
4mos ago
01.3K
自动解析PDF内容并提取文字与表格的开源服务

Automatically parse PDF content and extract text and tables of open source services

Comprehensive Introduction It can automatically analyze the layout of PDF documents, identify text, titles, images, tables, formulas and other elements in the page, and determine their correct order. The tool supports OCR functionality and can convert scanned PDF to searchable text. It runs on Docker and provides two models...
4mos ago
0968
WeClone:用微信聊天记录和语音训练数字分身

WeClone: training digital doppelgangers with WeChat chats and voices

Comprehensive introduction WeClone is an open source project that uses WeChat chat logs and voice messages, combined with large language models and speech synthesis technology, to allow users to create personalized digital doppelgangers. The project can analyze the user's chat habits to train the model , but also a small number of voice samples to generate realistic sound...
4mos ago
01.2K
KrillinAI:一键翻译和配音的视频多语言全球化工具

KrillinAI: Multilingual Globalization Tool for Video with One-Click Translation and Dubbing

Comprehensive Introduction KrillinAI is an open-source video processing tool focused on using artificial intelligence to help users translate videos and automatically dub them. It can start from the video download, all the way to generating the finished product adapted to different platforms, the whole process is just a few clicks. The developers are available on GitHub...
2mos ago
01.4K
AnimeGamer:用语言指令生成动漫视频和角色互动的开源工具

AnimeGamer: An Open Source Tool for Generating Anime Videos and Character Interactions with Language Commands

AnimeGamer is an open source tool launched by Tencent ARC Lab. Users can generate anime videos with simple language commands, such as "Sousuke drive around in a purple car", as well as allow different anime characters to interact with each other, such as Kiki from The Witch's House, and Sky City...
4mos ago
01.2K