AI Sharing Circle

Day arching a pawn and sharing for the king!
Trackers:用于视频对象跟踪的开源工具库

Trackers: open source tool library for video object tracking

General Introduction Trackers is an open source Python tool library focused on multi-object tracking in video. It integrates several leading tracking algorithms, such as SORT and DeepSORT, and allows users to combine different object detection models (such as YOLO...
3mos ago
02K
Kimi-Audio:开源音频处理与对话基础模型

Kimi-Audio: Open Source Audio Processing and Dialogue Base Modeling

Comprehensive Introduction Kimi-Audio is an open source audio base model developed by Moonshot AI that focuses on audio understanding, generation and dialog. It supports a wide range of audio processing tasks such as speech recognition, audio Q&A and speech emotion recognition. The model has been tested over 130...
3mos ago
02.3K
Describe Anything:为图像和视频区域生成详细描述的开源工具

Describe Anything: Open source tool for generating detailed descriptions of images and video regions

General Introduction Describe Anything is an open source project developed by NVIDIA and several universities, with the Describe Anything Model (DAM) at its core. This tool can be based on the user in the image or video tagged...
3mos ago
02.3K
Cooragent:一句话构建多智能体任务协作工具

Cooragent: building a multi-intelligence task collaboration tool in one sentence

General Introduction Cooragent is an open source AI agent collaboration framework developed by LeapLab at Tsinghua University and hosted on GitHub.It allows users to create intelligent AI agents with a one-sentence description and supports multiple agents to collaborate on complex tasks. The framework provides two...
3mos ago
02.3K
InstantCharacter:从单张图片生成一致性角色的开源工具

InstantCharacter: An Open Source Tool for Generating Consistent Characters from a Single Image

General Introduction InstantCharacter is an open source project developed by Tencent Hunyuan and the InstantX team, hosted on GitHub. It generates consistent-looking character maps with a reference image and a text description...
3mos ago
02.6K
Claude生成深度研究报告的MCP服务

Claude's MCP service for generating in-depth research reports

Comprehensive Introduction MCP Server Deep Research is an open source tool that automatically generates structured research reports for complex problems through artificial intelligence and web search. Users enter a research question, and the tool breaks down the question, searches for authoritative information, assesses source credibility...
3mos ago
01.9K
Deep Recall:为大模型提供企业级记忆框架的开源工具

Deep Recall: an open source tool that provides an enterprise-class memory framework for large models

Comprehensive Introduction Deep Recall is an open source, enterprise-class memory framework designed for large-scale language models (LLMs). It provides hyper-personalized responsiveness through efficient contextual retrieval and integration. The framework uses a three-tier architecture, including a memory service, a reasoning service, and a coordinator, supporting...
3mos ago
02.1K
CleverBee:开源AI研究助手,生成引证研究报告

CleverBee: open source AI research assistant generates citation studies

General Introduction CleverBee is an open source AI research assistant hosted on GitHub and developed by SureScaleAI. It helps users by combining web browsing technology with large language models (such as Gemini and Claude)...
3mos ago
02K
FantasyTalking:生成真实感说话肖像的开源工具

FantasyTalking: an open-source tool for generating realistic speaking portraits

General Introduction FantasyTalking is an open source project developed by the Fantasy-AMAP team, focusing on generating realism talking portrait videos through audio drive. The project is based on the advanced video diffusion model Wan2.1 , combined with the audio encoder Wa...
3mos ago
02.6K
Paper2Code:将机器学习论文自动转化为可运行代码

Paper2Code: Automatically Converting Machine Learning Papers into Runnable Code

General Introduction Paper2Code is an open source project that aims to solve the problem of lack of code implementations for machine learning papers. It automatically transforms scientific papers into runnable code repositories through the multi-agent Large Language Modeling (LLM) system PaperCoder. The system uses planning ...
3mos ago
02K