AI Sharing Circle

AI is changing the world!
UnifoLM-WMA-0 - 宇树科技开源的世界模型动作架构

UnifoLM-WMA-0 - Yu Shu Technology open source world model action architecture

UnifoLM-WMA-0 is an open source world model-action architecture across multiple classes of robot ontologies by Yu Shu Technology, designed for general robot learning. Composed of a world model and an action architecture, the world model understands the physical laws of robot-environment interaction, and the action architecture is responsible for specific...
8mos ago
049.7K
InfiniteTalk - 美团视觉AI开源的音频驱动视频生成工具

InfiniteTalk - Open Source Audio-Driven Video Generation Tool for Mission Vision AI

InfiniteTalk is an audio-driven video generation tool developed by the MeiGen-AI team that generates talking videos of unlimited length based on the input audio. The core advantage lies in the precise lip synchronization technology, which can perfectly match the audio with the character's mouth shape to generate natural and smooth...
8mos ago
059.2K
ROMA - 开源的元Agent框架,自动分解复杂任务并行处理

ROMA - Open Source Meta-Agent Framework for Automatic Decomposition of Complex Tasks for Parallel Processing

ROMA (Recursive-Open-Meta-Agent) is an open source meta-agent framework developed by Sentient AGI to efficiently solve complex problems through recursive task decomposition and parallel processing. Support for Python 3.12+, Docker and ...
8mos ago
046.8K
Lumina-DiMOO - 上海AI Lab联合华为昇腾开源的多模态大模型

Lumina-DiMOO - A Multimodal Large Model Open-Sourced by Shanghai AI Lab and Huawei Ascendant

Lumina-DiMOO is a new generation of unified model for multimodal generation and understanding launched by Shanghai Artificial Intelligence Laboratory (SAL) in conjunction with Huawei Rise at the World Artificial Intelligence Conference 2025. Based on the Rise AI basic hardware and software platform and the MindSpeed MM multimodal large model suite, it accomplishes...
8mos ago
041.9K
Hyprnote - 开源的本地优先AI会议笔记工具

Hyprnote - Open source, locally prioritized AI conference note-taking tool

Hyprnote is an open source, local-first AI meeting note-taking tool designed for professionals to protect user privacy and improve meeting efficiency. Adopting the "local first" principle, all data storage and processing is done on the user's local device to ensure data security and support offline operation.
8mos ago
041.5K
MobileLLM-R1 - Meta开源的专项高效推理模型系列

MobileLLM-R1 - Meta open source special efficient inference model series

MobileLLM-R1 is Meta's open source series of efficient inference models designed for mathematical, programming and scientific reasoning. It contains a base model and a final model, with 140 million, 360 million and 950 million parameter versions, respectively. The models are not generic chat models and are supervised fine-tuned (SFT...
8mos ago
034.3K
ERNIE-4.5-21B-A3B-Thinking - 百度开源的推理思考模型

ERNIE-4.5-21B-A3B-Thinking - Baidu open source reasoning thinking model

ERNIE-4.5-21B-A3B-Thinking is Baidu's open source large-scale language model focused on reasoning tasks. Using the Mixed Expert (MoE) architecture , the total number of references to 21 billion , each token activates 3 billion parameters to support 128K long context window ...
8mos ago
032.8K
MobiAgent - 上海交大开源的移动端智能体全栈构建框架

MobiAgent - Shanghai Jiaotong University open source mobile intelligent body full-stack building framework

MobiAgent is an open source mobile intelligent body toolchain from IPADS Lab of Shanghai Jiaotong University, which helps users to build their own mobile intelligent assistants. By recording the user's operation trajectory and generating high-quality data, it trains an intelligent body that can understand natural language commands. Core features include efficient...
8mos ago
039.6K
ZipVoice - 小米开源的语音合成系列模型

ZipVoice - Xiaomi's open source speech synthesis model series

ZipVoice is a series of speech synthesis (TTS) models based on the Flow Matching architecture released by Xiaomi, including ZipVoice (zero-sample single-speaker speech synthesis model) and ZipVoice-Dialog (zero-sample conversational speech synthesis...
8mos ago
048.9K
PP-OCRv5 - 百度开源的新一代文字识别AI模型

PP-OCRv5 - Baidu's open source AI model for next-generation text recognition

PP-OCRv5 is the latest generation of text recognition AI model released by Baidu. With a lightweight design and a reference count of only 0.07B, it is suitable for efficient operation on CPU and edge devices, and can process more than 370 characters per second. The model supports Simplified Chinese, Traditional Chinese, English, Japanese and Pinyin...
8mos ago
062.5K