AI Sharing Circle

AI is changing the world!
Step-GUI - 阶跃星辰开源的AI Agent系列模型

Step-GUI - Step-Star Open Source AI Agent Series Models

Step-GUI is Step-Star's open source AI Agent series of models, including the cloud model Step-GUI, the first MCP protocol for GUI Agents, and the industry's first open source end-side model Step-GUI Edge to support cell phone deployment.Specialized...
5mos ago
040.9K
A2UI - 谷歌开源的Agent驱动型用户交互界面声明式协议

A2UI - Google's open source declarative protocol for Agent-driven user interaction interfaces

A2UI (Agent-to-User Interface) is Google's open-source Agent-driven interface protocol that solves the problem of generating complex interactive interfaces for AI agents. Through a declarative JSON format that allows AI agents to describe the structure of the user interface , client applications ...
5mos ago
046.9K
SAM Audio - Meta推出的开源多模态音频分割模型

SAM Audio - Open Source Multimodal Audio Segmentation Model from Meta

SAM Audio is an open source multimodal audio segmentation model introduced by Meta to accurately separate arbitrary target sounds from complex audio mixes. By combining textual, visual, and temporal dimensional cues, it enables flexible and efficient audio processing for tasks such as audio editing, denoising, sound extraction, and...
5mos ago
035.8K
混元世界模型1.5 - 腾讯混元开源的实时世界模型生成框架

Mixed World Model 1.5 - Tencent Mixed Open Source Real-time World Model Generation Framework

Mixed World Model 1.5 (Tencent HY WorldPlay) is the industry's first open source real-time world modeling framework released by Tencent, covering the entire chain of data, training, and streaming inference deployment. The core is the WorldPlay autoregressive diffusion model, which uses Next-F...
5mos ago
036.7K
Molmo 2 - Ai2开源的多模态视频图像理解模型系列

Molmo 2 - Ai2 open source multimodal video image understanding model series

Molmo 2 is an open source multimodal model released by the Allen Institute for AI (Ai2) to improve video and multi-image understanding. Three variants are included; Molmo 2 (8B), Molmo 2 (4B) and Molmo 2-O...
5mos ago
041.2K
LongCat-Video-Avatar - 美团开源的虚拟人视频生成模型

LongCat-Video-Avatar - MeiTuan open source avatar video generation model

LongCat-Video-Avatar is an advanced audio-driven video generation model built on LongCat-Video open-sourced by Meituan, focusing on generating hyper-realistic, lip-synchronized long videos with natural dynamics and consistent identity.
5mos ago
042.4K
MiMo-V2-Flash - 小米发布的开源MoE架构大模型

MiMo-V2-Flash - a large model of the open source MoE architecture released by Xiaomi

MiMo-V2-Flash is an open source MoE architecture large model released by Xiaomi, with 309 billion total parameters and 15 billion active parameters, focusing on efficient reasoning and intelligent body applications. The model adopts hybrid attention architecture and multi-word meta-prediction technology, with an inference speed of 150 tokens/second, into...
5mos ago
037.7K
Nemotron 3 - 英伟达发布的开源 AI 模型系列

Nemotron 3 - A family of open source AI models released by NVIDIA

Nemotron 3 is a family of open source AI models released by NVIDIA in Nano, Super and Ultra sizes. It adopts the hybrid potential expert hybrid (latent MoE) architecture to significantly improve inference efficiency and reduce operating costs. Among them...
5mos ago
035.5K
Wan-Move - 阿里通义联合清华等开源的AI视频生成框架

Wan-Move - Ali Tongyi's open source AI video generation framework with Tsinghua and others

Wan-Move is an open source AI video generation framework jointly developed by Ali Tongyi Labs, Tsinghua University and other organizations, focusing on high-quality video synthesis through precise motion control technology. The core technology is "potential trajectory guidance", which can seamlessly add point-level motion control to the existing image-to-video model...
5mos ago
035.5K
PaCoRe - 阶跃星辰开源的并行协同AI推理框架

PaCoRe - Step Star's open source parallel collaborative AI reasoning framework

PaCoRe (Parallel Coordinated Reasoning) is StepFun's open source innovative parallel collaborative reasoning framework, through a massively parallel thinking mechanism, from multiple perspectives to simultaneously explore the problem solution, breaking through the traditional...
5mos ago
038.6K