AI Sharing Circle

Daily sharing of the latest AI products, projects, frameworks, paper interpretations, etc.~
RynnEC - 阿里达摩院开源的世界理解模型

RynnEC - Ali Dharma Institute's open source world understanding model

RynnEC is a world understanding model introduced by Alibaba Dharma Institute, focusing on embodied intelligence tasks. The model is based on multimodal fusion technology, combining video data and natural language, and can parse objects in a scene from multiple dimensions, supporting functions such as object understanding, spatial perception and video target segmentation.
5mos ago
040.2K
Matrix-3D - 昆仑万维开源的3D世界生成框架

Matrix-3D - Kunlun World Wide open source 3D world generation framework

Matrix-3D is an open source framework from Skywork AI team, focusing on generating explorable panoramic 3D worlds. The framework combines panoramic video generation and 3D reconstruction techniques to generate high-quality, omni-directional explorable 3D worlds from a single image or text prompt...
5mos ago
037.6K
GLM-4.5V - 智谱推出的多模态开源视觉推理模型

GLM-4.5V - Multimodal Open Source Visual Reasoning Model by Smart Spectrum

GLM-4.5V is the world's leading open source visual inference model introduced by Smart Spectrum, with 106 billion total parameters and 12 billion activated parameters. The model is trained based on the new generation text base model GLM-4.5-Air, with powerful visual understanding and reasoning capabilities, capable of handling images, video...
5mos ago
041K
Genie 3 - 谷歌推出的通用世界模型

Genie 3 - A Universal World Model from Google

Genie 3 is a next-generation universal world model from Google DeepMind that enables the generation of highly dynamic and coherent virtual worlds in real time.Genie 3 simulates physical phenomena, natural ecosystems, and supports the creation of fantasy and historical scenarios. With text prompts, users can...
6mos ago
035.6K
Claude Opus 4.1 - Anthropic推出的最强编程模型

Claude Opus 4.1 - The Most Powerful Programming Model from Anthropic

Claude Opus 4.1 is a state-of-the-art large-scale language model from Anthropic, designed for efficient processing of complex tasks. The model excels in the programming domain, generating high-quality code, supporting up to 32k of single output, and adapting to a wide range of programming styles...
6mos ago
035.3K
gpt-oss - OpenAI推出的开源推理模型系列

gpt-oss - a family of open source inference models from OpenAI

gpt-oss is a family of open source inference models from OpenAI that enable efficient, flexible, and easy-to-deploy AI solutions for developers. gpt-oss consists of two versions, gpt-oss-120B with 117 billion parameters and support for 8...
6mos ago
035K
MiDashengLM - 小米开源的声音理解模型

MiDashengLM - Xiaomi's open source sound understanding model

MiDashengLM is Xiaomi's open source large model for efficient sound understanding, with specific parameter version MiDashengLM-7B , focusing on audio processing and understanding. The model is based on Xiaomi Dasheng audio encoder and Qwen2.5-Omn...
6mos ago
035.5K
MOSS-TTSD - 清华实验室开源的双语对话语音生成模型

MOSS-TTSD - Tsinghua Lab's open source speech generation model for bilingual dialogs

MOSS-TTSD is an open source spoken dialog speech generation model developed by the Speech and Language Laboratory of Tsinghua University. MOSS-TTSD can convert text dialog scripts into natural, smooth and expressive conversational speech, and supports bilingual generation in English and Chinese.
6mos ago
038.6K
AudioGen-Omni - 快手推出的多模态音频生成模型

AudioGen-Omni - Multimodal Audio Generation Model from Racer

AudioGen-Omni is a multimodal audio generation model from Racer that generates high-quality audio, speech, and songs based on inputs such as video, text, etc.AudioGen-Omni is based on advanced techniques such as multimodal diffusionTransformer and phase-aligned...
6mos ago
037.2K
RedOne - 小红书最新推出的社交大模型

RedOne - the latest social mega-model from Little Red Book

RedOne is a large language model customized for social networks introduced by Little Red Book. The model is trained through a three-stage training strategy that incorporates social and cultural knowledge, strengthens multitasking capabilities, and aligns human preferences.RedOne significantly outperforms the base model in social task performance, in harmful content detection and browsing...
6mos ago
036.3K