AI Sharing Circle

AI is changing the world!
Ouro - 字节跳动Seed团队开源的新型循环语言模型

Ouro - A new cyclic language model open-sourced by the ByteHopper Seed team

Ouro is a new type of Looped Language Models (LLMs) developed by the ByteDance Seed team, with the core innovation of directly building inference capabilities in the pre-training phase through a parameter-sharing recurrent computation structure. The model uses 24 layers as the base block through...
7mos ago
038.4K
ChronoEdit - 英伟达与多伦多大学联合开源的AI图像编辑框架

ChronoEdit - AI image editing framework jointly open-sourced by NVIDIA and the University of Toronto

ChronoEdit, an open-source AI image editing framework developed by NVIDIA in conjunction with the University of Toronto, redefines the image editing task as a video generation task to ensure that the editing results are temporally and physically consistent. By distilling a pre-trained video generation model with 14B parameters from a...
7mos ago
033.4K
LongCat-Flash-Omni - 美团开源的全模态大语言模型

LongCat-Flash-Omni - A Fully Modal Large Language Model for Meituan Open Source

LongCat-Flash-Omni is an open source fully modal big language model released by the LongCat team of Meituan. With a parameter scale of 560 billion (27 billion activated parameters), it realizes millisecond-level real-time audio and video interaction capabilities while maintaining a large number of parameters.
7mos ago
031.8K
Petri - Anthropic开源的 AI 安全审计框架

Petri - Anthropic's open source AI security auditing framework

Petri is an open source AI security auditing framework developed by Anthropic that systematically assesses the security and behavioral alignment of AI models. By simulating a real-world scenario where an automated auditor engages in multiple rounds of conversations with a target model, followed by a judge agent that acts on the model's...
7mos ago
027.9K
Kimi Linear - 月之暗面开源的新型混合线性注意力架构

Kimi Linear - A New Hybrid Linear Attention Architecture Open-Sourced by Dark Side of the Moon

Kimi Linear is a new hybrid linear attention architecture open-sourced by Dark Side of the Moon, with Kimi Delta Attention (KDA) as the core, optimizing the traditional attention model through a finer-grained gating mechanism, which significantly improves the hardware efficiency and memory control ability ...
7mos ago
041K
FIBO - 全球首个开源原生支持JSON的文本生成图像模型

FIBO - The world's first open-source native JSON-enabled text to image modeling

FIBO is the world's first open source text generation image model with native JSON support developed by Bria AI. Based on the DiT (Diffusion Transformer) architecture with 8B parameters, it adopts the Flow Matching training method...
7mos ago
032.8K
SoulX-Podcast - Soul AI Lab开源的对话式语音合成模型

SoulX-Podcast - Soul AI Lab's Open Source Conversational Speech Synthesis Model

SoulX-Podcast is Soul AI Lab's open source advanced multi-speaker conversational speech synthesis model designed for generating high quality podcast content. SoulX-Podcast has the ability to generate multiple rounds of conversations, which can simulate smooth conversations in real podcasting scenarios, and supports Mandarin, English, and multiple Chinese...
7mos ago
042.2K
GigaBrain-0 - 开源的具身基础模型,由世界模型生成数据驱动

GigaBrain-0 - Open source embodied base model driven by world model generation data

GigaBrain-0 is the first end-to-end Vision-Language-Action (VLA) embodied base model in China that uses world model generation data to realize real machine generalization, and it is jointly released as open source by GigaVision and Hubei Humanoid Robot Innovation Center. It adopts the hybrid Transformer architecture, integrating ...
7mos ago
030.1K
Ming-flash-omni-Preview - 蚂蚁集团开源的全模态大模型

Ming-flash-omni-Preview - Ant Group's open source fully modal large models

Ming-flash-omni-Preview is an open-source full-modal macromodel released by Ant Group inclusionAI, with a parameter scale of hundreds of billions, based on the sparse MoE architecture of Ling 2.0, with total parameters of 103B and activations of 9B. in full-modal understanding and generating...
7mos ago
034.3K
OmniVinci - NVIDIA开源的全模态大语言模型

OmniVinci - NVIDIA's Open Source Omnimodal Large Language Model

OmniVinci is an open-source, fully modal large-scale language model developed by NVIDIA that solves the problem of modal fragmentation in multimodal models through architectural innovation and data optimization. Alignment of visual and audio embeddings is enhanced by OmniAlignNet, which utilizes temporally embedded group capture...
7mos ago
033.9K