AI Sharing Circle

AI is changing the world!
dots.vlm1 - 小红书hi lab开源的多模态大模型

dots.vlm1 - Small red book hi lab open source multimodal big model

dots.vlm1 is the first multimodal big model open-sourced by Little Red Book hi lab. Based on NaViT, a 1.2 billion parameter visual encoder trained from scratch, and DeepSeek V3 Large Language Model (LLM), it has powerful visual perception and text inference...
2mos ago
021.6K
LangExtract - 谷歌开源的Python库,提取结构化信息

LangExtract - Google's open source Python library to extract structured information

LangExtract is a Google Open Source Python library that uses large language models (LLMs) to extract structured information from unstructured text. With user-defined commands and a handful of examples, it can efficiently identify and organize key details, such as clinical notes from...
2mos ago
024.5K
Qwen-Image - 通义千问推出开源的文生图基础模型

Qwen-Image - Tongyi Qianqian Launches Open Source Basic Model of Qwen-Image

Qwen-Image is an open source image generation base model released by Alibaba Tongyi Qianqian team. With 20 billion parameters, it adopts the Multimodal Diffusion Transformer Architecture (MMDiT), which integrates three modules: multimodal understanding, high-resolution coding and diffusion modeling.Qwen-Image's...
2mos ago
020.5K
Gemini 2.5 Deep Think - 谷歌推出的AI推理模型

Gemini 2.5 Deep Think - AI inference model from Google

Gemini 2.5 Deep Think is an AI reasoning model from Google designed to solve complex tasks. It is a variant of the model that won the gold medal at the International Mathematical Olympiad (IMO) 2025, and is designed to solve complex tasks through Parallel ...
2mos ago
017.8K
MindLink - 昆仑万维推出的开源推理大模型

MindLink - Open Source Reasoning Big Model from KunlunWei

MindLink is a large model of open source reasoning launched by Kunlun World Wide Web. With adaptive reasoning mechanism , according to the complexity of the task can be flexibly switched inference mode , simple tasks quickly generated , complex tasks in-depth reasoning , taking into account the efficiency and accuracy . Plan-driven reasoning paradigm to remove the "think" label , down ...
2mos ago
019.3K
MirageLSD - Decart AI推出首个实时AI视频生成模型

MirageLSD - Decart AI Launches First Real-Time AI Video Generation Model

MirageLSD is the world's first real-time streaming diffusion AI video model from the Decart AI team, enabling unlimited real-time video generation with latency as low as 40 milliseconds and smooth output at 24 frames/second.
3mos ago
020.4K
k2 – 月之暗面Kimi最新推出的MoE架构基础模型

k2 - Dark Side of the Moon Kimi's newest MoE Architecture Base Model

k2 is a MoE architecture base model with superb code and Agent capabilities from Moonshot AI, with 1T total parameters and 32B activation parameters. in benchmark performance tests in the main categories of General Knowledge Reasoning, Programming, Mathematics, and Agent, the k2 model...
3mos ago
023.7K
Grok 4 – 马斯克旗下xAI推出的最新大模型

Grok 4 - The latest big model from Musk's xAI

Grok 4 is the latest big AI model from xAI, and Grok 4 delivers a 10x improvement in reasoning power over its predecessor. The model's superior reasoning ability enables it to score near perfect on difficult exams such as the SAT and GRE, and outperforms other cutting-edge models in a number of benchmark tests...
3mos ago
020.7K
GenFlow超能搭子 – 百度文库推出的通用AI Agent

GenFlow Super Hitchhiker - Generalized AI Agent from Baidu Literature Library

GenFlow Super Hitchhiker is a general-purpose AI Agent launched by Baidu Literature Library, which allows users to autonomously disassemble tasks, call up Baidu Literature Library's 1.4 billion document libraries and online resources, and generate PPTs, reports, charts, posters, and other full-modal content in an extremely fast manner by simply typing in the natural language commands.
3mos ago
022K
Step-Audio-AQAA – StepFun推出的端到端大音频语言模型

Step-Audio-AQAA - End-to-End Big Audio Language Model from StepFun

Step-Audio-AQAA is an end-to-end large-scale audio language model for Audio Query-Audio Answer (AQAA) tasks from the StepFun team. It can directly process audio input to generate natural and accurate speech responses without relying on traditional automatic speech recognition (A...
3mos ago
019.6K