Intern-S1-mini - Lightweight scientific multimodal model open source by Shanghai AI Lab
Intern-S1-mini is a lightweight scientific multimodal macromodel with parameter scale of 8B launched by Shanghai Artificial Intelligence Laboratory (SAL).It inherits the powerful capabilities of Intern-S1, combining both general and specialized scientific capabilities, and is suitable for rapid deployment and secondary development. In terms of performance, I...
Nano Banana - AI image editing model launched by Google
Nano Banana is the Gemini 2.5 Flash Image codename for Gemini, an AI image generation and editing model from Google that generates detailed, photorealistic images based on simple text prompts to make high-quality modifications to existing images.
Genie Envisioner - Jiyuan's open-source general-purpose robotics platform with Beihang and others
Genie Envisioner (GE) is a unified platform for robot operation developed by the Genie Robotics team in collaboration with the National University of Singapore, Beijing University of Aeronautics and Astronautics and other organizations. It allows robots to better understand and perform tasks by "imagining first, then acting".
DINOv3 - Next Generation Self-Supervised Vision Base Model from Meta AI
DINOv3 is a next-generation self-supervised vision base model from Meta AI, which adopts a self-supervised learning paradigm to learn image features without labeling data. It solves the feature degradation problem by improving data preparation and introducing Gram anchoring, and improves the generalization...
Matrix-Game 2.0 - Interactive World Model developed by KunlunWanwei
Matrix-Game 2.0 is a self-developed interactive world model released by Kunlun SkyWork AI. Matrix-Game 2.0 is the industry's first open-source, real-time, long-sequence interactive generation model for general-purpose scenarios. The model is able to run at 25 FPS through a visually-driven interaction scheme in multiple...
Baichuan-M2 - Baichuan Intelligence Launches Open Source Healthcare Enhanced Big Model
Baichuan-M2 is an open source medical augmented large model launched by Baichuan Intelligence. It performs well in the medical field, especially in the HealthBench review with a score of 60.1, surpassing OpenAI's gpt-oss120b and many other open source models, becoming a global...
Qwen-Flash - A high-performance, low-cost language model from Tongyi Chien-quan
Qwen-Flash is a high-performance, low-cost language model introduced in the Alibaba Tongyi Thousand Questions series, designed for fast response and efficient processing of simple tasks. Based on the advanced Mixture-of-Experts (MoE) architecture, it is realized by sparse expert network...
SkyReels-A3 - Audio-Driven Digital Human Creation Tool from KunlunWangwei
SkyReels-A3 is an audio-driven digital human creation tool from Kunlun World Wide Group. SkyReels-A3 is an audio-driven digital human creation tool, which can generate high-quality dynamic video content through simple inputs (e.g., portrait images and voice), make static photos "come alive", and replace lines for existing videos with new lip-syncs that the characters will automatically...
MiniMax Speech 2.5 - Speech Generation Model from MiniMax
MiniMax Speech 2.5 is an advanced speech generation model developed by MiniMax team. It has made significant progress in the field of speech synthesis, especially in multilingual expressiveness, timbre reproduction accuracy and language coverage. The model supports 40 languages...
GPT-5 - The Strongest Language Model Introduced by OpenAI, Unified Intelligence System
GPT-5 is the latest language model released by OpenAI with several upgrades. It is a unified intelligence system with a built-in real-time router that automatically switches between efficient and deep thinking modes according to the complexity of the problem, realizing fast response and accurate answers.GPT-5 has several versions, including the one for general...