MiniCPM-V 4.5 - Faceted Intelligent Open Source 8B Parameter Multimodal Modeling
MiniCPM-V 4.5 is an open source 8B parametric multimodal model of Facade Intelligence, built based on Qwen3-8B and SigLIP2-400M, with the ability to efficiently process images and videos. It has excellent performance in visual token consumption, processing ...
Aivilization - A Multi-Agent Social Simulation Platform Launched by HKUST
Aivilization is the world's first AI multi-intelligent body social simulation platform developed by the Hong Kong University of Science and Technology. It builds a visual digital sandbox where users can create and guide thousands of AI intelligences to observe the social evolution of future human-AI coexistence. The platform supports...
Grok 2.5 - Musk's xAI open source AI model
Grok 2.5 is an open source AI model from Elon Musk's xAI. With 269 billion parameters, it is based on the Mixed Expert (MoE) architecture for powerful performance and inference. The model has been tested at graduate level scientific knowledge (GPQA), generalized knowledge (MMLU, MM...
Draw A Fish - free online AI fish drawing site with shared virtual fish tanks
Draw A Fish is simple and fun online AI fish drawing site where users can draw fish patterns and place them in a globally shared virtual fish tank.Draw A Fish requires no registration and is easy to use, taking only seconds to create and share.
ToonComposer - Tencent open source generative AI animation tool
ToonComposer is a generative AI animation tool jointly launched by The Chinese University of Hong Kong, Tencent PCG ARC Lab and Peking University. Through generative post keyframe technology, the intermediate frame generation and coloring process is integrated into an automated process, requiring only a sketch and a...
Intern-S1-mini - Lightweight scientific multimodal model open source by Shanghai AI Lab
Intern-S1-mini is a lightweight scientific multimodal macromodel with parameter scale of 8B launched by Shanghai Artificial Intelligence Laboratory (SAL).It inherits the powerful capabilities of Intern-S1, combining both general and specialized scientific capabilities, and is suitable for rapid deployment and secondary development. In terms of performance, I...
Nano Banana - AI image editing model launched by Google
Nano Banana is the Gemini 2.5 Flash Image codename for Gemini, an AI image generation and editing model from Google that generates detailed, photorealistic images based on simple text prompts to make high-quality modifications to existing images.
Genie Envisioner - Jiyuan's open-source general-purpose robotics platform with Beihang and others
Genie Envisioner (GE) is a unified platform for robot operation developed by the Genie Robotics team in collaboration with the National University of Singapore, Beijing University of Aeronautics and Astronautics and other organizations. It allows robots to better understand and perform tasks by "imagining first, then acting".
DINOv3 - Next Generation Self-Supervised Vision Base Model from Meta AI
DINOv3 is a next-generation self-supervised vision base model from Meta AI, which adopts a self-supervised learning paradigm to learn image features without labeling data. It solves the feature degradation problem by improving data preparation and introducing Gram anchoring, and improves the generalization...
Matrix-Game 2.0 - Interactive World Model developed by KunlunWanwei
Matrix-Game 2.0 is a self-developed interactive world model released by Kunlun SkyWork AI. Matrix-Game 2.0 is the industry's first open-source, real-time, long-sequence interactive generation model for general-purpose scenarios. The model is able to run at 25 FPS through a visually-driven interaction scheme in multiple...