Latest AI Resources

Total 3048 articles posts

Course materials Latest AI Resources AI Knowledge Base AI News

Sorting

Gemini 2.5 Flash Image - 谷歌推出的最强图像生成与编辑模型

Gemini 2.5 Flash Image - The Most Powerful Image Generation and Editing Model from Google

Gemini 2.5 Flash Image (codename nano banana) is a state-of-the-art image generation and editing model from Google that maintains the consistency of characters across different scenes and supports precise image editing through natural language, such as blurring backgrounds and removing stains.

Latest AI Resources

8mos ago

043.9K

Wan2.2-S2V - 阿里通义开源的音频驱动视频生成模型

Wan2.2-S2V - Ali Tongyi open source audio-driven video generation model

Wan2.2-S2V is Ali Tongyi open source multimodal video generation model , only a static picture and a piece of audio , you can generate high-quality digital human video , and supports a variety of image types and frame .

Latest AI Resources

8mos ago

045.1K

吴恩达面向开发者的ChatGPT提示工程免费课程

Free Course on ChatGPT Tip Engineering for Developers by Ernest Ng

ChatGPT Tip Engineering for Developers is a joint DeepLearning.AI and OpenAI course designed for developers, featuring Isa Fulford, Andrew Ng to teach how to use Large Language Models (LLMs...

Latest AI Resources Course materials

8mos ago

047K

问小白o4 - 问小白推出的并行思考模型，同时开启8条思考路径

Ask Whitey o4 - A parallel thinking model introduced by Ask Whitey that opens 8 thinking paths at the same time

Ask White o4 is an innovative parallel thinking model that opens 8 thinking paths at the same time, analyzes the problem from multiple perspectives and automatically filters out the optimal solution. The model incorporates advanced Long-CoT reinforcement learning and process reward learning techniques, has powerful deep reasoning capabilities, and performs well in complex tasks.

Latest AI Resources

8mos ago

037.7K

VibeVoice - 微软推出的文本到语音模型

VibeVoice - Text-to-Speech Model from Microsoft

VibeVoice is a new text-to-speech (TTS) model from Microsoft. The model generates conversational audio from up to four different speakers and supports up to 90 minutes of continuous voice output, breaking the length limitations of traditional TTS systems.

Latest AI Resources

8mos ago

065.4K

SpatialGen - 群核科技推出的开源3D场景生成模型

SpatialGen - Open Source 3D Scene Generation Model by Qunar Technology

SpatialGen is an open source 3D scene generation model of Qunar Technology, based on the diffusion model architecture, supporting the generation of spatio-temporally consistent multi-view images based on textual descriptions, reference images and 3D spatial layouts, and further generating 3D Gaussian scenes and rendering roaming videos.

Latest AI Resources

8mos ago

043.6K

EchoMimicV3 - 蚂蚁开源的多模态数字人动画生成模型

EchoMimicV3 - Ant open source multimodal digital human animation generation model

EchoMimicV3 is a multimodal digital human video generation model introduced by Ant Group, with 1.3 billion parameters, capable of handling multiple inputs such as audio, text, images, etc. to generate high-quality digital human animations.

Latest AI Resources

8mos ago

043.1K

Fun-ASR - 钉钉、通义联合推出的新一代语音识别模型

Fun-ASR - A New Generation of Speech Recognition Models Jointly Launched by Nail and Tongyi

Fun-ASR is a big model of speech recognition jointly launched by Nail and Tongyi Labs. The model has been trained with massive audio data and can accurately recognize multi-industry terminology, such as Internet, technology, home decoration, etc., significantly improving the recognition accuracy. The model combines with Nail enterprise information for inference optimization to reduce the illusion problem...

Latest AI Resources

8mos ago

066.1K

Squibler - AI小说辅助写作平台，助力构思到创作全过程

Squibler - AI novel-assisted writing platform that facilitates the entire process from idea to creation

Squibler is a powerful AI-assisted writing platform designed for writers that helps users with the entire process from conception to creation to publication. The platform provides a variety of story templates covering novels, screenplays, short stories, etc. Users only need to enter the initial concept, and the AI can generate outlines, characters, scenes...

Latest AI Resources

8mos ago

046.1K

91写作 - 开源的AI智能小说创作平台

91Writing - Open Source AI Intelligent Novel Creation Platform

91 Writing is a fully open source AI novel creation tool, developed based on Vue 3 and Element Plus, integrating a variety of advanced AI models, such as GPT, Claude, Gemini, and so on. The tool provides creators with a complete creation tool chain from idea to text, including project creation...

Latest AI Resources

8mos ago

047K

Aivilization - 港科大推出的多Agent社会模拟平台

Aivilization - A Multi-Agent Social Simulation Platform Launched by HKUST

Aivilization is the world's first AI multi-intelligent body social simulation platform developed by the Hong Kong University of Science and Technology. It builds a visual digital sandbox where users can create and guide thousands of AI intelligences to observe the social evolution of future human-AI coexistence. The platform supports...

Latest AI Resources

8mos ago

082.3K

Grok 2.5 - 马斯克旗下xAI开源的人工智能模型

Grok 2.5 - Musk's xAI open source AI model

Grok 2.5 is an open source AI model from Elon Musk's xAI. With 269 billion parameters, it is based on the Mixed Expert (MoE) architecture for powerful performance and inference. The model has been tested at graduate level scientific knowledge (GPQA), generalized knowledge (MMLU, MM...

Latest AI Resources

8mos ago

047.7K

Draw A Fish - 免费的在线AI画鱼网站，共享虚拟鱼缸

Draw A Fish - free online AI fish drawing site with shared virtual fish tanks

Draw A Fish is simple and fun online AI fish drawing site where users can draw fish patterns and place them in a globally shared virtual fish tank.Draw A Fish requires no registration and is easy to use, taking only seconds to create and share.

Latest AI Resources

8mos ago

067.2K

MIT最新报告《生成式AI鸿沟：2025年商业人工智能现状》

MIT's new report, "The Generative AI Divide: the State of Business AI in 2025

MIT's latest report, The Generative AI Divide: the State of Business AI in 2025, reveals the core of the generative AI (GenAI) adoption process that companies are experiencing through in-depth research of more than 300 AI projects, interviews with 52 organizations, and a survey of 153 executives...

Latest AI Resources Course materials

8mos ago

080.8K

AutoClip - 开源的AI视频切片工具，一键生成专题视频合集

AutoClip - Open source AI video slicing tool to generate thematic video collections with one click

AutoClip is open source AI video editing tool, based on advanced AI technology to realize the whole process of automated video processing. Tools can automatically identify the highlights of the video, accurate extraction of valuable content, can be based on the similarity of the theme of intelligent clustering, to generate a collection of content.AutoClip support...

Latest AI Resources

8mos ago

063.6K

ToonComposer - 腾讯开源的生成式AI动画制作工具

ToonComposer - Tencent open source generative AI animation tool

ToonComposer is a generative AI animation tool jointly launched by The Chinese University of Hong Kong, Tencent PCG ARC Lab and Peking University. Through generative post keyframe technology, the intermediate frame generation and coloring process is integrated into an automated process, requiring only a sketch and a...

Latest AI Resources

8mos ago

053.8K

Seed-OSS - 字节跳动团队开源的全新AI模型

Seed-OSS - A new AI model open-sourced by the Wordpress team

Seed-OSS is a large family of language models open-sourced by the Byte Jump Seed team, focusing on long text and reasoning tasks. The model performs well in complex logical reasoning and multi-step reasoning, with high accuracy and efficient problem solving.Seed-OSS supports long text contexts up to 512K...

Latest AI Resources

8mos ago

050.5K

Intern-S1-mini - 上海AI Lab开源的轻量化科学多模态模型

Intern-S1-mini - Lightweight scientific multimodal model open source by Shanghai AI Lab

Intern-S1-mini is a lightweight scientific multimodal macromodel with parameter scale of 8B launched by Shanghai Artificial Intelligence Laboratory (SAL).It inherits the powerful capabilities of Intern-S1, combining both general and specialized scientific capabilities, and is suitable for rapid deployment and secondary development. In terms of performance, I...

Latest AI Resources

8mos ago

044.2K

Nano Banana - 谷歌推出的AI图像编辑模型

Nano Banana - AI image editing model launched by Google

Nano Banana is the Gemini 2.5 Flash Image codename for Gemini, an AI image generation and editing model from Google that generates detailed, photorealistic images based on simple text prompts to make high-quality modifications to existing images.

Latest AI Resources

8mos ago

070.1K

Klear-Reasoner - 快手推出的全新推理模型

Klear-Reasoner - The New Reasoning Model Introduced by Racer

Klear-Reasoner is a high-performance inference model from Racer, based on Qwen3-8B-Base. The model is trained by long thought chain supervised fine-tuning and reinforcement learning to perform well in mathematical and code reasoning.Klear-Reasoner...

Latest AI Resources

8mos ago

040.5K

CombatVLA - 淘天集团推出的高效VLA模型

CombatVLA - Efficient VLA Model by Amoy Group

CombatVLA is an innovative 3D action role-playing game (ARPG)-specific model from the Future Life Lab team of the Amoy Sky Group.CombatVLA is a visual-linguistic-action (VLA) model, built on a 3B parametric scale, that collects human player's through a motion tracker...

Latest AI Resources

8mos ago

045K

DeepSeek V3.1 - DeepSeek推出的最新开源AI模型

DeepSeek V3.1 - Latest Open Source AI Models from DeepSeek

DeepSeek V3.1 is a new generation of AI models introduced by DeepSeek, with important upgrades based on its predecessor, V3. DeepSeek V3.1 introduces a hybrid reasoning architecture that allows the model to flexibly switch between thinking and non-thinking modes, significantly improving the thinking...

Latest AI Resources

8mos ago

047.8K

Qwen-Image-Edit - 阿里通义开源的图像编辑模型

Qwen-Image-Edit - Ali Tongyi open source image editing model

Qwen-Image-Edit is an all-purpose image editing model introduced by Ali Tongyi, built on the Qwen-Image architecture with 20 billion parameters. The model combines both semantic and appearance editing capabilities, and can perform low-level visual appearance editing on images (e.g., adding, deleting...

Latest AI Resources

8mos ago

044.8K

MoE-TTS - 昆仑万维推出的最新语音生成框架

MoE-TTS - The Latest Speech Generation Framework from KunlunWei

MoE-TTS is a speech synthesis framework introduced by KunlunWanwei, based on the Mixed Expert (MoE) architecture, which combines pre-trained Large Language Models (LLMs) with speech expert modules.MoE-TTS retains the powerful textual reasoning by freezing the textual module parameters and updating only the speech module parameters...

Latest AI Resources

8mos ago

044.4K

Genie Envisioner - 智元联合北航等开源的通用机器人操作平台

Genie Envisioner - Jiyuan's open-source general-purpose robotics platform with Beihang and others

Genie Envisioner (GE) is a unified platform for robot operation developed by the Genie Robotics team in collaboration with the National University of Singapore, Beijing University of Aeronautics and Astronautics and other organizations. It allows robots to better understand and perform tasks by "imagining first, then acting".

Latest AI Resources

8mos ago

044.8K

DINOv3 - Meta AI推出的新一代自监督视觉基础模型

DINOv3 - Next Generation Self-Supervised Vision Base Model from Meta AI

DINOv3 is a next-generation self-supervised vision base model from Meta AI, which adopts a self-supervised learning paradigm to learn image features without labeling data. It solves the feature degradation problem by improving data preparation and introducing Gram anchoring, and improves the generalization...

Latest AI Resources

8mos ago

055.4K

Mureka V7.5 - 昆仑万维推出的先进AI音乐创作模型

Mureka V7.5 - Advanced AI Music Creation Model from Quintessence

Mureka V7.5 is a state-of-the-art AI music generation model from Kunlun World Wide, focusing on Chinese songwriting. The model can accurately reproduce tones and playing techniques to generate natural, smooth and emotional vocals. Based on optimized automatic speech recognition (ASR) technology, Mureka V...

Latest AI Resources

8mos ago

044.6K

Skywork Deep Research Agent v2 - 昆仑万维推出的深度研究智能体升级版

Skywork Deep Research Agent v2 - An Upgraded Version of Deep Research Intelligence from Kunlun

Skywork Deep Research Agent v2 is a deep research intelligent body launched by Kunlun Wave, focusing on the integration and analysis of multimodal information.Skywork Deep Research Agent v2 can process text, graph...

Latest AI Resources

8mos ago

043.8K

Hunyuan-GameCraft - 腾讯混元开源的下一代游戏交互式视频生成框架

Hunyuan-GameCraft - Tencent Hunyuan's open source framework for generating interactive video for next-generation games.

Hunyuan-GameCraft is Tencent Hunyuan team open source interactive game video generation framework. Framework from a single picture and prompts to generate highly dynamic game video , support the user through the keyboard and mouse to control the video content in real time .

Latest AI Resources

8mos ago

047.9K

Skywork UniPic 2.0 - 昆仑万维开源的高效多模态模型

Skywork UniPic 2.0 - Open Source Efficient Multi-Modal Modeling by KunlunWanwei

Skywork UniPic 2.0 is an efficient multimodal model open-sourced by KunlunWei, focusing on image generation, editing and understanding. The model is based on a 2B-parameter SD3.5-Medium architecture, which is realized through pre-training, progressive dual-task reinforcement strategies and co-training...

Latest AI Resources

8mos ago

045.2K

RynnRCP - 阿里达摩院推出的首个开源机器人上下文协议

RynnRCP - First Open Source Robotics Context Protocol from Ali Dharma Institute

RynnRCP is an open source Robot Context Protocol (RCP) from Ali Dharma Institute that lowers the threshold for development of embodied intelligence and opens up the entire development process.RynnRCP consists of the RCP framework and the RobotMotion module.The RCP framework, through capability abstraction and multi-protocol support, will...

Latest AI Resources

8mos ago

050.3K

RynnEC - 阿里达摩院开源的世界理解模型

RynnEC - Ali Dharma Institute's open source world understanding model

RynnEC is a world understanding model introduced by Alibaba Dharma Institute, focusing on embodied intelligence tasks. The model is based on multimodal fusion technology, combining video data and natural language, and can parse objects in a scene from multiple dimensions, supporting functions such as object understanding, spatial perception and video target segmentation.

Latest AI Resources

8mos ago

051K

Matrix-3D - 昆仑万维开源的3D世界生成框架

Matrix-3D - Kunlun World Wide open source 3D world generation framework

Matrix-3D is an open source framework from Skywork AI team, focusing on generating explorable panoramic 3D worlds. The framework combines panoramic video generation and 3D reconstruction techniques to generate high-quality, omni-directional explorable 3D worlds from a single image or text prompt...

Latest AI Resources

8mos ago

051.7K

GLM-4.5V - 智谱推出的多模态开源视觉推理模型

GLM-4.5V - Multimodal Open Source Visual Reasoning Model by Smart Spectrum

GLM-4.5V is the world's leading open source visual inference model introduced by Smart Spectrum, with 106 billion total parameters and 12 billion activated parameters. The model is trained based on the new generation text base model GLM-4.5-Air, with powerful visual understanding and reasoning capabilities, capable of handling images, video...

Latest AI Resources

8mos ago

050.7K

Matrix-Game 2.0 - 昆仑万维开源自研的交互式世界模型

Matrix-Game 2.0 - Interactive World Model developed by KunlunWanwei

Matrix-Game 2.0 is a self-developed interactive world model released by Kunlun SkyWork AI. Matrix-Game 2.0 is the industry's first open-source, real-time, long-sequence interactive generation model for general-purpose scenarios. The model is able to run at 25 FPS through a visually-driven interaction scheme in multiple...

Latest AI Resources

8mos ago

050.5K

Baichuan-M2 - 百川智能推出开源的医疗增强大模型

Baichuan-M2 - Baichuan Intelligence Launches Open Source Healthcare Enhanced Big Model

Baichuan-M2 is an open source medical augmented large model launched by Baichuan Intelligence. It performs well in the medical field, especially in the HealthBench review with a score of 60.1, surpassing OpenAI's gpt-oss120b and many other open source models, becoming a global...

Latest AI Resources

8mos ago

050.9K

Qwen-Flash - 通义千问推出的高性能、低成本语言模型

Qwen-Flash - A high-performance, low-cost language model from Tongyi Chien-quan

Qwen-Flash is a high-performance, low-cost language model introduced in the Alibaba Tongyi Thousand Questions series, designed for fast response and efficient processing of simple tasks. Based on the advanced Mixture-of-Experts (MoE) architecture, it is realized by sparse expert network...

Latest AI Resources

8mos ago

046.7K

SkyReels-A3 - 昆仑万维推出的音频驱动数字人创作工具

SkyReels-A3 - Audio-Driven Digital Human Creation Tool from KunlunWangwei

SkyReels-A3 is an audio-driven digital human creation tool from Kunlun World Wide Group. SkyReels-A3 is an audio-driven digital human creation tool, which can generate high-quality dynamic video content through simple inputs (e.g., portrait images and voice), make static photos "come alive", and replace lines for existing videos with new lip-syncs that the characters will automatically...

Latest AI Resources

8mos ago

042.1K

MiniMax Speech 2.5 - MiniMax推出的语音生成模型

MiniMax Speech 2.5 - Speech Generation Model from MiniMax

MiniMax Speech 2.5 is an advanced speech generation model developed by MiniMax team. It has made significant progress in the field of speech synthesis, especially in multilingual expressiveness, timbre reproduction accuracy and language coverage. The model supports 40 languages...

Latest AI Resources

8mos ago

049.3K

GPT-5 - OpenAI推出的最强语言模型，统一智能系统

GPT-5 - The Strongest Language Model Introduced by OpenAI, Unified Intelligence System

GPT-5 is the latest language model released by OpenAI with several upgrades. It is a unified intelligence system with a built-in real-time router that automatically switches between efficient and deep thinking modes according to the complexity of the problem, realizing fast response and accurate answers.GPT-5 has several versions, including the one for general...

Latest AI Resources

8mos ago

047.2K

dots.vlm1 - 小红书hi lab开源的多模态大模型

dots.vlm1 - Small red book hi lab open source multimodal big model

dots.vlm1 is the first multimodal big model open-sourced by Little Red Book hi lab. Based on NaViT, a 1.2 billion parameter visual encoder trained from scratch, and DeepSeek V3 Large Language Model (LLM), it has powerful visual perception and text inference...

Latest AI Resources

8mos ago

046.4K

Genie 3 - 谷歌推出的通用世界模型

Genie 3 - A Universal World Model from Google

Genie 3 is a next-generation universal world model from Google DeepMind that enables the generation of highly dynamic and coherent virtual worlds in real time.Genie 3 simulates physical phenomena, natural ecosystems, and supports the creation of fantasy and historical scenarios. With text prompts, users can...

Latest AI Resources

8mos ago

045.2K

Claude Opus 4.1 - Anthropic推出的最强编程模型

Claude Opus 4.1 - The Most Powerful Programming Model from Anthropic

Claude Opus 4.1 is a state-of-the-art large-scale language model from Anthropic, designed for efficient processing of complex tasks. The model excels in the programming domain, generating high-quality code, supporting up to 32k of single output, and adapting to a wide range of programming styles...

Latest AI Resources

8mos ago

045.1K

gpt-oss - OpenAI推出的开源推理模型系列

gpt-oss - a family of open source inference models from OpenAI

gpt-oss is a family of open source inference models from OpenAI that enable efficient, flexible, and easy-to-deploy AI solutions for developers. gpt-oss consists of two versions, gpt-oss-120B with 117 billion parameters and support for 8...

Latest AI Resources

8mos ago

043K

MiDashengLM - 小米开源的声音理解模型

MiDashengLM - Xiaomi's open source sound understanding model

MiDashengLM is Xiaomi's open source large model for efficient sound understanding, with specific parameter version MiDashengLM-7B , focusing on audio processing and understanding. The model is based on Xiaomi Dasheng audio encoder and Qwen2.5-Omn...

Latest AI Resources

8mos ago

045K

MOSS-TTSD - 清华实验室开源的双语对话语音生成模型

MOSS-TTSD - Tsinghua Lab's open source speech generation model for bilingual dialogs

MOSS-TTSD is an open source spoken dialog speech generation model developed by the Speech and Language Laboratory of Tsinghua University. MOSS-TTSD can convert text dialog scripts into natural, smooth and expressive conversational speech, and supports bilingual generation in English and Chinese.

Latest AI Resources

8mos ago

047.8K

AudioGen-Omni - 快手推出的多模态音频生成模型

AudioGen-Omni - Multimodal Audio Generation Model from Racer

AudioGen-Omni is a multimodal audio generation model from Racer that generates high-quality audio, speech, and songs based on inputs such as video, text, etc.AudioGen-Omni is based on advanced techniques such as multimodal diffusionTransformer and phase-aligned...

Latest AI Resources

8mos ago

047.6K

LangExtract - 谷歌开源的Python库，提取结构化信息

LangExtract - Google's open source Python library to extract structured information

LangExtract is a Google Open Source Python library that uses large language models (LLMs) to extract structured information from unstructured text. With user-defined commands and a handful of examples, it can efficiently identify and organize key details, such as clinical notes from...

Latest AI Resources

8mos ago

052.5K

Qwen-Image - 通义千问推出开源的文生图基础模型

Qwen-Image - Tongyi Qianqian Launches Open Source Basic Model of Qwen-Image

Qwen-Image is an open source image generation base model released by Alibaba Tongyi Qianqian team. With 20 billion parameters, it adopts the Multimodal Diffusion Transformer Architecture (MMDiT), which integrates three modules: multimodal understanding, high-resolution coding and diffusion modeling.Qwen-Image's...

Latest AI Resources

8mos ago

047.2K

RedOne - 小红书最新推出的社交大模型

RedOne - the latest social mega-model from Little Red Book

RedOne is a large language model customized for social networks introduced by Little Red Book. The model is trained through a three-stage training strategy that incorporates social and cultural knowledge, strengthens multitasking capabilities, and aligns human preferences.RedOne significantly outperforms the base model in social task performance, in harmful content detection and browsing...

Latest AI Resources

8mos ago

044.7K

FastDeploy - 百度推出的高性能大模型推理与部署工具

FastDeploy - Baidu's high-performance large model reasoning and deployment tool

FastDeploy is a high-performance reasoning and deployment tool from Baidu, designed for Large Language Models (LLMs) and Visual Language Models (VLMs).FastDeploy is developed based on the Flying Paddle (PaddlePaddle) framework, and supports a variety of hardware platforms...

Latest AI Resources

8mos ago

045.7K

InteriorGS - 群核科技推出的3D高斯语义数据集

InteriorGS - 3D Gaussian Semantic Dataset launched by Qunar Technologies

InteriorGS is a high-quality 3D Gaussian semantic dataset introduced by Qunar Technology. The dataset contains 1,000 3D scenes covering more than 80 indoor environments such as homes, convenience stores, wedding halls and museums. The dataset has more than 554,000 object instances in 755 categories...

Latest AI Resources

8mos ago

045.1K

DragonV2.1 - 微软推出的零样本语音合成模型

DragonV2.1 - Zero-Sample Speech Synthesis Model from Microsoft

DragonV2.1 is an advanced zero-sample text-to-speech (TTS) model from Microsoft. Based on the Transformer architecture, the model supports multi-language and zero-sample speech cloning, and generates natural, expressive speech with only 5-90 seconds of voice prompts.

Latest AI Resources

8mos ago

043.1K

ScreenCoder – 开源的UI截图生成前端代码工具

ScreenCoder - Open Source UI Screenshot Generation Front-End Code Tool

ScreenCoder is an open source intelligent tool to quickly convert UI design screenshots into high quality HTML/CSS code. Tools based on modular multi-intelligence architecture , combined with visual understanding , layout planning and code synthesis techniques to support the generation of high-precision and semantic front-end ...

Latest AI Resources

8mos ago

054.6K

Gemini 2.5 Deep Think - 谷歌推出的AI推理模型

Gemini 2.5 Deep Think - AI inference model from Google

Gemini 2.5 Deep Think is an AI reasoning model from Google designed to solve complex tasks. It is a variant of the model that won the gold medal at the International Mathematical Olympiad (IMO) 2025, and is designed to solve complex tasks through Parallel ...

Latest AI Resources

8mos ago

042.1K

MindLink - 昆仑万维推出的开源推理大模型

MindLink - Open Source Reasoning Big Model from KunlunWei

MindLink is a large model of open source reasoning launched by Kunlun World Wide Web. With adaptive reasoning mechanism , according to the complexity of the task can be flexibly switched inference mode , simple tasks quickly generated , complex tasks in-depth reasoning , taking into account the efficiency and accuracy . Plan-driven reasoning paradigm to remove the "think" label , down ...

Latest AI Resources

8mos ago

042.4K

Kimi K2 高速版 - 月之暗面Kimi推出的高速版语言模型

Kimi K2 High-Speed Edition - High-Speed Edition of the language model released by Dark Side of the Moon Kimi

Kimi K2 High Speed Edition (kimi-k2-turbo-preview) is a high-performance language model introduced by Kimi, the Dark Side of the Moon. The model is optimized on the basis of Kimi K2, the output speed is greatly increased, and 40 Token per second can be generated...

Latest AI Resources

8mos ago

060.7K

dots.ocr - 小红书hi lab推出的开源多语言文档解析模型

dots.ocr - the open source multilingual document parsing model launched by the Little Red Book hi lab

dots.ocr is a multilingual document parsing model open-sourced by Xiaohongshu hi lab, based on a 1.7 billion-parameter visual language model (VLM), which can efficiently perform document layout detection and content recognition while maintaining a good reading order.

Latest AI Resources

8mos ago

066.7K

HYPIR - 中国科学院团队推出的新型图像复原大模型

HYPIR - A new large model for image restoration introduced by a team from the Chinese Academy of Sciences

HYPIR is a large model for image restoration introduced by Dong Chao's team at Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences. The model combines the fractional prior of diffusion modeling with adversarial generative networks to achieve efficient, high-quality image restoration.HYPIR can quickly restore old photos and improve resolution while keeping text clear...

Latest AI Resources

8mos ago

055.9K

FLUX.1 Krea [dev] - 黑森林和Krea AI联合推出的文生图模型

FLUX.1 Krea [dev] - Black Forest and Krea AI joint venture on Vincennes graph models

FLUX.1 Krea [dev] is a text-generated graph model from Black Forest Labs and Krea AI. The model is capable of generating high-quality, photorealistic images based on input text descriptions with a unique aesthetic style that avoids traditional A...

Latest AI Resources

8mos ago

050.8K

Qwen3-Coder-Flash - 阿里通义推出的开源高性能编程模型

Qwen3-Coder-Flash - an open source high performance programming model from Ali Tongyi

Qwen3-Coder-Flash is a high-performance programming model introduced by Ali Tongyi Thousand Questions team, which has excellent agent-based programming and tool invocation capabilities, and is good at handling complex programming tasks. The model supports 256K tokens of long context understanding, and can scale to 1M ...

Latest AI Resources

8mos ago

044.7K

Wide Research - Manus平台推出的多智能体协同功能

Wide Research - Multi-Intelligence Collaboration Introduced on the Manus Platform

Wide Research is a powerful feature of the Manus platform designed to handle complex and large-scale tasks. The platform supports hundreds of general-purpose intelligences working simultaneously through system-level parallel processing mechanisms and intelligence collaboration protocols.

Latest AI Resources

8mos ago

040.7K

Seed Diffusion - 字节跳动最新推出的扩散语言模型

Seed Diffusion - the newest diffusion language model from ByteHopper

Seed Diffusion is an experimental diffusion language model introduced by ByteHop that handles code generation tasks. The model is based on techniques such as two-stage diffusion training, constrained sequential learning, and enhanced efficient parallel decoding, which significantly improves inference speed to 2146 tokens/s, which is faster than...

Latest AI Resources

8mos ago

045.3K

小星绪 - 京东健康推出的AI情绪漫画生成产品

Little Star Ogre - AI Emotion Manga Generation Product Launched by Jingdong Health

Xiao Xingxu is the AI emotion comic generation product launched by Jingdong Health, currently in the testing stage, the product is emotionally driven comic story generation as the core function, the user can express emotion or tell a story through voice or text input, the AI generates matching four-panel comics and story interpretation based on the input.

Latest AI Resources

8mos ago

046.6K

1688 AI版 - 阿里旗下1688平台推出的AI生意助手

1688 AI Edition - AI business assistant launched by Ali's 1688 platform

1688 AI version is an intelligent business assistant application launched by Alibaba's 1688 platform, designed for small B buyers and merchants. Based on the massive data of 1688 platform, the application provides business opportunity push, product recommendation, idea generation, enterprise inquiry and other functions to help users accurately grasp the market dynamics, rapid...

Latest AI Resources

8mos ago

070.1K

阶跃深研 - 阶跃星辰推出的AI深入研究工具

Steps Deep Research - AI Deep Research Tool by Steps Star

Steps Deep Research is an efficient AI research tool launched by Steps Star, which can autonomously complete research on complex issues and generate professional reports in a short period of time. The tool is designed for finance, consulting, healthcare, law and other fields, and excels in industry reviews with its in-depth search and information integration capabilities.

Latest AI Resources

8mos ago

039K

Runway Aleph - Runway推出的全新AI视频编辑模型

Runway Aleph - New AI Video Editing Model from Runway

Runway Aleph is an advanced AI video editing model launched by Runway, which is based on simple text commands to quickly add and delete video content, style change, environment adjustment and camera movement optimization. Users can easily remove redundant elements, change scenes without complex operations...

Latest AI Resources

8mos ago

050K

WebShaper - 阿里通义开源的AI训练数据合成系统

WebShaper - Ali Tongyi's open source AI training data synthesis system

WebShaper is an AI training data synthesis system launched by Alibaba's Tongyi Lab, which is based on formal modeling and intelligence expansion mechanism to generate high-quality and scalable training data to help AI intelligences improve complex information retrieval capabilities. The system introduces the concept of "knowledge projection"...

Latest AI Resources

8mos ago

065.6K

Skywork UniPic - 昆仑万维推出的开源多模态统一预训练模型

Skywork UniPic - An Open Source Multimodal Unified Pre-Training Model from KunlunWei

Skywork UniPic is an open source multimodal pre-training model from KunlunWanwei, with three core capabilities: image understanding, text generation image and image editing. The model is based on an autoregressive architecture, incorporating a MAR encoder and a SigLIP2 backbone, with 1.5B parameter gauge...

Latest AI Resources

8mos ago

048.9K

ChatGPT Study - OpenAI推出的创新学习模式

ChatGPT Study - An Innovative Learning Model Introduced by OpenAI

ChatGPT Study is an innovative learning model from OpenAI that helps users learn and understand more efficiently.ChatGPT Study guides users to think actively and solve problems step-by-step through Socratic questioning, scaffolded responses, and personalized instruction....

Latest AI Resources

8mos ago

046.7K

通义万相Wan2.2 - 阿里推出的开源AI视频生成模型

Tongyi Wanphase Wan 2.2 - Open source AI video generation model launched by Ali

Tongyi Wanphase Wan2.2 is an advanced AI video generation model open-sourced by Alibaba, with 27 billion total references. The model contains three modes of text-generated video, graph-generated video, and unified video generation, which can generate high-quality videos based on text descriptions, images, or a combination of both.

Latest AI Resources

8mos ago

057K

GLM-4.5 - 智谱开源的面向推理、代码与智能体的SOTA模型

GLM-4.5 - Smart Spectrum Open Source SOTA Model for Reasoning, Code and Intelligentsia

GLM-4.5 is an open source SOTA model from Smart Spectrum, designed for intelligent body applications, incorporating reasoning, code generation and intelligent body capabilities. The model is based on the Mixed Expert (MoE) architecture and contains two versions, GLM-4.5 with 355 billion parameters and 106 billion...

Latest AI Resources

8mos ago

046.7K

Coze Studio - 字节跳动推出的开源AI Agent开发平台

Coze Studio - Open Source AI Agent Development Platform from ByteDance

Coze Studio is ByteDance's open source AI Intelligent Body development platform designed for developers to simplify the building, deployment and management of AI applications.Coze Studio provides a one-stop development environment that supports Prompt, RAG, Plugin...

Latest AI Resources

8mos ago

053.3K

Coze Loop – 字节Coze开源的AI Agent开发与调试平台

Coze Loop - Byte Coze Open Source AI Agent Development and Debugging Platform

Coze Loop is the open source AI intelligence development and operation and maintenance management platform of Coze Platform under ByteDance. The platform provides developers with full lifecycle management from development, debugging to evaluation and monitoring, covering cue word engineering, Agent effect evaluation, performance monitoring and tuning...

Latest AI Resources

8mos ago

052.1K

悟能 - 商汤科技最新推出的具身智能平台

Wuneng - the latest Body Intelligence Platform from Shangtang Technology

Wuneng is an embodied intelligence platform designed for robots and smart devices. Based on the "Enlightened" world model and multimodal big model technology, Wuneng integrates multi-sensor inputs such as vision, voice, touch, etc., and possesses powerful perception, decision-making and action capabilities.

Latest AI Resources

8mos ago

044.1K

Intern-S1 - 上海AI Lab开源的科学多模态大模型

Intern-S1 - Shanghai AI Lab's open source scientific multimodal macromodels

Intern-S1 is a scientific multimodal grand model launched by Shanghai Artificial Intelligence Laboratory. The model deeply integrates linguistic and multimodal capabilities, with powerful functions such as cross-modal scientific parsing, linguistic and visual fusion, scientific data processing, scientific question answering, experiment design and optimization.

Latest AI Resources

9mos ago

050.5K

混元3D世界模型 1.0 - 腾讯推出的开源3D世界生成模型

Mixed 3D World Model 1.0 - Tencent's open source 3D world generation model

Hunyuan 3D world model 1.0 (Hunyuan World 1.0) is Tencent's open source industry's first immersive roaming, interactive, simulation world generation model. The model integrates panoramic visual generation and hierarchical 3D reconstruction technology , support for text or image input to quickly generate 36...

Latest AI Resources

9mos ago

050.5K

日日新 V6.5 - 商汤科技推出的最新多模态推理大模型

Day by Day V6.5 - The latest multimodal inference macromodel from ShangTech

Day by Day V6.5 is an advanced multimodal inference macromodel from ShangTech, designed to handle mixed image and text inputs, supporting accurate understanding of image content and generating descriptions or answering questions in combination with text.

Latest AI Resources

9mos ago

045.2K

Opal - 谷歌推出的AI工作流创建平台

Opal - AI workflow creation platform from Google

Opal is an innovative AI applet generation platform from Google Labs that helps users quickly create and share AI apps without having to write code.Opal makes it easy for users to string together prompts, model calls, and tools into a multi-step process through natural language interactions and visual editing interface...

Latest AI Resources

9mos ago

055.6K

Qwen-MT - 阿里通义推出的机器翻译模型

Qwen-MT - A machine translation model introduced by Ali Tongyi

Qwen-MT is a state-of-the-art Qwen-MT - a machine translation model launched by Ali Tongyi Thousand Questions team, based on the powerful Qwen3 architecture, which supports inter-translation of 92 languages and covers a global population of more than 95%. The model is based on lightweight MoE ...

Latest AI Resources

9mos ago

055.5K

Agentar-Fin-R1 - 蚂蚁数科推出的金融领域推理大模型

Agentar-Fin-R1 - A Grand Model for Reasoning in Finance by Anthem Digital

Agentar-Fin-R1 is a state-of-the-art large language model for the financial domain introduced by Anthem. Developed based on the powerful Qwen3 architecture, the model provides two parameter scale versions, 8B and 32B, and can accurately handle complex financial reasoning tasks, including multi-step analysis, risk assessment and war...

Latest AI Resources

9mos ago

044.1K

MonkeyCode - 开源的企业级AI编程助手

MonkeyCode - Open Source Enterprise AI Programming Assistant

MonkeyCode is an open source, enterprise-grade, native AI programming assistant designed for privacy- and security-conscious development teams.MonkeyCode supports private deployment and offline use to ensure code data security. MonkeyCode supports private deployment and offline use to ensure the security of code data.

Latest AI Resources

9mos ago

046.9K

Seed LiveInterpret 2.0 - 字节跳动推出的同声传译模型

Seed LiveInterpret 2.0 - Simultaneous Interpretation Model Launched by ByteHopper

Seed LiveInterpret 2.0 is a state-of-the-art simultaneous interpreting model launched by the Seed team of ByteDance, supporting two-way translation between Chinese and English. The model has near real-life translation accuracy and extremely low latency, with an average speech-to-speech delay of only 2-3 seconds, which is much lower than that of...

Latest AI Resources

9mos ago

042K

Excel MCP Server - 基于MCP的AI Excel处理工具

Excel MCP Server - MCP-based AI Excel Processing Tool

Excel MCP Server is a Model Context Protocol (MCP)-based server tool for manipulating Excel files without installing Microsoft Excel.Excel MC...

Latest AI Resources

9mos ago

054.5K

ChatFlow - 开源AI工作流自动化工具

ChatFlow - Open Source AI Workflow Automation Tool

ChatFlow is an open source AI workflow automation tool that supports the transformation of complex requirements into efficient workflows. Tools based on AI technology to help users quickly generate code frameworks, test cases, can assist in writing and designing software architecture.

Latest AI Resources

9mos ago

045.8K

Mureka V7 - 昆仑万维推出的AI音乐生成模型

Mureka V7 - AI Music Generation Models from Quintessence

Mureka V7 is a state-of-the-art AI music generation model launched by Kunlun World Wide. The model is based on MusiCoT technology, which supports planning the overall structure of the music before filling in the details to generate more coherent and artistic music works.

Latest AI Resources

9mos ago

044.2K

Seed GR-3 - 字节跳动Seed团队推出的通用机器人模型

Seed GR-3 - Generalized Robotics Model from the Wordpress Seed Team

Seed GR-3 is a general-purpose robot model introduced by ByteDance with strong generalization ability to adapt to new environments and complex commands. The model fuses visual, verbal, and motion information, and is based on a three-in-one training method of robot data, VR human trajectory data, and publicly available graphic data to enhance the ability to respond to new objects...

Latest AI Resources

9mos ago

046.2K

Qwen3-Coder - 阿里通义千问开源的的代码生成模型

Qwen3-Coder - Ali Tongyi Qianqian open source code generation model

Qwen3-Coder is a state-of-the-art code generation model introduced by Ali Tongyi Qianqian team. The model has 480B parameters and 35B activation parameters, supports native 256K token contexts, and can scale to 1M tokens.The model is based on a hybrid expert architecture...

Latest AI Resources

9mos ago

048.9K

OpenReasoning-Nemotron - 英伟达推出的开源系列推理模型

OpenReasoning-Nemotron - Open Source Series of Reasoning Models from NVIDIA

OpenReasoning-Nemotron is a series of large-scale language models open-sourced by NVIDIA to support processing of reasoning tasks in math, science and code. The models are distilled based on the DeepSeek R1 0528 model with parameter scales of 1.5B...

Latest AI Resources

9mos ago

041.7K

Seed-X - 字节跳动推出的开源多语言翻译模型

Seed-X - Open Source Multilingual Translation Model Launched by ByteHopper

Seed-X is a multilingual translation model launched by the Seed team of ByteDance, with 7 billion parameters, supporting two-way translation in 28 languages. The model combines multilingual data pre-training, command fine-tuning and reinforcement learning techniques to efficiently handle complex language patterns and make translation quality better...

Latest AI Resources

9mos ago

066.5K

JoyAgent-JDGenie - 京东开源的轻量化通用多智能体系统

JoyAgent-JDGenie - Jingdong open source lightweight general multi-intelligence body system

JoyAgent-JDGenie is Jingdong open source lightweight general multi-intelligence system , no need for secondary development can be used directly.JoyAgent-JDGenie can handle complex tasks , such as generating reports , analyzing data , etc. , supports a variety of delivery formats , such as web pages , PPT ...

Latest AI Resources

9mos ago

054.2K

TRAE SOLO - 字节跳动TRAE推出的AI自动开发助手

TRAE SOLO - AI Automated Development Assistant from Wordhop TRAE

TRAE SOLO is an AI automated development assistant introduced by TRAE, an AI programming assistant launched by ByteDance, to simplify the software development process with AI technology.TRAE SOLO understands the user's needs, supports text descriptions, voice commands, and file uploads to input the requirements, and automatically plans...

Latest AI Resources

9mos ago

072.7K

雾象Fogsight - AI动画生成Agent，输入主题生成完整动画

Fogsight - AI Animation Generation Agent, input theme to generate full animation

Fogsight is an innovative AI animation generation agent that transforms abstract concepts into vivid animations based on large-scale language modeling (LLM). Users input topics and Fogsight generates complete animations with bilingual narration and cinematic visuals.

Latest AI Resources

9mos ago

055.8K

Goedel-Prover-V2 - 普林斯顿联合清华和英伟达等开源的定理证明模型

Goedel-Prover-V2 - Princeton's open-source theorem proving model in conjunction with Tsinghua and NVIDIA, among others

Goedel-Prover-V2 is an open-source theorem proving model jointly released by leading organizations such as Princeton University, Tsinghua University, and NVIDIA. The model is based on innovative techniques such as hierarchical data synthesis, verifier-guided self-correction, and model averaging to significantly improve the performance of automated formal proofs...

Latest AI Resources

9mos ago

048K

BytePlus - 字节跳动推出的企业级智能云服务平台

BytePlus - BytePlus Launches Enterprise-Class Intelligent Cloud Services Platform

BytePlus is an enterprise-level intelligent service platform launched by BytePlus to provide diversified services overseas. The platform covers powerful functions such as content distribution and acceleration (CDN), personalized recommendation, augmented reality, data processing and analysis, real-time audio and video communication, artificial intelligence and machine learning.

Latest AI Resources

9mos ago

061.4K

飞书妙搭 - 飞书推出的AI原生系统搭建平台

Flying Book Miaohu - AI Native System Building Platform by Flying Book

Flying Book Miaohu is an enterprise-level AI native system building platform launched by Flying Book. The platform quickly transforms enterprise business requirements into practical applications through a multi-agent architecture, supporting the whole process from requirements analysis to functional design, application development and problem repair. Users use a dialog to easily build lightweight...

Latest AI Resources

9mos ago

049.9K

MirageLSD - Decart AI推出首个实时AI视频生成模型

MirageLSD - Decart AI Launches First Real-Time AI Video Generation Model

MirageLSD is the world's first real-time streaming diffusion AI video model from the Decart AI team, enabling unlimited real-time video generation with latency as low as 40 milliseconds and smooth output at 24 frames/second.

Latest AI Resources

9mos ago

047.2K

Kimi Playground - 月之暗面推出的一站式AI工具调用体验平台

Kimi Playground - A One-Stop AI Tool Calling Experience from Dark Side of the Moon

Kimi Playground is an AI tool-calling experience platform for developers from Dark Side of the Moon.Kimi Playground enables AI to call a variety of tools (e.g., weather lookups, hotel bookings, data analytics, etc.) to accomplish complex tasks, not just...

Latest AI Resources

9mos ago

050.6K

ChatGPT Agent – OpenAI推出的通用智能AI Agent

ChatGPT Agent - General Intelligence AI Agent by OpenAI

ChatGPT Agent is a general-purpose AI Agent from OpenAI that combines multiple capabilities to autonomously accomplish complex tasks. Users only need to describe their needs in natural language, and the Agent can automatically select the appropriate tools, such as browsing the web, extracting information, running code...

Latest AI Resources

9mos ago

043.3K