Featured AI Tools List | page 5 | AI Sharing Circle

超人工智能 ASI（Artificial Super Intelligence）是什么，一文看懂

Super Artificial Intelligence (ASI) What is ASI (Artificial Super Intelligence) in one article?

Artificial Super Intelligence (ASI) is an intelligent system that exceeds human intelligence, with capabilities that surpass those of humans in all domains, including cognition, creativity, problem solving, and decision-making.

7mos ago

050.3K

迁移学习（Transfer Learning）是什么，一文看懂

Transfer Learning (Transfer Learning) what is it, an article to read and understand

Transfer Learning (Transfer Learning) is an important branch in the field of machine learning, the core idea is to apply the knowledge learned from one task or domain to another related but different task or domain.

7mos ago

036.7K

HuMo - 清华大学联合字节开源的多模态视频生成框架

HuMo - Tsinghua University United Bytes open source multimodal video generation framework

HuMo is a multi-modal video generation framework jointly open-sourced by Tsinghua University and ByteDance Intelligent Creation Lab, focusing on human-centered video generation. It can generate high-quality, fine-grained and controllable human videos from a variety of modal inputs such as text, images and audio.HuMo supports a powerful text cue-following capability...

Latest AI Resources

7mos ago

0119.8K

AnyI2V - 复旦联合阿里达摩院等开源的智能图像动画生成框架

AnyI2V - Fudan, Ali Dharma Institute and other open source framework for intelligent image animation generation

AnyI2V is an image animation generation framework jointly launched by Fudan University, Alibaba Dharma Institute and others, which supports the conversion of static conditional images (e.g., grids, point clouds, etc.) into dynamic videos without the need for complex training processes and large amounts of data.

Latest AI Resources

7mos ago

034.5K

SRPO - 腾讯混元推出的文本到图像生成模型

SRPO - Text-to-Image Generation Model launched by Tencent Mixed Meta

SRPO (Semantic Relative Preference Optimization) is a text-to-image generation model introduced by Tencent Hybrid, which optimizes the reward mechanism through text conditioned signals to achieve online adjustment of rewards and reduce offline fine-tuning dependency.

Latest AI Resources

7mos ago

047.9K

Qwen3-Next - 阿里通义推出的最新基础模型

Qwen3-Next - the latest base model from Ali Tongyi

Qwen3-Next is a new generation of hybrid architecture big model open source by Ali Tongyi, combining Gated DeltaNet and Gated Attention technology, good at dealing with long text, fast inference and saving computing resources.

Latest AI Resources

7mos ago

033K

文心大模型X1.1 - 百度推出的深度思考模型，理解能力更强

Wenshin Big Model X1.1 - Baidu's Deep Thinking Model for Better Understanding

Wenxin Big Model X1.1 is a deep thinking model launched by Baidu, based on a hybrid reinforcement learning framework that focuses on improving language understanding and generation. The model excels in handling complex questions, following instructions and simulating the behavior of intelligences, and can accurately provide knowledgeable answers and high-quality text content.

Latest AI Resources

7mos ago

040.3K

混元图像2.1 - 腾讯推出的开源文生图模型

Hybrid Image 2.1 - Tencent's Open Source Vendor Graph Model

HunyuanImage 2.1 is Tencent's open source graphic model, designed for high-quality image generation. The model supports native 2K resolution, can accurately render complex scenes and details, so that the character's expression and movement can be vividly reproduced.

Latest AI Resources

7mos ago

036.5K

AntSK FileChunk - 免费的AI语义文档切片工具，动态切片调整

AntSK FileChunk - Free AI Semantic Document Slicing Tool, Dynamic Slicing Adjustment

AntSK FileChunk is a free intelligent document slicing tool designed for RAG (Retrieval Augmented Generation) applications. Semantic as the core, the document will be intelligently sliced into semantically complete, coherent segments , support for multi-language , can dynamically adjust the size of the slice to ensure that the context of coherence.

Latest AI Resources

7mos ago

040.6K

UnifiedTTS - 一站式TTS API服务平台，实时性能监控

UnifiedTTS - One-stop TTS API Service Platform, Real-time Performance Monitoring

UnifiedTTS is a one-stop platform for text-to-speech (TTS) services. It supports multiple languages, including Chinese, English, Japanese and Korean, to meet the needs of global business. Through a unified API interface, it integrates many mainstream TTS services, including Micro...

Latest AI Resources

7mos ago

044.3K

MiniCPM 4.1 - 面壁智能推出的超高效端侧大模型

MiniCPM 4.1 - Ultra-efficient end-side grand model introduced by Facing Face Intelligence

MiniCPM 4.1 is an ultra-efficient end-side large language model introduced by Facade Intelligence. With InfLLM v2 sparse attention architecture, each lexeme only needs to calculate the relevance to less than 5% lexemes, which significantly reduces the processing overhead of long text. In a 128K long text scenario...

Latest AI Resources

7mos ago

035.6K

WeKnora - 腾讯微信开源的文档理解与语义检索框架

WeKnora - Tencent WeChat Open Source Document Understanding and Semantic Retrieval Framework

WeKnora is Tencent WeChat team open source based on the Large Language Model (LLM) document understanding and semantic retrieval framework , designed for the structure of complex, heterogeneous document content scenarios and designed to use a modularized architecture , integration of multimodal preprocessing , semantic vector indexing , intelligent recall and large model generative reasoning ...

Latest AI Resources

7mos ago

077.3K

XTuner V1 - 上海AI Lab开源的大模型训练引擎

XTuner V1 - Shanghai AI Lab open source large model training engine

XTuner V1 is a new generation of large model training engine open-sourced by Shanghai Artificial Intelligence Laboratory (SAL), designed for ultra-large scale sparse Mixed Expert (MoE) model training. Developed based on PyTorch FSDP, it achieves high performance through multi-dimensional optimization of memory, communication and load ...

Latest AI Resources

7mos ago

036.2K

Qwen3-ASR-Flash - 阿里通义千问推出的系列语音识别模型

Qwen3-ASR-Flash - A series of speech recognition models launched by Ali Tongyi Qianqian

Qwen3-ASR-Flash is Alibaba's latest high-precision speech recognition model, based on the Qwen3 base model, trained on massive multimodal data. It supports 11 languages and multiple accents, including Mandarin, Sichuan, Minnan, Wu, Cantonese and other dialects...

Latest AI Resources

7mos ago

049.2K

人工智能治理（AI Governance）是什么，一文看懂

What is Artificial Intelligence Governance (AI Governance) in One Article

AI governance is a comprehensive framework covering technology, ethics, law, and society that effectively guides, manages, and oversees the entire lifecycle of AI systems-from design, development, deployment, and end use. The core goal is not to hinder technological innovation, but to ensure that the development and application of AI technologies begin...

7mos ago

044.4K

吴恩达的LangChain for LLM应用开发免费课程

Free LangChain for LLM Application Development Course by Ernest Ng

LangChain for LLM Application Development is an online course presented by DeepLearning.AI, featuring LangChain founder Harrison Chase and Andrew Ng.

Latest AI Resources Course materials

7mos ago

058K

吴恩达的Transformer LLMs工作原理免费课程

Free course on how Transformer LLMs work by Enda Wu

Transformer LLMs work on the principle that DeepLearning.AI and Jay Alammar and Maarten Grootend, authors of Hands-On Large Language Models...

Latest AI Resources Course materials

7mos ago

051.9K

半监督学习（Semi-Supervised Learning）是什么，一文看懂

What is Semi-Supervised Learning (SSL) in one article?

Semi-supervised learning is an important branch in the field of machine learning, which uses a small amount of labeled data and a large amount of unlabeled data to co-train a model to improve the learning effect and generalization ability.

7mos ago

042.6K

无监督学习（Unsupervised Learning）是什么，一文看懂

What is Unsupervised Learning (ULS) in one article?

Unsupervised Learning (ULS) is an important branch of machine learning that focuses on processing data sets that are not pre-labeled.

7mos ago

035.3K

Seedream 4.0 - 字节推出的最新一代图像创作模型

Seedream 4.0 - the latest generation of image creation models launched by Bytes

Seedream 4.0 is an advanced image generation and editing tool launched by ByteDance, centered on the integration of generation and editing, with powerful features such as precise command editing, high feature retention, and deep intent understanding.

Latest AI Resources

7mos ago

080.3K

rStar2-Agent - 微软开源的高效AI推理模型

rStar2-Agent - Microsoft's Open Source Efficient AI Reasoning Model

rStar2-Agent is an advanced AI mathematical reasoning model open-sourced by Microsoft that demonstrates strong mathematical problem solving capabilities by achieving an accuracy of 80.61 TP3T in the AIME24 test. The model is equipped with scientific reasoning capabilities, achieving in the GPQA-Diamond benchmark...

Latest AI Resources

7mos ago

037.5K

Qwen3-Max-Preview - 通义千问推出的旗舰大语言模型

Qwen3-Max-Preview - The Flagship Big Language Model from Tongyi Qianqian

Qwen3-Max-Preview is the latest flagship large language model released by Tongyi Qianwen. It is the model with the largest number of parameters in the Qwen3 family, with a parameter size of over 1 trillion. The model has significant improvements in inference, instruction following, multi-language support and long-tail knowledge coverage...

Latest AI Resources

7mos ago

040.8K

OneCAT - 美团联合上海交大开源的多模态模型

OneCAT - Open source multimodal modeling by Meituan and Shanghai Jiaotong University

OneCAT is a new unified multimodal model launched by Meituan in conjunction with Shanghai Jiaotong University, which adopts a pure decoder architecture and can seamlessly integrate multimodal comprehension, text-to-image generation and image editing functions. The model abandons the design of traditional multimodal models that rely on external visual coders and disambiguators through modality-specific...

Latest AI Resources

7mos ago

039.2K

Claudable - 开源AI Web应用构建器，自然语言生成代码

Claudable - Open Source AI Web Application Builder, Natural Language Generated Code

Claudable is an open source web application builder based on Next.js that combines the advanced AI agent capabilities of Claude Code and Cursor CLI with Lovable's simple and intuitive application building experience....

Latest AI Resources

7mos ago

043.2K

FineVision - Hugging Face推出的开源视觉语言数据集

FineVision - Open Source Visual Language Dataset from Hugging Face

FineVision is Hugging Face's open source visual language dataset for training advanced visual language models. It contains 17.3 million images, 24.3 million samples, 88.9 million rounds of dialog, and 9.5 billion answer tokens. The dataset aggregates...

Latest AI Resources

7mos ago

041.2K

InfinityHuman - 字节联合浙大推出的长视频数字人生成模型

InfinityHuman - Long video digital human generation model launched by Bytes in collaboration with ZJU

InfinityHuman is a commercial-grade long time-series audio-driven character video generation model jointly launched by ByteDance and Zhejiang University. The model is audio-driven and can generate high-resolution, long duration and visually consistent character videos.

Latest AI Resources

7mos ago

037.3K

Kimi K2-0905 - 月之暗面推出的最新模型版本

Kimi K2-0905 - The latest model release from Dark Side of the Moon!

Kimi K2-0905 is an advanced AI model from Dark Side of the Moon Technologies Ltd. that excels in programming assistance, generates code efficiently, and supports the generation of neat and standardized code in front-end development. The model context length is extended to 256K to handle complex tasks.

Latest AI Resources

7mos ago

074.7K

强化学习（Reinforcement Learning）是什么，一文看懂

What is Reinforcement Learning in one article?

Reinforcement learning is an important branch of machine learning that centers on allowing intelligences to autonomously learn how to make optimal decisions to maximize long-term cumulative rewards through continuous interaction with the environment.

7mos ago

036K

监督学习（Supervised Learning）是什么，一文看懂

Supervised Learning (Supervised Learning) what is it, an article to understand

Supervised learning is one of the most common and basic methods of machine learning, the core idea is to teach computer models how to make predictions or judgments through existing data sets with "correct answers".

7mos ago

038.2K

深度学习（Deep Learning）是什么，一文看懂

Deep Learning (Deep Learning) is what, an article to understand

Deep Learning (DL) is a branch of machine learning that centers on the use of multi-layer artificial neural networks to learn and represent complex patterns in data.

7mos ago

039.3K

HunyuanWorld-Voyager - 腾讯开源的超长漫游世界模型

HunyuanWorld-Voyager - Tencent open source ultra-long roaming world model

HunyuanWorld-Voyager (Hunyuan Voyager for short) is the industry's first ultra-long roaming world model released by Tencent that supports native 3D reconstruction. It is a novel video diffusion framework that generates a 3D point cloud sequence of user-defined camera paths from a single image, supporting...

Latest AI Resources

7mos ago

040.3K

Hunyuan-MT-7B - 腾讯混元开源的轻量级翻译模型

Hunyuan-MT-7B - Tencent Mixed Meta Open Source Lightweight Translation Model

Hunyuan-MT-7B is a lightweight translation model introduced by Tencent's Mixed Meta Team, with 7 billion references, supporting the mutual translation of 33 languages and 5 folk-Chinese languages/dialects, including Cantonese, Uyghur, and Tibetan. In the International Association for Computational Linguistics (ACL) WMT2025 competition...

Latest AI Resources

7mos ago

037.4K

Step-Audio 2 mini - 阶跃星辰开源的语音大模型

Step-Audio 2 mini - Step-Star Open Source Speech Megamodels

Step-Audio 2 mini is an open source end-to-end speech grand model of Step-Audio. It breaks through the traditional speech model structure and adopts the true end-to-end multimodal architecture, which directly transforms the original audio input into speech response output with lower latency, and understands paralinguistic information and non-vocal signals.

Latest AI Resources

7mos ago

046.3K

MobileCLIP2 - 苹果公司开源的高效端侧多模态模型

MobileCLIP2 - Apple's Open Source Efficient End-Side Multi-Modal Modeling

MobileCLIP2 is an upgraded version of MobileCLIP, an efficient end-side multimodal model introduced by Apple researchers. It is optimized in terms of multimodal reinforcement training by training better-performing CLIP instructor model integration on DFN datasets and improved graphical raw...

Latest AI Resources

7mos ago

050.1K

InternVL3.5 - 上海AI实验室开源的多模态大模型

InternVL3.5 - Shanghai AI Lab Open Source Multimodal Large Models

InternVL3.5 (Shusheng-Wanxiang 3.5) is an open source multimodal large model of the Shanghai Artificial Intelligence Laboratory, the model is fully upgraded in terms of general ability, reasoning ability and deployment efficiency, providing nine sizes of versions from 1 billion to 241 billion parameters, covering different resource demand scenarios, including thick...

Latest AI Resources

7mos ago

048.7K

FastVLM - 苹果公司推出的视觉语言模型

FastVLM - Visual Language Model from Apple

FastVLM (Fast Vision Language Model) is an efficient visual language model introduced by Apple Inc. With FastViTHD hybrid visual coder as the core, it incorporates convolutional and Transformer architectures to significantly reduce visual...

Latest AI Resources

7mos ago

046.4K

Meeseeks - 美团开源的评估模型指令遵循能力的评测集

Meeseeks - Meeseeks open-source assessment set for evaluating the ability to follow model instructions

Meeseeks is an open source large model evaluation set used by the Meituan M17 team to evaluate the model's ability to follow instructions.Meeseeks uses a three-tiered evaluation framework to comprehensively measure whether the model is able to generate answers in strict accordance with the user's instructions from the macro to the micro level, without evaluating the knowledge of the content of the answers positively ...

Latest AI Resources

7mos ago

041.1K

gpt-realtime - OpenAI最新推出的AI语音模型

gpt-realtime - OpenAI's newest AI speech model

gpt-realtime is an advanced speech model from OpenAI that supports direct audio processing to generate natural and smooth speech. The model supports multiple languages and styles, understands non-verbal cues such as laughter, and can switch between languages.

Latest AI Resources

7mos ago

042.7K

Youtu-agent - 腾讯开源的高效智能体框架

Youtu-agent - Tencent open source efficient intelligent body framework

Youtu-agent is an open source framework for building and running autonomous intelligences from Tencent Youtu Labs. The framework performs well in WebWalkerQA and GAIA benchmarks, with an accuracy of 71.47% and 72.8% respectively.The framework...

Latest AI Resources

7mos ago

052.6K

HunyuanVideo-Foley - 腾讯推出的开源视频音效生成模型

HunyuanVideo-Foley - Tencent's Open Source Video Sound Generation Model

HunyuanVideo-Foley is an open source video sound generation model by the Tencent Mixed Yuan team that supports adding accurately matched sound effects to silent videos. The model is based on a large-scale dataset training , with a multimodal diffusion transformer architecture , combined with the characterization of the alignment loss function and audio VAE optimization techniques ...

Latest AI Resources

7mos ago

052K

PixVerse V5 - 爱诗科技推出的自研AI视频模型

PixVerse V5 - Self-developed AI video model launched by Aishi Technologies

PixVerse V5 is a big model of AI video generation launched by Aishi Technology. The model can generate high-quality video content based on user-input text descriptions or images, and supports multiple styles, such as anime, sci-fi, and national style.

Latest AI Resources

7mos ago

046.2K

问小白5 - 问小白推出的全能AI模型

Ask White 5 - All-in-One AI Model from Ask White

Ask White 5 is the flagship "All in One" model with a very high level of intelligence. The model has excellent performance in many assessments, such as the AA-Index composite assessment score of 64.7 and the STEM ability assessment score of 86, which is close to the world's leading GPT-5.

Latest AI Resources

7mos ago

042.1K

MiniCPM-V 4.5 - 面壁智能开源的8B参数多模态模型

MiniCPM-V 4.5 - Faceted Intelligent Open Source 8B Parameter Multimodal Modeling

MiniCPM-V 4.5 is an open source 8B parametric multimodal model of Facade Intelligence, built based on Qwen3-8B and SigLIP2-400M, with the ability to efficiently process images and videos. It has excellent performance in visual token consumption, processing ...

Latest AI Resources

7mos ago

052.4K

Gemini 2.5 Flash Image - 谷歌推出的最强图像生成与编辑模型

Gemini 2.5 Flash Image - The Most Powerful Image Generation and Editing Model from Google

Gemini 2.5 Flash Image (codename nano banana) is a state-of-the-art image generation and editing model from Google that maintains the consistency of characters across different scenes and supports precise image editing through natural language, such as blurring backgrounds and removing stains.

Latest AI Resources

7mos ago

043.8K

Wan2.2-S2V - 阿里通义开源的音频驱动视频生成模型

Wan2.2-S2V - Ali Tongyi open source audio-driven video generation model

Wan2.2-S2V is Ali Tongyi open source multimodal video generation model , only a static picture and a piece of audio , you can generate high-quality digital human video , and supports a variety of image types and frame .

Latest AI Resources

7mos ago

044.8K

吴恩达面向开发者的ChatGPT提示工程免费课程

Free Course on ChatGPT Tip Engineering for Developers by Ernest Ng

ChatGPT Tip Engineering for Developers is a joint DeepLearning.AI and OpenAI course designed for developers, featuring Isa Fulford, Andrew Ng to teach how to use Large Language Models (LLMs...

Latest AI Resources Course materials

7mos ago

046.8K

问小白o4 - 问小白推出的并行思考模型，同时开启8条思考路径

Ask Whitey o4 - A parallel thinking model introduced by Ask Whitey that opens 8 thinking paths at the same time

Ask White o4 is an innovative parallel thinking model that opens 8 thinking paths at the same time, analyzes the problem from multiple perspectives and automatically filters out the optimal solution. The model incorporates advanced Long-CoT reinforcement learning and process reward learning techniques, has powerful deep reasoning capabilities, and performs well in complex tasks.

Latest AI Resources

7mos ago

037.5K

VibeVoice - 微软推出的文本到语音模型

VibeVoice - Text-to-Speech Model from Microsoft

VibeVoice is a new text-to-speech (TTS) model from Microsoft. The model generates conversational audio from up to four different speakers and supports up to 90 minutes of continuous voice output, breaking the length limitations of traditional TTS systems.

Latest AI Resources

7mos ago

065K

SpatialGen - 群核科技推出的开源3D场景生成模型

SpatialGen - Open Source 3D Scene Generation Model by Qunar Technology

SpatialGen is an open source 3D scene generation model of Qunar Technology, based on the diffusion model architecture, supporting the generation of spatio-temporally consistent multi-view images based on textual descriptions, reference images and 3D spatial layouts, and further generating 3D Gaussian scenes and rendering roaming videos.

Latest AI Resources

7mos ago

043.2K

EchoMimicV3 - 蚂蚁开源的多模态数字人动画生成模型

EchoMimicV3 - Ant open source multimodal digital human animation generation model

EchoMimicV3 is a multimodal digital human video generation model introduced by Ant Group, with 1.3 billion parameters, capable of handling multiple inputs such as audio, text, images, etc. to generate high-quality digital human animations.

Latest AI Resources

7mos ago

042.7K

人工智能伦理（AI Ethics）是什么，一文看懂

What are AI Ethics, in one article?

Artificial Intelligence Ethics (AI Ethics) is a cross-disciplinary field that examines the ethical principles, values and social responsibilities that should be followed in the development, deployment and use of AI systems.

7mos ago

040.5K

AI论文写作工具有哪些？推荐15个免费AI学术论文助手

What are the best AI essay writing tools? 15 Recommended Free AI Academic Essay Assistants

In the era of booming artificial intelligence, AI tools have changed our lives and greatly boosted academic research and paper writing. In order to help users work and study more efficiently, this compilation carefully selects and introduces 15 cutting-edge free AI academic paper assistants.

7mos ago

047.4K

Fun-ASR - 钉钉、通义联合推出的新一代语音识别模型

Fun-ASR - A New Generation of Speech Recognition Models Jointly Launched by Nail and Tongyi

Fun-ASR is a big model of speech recognition jointly launched by Nail and Tongyi Labs. The model has been trained with massive audio data and can accurately recognize multi-industry terminology, such as Internet, technology, home decoration, etc., significantly improving the recognition accuracy. The model combines with Nail enterprise information for inference optimization to reduce the illusion problem...

Latest AI Resources

7mos ago

065.7K

Squibler - AI小说辅助写作平台，助力构思到创作全过程

Squibler - AI novel-assisted writing platform that facilitates the entire process from idea to creation

Squibler is a powerful AI-assisted writing platform designed for writers that helps users with the entire process from conception to creation to publication. The platform provides a variety of story templates covering novels, screenplays, short stories, etc. Users only need to enter the initial concept, and the AI can generate outlines, characters, scenes...

Latest AI Resources

7mos ago

045.9K

91写作 - 开源的AI智能小说创作平台

91Writing - Open Source AI Intelligent Novel Creation Platform

91 Writing is a fully open source AI novel creation tool, developed based on Vue 3 and Element Plus, integrating a variety of advanced AI models, such as GPT, Claude, Gemini, and so on. The tool provides creators with a complete creation tool chain from idea to text, including project creation...

Latest AI Resources

7mos ago

046.6K

Aivilization - 港科大推出的多Agent社会模拟平台

Aivilization - A Multi-Agent Social Simulation Platform Launched by HKUST

Aivilization is the world's first AI multi-intelligent body social simulation platform developed by the Hong Kong University of Science and Technology. It builds a visual digital sandbox where users can create and guide thousands of AI intelligences to observe the social evolution of future human-AI coexistence. The platform supports...

Latest AI Resources

7mos ago

081.6K

弱人工智能（Narrow AI）是什么，一文看懂

What is Weak Artificial Intelligence (Narrow AI), in one article

Weak Artificial Intelligence (Narrow AI) is currently the dominant form of AI technology development in our real world. Weak AI is designed and trained to perform a specific, well-defined task with a level of intelligence that may surpass that of humans in that particular domain.

7mos ago

045.5K

Grok 2.5 - 马斯克旗下xAI开源的人工智能模型

Grok 2.5 - Musk's xAI open source AI model

Grok 2.5 is an open source AI model from Elon Musk's xAI. With 269 billion parameters, it is based on the Mixed Expert (MoE) architecture for powerful performance and inference. The model has been tested at graduate level scientific knowledge (GPQA), generalized knowledge (MMLU, MM...

Latest AI Resources

7mos ago

047.2K

Draw A Fish - 免费的在线AI画鱼网站，共享虚拟鱼缸

Draw A Fish - free online AI fish drawing site with shared virtual fish tanks

Draw A Fish is simple and fun online AI fish drawing site where users can draw fish patterns and place them in a globally shared virtual fish tank.Draw A Fish requires no registration and is easy to use, taking only seconds to create and share.

Latest AI Resources

8mos ago

066.7K

MIT最新报告《生成式AI鸿沟：2025年商业人工智能现状》

MIT's new report, "The Generative AI Divide: the State of Business AI in 2025

MIT's latest report, The Generative AI Divide: the State of Business AI in 2025, reveals the core of the generative AI (GenAI) adoption process that companies are experiencing through in-depth research of more than 300 AI projects, interviews with 52 organizations, and a survey of 153 executives...

Latest AI Resources Course materials

8mos ago

080.1K

AutoClip - 开源的AI视频切片工具，一键生成专题视频合集

AutoClip - Open source AI video slicing tool to generate thematic video collections with one click

AutoClip is open source AI video editing tool, based on advanced AI technology to realize the whole process of automated video processing. Tools can automatically identify the highlights of the video, accurate extraction of valuable content, can be based on the similarity of the theme of intelligent clustering, to generate a collection of content.AutoClip support...

Latest AI Resources

8mos ago

063.1K

《动手学AI：人工智能通识与实践》 - 阿里云推出的免费AI通识课程

Hands-On AI: Artificial Intelligence Liberalization and Practice - Free AI Liberalization Course by AliCloud

Hands-On Learning AI: Artificial Intelligence General Knowledge and Practice" of AliCloud, in conjunction with Superstar Erlang, is a systematic learning course on AI for learners of different professional backgrounds. The course is taught by master teachers from five top universities, with comprehensive content, from the development history of AI, core technology to ethical security, etc., to build a complete body of knowledge...

Course materials

8mos ago

043.9K

ToonComposer - 腾讯开源的生成式AI动画制作工具

ToonComposer - Tencent open source generative AI animation tool

ToonComposer is a generative AI animation tool jointly launched by The Chinese University of Hong Kong, Tencent PCG ARC Lab and Peking University. Through generative post keyframe technology, the intermediate frame generation and coloring process is integrated into an automated process, requiring only a sketch and a...

Latest AI Resources

8mos ago

053.3K

Seed-OSS - 字节跳动团队开源的全新AI模型

Seed-OSS - A new AI model open-sourced by the Wordpress team

Seed-OSS is a large family of language models open-sourced by the Byte Jump Seed team, focusing on long text and reasoning tasks. The model performs well in complex logical reasoning and multi-step reasoning, with high accuracy and efficient problem solving.Seed-OSS supports long text contexts up to 512K...

Latest AI Resources

8mos ago

050.2K

Intern-S1-mini - 上海AI Lab开源的轻量化科学多模态模型

Intern-S1-mini - Lightweight scientific multimodal model open source by Shanghai AI Lab

Intern-S1-mini is a lightweight scientific multimodal macromodel with parameter scale of 8B launched by Shanghai Artificial Intelligence Laboratory (SAL).It inherits the powerful capabilities of Intern-S1, combining both general and specialized scientific capabilities, and is suitable for rapid deployment and secondary development. In terms of performance, I...

Latest AI Resources

8mos ago

043.9K

人工智能 AI（Artificial Intelligence）是什么，一文看懂

Artificial Intelligence What is AI (Artificial Intelligence) in one article?

Artificial Intelligence (AI) is a core branch of computer science that aims to build theoretical and technological systems that can simulate, extend, and even surpass human intelligence, so that machines have the ability to learn, reason, perceive, and make decisions that usually require human intelligence to...

7mos ago

057.6K

Nano Banana - 谷歌推出的AI图像编辑模型

Nano Banana - AI image editing model launched by Google

Nano Banana is the Gemini 2.5 Flash Image codename for Gemini, an AI image generation and editing model from Google that generates detailed, photorealistic images based on simple text prompts to make high-quality modifications to existing images.

Latest AI Resources

7mos ago

069.7K

Klear-Reasoner - 快手推出的全新推理模型

Klear-Reasoner - The New Reasoning Model Introduced by Racer

Klear-Reasoner is a high-performance inference model from Racer, based on Qwen3-8B-Base. The model is trained by long thought chain supervised fine-tuning and reinforcement learning to perform well in mathematical and code reasoning.Klear-Reasoner...

Latest AI Resources

8mos ago

040.2K

CombatVLA - 淘天集团推出的高效VLA模型

CombatVLA - Efficient VLA Model by Amoy Group

CombatVLA is an innovative 3D action role-playing game (ARPG)-specific model from the Future Life Lab team of the Amoy Sky Group.CombatVLA is a visual-linguistic-action (VLA) model, built on a 3B parametric scale, that collects human player's through a motion tracker...

Latest AI Resources

8mos ago

044.3K

DeepSeek V3.1 - DeepSeek推出的最新开源AI模型

DeepSeek V3.1 - Latest Open Source AI Models from DeepSeek

DeepSeek V3.1 is a new generation of AI models introduced by DeepSeek, with important upgrades based on its predecessor, V3. DeepSeek V3.1 introduces a hybrid reasoning architecture that allows the model to flexibly switch between thinking and non-thinking modes, significantly improving the thinking...

Latest AI Resources

8mos ago

047.3K

Qwen-Image-Edit - 阿里通义开源的图像编辑模型

Qwen-Image-Edit - Ali Tongyi open source image editing model

Qwen-Image-Edit is an all-purpose image editing model introduced by Ali Tongyi, built on the Qwen-Image architecture with 20 billion parameters. The model combines both semantic and appearance editing capabilities, and can perform low-level visual appearance editing on images (e.g., adding, deleting...

Latest AI Resources

8mos ago

044.6K

MoE-TTS - 昆仑万维推出的最新语音生成框架

MoE-TTS - The Latest Speech Generation Framework from KunlunWei

MoE-TTS is a speech synthesis framework introduced by KunlunWanwei, based on the Mixed Expert (MoE) architecture, which combines pre-trained Large Language Models (LLMs) with speech expert modules.MoE-TTS retains the powerful textual reasoning by freezing the textual module parameters and updating only the speech module parameters...

Latest AI Resources

8mos ago

043.8K

Genie Envisioner - 智元联合北航等开源的通用机器人操作平台

Genie Envisioner - Jiyuan's open-source general-purpose robotics platform with Beihang and others

Genie Envisioner (GE) is a unified platform for robot operation developed by the Genie Robotics team in collaboration with the National University of Singapore, Beijing University of Aeronautics and Astronautics and other organizations. It allows robots to better understand and perform tasks by "imagining first, then acting".

Latest AI Resources

8mos ago

044.8K

DINOv3 - Meta AI推出的新一代自监督视觉基础模型

DINOv3 - Next Generation Self-Supervised Vision Base Model from Meta AI

DINOv3 is a next-generation self-supervised vision base model from Meta AI, which adopts a self-supervised learning paradigm to learn image features without labeling data. It solves the feature degradation problem by improving data preparation and introducing Gram anchoring, and improves the generalization...

Latest AI Resources

8mos ago

054.9K

Mureka V7.5 - 昆仑万维推出的先进AI音乐创作模型

Mureka V7.5 - Advanced AI Music Creation Model from Quintessence

Mureka V7.5 is a state-of-the-art AI music generation model from Kunlun World Wide, focusing on Chinese songwriting. The model can accurately reproduce tones and playing techniques to generate natural, smooth and emotional vocals. Based on optimized automatic speech recognition (ASR) technology, Mureka V...

Latest AI Resources

8mos ago

044.2K

Skywork Deep Research Agent v2 - 昆仑万维推出的深度研究智能体升级版

Skywork Deep Research Agent v2 - An Upgraded Version of Deep Research Intelligence from Kunlun

Skywork Deep Research Agent v2 is a deep research intelligent body launched by Kunlun Wave, focusing on the integration and analysis of multimodal information.Skywork Deep Research Agent v2 can process text, graph...

Latest AI Resources

8mos ago

043.6K

Hunyuan-GameCraft - 腾讯混元开源的下一代游戏交互式视频生成框架

Hunyuan-GameCraft - Tencent Hunyuan's open source framework for generating interactive video for next-generation games.

Hunyuan-GameCraft is Tencent Hunyuan team open source interactive game video generation framework. Framework from a single picture and prompts to generate highly dynamic game video , support the user through the keyboard and mouse to control the video content in real time .

Latest AI Resources

8mos ago

047.6K

Skywork UniPic 2.0 - 昆仑万维开源的高效多模态模型

Skywork UniPic 2.0 - Open Source Efficient Multi-Modal Modeling by KunlunWanwei

Skywork UniPic 2.0 is an efficient multimodal model open-sourced by KunlunWei, focusing on image generation, editing and understanding. The model is based on a 2B-parameter SD3.5-Medium architecture, which is realized through pre-training, progressive dual-task reinforcement strategies and co-training...

Latest AI Resources

8mos ago

044.8K

RynnRCP - 阿里达摩院推出的首个开源机器人上下文协议

RynnRCP - First Open Source Robotics Context Protocol from Ali Dharma Institute

RynnRCP is an open source Robot Context Protocol (RCP) from Ali Dharma Institute that lowers the threshold for development of embodied intelligence and opens up the entire development process.RynnRCP consists of the RCP framework and the RobotMotion module.The RCP framework, through capability abstraction and multi-protocol support, will...

Latest AI Resources

8mos ago

049.7K

RynnEC - 阿里达摩院开源的世界理解模型

RynnEC - Ali Dharma Institute's open source world understanding model

RynnEC is a world understanding model introduced by Alibaba Dharma Institute, focusing on embodied intelligence tasks. The model is based on multimodal fusion technology, combining video data and natural language, and can parse objects in a scene from multiple dimensions, supporting functions such as object understanding, spatial perception and video target segmentation.

Latest AI Resources

8mos ago

050.2K

Matrix-3D - 昆仑万维开源的3D世界生成框架

Matrix-3D - Kunlun World Wide open source 3D world generation framework

Matrix-3D is an open source framework from Skywork AI team, focusing on generating explorable panoramic 3D worlds. The framework combines panoramic video generation and 3D reconstruction techniques to generate high-quality, omni-directional explorable 3D worlds from a single image or text prompt...

Latest AI Resources

8mos ago

051.3K

GLM-4.5V - 智谱推出的多模态开源视觉推理模型

GLM-4.5V - Multimodal Open Source Visual Reasoning Model by Smart Spectrum

GLM-4.5V is the world's leading open source visual inference model introduced by Smart Spectrum, with 106 billion total parameters and 12 billion activated parameters. The model is trained based on the new generation text base model GLM-4.5-Air, with powerful visual understanding and reasoning capabilities, capable of handling images, video...

Latest AI Resources

8mos ago

050.3K

Matrix-Game 2.0 - 昆仑万维开源自研的交互式世界模型

Matrix-Game 2.0 - Interactive World Model developed by KunlunWanwei

Matrix-Game 2.0 is a self-developed interactive world model released by Kunlun SkyWork AI. Matrix-Game 2.0 is the industry's first open-source, real-time, long-sequence interactive generation model for general-purpose scenarios. The model is able to run at 25 FPS through a visually-driven interaction scheme in multiple...

Latest AI Resources

8mos ago

050.2K

Baichuan-M2 - 百川智能推出开源的医疗增强大模型

Baichuan-M2 - Baichuan Intelligence Launches Open Source Healthcare Enhanced Big Model

Baichuan-M2 is an open source medical augmented large model launched by Baichuan Intelligence. It performs well in the medical field, especially in the HealthBench review with a score of 60.1, surpassing OpenAI's gpt-oss120b and many other open source models, becoming a global...

Latest AI Resources

8mos ago

050.5K

Qwen-Flash - 通义千问推出的高性能、低成本语言模型

Qwen-Flash - A high-performance, low-cost language model from Tongyi Chien-quan

Qwen-Flash is a high-performance, low-cost language model introduced in the Alibaba Tongyi Thousand Questions series, designed for fast response and efficient processing of simple tasks. Based on the advanced Mixture-of-Experts (MoE) architecture, it is realized by sparse expert network...

Latest AI Resources

8mos ago

046.4K

SkyReels-A3 - 昆仑万维推出的音频驱动数字人创作工具

SkyReels-A3 - Audio-Driven Digital Human Creation Tool from KunlunWangwei

SkyReels-A3 is an audio-driven digital human creation tool from Kunlun World Wide Group. SkyReels-A3 is an audio-driven digital human creation tool, which can generate high-quality dynamic video content through simple inputs (e.g., portrait images and voice), make static photos "come alive", and replace lines for existing videos with new lip-syncs that the characters will automatically...

Latest AI Resources

8mos ago

041.9K

通用人工智能 AGI（Artificial General Intelligence）是什么，一文看懂

General Artificial Intelligence (AGI) What is AGI (Artificial General Intelligence) in one article?

Generalized Artificial Intelligence (AGI) is intelligent systems that can understand, learn, reason, adapt and create like or even beyond humans on any cognitive task.

7mos ago

043.4K

MiniMax Speech 2.5 - MiniMax推出的语音生成模型

MiniMax Speech 2.5 - Speech Generation Model from MiniMax

MiniMax Speech 2.5 is an advanced speech generation model developed by MiniMax team. It has made significant progress in the field of speech synthesis, especially in multilingual expressiveness, timbre reproduction accuracy and language coverage. The model supports 40 languages...

Latest AI Resources

8mos ago

049.1K

GPT-5 - OpenAI推出的最强语言模型，统一智能系统

GPT-5 - The Strongest Language Model Introduced by OpenAI, Unified Intelligence System

GPT-5 is the latest language model released by OpenAI with several upgrades. It is a unified intelligence system with a built-in real-time router that automatically switches between efficient and deep thinking modes according to the complexity of the problem, realizing fast response and accurate answers.GPT-5 has several versions, including the one for general...

Latest AI Resources

8mos ago

046.8K

dots.vlm1 - 小红书hi lab开源的多模态大模型

dots.vlm1 - Small red book hi lab open source multimodal big model

dots.vlm1 is the first multimodal big model open-sourced by Little Red Book hi lab. Based on NaViT, a 1.2 billion parameter visual encoder trained from scratch, and DeepSeek V3 Large Language Model (LLM), it has powerful visual perception and text inference...

Latest AI Resources

8mos ago

045.8K

Genie 3 - 谷歌推出的通用世界模型

Genie 3 - A Universal World Model from Google

Genie 3 is a next-generation universal world model from Google DeepMind that enables the generation of highly dynamic and coherent virtual worlds in real time.Genie 3 simulates physical phenomena, natural ecosystems, and supports the creation of fantasy and historical scenarios. With text prompts, users can...

Latest AI Resources

8mos ago

044.9K

Claude Opus 4.1 - Anthropic推出的最强编程模型

Claude Opus 4.1 - The Most Powerful Programming Model from Anthropic

Claude Opus 4.1 is a state-of-the-art large-scale language model from Anthropic, designed for efficient processing of complex tasks. The model excels in the programming domain, generating high-quality code, supporting up to 32k of single output, and adapting to a wide range of programming styles...

Latest AI Resources

8mos ago

044.6K

gpt-oss - OpenAI推出的开源推理模型系列

gpt-oss - a family of open source inference models from OpenAI

gpt-oss is a family of open source inference models from OpenAI that enable efficient, flexible, and easy-to-deploy AI solutions for developers. gpt-oss consists of two versions, gpt-oss-120B with 117 billion parameters and support for 8...

Latest AI Resources

8mos ago

042.6K

MiDashengLM - 小米开源的声音理解模型

MiDashengLM - Xiaomi's open source sound understanding model

MiDashengLM is Xiaomi's open source large model for efficient sound understanding, with specific parameter version MiDashengLM-7B , focusing on audio processing and understanding. The model is based on Xiaomi Dasheng audio encoder and Qwen2.5-Omn...

Latest AI Resources

8mos ago

044.6K

MOSS-TTSD - 清华实验室开源的双语对话语音生成模型

MOSS-TTSD - Tsinghua Lab's open source speech generation model for bilingual dialogs

MOSS-TTSD is an open source spoken dialog speech generation model developed by the Speech and Language Laboratory of Tsinghua University. MOSS-TTSD can convert text dialog scripts into natural, smooth and expressive conversational speech, and supports bilingual generation in English and Chinese.

Latest AI Resources

8mos ago

047.7K

可解释性人工智能（Explainable AI）是什么，一文看懂

What Explainable AI (AI) is, in one article

Explainable AI (XAI) is an overarching set of programs that encompasses concepts methods technologies and governance frameworks.

7mos ago

037.5K

AudioGen-Omni - 快手推出的多模态音频生成模型

AudioGen-Omni - Multimodal Audio Generation Model from Racer

AudioGen-Omni is a multimodal audio generation model from Racer that generates high-quality audio, speech, and songs based on inputs such as video, text, etc.AudioGen-Omni is based on advanced techniques such as multimodal diffusionTransformer and phase-aligned...

Latest AI Resources

8mos ago

047.1K

LangExtract - 谷歌开源的Python库，提取结构化信息

LangExtract - Google's open source Python library to extract structured information

LangExtract is a Google Open Source Python library that uses large language models (LLMs) to extract structured information from unstructured text. With user-defined commands and a handful of examples, it can efficiently identify and organize key details, such as clinical notes from...

Latest AI Resources

8mos ago

052.3K

Qwen-Image - 通义千问推出开源的文生图基础模型

Qwen-Image - Tongyi Qianqian Launches Open Source Basic Model of Qwen-Image

Qwen-Image is an open source image generation base model released by Alibaba Tongyi Qianqian team. With 20 billion parameters, it adopts the Multimodal Diffusion Transformer Architecture (MMDiT), which integrates three modules: multimodal understanding, high-resolution coding and diffusion modeling.Qwen-Image's...

Latest AI Resources

8mos ago

046.8K