Can't find AI tools? Try here!

Just type in the keyword Accessibility Bing SearchYou can quickly find all the AI tools on this site.

SongBloom - 腾讯联合港中文、南大开源的歌曲生成模型

SongBloom - Tencent's open source song generation model with HKCNU and NTU.

SongBloom is an open source song generation model developed by Tencent AI Lab in collaboration with The Chinese University of Hong Kong (Shenzhen) and Nanjing University, which solves the problem of "plasticity" in AI music generation, and realizes high-quality, structurally complete song generation. Simply enter 10 seconds of reference audio and corresponding lyrics, and you can...
6mos ago
036K
SAIL-VL2 - 字节跳动开源的多模态视觉语言模型

SAIL-VL2 - ByteHop's open source multimodal visual language model

SAIL-VL2 is an open source multimodal visual language model by the Byte Jump team, focusing on joint modeling of multimodal inputs such as images and text. Using the sparse mixture of experts (MoE) architecture and progressive training strategy, it achieves high performance at parameter scales from 2B to 8B, especially in the areas of graphic comprehension, math...
6mos ago
027.1K
决策树(Decision Tree)是什么,一文看懂

Decision Tree (Decision Tree) is what, an article to see and understand

Decision Tree (DT) is a tree-shaped predictive model that simulates the human decision-making process, classifying or predicting data through a series of rules. Each internal node represents a feature test, branches correspond to test results, and leaf nodes store the final decision. This algorithm uses a divide-and-conquer strategy...
6mos ago
029.7K
MineContext - 字节开源的主动式上下文感知AI伙伴

MineContext - Bytes Open Source Active Context-Aware AI Partner

MineContext is an active context-aware AI partner open-sourced by the ByteDance Viking team to help users efficiently manage massive amounts of information and improve the efficiency of knowledge work. Over the screenshot and content understanding technology, automatically record the user's daily operations (such as browsing the web, editing documents, etc.), support...
6mos ago
048K
吴恩达的《Agentic AI》最新智能体免费课程

Free Course on the Latest Intelligentsia from Agentic AI by Ernest Ng

Agentic AI is the newest course on intelligent bodies launched by Ernest Ng.The course focuses on the design and construction of intelligent bodies, covering the four major design patterns of reflection, tool use, planning, and multi-intelligent body collaboration. Learners will master how to make intelligent bodies check outputs, autonomously adjust through theoretical explanations and code practice...
6mos ago
053.8K
正则化(Regularization)是什么,一文看懂

Regularization (Regularization) is what, an article to see and understand

Regularization is a core technique in machine learning and statistics to prevent model overfitting. Regularization controls the degree of fitting by adding a penalty term to the objective function that is related to the complexity of the model. Common forms include L1 and L2 regularization: the L1 produces sparse solutions and applies...
6mos ago
031.8K
聆音EchoCare - 香港科学院开源的超声基座大模型

EchoCare - Hong Kong Academy of Sciences open source ultrasound base large model

EchoCare is a large model of ultrasound base developed by the Center for Artificial Intelligence and Robotics Innovation (CAIR) at the Hong Kong Institute of Innovation and Research of the Chinese Academy of Sciences (CAS), trained based on the world's largest ultrasound image dataset (more than 4.5 million images), covering multi-center, multi-region, multi-ethnicity, and more than 50 individuals...
6mos ago
033.1K
RoboBrain-X0 - 智源研究院开源的零样本跨本体泛化具身模型

RoboBrain-X0 - Wisdom Source Research Institute open source zero-sample cross ontology generalized embodiment model

RoboBrain-X0 is the world's first open source embodied model that supports zero-sample cross-ontology generalization open-sourced by Wisdom Source Research Institute, which is of great industrial significance. It can drive multiple real robots of different configurations to complete basic operation tasks without fine-tuning, and after a small amount of sample fine-tuning, it demonstrates the ability to replicate ...
6mos ago
034.1K
模型微调(Fine-tuning)是什么,一文看懂

What is Fine-tuning, in one article?

Model fine-tuning (Fine-tuning) is a specific implementation of transfer learning in machine learning. The core process is based on pre-trained models, which utilize large-scale datasets to learn generic patterns and develop extensive feature extraction capabilities. The fine-tuning phase then introduces task-specific datasets to ...
6mos ago
034.4K
Transformer 架构(Transformer Architecture)是什么,一文看懂

What is the Transformer Architecture in one article?

The Transformer architecture is a deep learning model designed for processing sequence-to-sequence tasks such as machine translation or text summarization. The core innovation is the complete reliance on self-attention mechanisms, eschewing traditional loops or convolutional structures. Allowing the model to process all elements of a sequence in parallel, large...
6mos ago
038.8K
大语言模型(Large Language Model)是什么,一文看懂

What is the Large Language Model (LLM) in one article?

Large Language Model (LLM) is a deep learning system trained on massive text data, with the Transformer architecture at its core. The self-attention mechanism of this architecture can effectively capture long-distance dependencies in language. The model's "large ...
6mos ago
037.8K
CWM - Meta FAIR开源的代码世界语言模型

CWM - Meta FAIR open source code world language model

CWM (Code World Model) is a 32-billion-parameter open-source world language model released by the Meta FAIR team, designed for code generation and reasoning. Introducing the concept of "world model", it can simulate the code execution process, predict the variable state changes, and advance...
6mos ago
034.9K
Neovate Code - 蚂蚁开源的智能编程助手

Neovate Code - Ant Open Source's Intelligent Programming Assistant

Neovate Code is an open source intelligent programming assistant from Ant Group's Alipay Experience Technology Department, which improves development efficiency through artificial intelligence technology. With conversational development features, developers can describe the requirements through natural language, Neovate Code can understand and generate the corresponding generation...
6mos ago
038.7K
Qwen3Guard - 阿里Qwen开源的安全模型

Qwen3Guard - Ali Qwen open source security model

Qwen3Guard is a fine-tuned security protection model based on the Qwen3 base model, designed for security detection. It provides accurate security categorization of prompts and responses, provides risk levels, and supports English, Chinese, and multi-language environments.Qwen3Guard comes with two pro...
6mos ago
043.3K
Qwen3-TTS-Flash - 阿里通义推出的语音合成模型

Qwen3-TTS-Flash - Speech Synthesis Models by Ali Tongyi

Qwen3-TTS-Flash is an advanced speech synthesis model introduced by Ali Tongyi, supporting 17 tones and 10 languages, covering Mandarin, English, dialects, etc. It has excellent stability and high expressiveness of Chinese and English speech, and the model can automatically adjust the tone of voice to make it more vivid.
7mos ago
053K
InternVLA-A1 - 上海AI Lab开源一体化操作能力的具身大模型

InternVLA-A1 - Shanghai AI Lab Open Source Integration of Operational Capabilities for Embodied Large Models

InternVLA-A1 is a large model of embodied operation open-sourced by Shanghai Artificial Intelligence Laboratory. It has the ability to understand, imagine, and execute the integration, and can accurately complete the task. The model fuses real and simulated operational data, and automates the construction of massive multimodal through large-scale virtual-real hybrid scene assets...
7mos ago
041.2K
VoxCPM - 面壁智能联合清华开源的端到端TTS模型

VoxCPM - Faceted Intelligence and Tsinghua Open Source End-to-End TTS Model

VoxCPM is a speech generation model jointly open-sourced by Facade Intelligence and Shenzhen International Graduate School of Tsinghua University.VoxCPM adopts an end-to-end diffusion autoregressive architecture to generate continuous speech representations directly from text, breaking through the limitations of traditional discrete disambiguation. Through hierarchical language modeling and finite state quantization...
7mos ago
045.1K
InternVLA·N1 - 上海AI Lab开源的端到端双系统导航大模型

InternVLA-N1 - Shanghai AI Lab Open Source End-to-End Dual System Navigation Large Model

InternVLA-N1 is an open source end-to-end dual-system navigation macromodel from Shanghai Artificial Intelligence Laboratory. Using a dual-system architecture, System 2 is responsible for understanding linguistic commands and planning long-range paths, while System 1 focuses on high-frequency response and agile obstacle avoidance. The model is trained entirely based on synthetic data through large-scale digital ...
7mos ago
040.7K
VLAC - 上海AI Lab开源的具身奖励大模型

VLAC - Shanghai AI Lab's Open Source Large Model of Embodied Reward

VLAC is an open source embodied reward macromodel from Shanghai Artificial Intelligence Laboratory. Based on InternVL multimodal macromodel, it integrates Internet video data and robot operation data to provide process reward and task completion estimation for robot reinforcement learning in the real world.VLAC can effectively ...
7mos ago
033.5K
浙江大学免费PDF资料《大模型基础》 - 附下载链接

Free PDF of Fundamentals of Large Models from Zhejiang University - with download link

Fundamentals of Large Models provides an in-depth analysis of the core technologies and practical paths of Large Language Models (LLMs). Starting from the fundamental theory of language modeling, it systematically explains the principles of model design based on statistics, recurrent neural networks (RNN), and Transformer architecture, focusing on the three major big language model...
7mos ago
044K
循环神经网络(Recurrent Neural Network)是什么,一文看懂

What is Recurrent Neural Network (RNN) in one article?

Recurrent Neural Network (RNN) is a neural network architecture designed for processing sequential data. Sequential data refers to a collection of data with temporal order or dependencies, such as linguistic text, speech signals, or time series.
7mos ago
040.7K
MobiAgent - 上海交大开源的移动端智能体全栈构建框架

MobiAgent - Shanghai Jiaotong University open source mobile intelligent body full-stack building framework

MobiAgent is an open source mobile intelligent body toolchain from IPADS Lab of Shanghai Jiaotong University, which helps users to build their own mobile intelligent assistants. By recording the user's operation trajectory and generating high-quality data, it trains an intelligent body that can understand natural language commands. Core features include efficient...
7mos ago
038.2K
Youtu-GraphRAG - 腾讯优图实验室开源的图检索增强生成框架

Youtu-GraphRAG - Tencent Youtu Labs Open Source Graph Retrieval Augmentation Generation Framework

Youtu-GraphRAG is an open source graph retrieval augmentation generation framework from Tencent's Youtu Labs to help large language models handle complex Q&A tasks more accurately. By constructing a four-layer knowledge tree, the knowledge is disassembled into four levels of attributes, relationships, keywords and communities to realize the self-directed performance of cross-domain knowledge...
7mos ago
039.5K
MiniMax Music 1.5 - MiniMax最新推出的AI音乐生成模型

MiniMax Music 1.5 - MiniMax's latest AI music generation model

MiniMax Music 1.5 is an advanced AI music generation tool that supports generating up to 4 minutes of music based on users' natural language descriptions. The model supports a variety of music styles and mood customization, generating a natural and full vocal color, smooth transitions, richly layered arrangements...
7mos ago
039.4K
人工智能安全(AI Safety)是什么,一文看懂

What is Artificial Intelligence Safety (AI Safety), in one article

Artificial Intelligence Safety (AI Safety) is the cutting-edge interdisciplinary field of ensuring that AI systems, especially those that are increasingly powerful and autonomous, act reliably and predictably throughout their lifecycle in accordance with human intent, without harmful consequences.
7mos ago
035.4K
自监督学习(Self-Supervised Learning)是什么,一文看懂

What is Self-Supervised Learning (SSL) in one article?

Self-Supervised Learning (SSL) is an emerging learning paradigm in the field of machine learning, where the core idea is to automatically generate supervised signals from unlabeled data and train models to learn useful representations of the data.
7mos ago
036.2K