Featured AI Tools List | page 4

Kandinsky 5.0 - Russian AI Team's Open Source Video Generation Model Series

Kandinsky 5.0 is the latest video generation model series developed by Russian AI team, focusing on lightweight design and high performance performance. The first model in the series, Kandinsky 5.0 Video Lite, has only 2 billion parameters but surpasses similar 14B models, especially...

Latest AI Resources

6mos ago

045.1K

SongBloom - Tencent's open source song generation model with HKCNU and NTU.

SongBloom is an open source song generation model developed by Tencent AI Lab in collaboration with The Chinese University of Hong Kong (Shenzhen) and Nanjing University, which solves the problem of "plasticity" in AI music generation, and realizes high-quality, structurally complete song generation. Simply enter 10 seconds of reference audio and corresponding lyrics, and you can...

Latest AI Resources

6mos ago

036K

Pyscn - Free AI code quality analysis tool open-sourced specifically for Python developers

Pyscn is an intelligent code quality analysis tool designed for Python developers to detect potential problems in code to improve maintainability. It analyzes dead code through control flow diagrams, identifies duplicate code using APTED+LSH algorithm, calculates metrics such as module coupling and circle complexity...

Latest AI Resources

6mos ago

028.7K

Youtu-Embedding - Tencent Youtu open source generalized text representation model

Youtu-Embedding is a generalized text representation model open-sourced by Tencent's Youtu Lab, designed for enterprise-level applications. Through deep neural networks to map the text to a high-dimensional vector space, so that semantically similar sentences are closer in that space, to achieve accurate semantic retrieval.

Latest AI Resources

6mos ago

034K

SAIL-VL2 - ByteHop's open source multimodal visual language model

SAIL-VL2 is an open source multimodal visual language model by the Byte Jump team, focusing on joint modeling of multimodal inputs such as images and text. Using the sparse mixture of experts (MoE) architecture and progressive training strategy, it achieves high performance at parameter scales from 2B to 8B, especially in the areas of graphic comprehension, math...

Latest AI Resources

6mos ago

027.1K

Hyperparameter (Hyperparameter) is what, an article to see and understand

In machine learning, a hyperparameter is a configuration option that is preset manually before model training begins, rather than learned from data. The central role is to control the learning process itself, as if setting a set of operating rules for the algorithm. For example, the learning...

AI Answers

6mos ago

031.4K

Decision Tree (Decision Tree) is what, an article to see and understand

Decision Tree (DT) is a tree-shaped predictive model that simulates the human decision-making process, classifying or predicting data through a series of rules. Each internal node represents a feature test, branches correspond to test results, and leaf nodes store the final decision. This algorithm uses a divide-and-conquer strategy...

AI Answers

6mos ago

029.7K

What is Gradient Descent (Gradient Descent), an article to read and understand

Gradient Descent is the core optimization algorithm for solving function minimization. The algorithm determines the direction of descent by calculating the gradient of the function (the vector consisting of the partial derivatives of each), and iteratively updating the parameters according to the rule θ = θ - η - ∇J(θ).

AI Answers

6mos ago

030.4K

MineContext - Bytes Open Source Active Context-Aware AI Partner

MineContext is an active context-aware AI partner open-sourced by the ByteDance Viking team to help users efficiently manage massive amounts of information and improve the efficiency of knowledge work. Over the screenshot and content understanding technology, automatically record the user's daily operations (such as browsing the web, editing documents, etc.), support...

Latest AI Resources

6mos ago

048K

nanochat - Karpathy's free and open source low-cost model training program

nanochat is an open source project released by AI legend and former Tesla AI Director Andrej Karpathy that allows individuals to quickly train a small ChatGPT-like language model at a very low cost and simplicity. The entire project uses only about 800...

Latest AI Resources

6mos ago

033.6K

LLaVA-OneVision-1.5 - 免费开源的多模态模型，高性能多模态理解

LLaVA-OneVision-1.5 - Free and open source multimodal modeling, high performance multimodal understanding

LLaVA-OneVision-1.5 is an open-source multimodal model by the EvolvingLMMS-Lab team, using 8B parameter scale, through a compact three-phase training process (language-image alignment, conceptual equalization and knowledge injection, and instruction fine-tuning) on 128 A800...

Latest AI Resources

6mos ago

032K

What is Logistic Regression (Logistic Regression), an article to read and understand

Logistic Regression is a statistical learning method used to solve binary classification problems. The central goal is to predict the probability that a sample belongs to a particular category based on input features. The model maps the linear output to between 0 and 1 by linearly combining the eigenvalues using an S-shaped function...

AI Answers

6mos ago

028.1K

Paper2Video - NUS open source project to automatically generate demo videos for academic papers

Paper2Video is an open-source presentation video project for automatic generation of academic papers by Show Lab at National University of Singapore. Using the PaperTalker multi-intelligence framework, papers are transformed into full presentation videos containing slides, subtitles, voiceover and speaker avatar...

Latest AI Resources

6mos ago

034.3K

NeuTTS Air - Free and Lightweight Speech Synthesis Model with Offline CPU Running Support

NeuTTS Air is open source lightweight speech synthesis model, developed by Neuphonic team, which can run in real time on local devices (e.g. cell phones, laptops, Raspberry Pi) without relying on the cloud. Using 0.5B parameter Qwen architecture and self-developed NeuCodec codec...

Latest AI Resources

6mos ago

040.1K

KAT-Dev-72B-Exp - Racer open source free programming-specific models

KAT-Dev-72B-Exp is an open-source programming-specific large language model launched by the Racer team, optimized based on reinforcement learning technology, which achieved an accuracy rate of 74.6% in the SWE-Bench Verified benchmark test, the best performance of any open-source model at present. The model uses innovative...

Latest AI Resources

6mos ago

031.4K

Jamba Reasoning 3B - 以色列AI21 Labs开源的轻量级推理模型

Jamba Reasoning 3B - Israel AI21 Labs open source lightweight reasoning model

Jamba Reasoning 3B is a lightweight inference model open-sourced by Israeli AI startup AI21 Labs with strong performance and potential for a wide range of applications. It utilizes a hybrid SSM-Transformer architecture that combines Trans...

Latest AI Resources

6mos ago

028.8K

Free Course on the Latest Intelligentsia from Agentic AI by Ernest Ng

Agentic AI is the newest course on intelligent bodies launched by Ernest Ng.The course focuses on the design and construction of intelligent bodies, covering the four major design patterns of reflection, tool use, planning, and multi-intelligent body collaboration. Learners will master how to make intelligent bodies check outputs, autonomously adjust through theoretical explanations and code practice...

Latest AI Resources Course materials

6mos ago

053.8K

OpenAgents - Open Source Free Open Collaboration Project for Building AI Agent Networks

OpenAgents is the open source project that creates a network of AI agents and facilitates open collaboration between agents. A basic network infrastructure is provided to enable AI agents to seamlessly connect and collaborate. Users can quickly start their own agent network, extend functionality through a modular architecture, support...

Latest AI Resources

6mos ago

030.7K

Androidify - Google open sources free resources on how to build AI apps on Android

Androidify is Google's open source project to help developers learn how to build AI-driven apps on Android. The project uses Google's latest technologies such as Jetpack Compose, Gemini API (via Fire...

Latest AI Resources

6mos ago

032K

Regularization (Regularization) is what, an article to see and understand

Regularization is a core technique in machine learning and statistics to prevent model overfitting. Regularization controls the degree of fitting by adding a penalty term to the objective function that is related to the complexity of the model. Common forms include L1 and L2 regularization: the L1 produces sparse solutions and applies...

AI Answers

6mos ago

031.8K

生成对抗网络（Generative Adversarial Network）是什么，一文看懂

What is Generative Adversarial Network (GAN) in one article?

Generative Adversarial Network (GAN) is a deep learning model proposed by Ian Goodfellow et al. in 2014. The framework implements generative modeling by training two neural networks against each other...

AI Answers

6mos ago

031.2K

Ling-1T - Ant Group's open source universal language model for trillions of parameters

Ling-1T is a trillion-parameter general-purpose language model open-sourced by Ant Group, which belongs to the flagship product of the Ling 2.0 series of Bering's large models. The model adopts a highly efficient MoE architecture, supports 128K context windows, and surpasses GPT in 7 benchmarks including code generation, mathematical reasoning, and logic test...

Latest AI Resources

6mos ago

056.6K

EchoCare - Hong Kong Academy of Sciences open source ultrasound base large model

EchoCare is a large model of ultrasound base developed by the Center for Artificial Intelligence and Robotics Innovation (CAIR) at the Hong Kong Institute of Innovation and Research of the Chinese Academy of Sciences (CAS), trained based on the world's largest ultrasound image dataset (more than 4.5 million images), covering multi-center, multi-region, multi-ethnicity, and more than 50 individuals...

Latest AI Resources

6mos ago

033.1K

Self-Attention (Self-Attention) is what, an article to read and understand

Self-Attention is a key mechanism in deep learning, originally proposed and widely used in the Transformer architecture. The core idea is to allow the model to simultaneously attend to all positions in the input sequence, and compute each position by weighted aggregation of...

AI Answers

6mos ago

041.2K

What is Multi-Task Learning (MTL) in one article?

Multi-Task Learning (MTL) is not an isolated algorithm, but an intelligent machine learning paradigm.

AI Answers

6mos ago

033K

Code2Video - Show Lab open source AI teaching video generation framework

Code2Video is innovative open source project that automatically converts code snippets into high quality video content (mp4 format). The project through a unique code-centric paradigm , the use of carbon-now-cli tools to generate code into beautiful images , the use of ffmpeg will be these ...

Latest AI Resources

6mos ago

037.9K

SceneGen - Shanghai Jiaotong University open source single image to generate 3D scene framework

SceneGen is an open source method for generating 3D scenes from a single image at Shanghai Jiao Tong University. From a single scene image and a target resource mask, a complete scene containing multiple 3D resources is efficiently generated, including the geometric structure of the resources, texture and relative spatial location.

Latest AI Resources

6mos ago

029.2K

Ming-UniAudio - Ant open source unified audio multimodal generation model

Ming-UniAudio is Ant Group's open source unified audio multimodal generation model that supports mixed input and output of text, audio, image and video. Using multi-scale Transformer and hybrid expert (MoE) architecture , through modality-aware routing mechanism to efficiently handle cross-modal ...

Latest AI Resources

6mos ago

035.7K

AIMangaStudio - Free AI manga authoring tool with complete authoring flow

AIMangaStudio is a free AI manga creation tool that provides creators with a complete manga creation pipeline, including plot generation, sub-scene design, character setting and other functions, which can simplify the production process from script to manga page. It supports natural language generation of comic scripts, including plot, dialog...

Latest AI Resources

6mos ago

042.5K

FireRedChat - Little Red Book's open source full-duplex voice interaction system

FireRedChat is an open source full-duplex voice interaction system for Xiaohongshu with real-time bidirectional dialog capabilities and support for controlled interruptions. Adopts a modular design , including transcription control module , interaction module and dialogue manager , etc., supports cascade and semi-cascade architecture , can be flexibly deployed .

Latest AI Resources

6mos ago

042.6K

Logics-Parsing - Ali open source document parsing model

Logics-Parsing is an open source Ali end-to-end document parsing model , based on Qwen2.5-VL-7B. Optimize document layout analysis and reading order inference through reinforcement learning , PDF images can be converted to structured HTML output to support a variety of content ...

Latest AI Resources

6mos ago

041.1K

Ring-1T-preview - Ant Group's open-source trillion-parameter macromodel

Ring-1T-preview is an open source trillion-parameter big model of Ant Group, based on Ling 2.0 MoE architecture, pre-trained on 20T corpus, and trained in reasoning ability by self-developed reinforcement learning system ASystem. In natural language reasoning ...

Latest AI Resources

6mos ago

048.8K

RoboBrain-X0 - Wisdom Source Research Institute open source zero-sample cross ontology generalized embodiment model

RoboBrain-X0 is the world's first open source embodied model that supports zero-sample cross-ontology generalization open-sourced by Wisdom Source Research Institute, which is of great industrial significance. It can drive multiple real robots of different configurations to complete basic operation tasks without fine-tuning, and after a small amount of sample fine-tuning, it demonstrates the ability to replicate ...

Latest AI Resources

6mos ago

034.1K

Diffusion Model (Diffusion Model) what is it, an article to read and understand

Diffusion Model (Diffusion Model) is a generative model specialized for creating new data samples such as images, audio or text. The core of the model is inspired by the process of diffusion in physics, which simulates the natural diffusion of particles from a region of high concentration to a region of low concentration. In the machine...

AI Answers

6mos ago

042.4K

What is Fine-tuning, in one article?

Model fine-tuning (Fine-tuning) is a specific implementation of transfer learning in machine learning. The core process is based on pre-trained models, which utilize large-scale datasets to learn generic patterns and develop extensive feature extraction capabilities. The fine-tuning phase then introduces task-specific datasets to ...

AI Answers

6mos ago

034.4K

Lynx - ByteHop's open source high-fidelity video generation model

Lynx is a high-fidelity personalized video generation model open-sourced by ByteDance that can generate identity-consistent videos with only a single portrait photo. Built on the diffusion Transformer (DiT) base model , the introduction of ID-adapter and Ref-adapte...

Latest AI Resources

6mos ago

036.4K

Claude Sonnet 4.5 - Anthropic推出的最强AI编程模型

Claude Sonnet 4.5 - The Most Powerful AI Programming Model from Anthropic

Claude Sonnet 4.5 is an artificial intelligence model from Anthropic designed for programming, computer operations, and complex task automation. The model excels in code generation, long-duration task processing, reasoning, and mathematical computation, supporting everything from initial planning...

Latest AI Resources

6mos ago

041.2K

DeepSeek-V3.2-Exp - DeepSeek最新开源的实验性AI模型

DeepSeek-V3.2-Exp - DeepSeek's latest open source experimental AI model

DeepSeek-V3.2-Exp is a DeepSeek open source experimental AI model that significantly improves the efficiency of long text processing by introducing the DeepSeek Sparse Attention (DSA) mechanism. The model is based on DeepSeek...

Latest AI Resources

6mos ago

037.8K

HunyuanImage 3.0 - Tencent open source free multimodal image generation model

HunyuanImage 3.0 (HunyuanImage 3.0) is a native multimodal image generation model released and open-sourced by Tencent. The model parameter size of 80B, is currently the best evaluation results, the largest number of parameters of the open source image generation model. Hybrid Image 3.0 supports real-time image generation, users can side...

Latest AI Resources

6mos ago

047.4K

Hunyuan3D-Part - Tencent open source free 3D components to generate models

Hunyuan3D-Part (Hybrid 3D-Part) is a 3D generation model released and open-sourced by Tencent. Composed of P3 - SAM and X - Part, it realizes high-precision and controllable component-based 3D generation for the first time, and supports 50 + components to be generated automatically. Users can first use...

Latest AI Resources

6mos ago

047.5K

AudioFly - KU Xunfei open source text generation sound AI models

AudioFly is KDDI open source AI model for text to generate sound effects. Based on the potential diffusion model architecture, with 1 billion parameters, trained on large-scale, diverse audio text datasets, covering AudioSet, AudioCaps, TUT and other public datasets and internal...

Latest AI Resources

6mos ago

041.6K

Hunyuan3D-Omni - Tencent Mixed-Year Open Source 3D Model Generation Framework

Hunyuan3D-Omni (Hybrid 3D-Omni) is an open source 3D asset generation framework by Tencent's Hybrid 3D team, which realizes accurate 3D model generation through multiple control signals. Based on Hunyuan3D 2.1 architecture, it introduces a unified control encoder that can handle point...

Latest AI Resources

6mos ago

045.3K

FLM-Audio - Wisdom Source and Nanyang Polytechnic Open Source Full-Duplex Audio Dialog Modeling

FLM-Audio is a native full-duplex audio dialog grand model released by Beijing Zhiyuan Artificial Intelligence Research Institute in conjunction with Spin Matrix and Nanyang Technological University of Singapore, supporting both Chinese and English. Adopting native full-duplex architecture, it can merge listening, speaking and monologue at each time step...

Latest AI Resources

6mos ago

038.7K

Attention Mechanism (Attention Mechanism) is what, an article to read and understand

Attention Mechanism (Attention Mechanism) is a computational technique that mimics human cognitive processes, initially applied in the field of machine translation, and later becoming an important part of deep learning.

AI Answers

6mos ago

040.6K

Transformer 架构（Transformer Architecture）是什么，一文看懂

What is the Transformer Architecture in one article?

The Transformer architecture is a deep learning model designed for processing sequence-to-sequence tasks such as machine translation or text summarization. The core innovation is the complete reliance on self-attention mechanisms, eschewing traditional loops or convolutional structures. Allowing the model to process all elements of a sequence in parallel, large...

AI Answers

6mos ago

038.8K

What is Pre-trained Model (Pre-trained Model), an article to read and understand

Pre-trained Model is a fundamental and powerful technique in the field of Artificial Intelligence, representing machine learning models that are pre-trained on large-scale datasets. Models form a broad knowledge base by processing massive amounts of information and learning generalized patterns and features from the data...

AI Answers

6mos ago

038.2K

What is the Large Language Model (LLM) in one article?

Large Language Model (LLM) is a deep learning system trained on massive text data, with the Transformer architecture at its core. The self-attention mechanism of this architecture can effectively capture long-distance dependencies in language. The model's "large ...

AI Answers

6mos ago

037.8K

What is Long Short-Term Memory (LSTM) network, an article to read and understand

Long Short-Term Memory (LSTM) is a recurrent neural network variant specialized in processing sequence data. In the field of artificial intelligence, sequence data is widely used in tasks such as time series prediction, natural language processing and speech recognition.

AI Answers

6mos ago

032.6K

CWM - Meta FAIR open source code world language model

CWM (Code World Model) is a 32-billion-parameter open-source world language model released by the Meta FAIR team, designed for code generation and reasoning. Introducing the concept of "world model", it can simulate the code execution process, predict the variable state changes, and advance...

Latest AI Resources

6mos ago

034.9K

Neovate Code - Ant Open Source's Intelligent Programming Assistant

Neovate Code is an open source intelligent programming assistant from Ant Group's Alipay Experience Technology Department, which improves development efficiency through artificial intelligence technology. With conversational development features, developers can describe the requirements through natural language, Neovate Code can understand and generate the corresponding generation...

Latest AI Resources

6mos ago

038.7K

Audio2Face - NVIDIA open source AI 3D facial animation generation model

Audio2Face is NVIDIA's open source AI tool capable of transforming audio input into realistic 3D facial animation. By analyzing speech features in the audio, such as phonemes and intonation, it generates precise lip synchronization and subtle emotional expressions to give vivid human expressions to virtual characters.

Latest AI Resources

6mos ago

040.3K

Qwen3-VL - AliCloud Tongyi Qianqian open source multimodal visual language big model

Qwen3-VL is an open source multimodal visual language large model by AliCloud Tongyi Qianqian team, the number of references reaches 235 billion, and the model file is about 471GB.Containing instruction version and thinking version, it adopts enhanced MRope interleaved layout, DeepStack and other technologies, which can effectively utilize the visual transform...

Latest AI Resources

6mos ago

052.7K

Qwen3Guard - Ali Qwen open source security model

Qwen3Guard is a fine-tuned security protection model based on the Qwen3 base model, designed for security detection. It provides accurate security categorization of prompts and responses, provides risk levels, and supports English, Chinese, and multi-language environments.Qwen3Guard comes with two pro...

Latest AI Resources

6mos ago

043.3K

Qwen3-TTS-Flash - Speech Synthesis Models by Ali Tongyi

Qwen3-TTS-Flash is an advanced speech synthesis model introduced by Ali Tongyi, supporting 17 tones and 10 languages, covering Mandarin, English, dialects, etc. It has excellent stability and high expressiveness of Chinese and English speech, and the model can automatically adjust the tone of voice to make it more vivid.

Latest AI Resources

7mos ago

053K

Qwen3-Omni - Omnimodal AI model launched by Ali Tongyi

Qwen3-Omni is a fully modal AI model introduced by the Ali Tongyi team that can handle multiple data types such as text, images, audio and video, and supports text interaction in 119 languages with low latency and high controllability.

Latest AI Resources

7mos ago

038.2K

DeepSeek-V3.1-Terminus - DeepSeek推出的最新版AI模型

DeepSeek-V3.1-Terminus - The latest version of the AI model introduced by DeepSeek

DeepSeek-V3.1-Terminus is an upgraded version of DeepSeek-V3.1, an artificial intelligence language model from the DeepSeek team. The model is optimized in terms of language consistency, code generation, and search capabilities to more accurately...

Latest AI Resources

7mos ago

036.1K

What is Federated Learning (FL) in one article?

Federated Learning (FL) is an innovative machine learning approach first proposed by a Google research team in 2016 to address challenges in data privacy and distributed computing.

AI Answers

7mos ago

037.7K

Granite-Docling-258M - IBM Open Source Visual Language Modeling

Granite-Docling-258M is an ultra-compact open source visual language model from IBM designed for efficient document conversion. The model converts documents into machine-readable formats while leaving layout, tables, formulas, and other elements intact.

Latest AI Resources

7mos ago

034.7K

Lucy Edit - open source AI video editing tool, natural language description editing

Lucy Edit is an open source AI video editing tool developed by Decart AI. Allows users to edit video through simple natural language descriptions, such as "change the character into a polar bear" or "turn the scene into a 2D cartoon style", without the need for complex fine-tuning or the use of masks ...

Latest AI Resources

7mos ago

043.9K

LongCat-Flash-Thinking - An Efficient Reasoning Model for Meituan Open Source

LongCat-Flash-Thinking is a highly efficient reasoning model released by the LongCat team at Mission LongCat that has become more powerful and specialized while maintaining the extreme speed of LongCat-Flash-Chat. The model is based on logic, math, code, intelligence...

Latest AI Resources

7mos ago

034.1K

Ling-V2 - The MoE Architecture Language Model Series of Ant Centurion Open Source

Ling-V2 is a family of large-scale language models based on the MoE architecture introduced by the Ant-Belling team. The first version, Ling-mini-2.0, has 16 billion total parameters, with only 1.4 billion parameters activated per input token.

Latest AI Resources

6mos ago

035.7K

Kronos - Tsinghua and Microsoft joint open source financial K chart base model

Kronos is the first K-line chart base model for financial markets jointly open-sourced by Tsinghua University and Microsoft Research Asia. It analyzes K-line data of stocks, cryptocurrencies and other assets, including opening, high, low, closing and volume, to predict future price movements.

Latest AI Resources

7mos ago

058.8K

Wan2.2-Animate - A Generative Model for Action Generation of the Tongyi Wanphase Open Source

Wan2.2-Animate is an open source action generation model , support for action imitation and role-playing mode . Users only need to input a character picture and a reference video , the model can migrate the video character's movements and expressions to the picture character , giving the picture character dynamic expression ...

Latest AI Resources

7mos ago

037K

Xiaomi-MiMo-Audio - Xiaomi Open Source's First Native End-to-End Speech Big Model

Xiaomi-MiMo-Audio is Xiaomi's open source 7-billion-parameter end-to-end speech macromodel with powerful features such as multi-language dialog, speech continuation, less-sample generalization, and audio understanding, which is able to reach the SOTA level in speech intelligence and audio understanding benchmarks, surpassing Google Gemi...

Latest AI Resources

7mos ago

040.6K

InternVLA-A1 - Shanghai AI Lab Open Source Integration of Operational Capabilities for Embodied Large Models

InternVLA-A1 is a large model of embodied operation open-sourced by Shanghai Artificial Intelligence Laboratory. It has the ability to understand, imagine, and execute the integration, and can accurately complete the task. The model fuses real and simulated operational data, and automates the construction of massive multimodal through large-scale virtual-real hybrid scene assets...

Latest AI Resources

7mos ago

041.2K

VoxCPM - Faceted Intelligence and Tsinghua Open Source End-to-End TTS Model

VoxCPM is a speech generation model jointly open-sourced by Facade Intelligence and Shenzhen International Graduate School of Tsinghua University.VoxCPM adopts an end-to-end diffusion autoregressive architecture to generate continuous speech representations directly from text, breaking through the limitations of traditional discrete disambiguation. Through hierarchical language modeling and finite state quantization...

Latest AI Resources

7mos ago

045.1K

InternVLA-N1 - Shanghai AI Lab Open Source End-to-End Dual System Navigation Large Model

InternVLA-N1 is an open source end-to-end dual-system navigation macromodel from Shanghai Artificial Intelligence Laboratory. Using a dual-system architecture, System 2 is responsible for understanding linguistic commands and planning long-range paths, while System 1 focuses on high-frequency response and agile obstacle avoidance. The model is trained entirely based on synthetic data through large-scale digital ...

Latest AI Resources

7mos ago

040.7K

WebWeaver - Ali Tongyi open source new dual-intelligence body framework

WebWeaver is a new dual-intelligence body framework introduced by Alibaba Tongyi team, which is mainly used in open deep research, and can simulate the human research process, which is divided into two intelligences: planning and writing.

Latest AI Resources

7mos ago

039.2K

MCP Registry - The official MCP server management platform from GitHub.

The MCP Registry is a centralized platform from GitHub that helps developers discover and install MCP servers more easily.The MCP Registry is here to help developers quickly find the AI tools they need in one place, greatly simplifying...

Latest AI Resources

7mos ago

037.9K

VLAC - Shanghai AI Lab's Open Source Large Model of Embodied Reward

VLAC is an open source embodied reward macromodel from Shanghai Artificial Intelligence Laboratory. Based on InternVL multimodal macromodel, it integrates Internet video data and robot operation data to provide process reward and task completion estimation for robot reinforcement learning in the real world.VLAC can effectively ...

Latest AI Resources

7mos ago

033.5K

Tongyi DeepResearch - Ali Tongyi Open Source Deep Research Intelligence Body

Tongyi DeepResearch (Tongyi DeepResearch) is an open source intelligent body launched by Alibaba, designed for deep information retrieval and complex task reasoning, with 30 billion parameters, supporting multiple reasoning modes, including ReAct mode and deep mode...

Latest AI Resources

7mos ago

042K

InternVLA-M1 - Shanghai AI Lab's Open Source Embodied Dual System Operation "Brain"

InternVLA-M1 is an open-source embodied operating "brain" of Shanghai Artificial Intelligence Laboratory, which is a large model of two-system operation oriented to instruction following. It builds a complete closed loop covering "think-act-learn" and is responsible for high-level spatial reasoning and task planning. The model adopts a two-phase training cur...

Latest AI Resources

7mos ago

033.2K

OpenAI's PDF Guide to Staying Ahead in the Age of AI - with Download Links

Staying ahead in the age of AI is an AI leadership guide from OpenAI that helps business leaders maintain a competitive edge in the age of AI. The guide points to the rapid growth of AI, with faster model releases, lower costs, and faster enterprise adoption...

Latest AI Resources Course materials

7mos ago

042.5K

Free PDF of Fundamentals of Large Models from Zhejiang University - with download link

Fundamentals of Large Models provides an in-depth analysis of the core technologies and practical paths of Large Language Models (LLMs). Starting from the fundamental theory of language modeling, it systematically explains the principles of model design based on statistics, recurrent neural networks (RNN), and Transformer architecture, focusing on the three major big language model...

Latest AI Resources Course materials

7mos ago

044K

循环神经网络（Recurrent Neural Network）是什么，一文看懂

What is Recurrent Neural Network (RNN) in one article?

Recurrent Neural Network (RNN) is a neural network architecture designed for processing sequential data. Sequential data refers to a collection of data with temporal order or dependencies, such as linguistic text, speech signals, or time series.

AI Answers

7mos ago

040.7K

What is Neural Network (Neural Network), an article to read and understand

Neural Network (NN) is a computational model inspired by the way neurons work in the biological brain.

AI Answers

7mos ago

032.5K

PromptEnhancer - Tencent Mixed Meta Open Source AI Prompt Word Enhancement Tool

PromptEnhancer is an open source prompt word enhancement tool from Tencent's Mixed Meta team to improve the generation of text-to-image (Text-to-Image, T2I) models. Through the chain of reasoning (Chain-of-Thought, CoT) approach to the use of ...

Latest AI Resources

7mos ago

038.6K

LLaSO - The Industry's First Fully Open Source Speech Model from Logic Intelligence

LLaSO is an open source speech model launched by Beijing Depth Logic Intelligence Technology Co. Ltd, which solves the problems of data dispersion and insufficient task coverage in the field of large-scale speech language modeling by integrating speech and text data and providing alignment datasets, command fine-tuning datasets and evaluation benchmarks.

Latest AI Resources

7mos ago

029.6K

Hybrid 3D 3.0 - Tencent's 3D generated models with UHD modeling support

Hybrid 3D 3.0 is an advanced 3D generation model launched by Tencent, based on 3D-DiT hierarchical sculpting technology, with a geometric resolution of up to 1536³, capable of generating ultra-high-definition, detail-rich 3D models, and excelling in character modeling, with the ability to accurately shape the five senses and body shape.

Latest AI Resources

7mos ago

047.7K

UnifoLM-WMA-0 - Yu Shu Technology open source world model action architecture

UnifoLM-WMA-0 is an open source world model-action architecture across multiple classes of robot ontologies by Yu Shu Technology, designed for general robot learning. Composed of a world model and an action architecture, the world model understands the physical laws of robot-environment interaction, and the action architecture is responsible for specific...

Latest AI Resources

7mos ago

047.3K

InfiniteTalk - Open Source Audio-Driven Video Generation Tool for Mission Vision AI

InfiniteTalk is an audio-driven video generation tool developed by the MeiGen-AI team that generates talking videos of unlimited length based on the input audio. The core advantage lies in the precise lip synchronization technology, which can perfectly match the audio with the character's mouth shape to generate natural and smooth...

Latest AI Resources

7mos ago

056.6K

Mini-o3 - Bytes, HKU Joint Open Source Visual Reasoning Model

Mini-o3 is an open source model jointly launched by ByteDance and the University of Hong Kong, focusing on solving complex visual search problems. The model has a powerful multi-round interactive reasoning capability, and can locate the target through deep exploration and trial-and-error.

Latest AI Resources

7mos ago

034.5K

GPT-5-Codex - The Most Powerful Programming Model Introduced by OpenAI

GPT-5-Codex is a powerful programming optimization model from OpenAI, further enhanced by GPT-5 and designed for software engineers. The model generates high-quality code quickly, supports multiple programming languages, and optimizes existing code to improve performance.

Latest AI Resources

7mos ago

030.8K

ROMA - Open Source Meta-Agent Framework for Automatic Decomposition of Complex Tasks for Parallel Processing

ROMA (Recursive-Open-Meta-Agent) is an open source meta-agent framework developed by Sentient AGI to efficiently solve complex problems through recursive task decomposition and parallel processing. Support for Python 3.12+, Docker and ...

Latest AI Resources

7mos ago

045K

Lumina-DiMOO - A Multimodal Large Model Open-Sourced by Shanghai AI Lab and Huawei Ascendant

Lumina-DiMOO is a new generation of unified model for multimodal generation and understanding launched by Shanghai Artificial Intelligence Laboratory (SAL) in conjunction with Huawei Rise at the World Artificial Intelligence Conference 2025. Based on the Rise AI basic hardware and software platform and the MindSpeed MM multimodal large model suite, it accomplishes...

Latest AI Resources

7mos ago

040.4K

Hyprnote - Open source, locally prioritized AI conference note-taking tool

Hyprnote is an open source, local-first AI meeting note-taking tool designed for professionals to protect user privacy and improve meeting efficiency. Adopting the "local first" principle, all data storage and processing is done on the user's local device to ensure data security and support offline operation.

Latest AI Resources

7mos ago

040.3K

MobileLLM-R1 - Meta open source special efficient inference model series

MobileLLM-R1 is Meta's open source series of efficient inference models designed for mathematical, programming and scientific reasoning. It contains a base model and a final model, with 140 million, 360 million and 950 million parameter versions, respectively. The models are not generic chat models and are supervised fine-tuned (SFT...

Latest AI Resources

7mos ago

032.7K

ERNIE-4.5-21B-A3B-Thinking - 百度开源的推理思考模型

ERNIE-4.5-21B-A3B-Thinking - Baidu open source reasoning thinking model

ERNIE-4.5-21B-A3B-Thinking is Baidu's open source large-scale language model focused on reasoning tasks. Using the Mixed Expert (MoE) architecture , the total number of references to 21 billion , each token activates 3 billion parameters to support 128K long context window ...

Latest AI Resources

7mos ago

030.6K

What is Artificial Intelligence Fairness (AI Fairness) in one article

AI fairness is the interdisciplinary field of ensuring that AI systems treat all individuals and groups in a fair and unbiased manner throughout their design, development, deployment, and operation lifecycle.

AI Answers

7mos ago

037.3K

What is Meta-Learning (Meta-Learning) in one article?

Meta-Learning, or learning how to learn, is an important branch of the machine learning field that focuses on developing learning algorithms that can quickly adapt to new tasks.

AI Answers

7mos ago

041.6K

MobiAgent - Shanghai Jiaotong University open source mobile intelligent body full-stack building framework

MobiAgent is an open source mobile intelligent body toolchain from IPADS Lab of Shanghai Jiaotong University, which helps users to build their own mobile intelligent assistants. By recording the user's operation trajectory and generating high-quality data, it trains an intelligent body that can understand natural language commands. Core features include efficient...

Latest AI Resources

7mos ago

038.2K

ZipVoice - Xiaomi's open source speech synthesis model series

ZipVoice is a series of speech synthesis (TTS) models based on the Flow Matching architecture released by Xiaomi, including ZipVoice (zero-sample single-speaker speech synthesis model) and ZipVoice-Dialog (zero-sample conversational speech synthesis...

Latest AI Resources

7mos ago

046.4K

PP-OCRv5 - Baidu's open source AI model for next-generation text recognition

PP-OCRv5 is the latest generation of text recognition AI model released by Baidu. With a lightweight design and a reference count of only 0.07B, it is suitable for efficient operation on CPU and edge devices, and can process more than 370 characters per second. The model supports Simplified Chinese, Traditional Chinese, English, Japanese and Pinyin...

Latest AI Resources

7mos ago

059.7K

Youtu-GraphRAG - Tencent Youtu Labs Open Source Graph Retrieval Augmentation Generation Framework

Youtu-GraphRAG is an open source graph retrieval augmentation generation framework from Tencent's Youtu Labs to help large language models handle complex Q&A tasks more accurately. By constructing a four-layer knowledge tree, the knowledge is disassembled into four levels of attributes, relationships, keywords and communities to realize the self-directed performance of cross-domain knowledge...

Latest AI Resources

7mos ago

039.5K

Stand-In - Tencent WeChat Visual Open Source Lightweight Video Generation Framework

Stand-In is a lightweight, plug-and-play identity-preserving video generation framework from Tencent's WeChat Vision team. Focusing on preserving specific identity features in video generation, it only needs to train the additional parameters of the base model 1%, and can achieve excellent results in face similarity and naturalness.

Latest AI Resources

7mos ago

037.6K

IndexTTS2 - B station open source free TTS model, the first to support precise duration control

IndexTTS2 is a new free text-to-speech (TTS) model open-sourced by the B station voice team, which realizes a major breakthrough in emotional expression and duration control, the first autoregressive TTS model that supports precise duration control. Supports zero-sample voice cloning, only one audio file can accurately copy the sound...

Latest AI Resources

7mos ago

099.2K

MiniMax Music 1.5 - MiniMax最新推出的AI音乐生成模型

MiniMax Music 1.5 - MiniMax's latest AI music generation model

MiniMax Music 1.5 is an advanced AI music generation tool that supports generating up to 4 minutes of music based on users' natural language descriptions. The model supports a variety of music styles and mood customization, generating a natural and full vocal color, smooth transitions, richly layered arrangements...

Latest AI Resources

7mos ago

039.4K

What is Artificial Intelligence Safety (AI Safety), in one article

Artificial Intelligence Safety (AI Safety) is the cutting-edge interdisciplinary field of ensuring that AI systems, especially those that are increasingly powerful and autonomous, act reliably and predictably throughout their lifecycle in accordance with human intent, without harmful consequences.

AI Answers

7mos ago

035.4K

What is Self-Supervised Learning (SSL) in one article?

Self-Supervised Learning (SSL) is an emerging learning paradigm in the field of machine learning, where the core idea is to automatically generate supervised signals from unlabeled data and train models to learn useful representations of the data.

AI Answers

7mos ago

036.2K

Can't find AI tools? Try here!