Latest AI Resources

Total 2985 articles posts
浙江大学免费PDF资料《大模型基础》 - 附下载链接

Free PDF of Fundamentals of Large Models from Zhejiang University - with download link

Fundamentals of Large Models provides an in-depth analysis of the core technologies and practical paths of Large Language Models (LLMs). Starting from the fundamental theory of language modeling, it systematically explains the principles of model design based on statistics, recurrent neural networks (RNN), and Transformer architecture, focusing on the three major big language model...
6mos ago
040K
QVQ-Max - 阿里通义推出视觉推理模型

QVQ-Max - Ali Tongyi Launches Visual Reasoning Models

QVQ-Max is a state-of-the-art visual reasoning model introduced by Alitonix, which is an upgraded version of QVQ-72B-Preview. QVQ-Max is an advanced visual reasoning model that can "read" images and video content and combine the information for analysis, reasoning and problem solving.QVQ-Max's main functions include image parsing, video analysis...
9mos ago
039.8K
Ovis-U1 - 阿里推出的多模态统一AI模型

Ovis-U1 - Multimodal Unified AI Model Introduced by Ali

Ovis-U1 is a multimodal unified model introduced by the Ovis team of Alibaba Group with a parameter scale of 3 billion. The model is equipped with three core capabilities: multimodal understanding, text-to-image generation, and image editing. With advanced architectural design and collaborative and unified training methods, the model supports the realization of high-fidelity image...
9mos ago
039.7K
万兴天幕 – 万兴科技推出AIGC视频创作平台

Wanxing Canopy - Wanxing Technology Launches AIGC Video Creation Platform

Wanxing Canopy is the AIGC video creation platform launched by Wanxing Technology, covering the three major creation fields of video, picture and audio generation, which is specially designed for media and cultural industry workers, film and television/post-production workers, art and design workers, advertising and marketing practitioners, etc. to provide one-stop professional creation solutions.
9mos ago
039.3K
优雅YOYA - 中科闻歌推出的AI音视频内容创作平台

Elegant YOYA - AI Audio/Video Content Creation Platform Launched by Sinotech Winkler

Elegant YOYA is a multimodal literate video platform launched by Zhongke Wenge, the platform is based on AI multimodal technology to empower the whole chain of video content creation. Users only need to input the theme requirements, the platform can quickly generate scripts, images, videos, and can complete intelligent editing, voice synthesis and character mouth drive and other operations, the output...
9mos ago
039.2K
Higress MCP - 今日投资推出的MCP服务平台

Higress MCP - Invest Today Launches MCP Services Platform

Higress MCP is an innovative platform launched by Invest Today that supports the rapid transformation of traditional financial data APIs into modern MCP services.Higress MCP enables the transformation of REST APIs to MCP Server based on a simple configuration without the need to program...
8mos ago
039.2K
Meeseeks - 美团开源的评估模型指令遵循能力的评测集

Meeseeks - Meeseeks open-source assessment set for evaluating the ability to follow model instructions

Meeseeks is an open source large model evaluation set used by the Meituan M17 team to evaluate the model's ability to follow instructions.Meeseeks uses a three-tiered evaluation framework to comprehensively measure whether the model is able to generate answers in strict accordance with the user's instructions from the macro to the micro level, without evaluating the knowledge of the content of the answers positively ...
7mos ago
039.1K
SkyReels-A3 - 昆仑万维推出的音频驱动数字人创作工具

SkyReels-A3 - Audio-Driven Digital Human Creation Tool from KunlunWangwei

SkyReels-A3 is an audio-driven digital human creation tool from Kunlun World Wide Group. SkyReels-A3 is an audio-driven digital human creation tool, which can generate high-quality dynamic video content through simple inputs (e.g., portrait images and voice), make static photos "come alive", and replace lines for existing videos with new lip-syncs that the characters will automatically...
7mos ago
038.7K
职达AI简历 - AI简历生成与优化平台,精准分析问题、提供优化建议

Vinda AI Resume - AI Resume Generation and Optimization Platform, Precise Analysis of Problems and Optimization Suggestions

Job AI resume is an efficient and convenient intelligent resume generation and optimization platform. Based on AI technology, the platform helps users quickly generate professional and personalized resumes. Users only need to enter basic information and experience, the platform can generate high-quality resume in a short time, providing 2800+ beautiful templates, covering a variety of positions.
9mos ago
038.7K
MindLink - 昆仑万维推出的开源推理大模型

MindLink - Open Source Reasoning Big Model from KunlunWei

MindLink is a large model of open source reasoning launched by Kunlun World Wide Web. With adaptive reasoning mechanism , according to the complexity of the task can be flexibly switched inference mode , simple tasks quickly generated , complex tasks in-depth reasoning , taking into account the efficiency and accuracy . Plan-driven reasoning paradigm to remove the "think" label , down ...
7mos ago
038.7K
有道小P - 网易有道推出的新一代AI全科学习助手

Youdao Xiao P - A new generation of AI general learning assistant launched by NetEase Youdao

Youdao Little P is an AI all-subject learning assistant launched by NetEase Youdao, designed for K12 students, equipped with the Youdao Ziyi education big model, covering elementary school, middle school and high school all-subject Q&A, providing personalized learning advice. With AI word search and AI translation functions, Youdao Little P helps students quickly solve language problems...
9mos ago
038.6K
DeckSpeed - AI PPT制作工具,自然语言生成演示文稿

DeckSpeed - AI PPT Maker, Natural Language Generated Presentation

DeckSpeed is an AI presentation creation tool based on conversational interaction, where users express their needs based on natural language and quickly generate personalized slides without relying on traditional templates. The tool supports real-time feedback adjustment, users can modify the color, style and content of the slide at any time to ensure that the presentation is complete...
9mos ago
038.5K
InternVLA-A1 - 上海AI Lab开源一体化操作能力的具身大模型

InternVLA-A1 - Shanghai AI Lab Open Source Integration of Operational Capabilities for Embodied Large Models

InternVLA-A1 is a large model of embodied operation open-sourced by Shanghai Artificial Intelligence Laboratory. It has the ability to understand, imagine, and execute the integration, and can accurately complete the task. The model fuses real and simulated operational data, and automates the construction of massive multimodal through large-scale virtual-real hybrid scene assets...
6mos ago
038.5K
CRIC深度智联 - 克而瑞推出的中国房地产首个AI Agent

CRIC - The First AI Agent for Real Estate in China Launched by CRIC

CRIC Depth Intelligence is the first AI intelligent body of Chinese real estate independently developed by CRIC, based on CRIC's 20 years of experience in the real estate industry and data accumulation and multimodal big model technology, which opens up the whole chain from data integration, intelligent analysis to content generation.
9mos ago
037.9K
文心大模型X1.1 - 百度推出的深度思考模型,理解能力更强

Wenshin Big Model X1.1 - Baidu's Deep Thinking Model for Better Understanding

Wenxin Big Model X1.1 is a deep thinking model launched by Baidu, based on a hybrid reinforcement learning framework that focuses on improving language understanding and generation. The model excels in handling complex questions, following instructions and simulating the behavior of intelligences, and can accurately provide knowledgeable answers and high-quality text content.
6mos ago
037.6K
InternVLA·N1 - 上海AI Lab开源的端到端双系统导航大模型

InternVLA-N1 - Shanghai AI Lab Open Source End-to-End Dual System Navigation Large Model

InternVLA-N1 is an open source end-to-end dual-system navigation macromodel from Shanghai Artificial Intelligence Laboratory. Using a dual-system architecture, System 2 is responsible for understanding linguistic commands and planning long-range paths, while System 1 focuses on high-frequency response and agile obstacle avoidance. The model is trained entirely based on synthetic data through large-scale digital ...
6mos ago
037.1K
飞算JavaAI - AI Java开发助手,自然语言实现全流程智能化开发

Flycount JavaAI - AI Java development assistant, natural language implementation of the whole process of intelligent development

Flycount JavaAI is an intelligent Java development assistant launched by Flycount Technology. The platform supports natural language input to realize the whole process of intelligent development from requirements analysis to code generation. Developers only need to enter a description of the requirements, Flycount JavaAI can accurately understand and generate a complete engineering code framework, the platform...
9mos ago
037K
Youtu-GraphRAG - 腾讯优图实验室开源的图检索增强生成框架

Youtu-GraphRAG - Tencent Youtu Labs Open Source Graph Retrieval Augmentation Generation Framework

Youtu-GraphRAG is an open source graph retrieval augmentation generation framework from Tencent's Youtu Labs to help large language models handle complex Q&A tasks more accurately. By constructing a four-layer knowledge tree, the knowledge is disassembled into four levels of attributes, relationships, keywords and communities to realize the self-directed performance of cross-domain knowledge...
6mos ago
036.8K
美间:在线软装(家装)设计工具,快速生成设计方案,软装辅助AI工具箱

Meiman: online soft furnishing (home furnishing) design tools, rapid generation of design plans, soft furnishing auxiliary AI toolkit

Comprehensive Introduction Meiman is an online platform specializing in home design and marketing negotiation. The site provides a wealth of design materials, soft furnishings and proposal PPT templates, poster templates, etc. to help designers and homeowners quickly generate high-quality design solutions. Meiman's online soft furnishing design tool can be used in as little as 10 seconds...
9mos ago
036.5K
Neovate Code - 蚂蚁开源的智能编程助手

Neovate Code - Ant Open Source's Intelligent Programming Assistant

Neovate Code is an open source intelligent programming assistant from Ant Group's Alipay Experience Technology Department, which improves development efficiency through artificial intelligence technology. With conversational development features, developers can describe the requirements through natural language, Neovate Code can understand and generate the corresponding generation...
6mos ago
036.4K
MiniMax Music 1.5 - MiniMax最新推出的AI音乐生成模型

MiniMax Music 1.5 - MiniMax's latest AI music generation model

MiniMax Music 1.5 is an advanced AI music generation tool that supports generating up to 4 minutes of music based on users' natural language descriptions. The model supports a variety of music styles and mood customization, generating a natural and full vocal color, smooth transitions, richly layered arrangements...
6mos ago
035.8K
MobiAgent - 上海交大开源的移动端智能体全栈构建框架

MobiAgent - Shanghai Jiaotong University open source mobile intelligent body full-stack building framework

MobiAgent is an open source mobile intelligent body toolchain from IPADS Lab of Shanghai Jiaotong University, which helps users to build their own mobile intelligent assistants. By recording the user's operation trajectory and generating high-quality data, it trains an intelligent body that can understand natural language commands. Core features include efficient...
6mos ago
035.8K
OneCAT - 美团联合上海交大开源的多模态模型

OneCAT - Open source multimodal modeling by Meituan and Shanghai Jiaotong University

OneCAT is a new unified multimodal model launched by Meituan in conjunction with Shanghai Jiaotong University, which adopts a pure decoder architecture and can seamlessly integrate multimodal comprehension, text-to-image generation and image editing functions. The model abandons the design of traditional multimodal models that rely on external visual coders and disambiguators through modality-specific...
6mos ago
035.6K
问小白o4 - 问小白推出的并行思考模型,同时开启8条思考路径

Ask Whitey o4 - A parallel thinking model introduced by Ask Whitey that opens 8 thinking paths at the same time

Ask White o4 is an innovative parallel thinking model that opens 8 thinking paths at the same time, analyzes the problem from multiple perspectives and automatically filters out the optimal solution. The model incorporates advanced Long-CoT reinforcement learning and process reward learning techniques, has powerful deep reasoning capabilities, and performs well in complex tasks.
7mos ago
035.4K
阶跃深研 - 阶跃星辰推出的AI深入研究工具

Steps Deep Research - AI Deep Research Tool by Steps Star

Steps Deep Research is an efficient AI research tool launched by Steps Star, which can autonomously complete research on complex issues and generate professional reports in a short period of time. The tool is designed for finance, consulting, healthcare, law and other fields, and excels in industry reviews with its in-depth search and information integration capabilities.
7mos ago
034.8K
DiaMoE-TTS - 清华联合巨人网络开源的多方言语音合成框架

DiaMoE-TTS - Tsinghua and Giant Networks open source multi-dialect speech synthesis framework

DiaMoE-TTS is a multi-dialect speech synthesis framework jointly open-sourced by Tsinghua University and Giant Network, based on the International Phonetic Alphabet (IPA), to solve the problems of dialect data scarcity, orthographic inconsistency, and complex phonological changes. Through a unified IPA front-end standardized phoneme representation to eliminate cross-dialect differences ...
5mos ago
034.3K
XTuner V1 - 上海AI Lab开源的大模型训练引擎

XTuner V1 - Shanghai AI Lab open source large model training engine

XTuner V1 is a new generation of large model training engine open-sourced by Shanghai Artificial Intelligence Laboratory (SAL), designed for ultra-large scale sparse Mixed Expert (MoE) model training. Developed based on PyTorch FSDP, it achieves high performance through multi-dimensional optimization of memory, communication and load ...
6mos ago
033.8K
SongBloom - 腾讯联合港中文、南大开源的歌曲生成模型

SongBloom - Tencent's open source song generation model with HKCNU and NTU.

SongBloom is an open source song generation model developed by Tencent AI Lab in collaboration with The Chinese University of Hong Kong (Shenzhen) and Nanjing University, which solves the problem of "plasticity" in AI music generation, and realizes high-quality, structurally complete song generation. Simply enter 10 seconds of reference audio and corresponding lyrics, and you can...
5mos ago
033.8K
VoxCPM 1.5 - 面壁智能开源的端到端文本到语音模型

VoxCPM 1.5 - Faceted Intelligence Open Source End-to-End Text-to-Speech Modeling

VoxCPM 1.5 is an open source speech generation model released by Facade Intelligence, based on text-to-speech (TTS) technology without the need for a splitter, featuring several innovations and improvements. Adopting an end-to-end diffusion autoregressive architecture, it generates continuous speech waveforms directly from text, avoiding the limitations of traditional segmentation methods...
3mos ago
033.4K