Latest AI Resources

Total 3112 articles posts
商汤如影 - 商汤科技推出的AI数字人视频制作平台

Shangtang Ruyi - AI digital human video production platform launched by Shangtang Technology

Shangtang Ruying is an AI digital human video production platform launched by Shangtang Technology. Based on big model technology, the platform supports the creation of highly realistic digital human images and personalization, including facial features, clothing, hairstyles, and so on. The platform is equipped with sound cloning, video generation, automated data labeling, real-time interaction, and other functions...
1yrs ago
051.8K
Higress MCP - 今日投资推出的MCP服务平台

Higress MCP - Invest Today Launches MCP Services Platform

Higress MCP is an innovative platform launched by Invest Today that supports the rapid transformation of traditional financial data APIs into modern MCP services.Higress MCP enables the transformation of REST APIs to MCP Server based on a simple configuration without the need to program...
12mos ago
051.8K
优雅YOYA - 中科闻歌推出的AI音视频内容创作平台

Elegant YOYA - AI Audio/Video Content Creation Platform Launched by Sinotech Winkler

Elegant YOYA is a multimodal literate video platform launched by Zhongke Wenge, the platform is based on AI multimodal technology to empower the whole chain of video content creation. Users only need to input the theme requirements, the platform can quickly generate scripts, images, videos, and can complete intelligent editing, voice synthesis and character mouth drive and other operations, the output...
1yrs ago
051.4K
MagicTryOn - 浙大和vivo等机构推出的视频虚拟试穿框架

MagicTryOn - Video Virtual Try-On Framework from ZJU and Vivo and others

MagicTryOn is an advanced video virtual try-on framework launched by the School of Computer Science and Technology of Zhejiang University in collaboration with vivo and other organizations. The framework replaces the traditional U-Net architecture with an innovative Diffusion Transformer (DiT) architecture, combined with a fully self-attentive machine...
1yrs ago
051.4K
gpt-realtime - OpenAI最新推出的AI语音模型

gpt-realtime - OpenAI's newest AI speech model

gpt-realtime is an advanced speech model from OpenAI that supports direct audio processing to generate natural and smooth speech. The model supports multiple languages and styles, understands non-verbal cues such as laughter, and can switch between languages.
10mos ago
051.3K
InternVLA-A1 - 上海AI Lab开源一体化操作能力的具身大模型

InternVLA-A1 - Shanghai AI Lab Open Source Integration of Operational Capabilities for Embodied Large Models

InternVLA-A1 is a large model of embodied operation open-sourced by Shanghai Artificial Intelligence Laboratory. It has the ability to understand, imagine, and execute the integration, and can accurately complete the task. The model fuses real and simulated operational data, and automates the construction of massive multimodal through large-scale virtual-real hybrid scene assets...
9mos ago
051.3K
Qwen3Guard - 阿里Qwen开源的安全模型

Qwen3Guard - Ali Qwen open source security model

Qwen3Guard is a fine-tuned security protection model based on the Qwen3 base model, designed for security detection. It provides accurate security categorization of prompts and responses, provides risk levels, and supports English, Chinese, and multi-language environments.Qwen3Guard comes with two pro...
9mos ago
051.2K
MoE-TTS - 昆仑万维推出的最新语音生成框架

MoE-TTS - The Latest Speech Generation Framework from KunlunWei

MoE-TTS is a speech synthesis framework introduced by KunlunWanwei, based on the Mixed Expert (MoE) architecture, which combines pre-trained Large Language Models (LLMs) with speech expert modules.MoE-TTS retains the powerful textual reasoning by freezing the textual module parameters and updating only the speech module parameters...
10mos ago
051.2K
MindLink - 昆仑万维推出的开源推理大模型

MindLink - Open Source Reasoning Big Model from KunlunWei

MindLink is a large model of open source reasoning launched by Kunlun World Wide Web. With adaptive reasoning mechanism , according to the complexity of the task can be flexibly switched inference mode , simple tasks quickly generated , complex tasks in-depth reasoning , taking into account the efficiency and accuracy . Plan-driven reasoning paradigm to remove the "think" label , down ...
11mos ago
050.6K
SkyReels-A3 - 昆仑万维推出的音频驱动数字人创作工具

SkyReels-A3 - Audio-Driven Digital Human Creation Tool from KunlunWangwei

SkyReels-A3 is an audio-driven digital human creation tool from Kunlun World Wide Group. SkyReels-A3 is an audio-driven digital human creation tool, which can generate high-quality dynamic video content through simple inputs (e.g., portrait images and voice), make static photos "come alive", and replace lines for existing videos with new lip-syncs that the characters will automatically...
10mos ago
050.6K
DeckSpeed - AI PPT制作工具,自然语言生成演示文稿

DeckSpeed - AI PPT Maker, Natural Language Generated Presentation

DeckSpeed is an AI presentation creation tool based on conversational interaction, where users express their needs based on natural language and quickly generate personalized slides without relying on traditional templates. The tool supports real-time feedback adjustment, users can modify the color, style and content of the slide at any time to ensure that the presentation is complete...
1yrs ago
050.4K
InternVLA·N1 - 上海AI Lab开源的端到端双系统导航大模型

InternVLA-N1 - Shanghai AI Lab Open Source End-to-End Dual System Navigation Large Model

InternVLA-N1 is an open source end-to-end dual-system navigation macromodel from Shanghai Artificial Intelligence Laboratory. Using a dual-system architecture, System 2 is responsible for understanding linguistic commands and planning long-range paths, while System 1 focuses on high-frequency response and agile obstacle avoidance. The model is trained entirely based on synthetic data through large-scale digital ...
9mos ago
050.2K
HeyGen - AI 数字人视频创作平台,支持多语言翻译配音

HeyGen - AI Digital Human Video Creation Platform with Multi-Language Translation and Dubbing Support

HeyGen is an AI-driven digital human video creation platform that supports a streamlined video production process, allowing users to quickly generate professional-level digital human videos. The platform is based on advanced AI technology, giving users full control over the image and voice of digital people, providing a rich library of material, including diverse background...
1yrs ago
050.2K
VoxCPM 1.5 - 面壁智能开源的端到端文本到语音模型

VoxCPM 1.5 - Faceted Intelligence Open Source End-to-End Text-to-Speech Modeling

VoxCPM 1.5 is an open source speech generation model released by Facade Intelligence, based on text-to-speech (TTS) technology without the need for a splitter, featuring several innovations and improvements. Adopting an end-to-end diffusion autoregressive architecture, it generates continuous speech waveforms directly from text, avoiding the limitations of traditional segmentation methods...
6mos ago
050.1K
CRIC深度智联 - 克而瑞推出的中国房地产首个AI Agent

CRIC - The First AI Agent for Real Estate in China Launched by CRIC

CRIC Depth Intelligence is the first AI intelligent body of Chinese real estate independently developed by CRIC, based on CRIC's 20 years of experience in the real estate industry and data accumulation and multimodal big model technology, which opens up the whole chain from data integration, intelligent analysis to content generation.
1yrs ago
049.7K
问小白5 - 问小白推出的全能AI模型

Ask White 5 - All-in-One AI Model from Ask White

Ask White 5 is the flagship "All in One" model with a very high level of intelligence. The model has excellent performance in many assessments, such as the AA-Index composite assessment score of 64.7 and the STEM ability assessment score of 86, which is close to the world's leading GPT-5.
10mos ago
049.6K
职达AI简历 - AI简历生成与优化平台,精准分析问题、提供优化建议

Vinda AI Resume - AI Resume Generation and Optimization Platform, Precise Analysis of Problems and Optimization Suggestions

Job AI resume is an efficient and convenient intelligent resume generation and optimization platform. Based on AI technology, the platform helps users quickly generate professional and personalized resumes. Users only need to enter basic information and experience, the platform can generate high-quality resume in a short time, providing 2800+ beautiful templates, covering a variety of positions.
1yrs ago
049.4K
万兴天幕 – 万兴科技推出AIGC视频创作平台

Wanxing Canopy - Wanxing Technology Launches AIGC Video Creation Platform

Wanxing Canopy is the AIGC video creation platform launched by Wanxing Technology, covering the three major creation fields of video, picture and audio generation, which is specially designed for media and cultural industry workers, film and television/post-production workers, art and design workers, advertising and marketing practitioners, etc. to provide one-stop professional creation solutions.
12mos ago
049.3K
Meeseeks - 美团开源的评估模型指令遵循能力的评测集

Meeseeks - Meeseeks open-source assessment set for evaluating the ability to follow model instructions

Meeseeks is an open source large model evaluation set used by the Meituan M17 team to evaluate the model's ability to follow instructions.Meeseeks uses a three-tiered evaluation framework to comprehensively measure whether the model is able to generate answers in strict accordance with the user's instructions from the macro to the micro level, without evaluating the knowledge of the content of the answers positively ...
10mos ago
048.9K
文心大模型X1.1 - 百度推出的深度思考模型,理解能力更强

Wenshin Big Model X1.1 - Baidu's Deep Thinking Model for Better Understanding

Wenxin Big Model X1.1 is a deep thinking model launched by Baidu, based on a hybrid reinforcement learning framework that focuses on improving language understanding and generation. The model excels in handling complex questions, following instructions and simulating the behavior of intelligences, and can accurately provide knowledgeable answers and high-quality text content.
9mos ago
048.8K
飞算JavaAI - AI Java开发助手,自然语言实现全流程智能化开发

Flycount JavaAI - AI Java development assistant, natural language implementation of the whole process of intelligent development

Flycount JavaAI is an intelligent Java development assistant launched by Flycount Technology. The platform supports natural language input to realize the whole process of intelligent development from requirements analysis to code generation. Developers only need to enter a description of the requirements, Flycount JavaAI can accurately understand and generate a complete engineering code framework, the platform...
1yrs ago
048K
Youtu-GraphRAG - 腾讯优图实验室开源的图检索增强生成框架

Youtu-GraphRAG - Tencent Youtu Labs Open Source Graph Retrieval Augmentation Generation Framework

Youtu-GraphRAG is an open source graph retrieval augmentation generation framework from Tencent's Youtu Labs to help large language models handle complex Q&A tasks more accurately. By constructing a four-layer knowledge tree, the knowledge is disassembled into four levels of attributes, relationships, keywords and communities to realize the self-directed performance of cross-domain knowledge...
9mos ago
047.8K
OneCAT - 美团联合上海交大开源的多模态模型

OneCAT - Open source multimodal modeling by Meituan and Shanghai Jiaotong University

OneCAT is a new unified multimodal model launched by Meituan in conjunction with Shanghai Jiaotong University, which adopts a pure decoder architecture and can seamlessly integrate multimodal comprehension, text-to-image generation and image editing functions. The model abandons the design of traditional multimodal models that rely on external visual coders and disambiguators through modality-specific...
10mos ago
047.5K
MiniMax Music 1.5 - MiniMax最新推出的AI音乐生成模型

MiniMax Music 1.5 - MiniMax's latest AI music generation model

MiniMax Music 1.5 is an advanced AI music generation tool that supports generating up to 4 minutes of music based on users' natural language descriptions. The model supports a variety of music styles and mood customization, generating a natural and full vocal color, smooth transitions, richly layered arrangements...
9mos ago
047.5K
阶跃深研 - 阶跃星辰推出的AI深入研究工具

Steps Deep Research - AI Deep Research Tool by Steps Star

Steps Deep Research is an efficient AI research tool launched by Steps Star, which can autonomously complete research on complex issues and generate professional reports in a short period of time. The tool is designed for finance, consulting, healthcare, law and other fields, and excels in industry reviews with its in-depth search and information integration capabilities.
11mos ago
046.4K