Latest AI Resources

Total 3100 articles posts
Ovis-U1 - 阿里推出的多模态统一AI模型

Ovis-U1 - Multimodal Unified AI Model Introduced by Ali

Ovis-U1 is a multimodal unified model introduced by the Ovis team of Alibaba Group with a parameter scale of 3 billion. The model is equipped with three core capabilities: multimodal understanding, text-to-image generation, and image editing. With advanced architectural design and collaborative and unified training methods, the model supports the realization of high-fidelity image...
10mos ago
047.1K
HeyGen - AI 数字人视频创作平台,支持多语言翻译配音

HeyGen - AI Digital Human Video Creation Platform with Multi-Language Translation and Dubbing Support

HeyGen is an AI-driven digital human video creation platform that supports a streamlined video production process, allowing users to quickly generate professional-level digital human videos. The platform is based on advanced AI technology, giving users full control over the image and voice of digital people, providing a rich library of material, including diverse background...
11mos ago
047K
Higress MCP - 今日投资推出的MCP服务平台

Higress MCP - Invest Today Launches MCP Services Platform

Higress MCP is an innovative platform launched by Invest Today that supports the rapid transformation of traditional financial data APIs into modern MCP services.Higress MCP enables the transformation of REST APIs to MCP Server based on a simple configuration without the need to program...
10mos ago
046.9K
优雅YOYA - 中科闻歌推出的AI音视频内容创作平台

Elegant YOYA - AI Audio/Video Content Creation Platform Launched by Sinotech Winkler

Elegant YOYA is a multimodal literate video platform launched by Zhongke Wenge, the platform is based on AI multimodal technology to empower the whole chain of video content creation. Users only need to input the theme requirements, the platform can quickly generate scripts, images, videos, and can complete intelligent editing, voice synthesis and character mouth drive and other operations, the output...
11mos ago
046.7K
有道小P - 网易有道推出的新一代AI全科学习助手

Youdao Xiao P - A new generation of AI general learning assistant launched by NetEase Youdao

Youdao Little P is an AI all-subject learning assistant launched by NetEase Youdao, designed for K12 students, equipped with the Youdao Ziyi education big model, covering elementary school, middle school and high school all-subject Q&A, providing personalized learning advice. With AI word search and AI translation functions, Youdao Little P helps students quickly solve language problems...
11mos ago
046.4K
gpt-realtime - OpenAI最新推出的AI语音模型

gpt-realtime - OpenAI's newest AI speech model

gpt-realtime is an advanced speech model from OpenAI that supports direct audio processing to generate natural and smooth speech. The model supports multiple languages and styles, understands non-verbal cues such as laughter, and can switch between languages.
8mos ago
046.1K
MindLink - 昆仑万维推出的开源推理大模型

MindLink - Open Source Reasoning Big Model from KunlunWei

MindLink is a large model of open source reasoning launched by Kunlun World Wide Web. With adaptive reasoning mechanism , according to the complexity of the task can be flexibly switched inference mode , simple tasks quickly generated , complex tasks in-depth reasoning , taking into account the efficiency and accuracy . Plan-driven reasoning paradigm to remove the "think" label , down ...
9mos ago
045.9K
InternVLA-A1 - 上海AI Lab开源一体化操作能力的具身大模型

InternVLA-A1 - Shanghai AI Lab Open Source Integration of Operational Capabilities for Embodied Large Models

InternVLA-A1 is a large model of embodied operation open-sourced by Shanghai Artificial Intelligence Laboratory. It has the ability to understand, imagine, and execute the integration, and can accurately complete the task. The model fuses real and simulated operational data, and automates the construction of massive multimodal through large-scale virtual-real hybrid scene assets...
7mos ago
045.4K
万兴天幕 – 万兴科技推出AIGC视频创作平台

Wanxing Canopy - Wanxing Technology Launches AIGC Video Creation Platform

Wanxing Canopy is the AIGC video creation platform launched by Wanxing Technology, covering the three major creation fields of video, picture and audio generation, which is specially designed for media and cultural industry workers, film and television/post-production workers, art and design workers, advertising and marketing practitioners, etc. to provide one-stop professional creation solutions.
10mos ago
045.3K
问小白5 - 问小白推出的全能AI模型

Ask White 5 - All-in-One AI Model from Ask White

Ask White 5 is the flagship "All in One" model with a very high level of intelligence. The model has excellent performance in many assessments, such as the AA-Index composite assessment score of 64.7 and the STEM ability assessment score of 86, which is close to the world's leading GPT-5.
8mos ago
045.3K
Meeseeks - 美团开源的评估模型指令遵循能力的评测集

Meeseeks - Meeseeks open-source assessment set for evaluating the ability to follow model instructions

Meeseeks is an open source large model evaluation set used by the Meituan M17 team to evaluate the model's ability to follow instructions.Meeseeks uses a three-tiered evaluation framework to comprehensively measure whether the model is able to generate answers in strict accordance with the user's instructions from the macro to the micro level, without evaluating the knowledge of the content of the answers positively ...
8mos ago
045.1K
职达AI简历 - AI简历生成与优化平台,精准分析问题、提供优化建议

Vinda AI Resume - AI Resume Generation and Optimization Platform, Precise Analysis of Problems and Optimization Suggestions

Job AI resume is an efficient and convenient intelligent resume generation and optimization platform. Based on AI technology, the platform helps users quickly generate professional and personalized resumes. Users only need to enter basic information and experience, the platform can generate high-quality resume in a short time, providing 2800+ beautiful templates, covering a variety of positions.
11mos ago
045K
InternVLA·N1 - 上海AI Lab开源的端到端双系统导航大模型

InternVLA-N1 - Shanghai AI Lab Open Source End-to-End Dual System Navigation Large Model

InternVLA-N1 is an open source end-to-end dual-system navigation macromodel from Shanghai Artificial Intelligence Laboratory. Using a dual-system architecture, System 2 is responsible for understanding linguistic commands and planning long-range paths, while System 1 focuses on high-frequency response and agile obstacle avoidance. The model is trained entirely based on synthetic data through large-scale digital ...
7mos ago
044.9K
CRIC深度智联 - 克而瑞推出的中国房地产首个AI Agent

CRIC - The First AI Agent for Real Estate in China Launched by CRIC

CRIC Depth Intelligence is the first AI intelligent body of Chinese real estate independently developed by CRIC, based on CRIC's 20 years of experience in the real estate industry and data accumulation and multimodal big model technology, which opens up the whole chain from data integration, intelligent analysis to content generation.
11mos ago
044.8K
文心大模型X1.1 - 百度推出的深度思考模型,理解能力更强

Wenshin Big Model X1.1 - Baidu's Deep Thinking Model for Better Understanding

Wenxin Big Model X1.1 is a deep thinking model launched by Baidu, based on a hybrid reinforcement learning framework that focuses on improving language understanding and generation. The model excels in handling complex questions, following instructions and simulating the behavior of intelligences, and can accurately provide knowledgeable answers and high-quality text content.
8mos ago
044.3K
VoxCPM 1.5 - 面壁智能开源的端到端文本到语音模型

VoxCPM 1.5 - Faceted Intelligence Open Source End-to-End Text-to-Speech Modeling

VoxCPM 1.5 is an open source speech generation model released by Facade Intelligence, based on text-to-speech (TTS) technology without the need for a splitter, featuring several innovations and improvements. Adopting an end-to-end diffusion autoregressive architecture, it generates continuous speech waveforms directly from text, avoiding the limitations of traditional segmentation methods...
5mos ago
044.2K
DeckSpeed - AI PPT制作工具,自然语言生成演示文稿

DeckSpeed - AI PPT Maker, Natural Language Generated Presentation

DeckSpeed is an AI presentation creation tool based on conversational interaction, where users express their needs based on natural language and quickly generate personalized slides without relying on traditional templates. The tool supports real-time feedback adjustment, users can modify the color, style and content of the slide at any time to ensure that the presentation is complete...
11mos ago
044.2K
Youtu-GraphRAG - 腾讯优图实验室开源的图检索增强生成框架

Youtu-GraphRAG - Tencent Youtu Labs Open Source Graph Retrieval Augmentation Generation Framework

Youtu-GraphRAG is an open source graph retrieval augmentation generation framework from Tencent's Youtu Labs to help large language models handle complex Q&A tasks more accurately. By constructing a four-layer knowledge tree, the knowledge is disassembled into four levels of attributes, relationships, keywords and communities to realize the self-directed performance of cross-domain knowledge...
8mos ago
043.7K
阶跃深研 - 阶跃星辰推出的AI深入研究工具

Steps Deep Research - AI Deep Research Tool by Steps Star

Steps Deep Research is an efficient AI research tool launched by Steps Star, which can autonomously complete research on complex issues and generate professional reports in a short period of time. The tool is designed for finance, consulting, healthcare, law and other fields, and excels in industry reviews with its in-depth search and information integration capabilities.
9mos ago
043.7K
MiniMax Music 1.5 - MiniMax最新推出的AI音乐生成模型

MiniMax Music 1.5 - MiniMax's latest AI music generation model

MiniMax Music 1.5 is an advanced AI music generation tool that supports generating up to 4 minutes of music based on users' natural language descriptions. The model supports a variety of music styles and mood customization, generating a natural and full vocal color, smooth transitions, richly layered arrangements...
8mos ago
043.6K
Neovate Code - 蚂蚁开源的智能编程助手

Neovate Code - Ant Open Source's Intelligent Programming Assistant

Neovate Code is an open source intelligent programming assistant from Ant Group's Alipay Experience Technology Department, which improves development efficiency through artificial intelligence technology. With conversational development features, developers can describe the requirements through natural language, Neovate Code can understand and generate the corresponding generation...
7mos ago
043.4K
OneCAT - 美团联合上海交大开源的多模态模型

OneCAT - Open source multimodal modeling by Meituan and Shanghai Jiaotong University

OneCAT is a new unified multimodal model launched by Meituan in conjunction with Shanghai Jiaotong University, which adopts a pure decoder architecture and can seamlessly integrate multimodal comprehension, text-to-image generation and image editing functions. The model abandons the design of traditional multimodal models that rely on external visual coders and disambiguators through modality-specific...
8mos ago
043K
飞算JavaAI - AI Java开发助手,自然语言实现全流程智能化开发

Flycount JavaAI - AI Java development assistant, natural language implementation of the whole process of intelligent development

Flycount JavaAI is an intelligent Java development assistant launched by Flycount Technology. The platform supports natural language input to realize the whole process of intelligent development from requirements analysis to code generation. Developers only need to enter a description of the requirements, Flycount JavaAI can accurately understand and generate a complete engineering code framework, the platform...
11mos ago
042.3K
美间:在线软装(家装)设计工具,快速生成设计方案,软装辅助AI工具箱

Meiman: online soft furnishing (home furnishing) design tools, rapid generation of design plans, soft furnishing auxiliary AI toolkit

Comprehensive Introduction Meiman is an online platform specializing in home design and marketing negotiation. The site provides a wealth of design materials, soft furnishings and proposal PPT templates, poster templates, etc. to help designers and homeowners quickly generate high-quality design solutions. Meiman's online soft furnishing design tool can be used in as little as 10 seconds...
11mos ago
042.3K
MobiAgent - 上海交大开源的移动端智能体全栈构建框架

MobiAgent - Shanghai Jiaotong University open source mobile intelligent body full-stack building framework

MobiAgent is an open source mobile intelligent body toolchain from IPADS Lab of Shanghai Jiaotong University, which helps users to build their own mobile intelligent assistants. By recording the user's operation trajectory and generating high-quality data, it trains an intelligent body that can understand natural language commands. Core features include efficient...
8mos ago
041.5K
DiaMoE-TTS - 清华联合巨人网络开源的多方言语音合成框架

DiaMoE-TTS - Tsinghua and Giant Networks open source multi-dialect speech synthesis framework

DiaMoE-TTS is a multi-dialect speech synthesis framework jointly open-sourced by Tsinghua University and Giant Network, based on the International Phonetic Alphabet (IPA), to solve the problems of dialect data scarcity, orthographic inconsistency, and complex phonological changes. Through a unified IPA front-end standardized phoneme representation to eliminate cross-dialect differences ...
7mos ago
041.4K