Latest AI Resources

Total 2985 articles posts
Doppl - 谷歌推出的AI虚拟试衣应用

Doppl - AI virtual fitting app from Google

Doppl is an AI virtual fitting application launched by Google. After the user uploads a full body photo, the application supports the clothing picture or screenshot "wear" in the digital version of their own body, and can be converted from static pictures to AI-generated video, so that users can more truly feel the effect of clothing on the body.
9mos ago
042.9K
JoyHallo - 京东开源的AI数字人模型

JoyHallo - Jingdong's open source AI digital human model

JoyHallo is an open source AI digital human model from Jingdong, designed for Mandarin, supporting the conversion of audio into realistic speaking video.JoyHallo embeds audio features based on the wav2vec2 model, using a semi-decoupled structure to improve the accuracy of lip movement prediction, and supports the generation of English video...
9mos ago
042.8K
VoxCPM - 面壁智能联合清华开源的端到端TTS模型

VoxCPM - Faceted Intelligence and Tsinghua Open Source End-to-End TTS Model

VoxCPM is a speech generation model jointly open-sourced by Facade Intelligence and Shenzhen International Graduate School of Tsinghua University.VoxCPM adopts an end-to-end diffusion autoregressive architecture to generate continuous speech representations directly from text, breaking through the limitations of traditional discrete disambiguation. Through hierarchical language modeling and finite state quantization...
6mos ago
042.7K
探饭 - 字节跳动推出的AI美食推荐工具

Scouting Rice - AI food recommendation tool launched by Wordpress

TanRice is an AI food recommendation tool launched by Jitterbug, a subsidiary of ByteDance, which relies on the Beanbag Big Model to provide users with personalized food recommendations, store scouting comparisons, food tips and other services. TanRice can accurately recommend nearby restaurants and dishes based on users' taste preferences and location, support assisted ordering, and provide group-buying and takeaway services...
9mos ago
042.6K
Seed GR-3 - 字节跳动Seed团队推出的通用机器人模型

Seed GR-3 - Generalized Robotics Model from the Wordpress Seed Team

Seed GR-3 is a general-purpose robot model introduced by ByteDance with strong generalization ability to adapt to new environments and complex commands. The model fuses visual, verbal, and motion information, and is based on a three-in-one training method of robot data, VR human trajectory data, and publicly available graphic data to enhance the ability to respond to new objects...
8mos ago
042.6K
Tizzy.ai - 百度推出的AI搜索应用

Tizzy.ai - AI search app launched by Baidu

Tizzy.ai is an AI intelligent search application launched by Baidu.Tizzy.ai is based on Baidu's big model technology, with powerful intelligent search functions, can quickly answer questions, deep thinking and assist in decision-making.Tizzy.ai has a simple interface, no ads and pop-ups, and the bottom of the guide...
8mos ago
042.5K
MuseSteamer - 百度推出的视频生成大模型

MuseSteamer - Baidu Launches Big Model for Video Generation

MuseSteamer is a large model for multimodal video generation launched by Baidu. The model can quickly generate high-quality dynamic video content based on text descriptions or images provided by the user, supporting a variety of clarity and functionality versions to meet the needs of different scenarios of creation.
9mos ago
042.5K
宠TA - 京东推出的AI宠物互动产品

Pet TA - AI pet interaction product launched by Jingdong

Pet TA is an AIGC pet interactive product launched by Jingdong, which can provide a fun and cozy online interactive platform for pet lovers. It supports users to choose a variety of cute clothes and accessories for their pets, personalized dress up, and can create a digital image of their pets for rich interaction with them. The platform provides...
8mos ago
042.4K
ChatFlow - 开源AI工作流自动化工具

ChatFlow - Open Source AI Workflow Automation Tool

ChatFlow is an open source AI workflow automation tool that supports the transformation of complex requirements into efficient workflows. Tools based on AI technology to help users quickly generate code frameworks, test cases, can assist in writing and designing software architecture.
8mos ago
042.4K
Make - AI无代码自动化工作流搭建平台

Make - AI's no-code automated workflow building platform

Make is an AI-driven no-code automation platform that helps organizations improve efficiency and innovation based on automated processes. The platform offers more than 2,000 pre-built apps that support a variety of business scenarios, such as marketing, sales, finance, etc. Make's core features include no-code visual process creation, AI...
9mos ago
042.3K
EXAONE 4.0 - LG推出的混合推理模型

EXAONE 4.0 - Hybrid Reasoning Model introduced by LG

EXAONE 4.0 is a hybrid reasoning grand model from LG AI Research in Korea, blending general-purpose natural language processing and advanced reasoning capabilities. The model supports Korean, English, and Spanish, and is divided into a 32B professional version and a 1.2B end-side version. The professional version is suitable for legal, accounting...
8mos ago
042.2K
RoboOS 2.0 - 智谱开源的跨本体具身大小脑协作框架

RoboOS 2.0 - Wisdom Spectrum's Open Source Cross-Ontology Embodied Brain-Size Collaboration Framework

RoboOS 2.0 is an open-source framework for cross ontology brain collaboration, promoting the transformation of robots from single intelligence to collaborative group intelligence. The framework realizes efficient division of labor with a "big brain" architecture, where the cloud brain is responsible for complex decision-making and collaboration, and the small brain module focuses on executing specific skills.
8mos ago
042.2K
灵码 IDE - 通义灵码推出 AI 原生开发环境工具

Linguaphone IDE - Tongyi Linguaphone Launches AI Native Development Environment Tools

Spirit Code IDE is the AI native integrated development environment (IDE) launched by Tongyi Spirit Code, which is deeply adapted to the 3 major models of Thousand Questions, and has a powerful programming intelligent body mode to support the autonomous completion of tasks such as project perception, code retrieval, and execution of terminal operations. It supports MCP tools and integrates Magic Hitch MCP Square's 3...
9mos ago
042.1K
RedOne - 小红书最新推出的社交大模型

RedOne - the latest social mega-model from Little Red Book

RedOne is a large language model customized for social networks introduced by Little Red Book. The model is trained through a three-stage training strategy that incorporates social and cultural knowledge, strengthens multitasking capabilities, and aligns human preferences.RedOne significantly outperforms the base model in social task performance, in harmful content detection and browsing...
7mos ago
042.1K
Genie 3 - 谷歌推出的通用世界模型

Genie 3 - A Universal World Model from Google

Genie 3 is a next-generation universal world model from Google DeepMind that enables the generation of highly dynamic and coherent virtual worlds in real time.Genie 3 simulates physical phenomena, natural ecosystems, and supports the creation of fantasy and historical scenarios. With text prompts, users can...
7mos ago
041.9K
11ai - ElevenLabs推出个人AI语音助理

11ai - ElevenLabs Launches Personal AI Voice Assistant

11ai is an AI voice assistant launched by ElevenLabs, with voice interaction as the core, through natural and smooth dialogue to enhance the user's work efficiency. 11ai supports more than 5,000 voices, and users can customize the exclusive voice, the assistant is more personalized. With low-latency voice inter...
9mos ago
041.8K
MoE-TTS - 昆仑万维推出的最新语音生成框架

MoE-TTS - The Latest Speech Generation Framework from KunlunWei

MoE-TTS is a speech synthesis framework introduced by KunlunWanwei, based on the Mixed Expert (MoE) architecture, which combines pre-trained Large Language Models (LLMs) with speech expert modules.MoE-TTS retains the powerful textual reasoning by freezing the textual module parameters and updating only the speech module parameters...
7mos ago
041.8K
绘想 - 百度推出的AI视频生成平台

Painting Thinking - AI Video Generation Platform Launched by Baidu

Painting is an AI video generation platform launched by Baidu, based on AI technology to help users easily create personalized videos. Painting intuitive interface, powerful tools, with inspiration recommendation function, can provide creators with creative inspiration, support a key to the same operation, can quickly generate similar videos, simplify the creative process.
9mos ago
041.8K
CombatVLA - 淘天集团推出的高效VLA模型

CombatVLA - Efficient VLA Model by Amoy Group

CombatVLA is an innovative 3D action role-playing game (ARPG)-specific model from the Future Life Lab team of the Amoy Sky Group.CombatVLA is a visual-linguistic-action (VLA) model, built on a 3B parametric scale, that collects human player's through a motion tracker...
7mos ago
041.7K
ScienceOne - 中国科学院自动化研究所等机构推出的智能科研平台

ScienceOne - Intelligent Research Platform Launched by Institute of Automation, Chinese Academy of Sciences and Other Institutions

ScienceOne is an intelligent scientific research platform jointly launched by Institute of Automation, Chinese Academy of Sciences. The platform is based on the construction of large models of scientific foundation, and promotes a new paradigm of intelligent scientific research with multidisciplinary collaboration, providing support for the whole process of scientific research.The core products of ScienceOne include S1...
9mos ago
041.7K
ThinkSound - 阿里通义推出的音频生成模型

ThinkSound - Audio Generation Model launched by Ali Tongyi

ThinkSound is the first CoT (Chain Thinking) audio generation model introduced by Ali Tongyi's speech team. The model can generate accurately matched sound effects for video images, based on the introduction of CoT reasoning, to solve the problem of traditional technology is difficult to capture the dynamic details of the screen and spatial relationships.
9mos ago
041.6K
Seed Diffusion - 字节跳动最新推出的扩散语言模型

Seed Diffusion - the newest diffusion language model from ByteHopper

Seed Diffusion is an experimental diffusion language model introduced by ByteHop that handles code generation tasks. The model is based on techniques such as two-stage diffusion training, constrained sequential learning, and enhanced efficient parallel decoding, which significantly improves inference speed to 2146 tokens/s, which is faster than...
8mos ago
041.6K
Gemini CLI - 谷歌开源的编程Agent

Gemini CLI - Google Open Source Programming Agent

Gemini CLI is Google's open source AI programming tool based on incorporating the Gemini Big Model into the developer's endpoint to provide developers with powerful AI capabilities. The tool understands code, manipulates files, executes commands, and dynamically troubleshoots problems to help developers efficiently write generation...
9mos ago
041.5K
企鹅读伴 - 腾讯推出的中小学生AI阅读助手

Penguin Reading Companion - Tencent's AI Reading Assistant for Primary and Secondary School Students

Penguin Reading Companion is an AI reading assistant designed for primary and secondary school students by Tencent. Penguin Reading Companion relies on Tencent's hybrid big model and metamachine platform, combined with the Compulsory Education Language Curriculum Program and Curriculum Standards (2022 Edition), to provide students with personalized reading recommendations, multiple reading modes (focusing, reading aloud, listening...
9mos ago
041.5K
商汤如影 - 商汤科技推出的AI数字人视频制作平台

Shangtang Ruyi - AI digital human video production platform launched by Shangtang Technology

Shangtang Ruying is an AI digital human video production platform launched by Shangtang Technology. Based on big model technology, the platform supports the creation of highly realistic digital human images and personalization, including facial features, clothing, hairstyles, and so on. The platform is equipped with sound cloning, video generation, automated data labeling, real-time interaction, and other functions...
9mos ago
041.3K
稿定AI社区 - AI创意内容设计平台,多种设计资源满足不同创作需求

Drafting AI Community - AI creative content design platform, a variety of design resources to meet different creative needs

Drafting AI Community is an online AI creative inspiration platform that provides users with a wealth of creative design resources and tools. The platform covers a variety of design fields, including image photos, e-commerce design, holiday themes, 3D illustrations, avatar design, Xiaohongshu materials, portrait design, etc., to meet the needs of different users.
10mos ago
041K
MagicTryOn - 浙大和vivo等机构推出的视频虚拟试穿框架

MagicTryOn - Video Virtual Try-On Framework from ZJU and Vivo and others

MagicTryOn is an advanced video virtual try-on framework launched by the School of Computer Science and Technology of Zhejiang University in collaboration with vivo and other organizations. The framework replaces the traditional U-Net architecture with an innovative Diffusion Transformer (DiT) architecture, combined with a fully self-attentive machine...
9mos ago
041K
gpt-realtime - OpenAI最新推出的AI语音模型

gpt-realtime - OpenAI's newest AI speech model

gpt-realtime is an advanced speech model from OpenAI that supports direct audio processing to generate natural and smooth speech. The model supports multiple languages and styles, understands non-verbal cues such as laughter, and can switch between languages.
7mos ago
040.9K
Qwen3Guard - 阿里Qwen开源的安全模型

Qwen3Guard - Ali Qwen open source security model

Qwen3Guard is a fine-tuned security protection model based on the Qwen3 base model, designed for security detection. It provides accurate security categorization of prompts and responses, provides risk levels, and supports English, Chinese, and multi-language environments.Qwen3Guard comes with two pro...
6mos ago
040.8K
Megrez-3B-Omni:端侧多模态理解模型,支持文本、图像、音频多模态理解和分析

Megrez-3B-Omni: an end-side multimodal understanding model supporting text, image, and audio multimodal understanding and analysis

Comprehensive Introduction Infini-Megrez is an edge intelligence solution developed by the unquestioned core dome (Infinigence AI), aiming to achieve efficient multimodal understanding and analysis through hardware and software co-design. At the core of the project is the Megrez-3B model, which supports graph...
1yrs ago
040.6K
羚珑 - 京东推出的AI商品图设计工具

Antelope - AI product image design tool launched by Jingdong

Antelope is an intelligent design tool launched by Jingdong, providing efficient and convenient design solutions for e-commerce merchants and individuals. Through intelligent keying, intelligent layout, intelligent color matching and other functions, it helps users to quickly generate high-quality design works to meet the main picture of the product, advertising Banner, store page and other e-commerce store...
9mos ago
040.5K
ChatGPT Agent – OpenAI推出的通用智能AI Agent

ChatGPT Agent - General Intelligence AI Agent by OpenAI

ChatGPT Agent is a general-purpose AI Agent from OpenAI that combines multiple capabilities to autonomously accomplish complex tasks. Users only need to describe their needs in natural language, and the Agent can automatically select the appropriate tools, such as browsing the web, extracting information, running code...
8mos ago
040.4K
HeyGen - AI 数字人视频创作平台,支持多语言翻译配音

HeyGen - AI Digital Human Video Creation Platform with Multi-Language Translation and Dubbing Support

HeyGen is an AI-driven digital human video creation platform that supports a streamlined video production process, allowing users to quickly generate professional-level digital human videos. The platform is based on advanced AI technology, giving users full control over the image and voice of digital people, providing a rich library of material, including diverse background...
9mos ago
040.4K
琴乐大模型 - 腾讯推出的AI音乐创作模型

Piano Music Big Model - AI Music Composition Model by Tencent

Qin Music Grand Model is an advanced AI music creation grand model jointly launched by Tencent AI Lab and Tencent TME Tianqin Lab. The model intelligently generates high-quality stereo audio or multi-track sheet music based on user-inputted keywords, descriptive statements or audio clips in English and Chinese.
9mos ago
040.3K
ViMax - 香港大学开源的多智能体视频生成框架

ViMax - Open Source Multi-intelligent Body Video Generation Framework at the University of Hong Kong

ViMax is an open source multi-intelligence body video generation framework from the Data Science Laboratory of the University of Hong Kong, which can automate the whole process from creative input to video output. Integration of script generation , scene design , shot planning and video rendering and other functions , to support users to generate coherent film and television grade video through natural language description ...
4mos ago
040.3K
问小白5 - 问小白推出的全能AI模型

Ask White 5 - All-in-One AI Model from Ask White

Ask White 5 is the flagship "All in One" model with a very high level of intelligence. The model has excellent performance in many assessments, such as the AA-Index composite assessment score of 64.7 and the STEM ability assessment score of 86, which is close to the world's leading GPT-5.
7mos ago
040.1K