Latest AI Resources

Total 3098 articles posts
靠岸妙写 - AI论文写作工具,构思到成稿一站式解决

Leaning Wonderful Writer - AI essay writing tool, one-stop solution from idea to finished paper

Leaning Wonderful Writer is an AI dissertation writing tool that provides an efficient and convenient solution for academic writing. The tool supports one-click generation of dissertation outlines, abstracts and first drafts of the body of the paper, which is applicable to different levels of academic needs such as undergraduate and master's degrees, and covers multi-disciplinary fields such as science and technology, liberal arts and social sciences.
11mos ago
050.3K
Seed Diffusion - 字节跳动最新推出的扩散语言模型

Seed Diffusion - the newest diffusion language model from ByteHopper

Seed Diffusion is an experimental diffusion language model introduced by ByteHop that handles code generation tasks. The model is based on techniques such as two-stage diffusion training, constrained sequential learning, and enhanced efficient parallel decoding, which significantly improves inference speed to 2146 tokens/s, which is faster than...
9mos ago
050.1K
Seed GR-3 - 字节跳动Seed团队推出的通用机器人模型

Seed GR-3 - Generalized Robotics Model from the Wordpress Seed Team

Seed GR-3 is a general-purpose robot model introduced by ByteDance with strong generalization ability to adapt to new environments and complex commands. The model fuses visual, verbal, and motion information, and is based on a three-in-one training method of robot data, VR human trajectory data, and publicly available graphic data to enhance the ability to respond to new objects...
9mos ago
050.1K
浙江大学免费PDF资料《大模型基础》 - 附下载链接

Free PDF of Fundamentals of Large Models from Zhejiang University - with download link

Fundamentals of Large Models provides an in-depth analysis of the core technologies and practical paths of Large Language Models (LLMs). Starting from the fundamental theory of language modeling, it systematically explains the principles of model design based on statistics, recurrent neural networks (RNN), and Transformer architecture, focusing on the three major big language model...
7mos ago
050.1K
Make - AI无代码自动化工作流搭建平台

Make - AI's no-code automated workflow building platform

Make is an AI-driven no-code automation platform that helps organizations improve efficiency and innovation based on automated processes. The platform offers more than 2,000 pre-built apps that support a variety of business scenarios, such as marketing, sales, finance, etc. Make's core features include no-code visual process creation, AI...
11mos ago
050K
妙构 - AI视频分析与生成工具,揭示爆款视频创作规律

MyoConstruct - AI Video Analysis and Generation Tool, Revealing the Laws of Explosive Video Creation

Miaojiao is a professional AI video content analysis and generation tool, based on deep learning algorithms, which analyzes the visual, audio and creative structure of the video in an all-round way, revealing the laws behind the explosive videos. Miaojiao can analyze composition, color, and lens language, assess creative uniqueness and emotional resonance, and provide trend insights and optimization suggestions...
11mos ago
050K
MuseSteamer - 百度推出的视频生成大模型

MuseSteamer - Baidu Launches Big Model for Video Generation

MuseSteamer is a large model for multimodal video generation launched by Baidu. The model can quickly generate high-quality dynamic video content based on text descriptions or images provided by the user, supporting a variety of clarity and functionality versions to meet the needs of different scenarios of creation.
10mos ago
050K
Step-Audio 2 mini - 阶跃星辰开源的语音大模型

Step-Audio 2 mini - Step-Star Open Source Speech Megamodels

Step-Audio 2 mini is an open source end-to-end speech grand model of Step-Audio. It breaks through the traditional speech model structure and adopts the true end-to-end multimodal architecture, which directly transforms the original audio input into speech response output with lower latency, and understands paralinguistic information and non-vocal signals.
8mos ago
050K
Vortn:利用AI编写与管理企业内部知识库

Vortn: Authoring and managing an in-house knowledge base with AI

Comprehensive Introduction Vortn is a platform focused on intelligent knowledge storage and management, providing users with personalized knowledge management services through AI agents and access control systems. The platform supports the use of AI chat functionality to provide intelligent responses based on context to help users better organize, access and leverage information...
1yrs ago
050K
FastVLM - 苹果公司推出的视觉语言模型

FastVLM - Visual Language Model from Apple

FastVLM (Fast Vision Language Model) is an efficient visual language model introduced by Apple Inc. With FastViTHD hybrid visual coder as the core, it incorporates convolutional and Transformer architectures to significantly reduce visual...
8mos ago
049.9K
Tizzy.ai - 百度推出的AI搜索应用

Tizzy.ai - AI search app launched by Baidu

Tizzy.ai is an AI intelligent search application launched by Baidu.Tizzy.ai is based on Baidu's big model technology, with powerful intelligent search functions, can quickly answer questions, deep thinking and assist in decision-making.Tizzy.ai has a simple interface, no ads and pop-ups, and the bottom of the guide...
10mos ago
049.8K
探饭 - 字节跳动推出的AI美食推荐工具

Scouting Rice - AI food recommendation tool launched by Wordpress

TanRice is an AI food recommendation tool launched by Jitterbug, a subsidiary of ByteDance, which relies on the Beanbag Big Model to provide users with personalized food recommendations, store scouting comparisons, food tips and other services. TanRice can accurately recommend nearby restaurants and dishes based on users' taste preferences and location, support assisted ordering, and provide group-buying and takeaway services...
10mos ago
049.7K
VoxCPM - 面壁智能联合清华开源的端到端TTS模型

VoxCPM - Faceted Intelligence and Tsinghua Open Source End-to-End TTS Model

VoxCPM is a speech generation model jointly open-sourced by Facade Intelligence and Shenzhen International Graduate School of Tsinghua University.VoxCPM adopts an end-to-end diffusion autoregressive architecture to generate continuous speech representations directly from text, breaking through the limitations of traditional discrete disambiguation. Through hierarchical language modeling and finite state quantization...
7mos ago
049.7K
MonkeyCode - 开源的企业级AI编程助手

MonkeyCode - Open Source Enterprise AI Programming Assistant

MonkeyCode is an open source, enterprise-grade, native AI programming assistant designed for privacy- and security-conscious development teams.MonkeyCode supports private deployment and offline use to ensure code data security. MonkeyCode supports private deployment and offline use to ensure the security of code data.
9mos ago
049.7K
宠TA - 京东推出的AI宠物互动产品

Pet TA - AI pet interaction product launched by Jingdong

Pet TA is an AIGC pet interactive product launched by Jingdong, which can provide a fun and cozy online interactive platform for pet lovers. It supports users to choose a variety of cute clothes and accessories for their pets, personalized dress up, and can create a digital image of their pets for rich interaction with them. The platform provides...
10mos ago
049.4K
RoboOS 2.0 - 智谱开源的跨本体具身大小脑协作框架

RoboOS 2.0 - Wisdom Spectrum's Open Source Cross-Ontology Embodied Brain-Size Collaboration Framework

RoboOS 2.0 is an open-source framework for cross ontology brain collaboration, promoting the transformation of robots from single intelligence to collaborative group intelligence. The framework realizes efficient division of labor with a "big brain" architecture, where the cloud brain is responsible for complex decision-making and collaboration, and the small brain module focuses on executing specific skills.
10mos ago
049.4K
JoyHallo - 京东开源的AI数字人模型

JoyHallo - Jingdong's open source AI digital human model

JoyHallo is an open source AI digital human model from Jingdong, designed for Mandarin, supporting the conversion of audio into realistic speaking video.JoyHallo embeds audio features based on the wav2vec2 model, using a semi-decoupled structure to improve the accuracy of lip movement prediction, and supports the generation of English video...
11mos ago
049.4K
Doppl - 谷歌推出的AI虚拟试衣应用

Doppl - AI virtual fitting app from Google

Doppl is an AI virtual fitting application launched by Google. After the user uploads a full body photo, the application supports the clothing picture or screenshot "wear" in the digital version of their own body, and can be converted from static pictures to AI-generated video, so that users can more truly feel the effect of clothing on the body.
10mos ago
049K
Gemini CLI - 谷歌开源的编程Agent

Gemini CLI - Google Open Source Programming Agent

Gemini CLI is Google's open source AI programming tool based on incorporating the Gemini Big Model into the developer's endpoint to provide developers with powerful AI capabilities. The tool understands code, manipulates files, executes commands, and dynamically troubleshoots problems to help developers efficiently write generation...
10mos ago
048.9K
ThinkSound - 阿里通义推出的音频生成模型

ThinkSound - Audio Generation Model launched by Ali Tongyi

ThinkSound is the first CoT (Chain Thinking) audio generation model introduced by Ali Tongyi's speech team. The model can generate accurately matched sound effects for video images, based on the introduction of CoT reasoning, to solve the problem of traditional technology is difficult to capture the dynamic details of the screen and spatial relationships.
10mos ago
048.8K
ChatFlow - 开源AI工作流自动化工具

ChatFlow - Open Source AI Workflow Automation Tool

ChatFlow is an open source AI workflow automation tool that supports the transformation of complex requirements into efficient workflows. Tools based on AI technology to help users quickly generate code frameworks, test cases, can assist in writing and designing software architecture.
9mos ago
048.7K
11ai - ElevenLabs推出个人AI语音助理

11ai - ElevenLabs Launches Personal AI Voice Assistant

11ai is an AI voice assistant launched by ElevenLabs, with voice interaction as the core, through natural and smooth dialogue to enhance the user's work efficiency. 11ai supports more than 5,000 voices, and users can customize the exclusive voice, the assistant is more personalized. With low-latency voice inter...
10mos ago
048.6K
Megrez-3B-Omni:端侧多模态理解模型,支持文本、图像、音频多模态理解和分析

Megrez-3B-Omni: an end-side multimodal understanding model supporting text, image, and audio multimodal understanding and analysis

Comprehensive Introduction Infini-Megrez is an edge intelligence solution developed by the unquestioned core dome (Infinigence AI), aiming to achieve efficient multimodal understanding and analysis through hardware and software co-design. At the core of the project is the Megrez-3B model, which supports graph...
1yrs ago
048.6K
Genie 3 - 谷歌推出的通用世界模型

Genie 3 - A Universal World Model from Google

Genie 3 is a next-generation universal world model from Google DeepMind that enables the generation of highly dynamic and coherent virtual worlds in real time.Genie 3 simulates physical phenomena, natural ecosystems, and supports the creation of fantasy and historical scenarios. With text prompts, users can...
9mos ago
048.5K
羚珑 - 京东推出的AI商品图设计工具

Antelope - AI product image design tool launched by Jingdong

Antelope is an intelligent design tool launched by Jingdong, providing efficient and convenient design solutions for e-commerce merchants and individuals. Through intelligent keying, intelligent layout, intelligent color matching and other functions, it helps users to quickly generate high-quality design works to meet the main picture of the product, advertising Banner, store page and other e-commerce store...
10mos ago
048.3K
MagicTryOn - 浙大和vivo等机构推出的视频虚拟试穿框架

MagicTryOn - Video Virtual Try-On Framework from ZJU and Vivo and others

MagicTryOn is an advanced video virtual try-on framework launched by the School of Computer Science and Technology of Zhejiang University in collaboration with vivo and other organizations. The framework replaces the traditional U-Net architecture with an innovative Diffusion Transformer (DiT) architecture, combined with a fully self-attentive machine...
11mos ago
048.2K
稿定AI社区 - AI创意内容设计平台,多种设计资源满足不同创作需求

Drafting AI Community - AI creative content design platform, a variety of design resources to meet different creative needs

Drafting AI Community is an online AI creative inspiration platform that provides users with a wealth of creative design resources and tools. The platform covers a variety of design fields, including image photos, e-commerce design, holiday themes, 3D illustrations, avatar design, Xiaohongshu materials, portrait design, etc., to meet the needs of different users.
11mos ago
048.1K
Qwen-Image-Edit - 阿里通义开源的图像编辑模型

Qwen-Image-Edit - Ali Tongyi open source image editing model

Qwen-Image-Edit is an all-purpose image editing model introduced by Ali Tongyi, built on the Qwen-Image architecture with 20 billion parameters. The model combines both semantic and appearance editing capabilities, and can perform low-level visual appearance editing on images (e.g., adding, deleting...
8mos ago
048.1K
CombatVLA - 淘天集团推出的高效VLA模型

CombatVLA - Efficient VLA Model by Amoy Group

CombatVLA is an innovative 3D action role-playing game (ARPG)-specific model from the Future Life Lab team of the Amoy Sky Group.CombatVLA is a visual-linguistic-action (VLA) model, built on a 3B parametric scale, that collects human player's through a motion tracker...
8mos ago
048K
RedOne - 小红书最新推出的社交大模型

RedOne - the latest social mega-model from Little Red Book

RedOne is a large language model customized for social networks introduced by Little Red Book. The model is trained through a three-stage training strategy that incorporates social and cultural knowledge, strengthens multitasking capabilities, and aligns human preferences.RedOne significantly outperforms the base model in social task performance, in harmful content detection and browsing...
9mos ago
048K
QVQ-Max - 阿里通义推出视觉推理模型

QVQ-Max - Ali Tongyi Launches Visual Reasoning Models

QVQ-Max is a state-of-the-art visual reasoning model introduced by Alitonix, which is an upgraded version of QVQ-72B-Preview. QVQ-Max is an advanced visual reasoning model that can "read" images and video content and combine the information for analysis, reasoning and problem solving.QVQ-Max's main functions include image parsing, video analysis...
11mos ago
048K
SkyReels-A3 - 昆仑万维推出的音频驱动数字人创作工具

SkyReels-A3 - Audio-Driven Digital Human Creation Tool from KunlunWangwei

SkyReels-A3 is an audio-driven digital human creation tool from Kunlun World Wide Group. SkyReels-A3 is an audio-driven digital human creation tool, which can generate high-quality dynamic video content through simple inputs (e.g., portrait images and voice), make static photos "come alive", and replace lines for existing videos with new lip-syncs that the characters will automatically...
9mos ago
048K
ChatGPT Agent – OpenAI推出的通用智能AI Agent

ChatGPT Agent - General Intelligence AI Agent by OpenAI

ChatGPT Agent is a general-purpose AI Agent from OpenAI that combines multiple capabilities to autonomously accomplish complex tasks. Users only need to describe their needs in natural language, and the Agent can automatically select the appropriate tools, such as browsing the web, extracting information, running code...
10mos ago
047.7K
MoE-TTS - 昆仑万维推出的最新语音生成框架

MoE-TTS - The Latest Speech Generation Framework from KunlunWei

MoE-TTS is a speech synthesis framework introduced by KunlunWanwei, based on the Mixed Expert (MoE) architecture, which combines pre-trained Large Language Models (LLMs) with speech expert modules.MoE-TTS retains the powerful textual reasoning by freezing the textual module parameters and updating only the speech module parameters...
9mos ago
047.7K
琴乐大模型 - 腾讯推出的AI音乐创作模型

Piano Music Big Model - AI Music Composition Model by Tencent

Qin Music Grand Model is an advanced AI music creation grand model jointly launched by Tencent AI Lab and Tencent TME Tianqin Lab. The model intelligently generates high-quality stereo audio or multi-track sheet music based on user-inputted keywords, descriptive statements or audio clips in English and Chinese.
11mos ago
047.7K
绘想 - 百度推出的AI视频生成平台

Painting Thinking - AI Video Generation Platform Launched by Baidu

Painting is an AI video generation platform launched by Baidu, based on AI technology to help users easily create personalized videos. Painting intuitive interface, powerful tools, with inspiration recommendation function, can provide creators with creative inspiration, support a key to the same operation, can quickly generate similar videos, simplify the creative process.
10mos ago
047.6K
企鹅读伴 - 腾讯推出的中小学生AI阅读助手

Penguin Reading Companion - Tencent's AI Reading Assistant for Primary and Secondary School Students

Penguin Reading Companion is an AI reading assistant designed for primary and secondary school students by Tencent. Penguin Reading Companion relies on Tencent's hybrid big model and metamachine platform, combined with the Compulsory Education Language Curriculum Program and Curriculum Standards (2022 Edition), to provide students with personalized reading recommendations, multiple reading modes (focusing, reading aloud, listening...
11mos ago
047.5K
Qwen3Guard - 阿里Qwen开源的安全模型

Qwen3Guard - Ali Qwen open source security model

Qwen3Guard is a fine-tuned security protection model based on the Qwen3 base model, designed for security detection. It provides accurate security categorization of prompts and responses, provides risk levels, and supports English, Chinese, and multi-language environments.Qwen3Guard comes with two pro...
7mos ago
047.5K
商汤如影 - 商汤科技推出的AI数字人视频制作平台

Shangtang Ruyi - AI digital human video production platform launched by Shangtang Technology

Shangtang Ruying is an AI digital human video production platform launched by Shangtang Technology. Based on big model technology, the platform supports the creation of highly realistic digital human images and personalization, including facial features, clothing, hairstyles, and so on. The platform is equipped with sound cloning, video generation, automated data labeling, real-time interaction, and other functions...
11mos ago
047.4K