Latest AI Resources

Total 2950 articles posts
问小白o4 - 问小白推出的并行思考模型,同时开启8条思考路径

Ask Whitey o4 - A parallel thinking model introduced by Ask Whitey that opens 8 thinking paths at the same time

Ask White o4 is an innovative parallel thinking model that opens 8 thinking paths at the same time, analyzes the problem from multiple perspectives and automatically filters out the optimal solution. The model incorporates advanced Long-CoT reinforcement learning and process reward learning techniques, has powerful deep reasoning capabilities, and performs well in complex tasks.
5mos ago
031.7K
VibeVoice - 微软推出的文本到语音模型

VibeVoice - Text-to-Speech Model from Microsoft

VibeVoice is a new text-to-speech (TTS) model from Microsoft. The model generates conversational audio from up to four different speakers and supports up to 90 minutes of continuous voice output, breaking the length limitations of traditional TTS systems.
5mos ago
056.2K
Fun-ASR - 钉钉、通义联合推出的新一代语音识别模型

Fun-ASR - A New Generation of Speech Recognition Models Jointly Launched by Nail and Tongyi

Fun-ASR is a big model of speech recognition jointly launched by Nail and Tongyi Labs. The model has been trained with massive audio data and can accurately recognize multi-industry terminology, such as Internet, technology, home decoration, etc., significantly improving the recognition accuracy. The model combines with Nail enterprise information for inference optimization to reduce the illusion problem...
5mos ago
058.4K
Grok 2.5 - 马斯克旗下xAI开源的人工智能模型

Grok 2.5 - Musk's xAI open source AI model

Grok 2.5 is an open source AI model from Elon Musk's xAI. With 269 billion parameters, it is based on the Mixed Expert (MoE) architecture for powerful performance and inference. The model has been tested at graduate level scientific knowledge (GPQA), generalized knowledge (MMLU, MM...
5mos ago
039.7K
ToonComposer - 腾讯开源的生成式AI动画制作工具

ToonComposer - Tencent open source generative AI animation tool

ToonComposer is a generative AI animation tool jointly launched by The Chinese University of Hong Kong, Tencent PCG ARC Lab and Peking University. Through generative post keyframe technology, the intermediate frame generation and coloring process is integrated into an automated process, requiring only a sketch and a...
5mos ago
043.7K
Seed-OSS - 字节跳动团队开源的全新AI模型

Seed-OSS - A new AI model open-sourced by the Wordpress team

Seed-OSS is a large family of language models open-sourced by the Byte Jump Seed team, focusing on long text and reasoning tasks. The model performs well in complex logical reasoning and multi-step reasoning, with high accuracy and efficient problem solving.Seed-OSS supports long text contexts up to 512K...
5mos ago
043.4K
CombatVLA - 淘天集团推出的高效VLA模型

CombatVLA - Efficient VLA Model by Amoy Group

CombatVLA is an innovative 3D action role-playing game (ARPG)-specific model from the Future Life Lab team of the Amoy Sky Group.CombatVLA is a visual-linguistic-action (VLA) model, built on a 3B parametric scale, that collects human player's through a motion tracker...
6mos ago
036.9K
DeepSeek V3.1 - DeepSeek推出的最新开源AI模型

DeepSeek V3.1 - Latest Open Source AI Models from DeepSeek

DeepSeek V3.1 is a new generation of AI models introduced by DeepSeek, with important upgrades based on its predecessor, V3. DeepSeek V3.1 introduces a hybrid reasoning architecture that allows the model to flexibly switch between thinking and non-thinking modes, significantly improving the thinking...
6mos ago
040.4K
Qwen-Image-Edit - 阿里通义开源的图像编辑模型

Qwen-Image-Edit - Ali Tongyi open source image editing model

Qwen-Image-Edit is an all-purpose image editing model introduced by Ali Tongyi, built on the Qwen-Image architecture with 20 billion parameters. The model combines both semantic and appearance editing capabilities, and can perform low-level visual appearance editing on images (e.g., adding, deleting...
6mos ago
038.2K
MoE-TTS - 昆仑万维推出的最新语音生成框架

MoE-TTS - The Latest Speech Generation Framework from KunlunWei

MoE-TTS is a speech synthesis framework introduced by KunlunWanwei, based on the Mixed Expert (MoE) architecture, which combines pre-trained Large Language Models (LLMs) with speech expert modules.MoE-TTS retains the powerful textual reasoning by freezing the textual module parameters and updating only the speech module parameters...
6mos ago
037.4K
RynnEC - 阿里达摩院开源的世界理解模型

RynnEC - Ali Dharma Institute's open source world understanding model

RynnEC is a world understanding model introduced by Alibaba Dharma Institute, focusing on embodied intelligence tasks. The model is based on multimodal fusion technology, combining video data and natural language, and can parse objects in a scene from multiple dimensions, supporting functions such as object understanding, spatial perception and video target segmentation.
6mos ago
042.1K
SkyReels-A3 - 昆仑万维推出的音频驱动数字人创作工具

SkyReels-A3 - Audio-Driven Digital Human Creation Tool from KunlunWangwei

SkyReels-A3 is an audio-driven digital human creation tool from Kunlun World Wide Group. SkyReels-A3 is an audio-driven digital human creation tool, which can generate high-quality dynamic video content through simple inputs (e.g., portrait images and voice), make static photos "come alive", and replace lines for existing videos with new lip-syncs that the characters will automatically...
6mos ago
033.8K
Genie 3 - 谷歌推出的通用世界模型

Genie 3 - A Universal World Model from Google

Genie 3 is a next-generation universal world model from Google DeepMind that enables the generation of highly dynamic and coherent virtual worlds in real time.Genie 3 simulates physical phenomena, natural ecosystems, and supports the creation of fantasy and historical scenarios. With text prompts, users can...
6mos ago
037.1K
RedOne - 小红书最新推出的社交大模型

RedOne - the latest social mega-model from Little Red Book

RedOne is a large language model customized for social networks introduced by Little Red Book. The model is trained through a three-stage training strategy that incorporates social and cultural knowledge, strengthens multitasking capabilities, and aligns human preferences.RedOne significantly outperforms the base model in social task performance, in harmful content detection and browsing...
6mos ago
038K
MindLink - 昆仑万维推出的开源推理大模型

MindLink - Open Source Reasoning Big Model from KunlunWei

MindLink is a large model of open source reasoning launched by Kunlun World Wide Web. With adaptive reasoning mechanism , according to the complexity of the task can be flexibly switched inference mode , simple tasks quickly generated , complex tasks in-depth reasoning , taking into account the efficiency and accuracy . Plan-driven reasoning paradigm to remove the "think" label , down ...
6mos ago
034.7K
HYPIR - 中国科学院团队推出的新型图像复原大模型

HYPIR - A new large model for image restoration introduced by a team from the Chinese Academy of Sciences

HYPIR is a large model for image restoration introduced by Dong Chao's team at Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences. The model combines the fractional prior of diffusion modeling with adversarial generative networks to achieve efficient, high-quality image restoration.HYPIR can quickly restore old photos and improve resolution while keeping text clear...
6mos ago
044.6K
Seed Diffusion - 字节跳动最新推出的扩散语言模型

Seed Diffusion - the newest diffusion language model from ByteHopper

Seed Diffusion is an experimental diffusion language model introduced by ByteHop that handles code generation tasks. The model is based on techniques such as two-stage diffusion training, constrained sequential learning, and enhanced efficient parallel decoding, which significantly improves inference speed to 2146 tokens/s, which is faster than...
6mos ago
037.3K
1688 AI版 - 阿里旗下1688平台推出的AI生意助手

1688 AI Edition - AI business assistant launched by Ali's 1688 platform

1688 AI version is an intelligent business assistant application launched by Alibaba's 1688 platform, designed for small B buyers and merchants. Based on the massive data of 1688 platform, the application provides business opportunity push, product recommendation, idea generation, enterprise inquiry and other functions to help users accurately grasp the market dynamics, rapid...
6mos ago
059.9K
阶跃深研 - 阶跃星辰推出的AI深入研究工具

Steps Deep Research - AI Deep Research Tool by Steps Star

Steps Deep Research is an efficient AI research tool launched by Steps Star, which can autonomously complete research on complex issues and generate professional reports in a short period of time. The tool is designed for finance, consulting, healthcare, law and other fields, and excels in industry reviews with its in-depth search and information integration capabilities.
6mos ago
030.1K
Runway Aleph - Runway推出的全新AI视频编辑模型

Runway Aleph - New AI Video Editing Model from Runway

Runway Aleph is an advanced AI video editing model launched by Runway, which is based on simple text commands to quickly add and delete video content, style change, environment adjustment and camera movement optimization. Users can easily remove redundant elements, change scenes without complex operations...
6mos ago
040.9K
WebShaper - 阿里通义开源的AI训练数据合成系统

WebShaper - Ali Tongyi's open source AI training data synthesis system

WebShaper is an AI training data synthesis system launched by Alibaba's Tongyi Lab, which is based on formal modeling and intelligence expansion mechanism to generate high-quality and scalable training data to help AI intelligences improve complex information retrieval capabilities. The system introduces the concept of "knowledge projection"...
6mos ago
055.4K
Intern-S1 - 上海AI Lab开源的科学多模态大模型

Intern-S1 - Shanghai AI Lab's open source scientific multimodal macromodels

Intern-S1 is a scientific multimodal grand model launched by Shanghai Artificial Intelligence Laboratory. The model deeply integrates linguistic and multimodal capabilities, with powerful functions such as cross-modal scientific parsing, linguistic and visual fusion, scientific data processing, scientific question answering, experiment design and optimization.
6mos ago
039.8K
Opal - 谷歌推出的AI工作流创建平台

Opal - AI workflow creation platform from Google

Opal is an innovative AI applet generation platform from Google Labs that helps users quickly create and share AI apps without having to write code.Opal makes it easy for users to string together prompts, model calls, and tools into a multi-step process through natural language interactions and visual editing interface...
6mos ago
046.5K
MonkeyCode - 开源的企业级AI编程助手

MonkeyCode - Open Source Enterprise AI Programming Assistant

MonkeyCode is an open source, enterprise-grade, native AI programming assistant designed for privacy- and security-conscious development teams.MonkeyCode supports private deployment and offline use to ensure code data security. MonkeyCode supports private deployment and offline use to ensure the security of code data.
6mos ago
038.2K
ChatFlow - 开源AI工作流自动化工具

ChatFlow - Open Source AI Workflow Automation Tool

ChatFlow is an open source AI workflow automation tool that supports the transformation of complex requirements into efficient workflows. Tools based on AI technology to help users quickly generate code frameworks, test cases, can assist in writing and designing software architecture.
6mos ago
038.2K
Seed GR-3 - 字节跳动Seed团队推出的通用机器人模型

Seed GR-3 - Generalized Robotics Model from the Wordpress Seed Team

Seed GR-3 is a general-purpose robot model introduced by ByteDance with strong generalization ability to adapt to new environments and complex commands. The model fuses visual, verbal, and motion information, and is based on a three-in-one training method of robot data, VR human trajectory data, and publicly available graphic data to enhance the ability to respond to new objects...
6mos ago
037.3K
TRAE SOLO - 字节跳动TRAE推出的AI自动开发助手

TRAE SOLO - AI Automated Development Assistant from Wordhop TRAE

TRAE SOLO is an AI automated development assistant introduced by TRAE, an AI programming assistant launched by ByteDance, to simplify the software development process with AI technology.TRAE SOLO understands the user's needs, supports text descriptions, voice commands, and file uploads to input the requirements, and automatically plans...
7mos ago
059.9K
Goedel-Prover-V2 - 普林斯顿联合清华和英伟达等开源的定理证明模型

Goedel-Prover-V2 - Princeton's open-source theorem proving model in conjunction with Tsinghua and NVIDIA, among others

Goedel-Prover-V2 is an open-source theorem proving model jointly released by leading organizations such as Princeton University, Tsinghua University, and NVIDIA. The model is based on innovative techniques such as hierarchical data synthesis, verifier-guided self-correction, and model averaging to significantly improve the performance of automated formal proofs...
7mos ago
037.5K
飞书妙搭 - 飞书推出的AI原生系统搭建平台

Flying Book Miaohu - AI Native System Building Platform by Flying Book

Flying Book Miaohu is an enterprise-level AI native system building platform launched by Flying Book. The platform quickly transforms enterprise business requirements into practical applications through a multi-agent architecture, supporting the whole process from requirements analysis to functional design, application development and problem repair. Users use a dialog to easily build lightweight...
7mos ago
039.2K