Latest AI Resources

Total 3112 articles posts

Course materials Latest AI Resources AI Knowledge Base AI News

Sorting

FactSnap - 新一代AI信息核查工具

FactSnap - Next Generation AI Information Verification Tool

FactSnap is a new generation AI information verification tool that helps users quickly verify the authenticity of web information. By integrating multiple models and search engines, it performs real-time verification of selected text while the user is browsing the web.

Latest AI Resources

1yrs ago

051.8K

商汤如影 - 商汤科技推出的AI数字人视频制作平台

Shangtang Ruyi - AI digital human video production platform launched by Shangtang Technology

Shangtang Ruying is an AI digital human video production platform launched by Shangtang Technology. Based on big model technology, the platform supports the creation of highly realistic digital human images and personalization, including facial features, clothing, hairstyles, and so on. The platform is equipped with sound cloning, video generation, automated data labeling, real-time interaction, and other functions...

Latest AI Resources

1yrs ago

051.8K

Higress MCP - 今日投资推出的MCP服务平台

Higress MCP - Invest Today Launches MCP Services Platform

Higress MCP is an innovative platform launched by Invest Today that supports the rapid transformation of traditional financial data APIs into modern MCP services.Higress MCP enables the transformation of REST APIs to MCP Server based on a simple configuration without the need to program...

Latest AI Resources

12mos ago

051.8K

Qwen3-Coder-Flash - 阿里通义推出的开源高性能编程模型

Qwen3-Coder-Flash - an open source high performance programming model from Ali Tongyi

Qwen3-Coder-Flash is a high-performance programming model introduced by Ali Tongyi Thousand Questions team, which has excellent agent-based programming and tool invocation capabilities, and is good at handling complex programming tasks. The model supports 256K tokens of long context understanding, and can scale to 1M ...

Latest AI Resources

11mos ago

051.7K

Intern-S1-mini - 上海AI Lab开源的轻量化科学多模态模型

Intern-S1-mini - Lightweight scientific multimodal model open source by Shanghai AI Lab

Intern-S1-mini is a lightweight scientific multimodal macromodel with parameter scale of 8B launched by Shanghai Artificial Intelligence Laboratory (SAL).It inherits the powerful capabilities of Intern-S1, combining both general and specialized scientific capabilities, and is suitable for rapid deployment and secondary development. In terms of performance, I...

Latest AI Resources

10mos ago

051.7K

Gemini 2.5 Flash Image - 谷歌推出的最强图像生成与编辑模型

Gemini 2.5 Flash Image - The Most Powerful Image Generation and Editing Model from Google

Gemini 2.5 Flash Image (codename nano banana) is a state-of-the-art image generation and editing model from Google that maintains the consistency of characters across different scenes and supports precise image editing through natural language, such as blurring backgrounds and removing stains.

Latest AI Resources

10mos ago

051.7K

Agentar-Fin-R1 - 蚂蚁数科推出的金融领域推理大模型

Agentar-Fin-R1 - A Grand Model for Reasoning in Finance by Anthem Digital

Agentar-Fin-R1 is a state-of-the-art large language model for the financial domain introduced by Anthem. Developed based on the powerful Qwen3 architecture, the model provides two parameter scale versions, 8B and 32B, and can accurately handle complex financial reasoning tasks, including multi-step analysis, risk assessment and war...

Latest AI Resources

11mos ago

051.6K

Magentic-UI - 微软开源的人机协作AI Agent

Magentic-UI - Microsoft Open Source AI Agent for Human-Computer Collaboration

Magentic-UI is Microsoft's open source human-computer collaboration AI Agent research tool.Magentic-UI is based on working closely with users to facilitate complex Web tasks such as Web browsing, code execution, and file handling. The tool emphasizes collaborative planning, enabling users to raise...

Latest AI Resources

1yrs ago

051.6K

TuriX-CUA - 开源AI桌面自动化工具，AI直接操作电脑桌面

TuriX-CUA - 开源AI桌面自动化工具，AI直接操作电脑桌面

TuriX-CUA 是开源的 AI 桌面自动化工具，能通过截屏、多模态模型决策和自动化操作实现电脑交互。让 AI 模型直接操作电脑桌面环境。支持 macOS 和 Windows 系统，通过先进的计算机...

Latest AI Resources

6mos ago

051.6K

Report mAIstro：生成任意自定义主题的详细报告文档，例如商业分析、年终汇报等

Report mAIstro: Generate detailed reports on any customizable topic, such as business analysis, year-end reporting, etc.

General Description Report mAIstro is a powerful tool designed to help users easily create customized reports through natural language processing technology. The tool utilizes LangChain technology to transform user-supplied topics and structures into detailed reports within...

Latest AI Resources # AI Java Open Source Projecct # Generate in-depth research report

1yrs ago

051.5K

Mistral Code - Mistral AI推出面向企业的AI编程助手

Mistral Code - Mistral AI Launches AI Programming Assistant for the Enterprise

Mistral Code is an AI programming assistant for enterprise development teams launched by Mistral AI, integrating the four major models of Codestral, Codestral Embed, Devstral and Mistral Medium, supporting...

Latest AI Resources

1yrs ago

051.5K

Qwen VLo – 通义千问推出的多模态统一理解与生成模型

Qwen VLo - A Unified Multimodal Comprehension and Generation Model by Tongyi Qianqian

Qwen VLo is a multimodal unified comprehension and generation model introduced by Tongyi Qianqian team. Qwen VLo can "understand" the world and recreate with high quality based on its understanding, realizing the leap from perception to generation. VLo can accurately understand the image content, and on the basis of this, it can carry out consistent and high-quality generation.

Latest AI Resources

12mos ago

051.5K

优雅YOYA - 中科闻歌推出的AI音视频内容创作平台

Elegant YOYA - AI Audio/Video Content Creation Platform Launched by Sinotech Winkler

Elegant YOYA is a multimodal literate video platform launched by Zhongke Wenge, the platform is based on AI multimodal technology to empower the whole chain of video content creation. Users only need to input the theme requirements, the platform can quickly generate scripts, images, videos, and can complete intelligent editing, voice synthesis and character mouth drive and other operations, the output...

Latest AI Resources

1yrs ago

051.4K

MagicTryOn - 浙大和vivo等机构推出的视频虚拟试穿框架

MagicTryOn - Video Virtual Try-On Framework from ZJU and Vivo and others

MagicTryOn is an advanced video virtual try-on framework launched by the School of Computer Science and Technology of Zhejiang University in collaboration with vivo and other organizations. The framework replaces the traditional U-Net architecture with an innovative Diffusion Transformer (DiT) architecture, combined with a fully self-attentive machine...

Latest AI Resources

1yrs ago

051.4K

gpt-oss - OpenAI推出的开源推理模型系列

gpt-oss - a family of open source inference models from OpenAI

gpt-oss is a family of open source inference models from OpenAI that enable efficient, flexible, and easy-to-deploy AI solutions for developers. gpt-oss consists of two versions, gpt-oss-120B with 117 billion parameters and support for 8...

Latest AI Resources

11mos ago

051.3K

LongCat-Video-Avatar - MeiTuan open source avatar video generation model

LongCat-Video-Avatar is an advanced audio-driven video generation model built on LongCat-Video open-sourced by Meituan, focusing on generating hyper-realistic, lip-synchronized long videos with natural dynamics and consistent identity.

Latest AI Resources

6mos ago

051.3K

Mureka V7 - 昆仑万维推出的AI音乐生成模型

Mureka V7 - AI Music Generation Models from Quintessence

Mureka V7 is a state-of-the-art AI music generation model launched by Kunlun World Wide. The model is based on MusiCoT technology, which supports planning the overall structure of the music before filling in the details to generate more coherent and artistic music works.

Latest AI Resources

11mos ago

051.3K

gpt-realtime - OpenAI最新推出的AI语音模型

gpt-realtime - OpenAI's newest AI speech model

gpt-realtime is an advanced speech model from OpenAI that supports direct audio processing to generate natural and smooth speech. The model supports multiple languages and styles, understands non-verbal cues such as laughter, and can switch between languages.

Latest AI Resources

10mos ago

051.3K

InternVLA-A1 - 上海AI Lab开源一体化操作能力的具身大模型

InternVLA-A1 - Shanghai AI Lab Open Source Integration of Operational Capabilities for Embodied Large Models

InternVLA-A1 is a large model of embodied operation open-sourced by Shanghai Artificial Intelligence Laboratory. It has the ability to understand, imagine, and execute the integration, and can accurately complete the task. The model fuses real and simulated operational data, and automates the construction of massive multimodal through large-scale virtual-real hybrid scene assets...

Latest AI Resources

9mos ago

051.3K

TRELLIS.2 - 微软开源的大型3D生成模型

TRELLIS.2 - Microsoft Open Source Large Scale 3D Generative Modeling

TRELLIS.2 is a Microsoft open source large-scale 3D generative model , with 4 billion parameters , focusing on high-fidelity image to 3D generation . Using the innovative "O-Voxel" sparse voxel structure , can efficiently handle complex topology and sharp features , to generate high-quality 3D information with full PBR material ...

Latest AI Resources

6mos ago

051.2K

全球首个量子 AI 模型问世！SECQAI 发布 QLLM 即将进入 Beta 测试

World's First Quantum AI Model! SECQAI Releases QLLM for Beta Testing!

SECQAI, a UK-based ultra-secure hardware and software company, announces the world's first Quantum Large Language Model (QLLM), which integrates quantum computing technology into traditional AI models to improve computational efficiency and problem solving capabilities. Quantum mechanics + AI = more powerful AI? ...

Latest AI Resources

1yrs ago

051.2K

OpenAI《在AI时代保持领先》PDF指南 - 附下载链接

OpenAI's PDF Guide to Staying Ahead in the Age of AI - with Download Links

Staying ahead in the age of AI is an AI leadership guide from OpenAI that helps business leaders maintain a competitive edge in the age of AI. The guide points to the rapid growth of AI, with faster model releases, lower costs, and faster enterprise adoption...

Latest AI Resources Course materials

9mos ago

051.2K

Tencent-HY-MT1.5 - 腾讯混元开源的翻译模型系列

Tencent-HY-MT1.5 - Tencent hybrid open source translation model series

Tencent-HY-MT1.5 is Tencent hybrid open source translation model version 1.5, including 1.8B and 7B two models, support for 33 international languages and 5 kinds of folk Chinese/dialect translation.1.8B model is specially optimized for cell phones and other consumer-grade devices, only 1GB of RAM can be achieved end-side...

Latest AI Resources

6mos ago

051.2K

用语音和文字控制macOS操作的开源工具

Open source tool to control macOS operations with voice and text

General Introduction MacOS LLM Controller is an open source desktop application, hosted on GitHub, that allows users to execute macOS system commands by entering natural language commands via voice or text. It is based on Llama-3.2-3B...

Latest AI Resources

1yrs ago

051.2K

Magistral - Mistral AI 推出的系列推理模型

Magistral - Series of inference models from Mistral AI

Magistral is an inference model from Mistral AI that focuses on transparent, multilingual and domain-specific reasoning capabilities. The model consists of an open source version (Magistral Small) and an enterprise version (Magistral Medium), the latter in...

Latest AI Resources

1yrs ago

051.2K

Qwen3Guard - 阿里Qwen开源的安全模型

Qwen3Guard - Ali Qwen open source security model

Qwen3Guard is a fine-tuned security protection model based on the Qwen3 base model, designed for security detection. It provides accurate security categorization of prompts and responses, provides risk levels, and supports English, Chinese, and multi-language environments.Qwen3Guard comes with two pro...

Latest AI Resources

9mos ago

051.2K

MoE-TTS - 昆仑万维推出的最新语音生成框架

MoE-TTS - The Latest Speech Generation Framework from KunlunWei

MoE-TTS is a speech synthesis framework introduced by KunlunWanwei, based on the Mixed Expert (MoE) architecture, which combines pre-trained Large Language Models (LLMs) with speech expert modules.MoE-TTS retains the powerful textual reasoning by freezing the textual module parameters and updating only the speech module parameters...

Latest AI Resources

10mos ago

051.2K

FireRed-Image-Edit - 小红书团队开源的通用图像编辑模型

FireRed-Image-Edit - 小红书团队开源的通用图像编辑模型

FireRed-Image-Edit 是小红书 Super Intelligence 团队开源的通用图像编辑模型，基于扩散 Transformer 架构，在 GEdit、ImgEdit 等多个权威评测...

Latest AI Resources

4mos ago

051.1K

SpatialGen - 群核科技推出的开源3D场景生成模型

SpatialGen - Open Source 3D Scene Generation Model by Qunar Technology

SpatialGen is an open source 3D scene generation model of Qunar Technology, based on the diffusion model architecture, supporting the generation of spatio-temporally consistent multi-view images based on textual descriptions, reference images and 3D spatial layouts, and further generating 3D Gaussian scenes and rendering roaming videos.

Latest AI Resources

10mos ago

051K

AntSK FileChunk - 免费的AI语义文档切片工具，动态切片调整

AntSK FileChunk - Free AI Semantic Document Slicing Tool, Dynamic Slicing Adjustment

AntSK FileChunk is a free intelligent document slicing tool designed for RAG (Retrieval Augmented Generation) applications. Semantic as the core, the document will be intelligently sliced into semantically complete, coherent segments , support for multi-language , can dynamically adjust the size of the slice to ensure that the context of coherence.

Latest AI Resources

9mos ago

051K

剪影专业版6.0.x，新年快乐版

Silhouette Pro 6.0.x, Happy New Year Edition

You can use all vip features without membership, unzip and use, don't upgrade! Never upgrade! Don't upgrade! Link: https://pan.quark.cn/s/a120ee707f47 Extract Code: jHDN

Latest AI Resources

1yrs ago

051K

Qwen-Image-Layered - 阿里团队开源的AI图像编辑模型

Qwen-Image-Layered - AI image editing model open-sourced by Ali team

Qwen-Image-Layered is an open source AI image editing model by Ali team, which can intelligently decompose ordinary images into independent transparent layers to achieve accurate editing similar to Photoshop. The model is open source using the Apache 2.0 protocol and supports flexible control of layers...

Latest AI Resources

6mos ago

051K

DragonV2.1 - 微软推出的零样本语音合成模型

DragonV2.1 - Zero-Sample Speech Synthesis Model from Microsoft

DragonV2.1 is an advanced zero-sample text-to-speech (TTS) model from Microsoft. Based on the Transformer architecture, the model supports multi-language and zero-sample speech cloning, and generates natural, expressive speech with only 5-90 seconds of voice prompts.

Latest AI Resources

11mos ago

051K

FireRedChat - 小红书开源的全双工语音交互系统

FireRedChat - Little Red Book's open source full-duplex voice interaction system

FireRedChat is an open source full-duplex voice interaction system for Xiaohongshu with real-time bidirectional dialog capabilities and support for controlled interruptions. Adopts a modular design , including transcription control module , interaction module and dialogue manager , etc., supports cascade and semi-cascade architecture , can be flexibly deployed .

Latest AI Resources

9mos ago

050.8K

通义DeepResearch - 阿里通义开源的深度研究智能体

Tongyi DeepResearch - Ali Tongyi Open Source Deep Research Intelligence Body

Tongyi DeepResearch (Tongyi DeepResearch) is an open source intelligent body launched by Alibaba, designed for deep information retrieval and complex task reasoning, with 30 billion parameters, supporting multiple reasoning modes, including ReAct mode and deep mode...

Latest AI Resources

9mos ago

050.8K

Skywork-SWE-32B - 昆仑万维开源的自主代码智能体基座模型

Skywork-SWE-32B - KunlunWanwei Open Source Autonomous Code Intelligent Body Base Model

Skywork-SWE-32B is an open source 32B scale software engineering (SWE) autonomous code intelligences base model introduced by Kunlun World Wide Web. The model focuses on software engineering tasks, has powerful repository-level code repair capabilities, and can perform in complex scenarios with multi-round interactions and long text processing...

Latest AI Resources

1yrs ago

050.7K

MindLink - 昆仑万维推出的开源推理大模型

MindLink - Open Source Reasoning Big Model from KunlunWei

MindLink is a large model of open source reasoning launched by Kunlun World Wide Web. With adaptive reasoning mechanism , according to the complexity of the task can be flexibly switched inference mode , simple tasks quickly generated , complex tasks in-depth reasoning , taking into account the efficiency and accuracy . Plan-driven reasoning paradigm to remove the "think" label , down ...

Latest AI Resources

11mos ago

050.6K

SkyReels-A3 - 昆仑万维推出的音频驱动数字人创作工具

SkyReels-A3 - Audio-Driven Digital Human Creation Tool from KunlunWangwei

SkyReels-A3 is an audio-driven digital human creation tool from Kunlun World Wide Group. SkyReels-A3 is an audio-driven digital human creation tool, which can generate high-quality dynamic video content through simple inputs (e.g., portrait images and voice), make static photos "come alive", and replace lines for existing videos with new lip-syncs that the characters will automatically...

Latest AI Resources

10mos ago

050.6K

Confucius3-Math - 网易有道推出专注于数学教育的开源推理模型

Confucius3-Math - NetEaseYouDao Launches Open Source Reasoning Model Focused on Mathematics Education

Confucius3-Math is the first domestic open source reasoning model focused on mathematics education open-sourced by NetEaseYouDao. With 14 billion parameters, optimized for K-12 math education scenarios, it can run efficiently on a single consumer GPU (such as RTX 4090D), with an inference performance of about...

Latest AI Resources

1yrs ago

050.5K

Logics-Parsing - 阿里开源的文档解析模型

Logics-Parsing - Ali open source document parsing model

Logics-Parsing is an open source Ali end-to-end document parsing model , based on Qwen2.5-VL-7B. Optimize document layout analysis and reading order inference through reinforcement learning , PDF images can be converted to structured HTML output to support a variety of content ...

Latest AI Resources

9mos ago

050.5K

Skywork Deep Research Agent v2 - 昆仑万维推出的深度研究智能体升级版

Skywork Deep Research Agent v2 - An Upgraded Version of Deep Research Intelligence from Kunlun

Skywork Deep Research Agent v2 is a deep research intelligent body launched by Kunlun Wave, focusing on the integration and analysis of multimodal information.Skywork Deep Research Agent v2 can process text, graph...

Latest AI Resources

10mos ago

050.4K

DeckSpeed - AI PPT制作工具，自然语言生成演示文稿

DeckSpeed - AI PPT Maker, Natural Language Generated Presentation

DeckSpeed is an AI presentation creation tool based on conversational interaction, where users express their needs based on natural language and quickly generate personalized slides without relying on traditional templates. The tool supports real-time feedback adjustment, users can modify the color, style and content of the slide at any time to ensure that the presentation is complete...

Latest AI Resources

1yrs ago

050.4K

分析 civitai 226K 得到的常用正负面提示词

Analyze civitai 226K for commonly used positive and negative cues

Resource List Top 10 1000 Most Common Tokens 1000 Most Common Negative Tokens 20 Most Common Samplers 100 Most Common Steps 100 Most Common Dimensions 50 Most Common...

Latest AI Resources # AI Image Generation Aids

2yrs ago

050.4K

Why My Wife Yelling At Me：模拟婚姻沟通的互动工具

Why My Wife Yelling At Me: An Interactive Tool to Model Marital Communication

Comprehensive Introduction "Why My Wife Yelling At Me" is a unique marriage relationship simulation website designed to help users understand their partner's emotional reactions and communication patterns through artificial intelligence. Users can input different scenarios and experience the reactions of their virtual partner, simulating real...

Latest AI Resources

1yrs ago

050.4K

Paper2Any - 北大DCAI团队开源的AI科研与演示文稿生成平台

Paper2Any - 北大DCAI团队开源的AI科研与演示文稿生成平台

Paper2Any是北京大学DCAI课题组开源的多模态辅助平台，专注于从论文PDF、图片和文本中快速生成多种科研内容。具备一键生成科研绘图的功能，能从多种输入源生成模型架构图、技术路线图和实验数据图等...

Latest AI Resources

6mos ago

050.3K

InternVLA·N1 - 上海AI Lab开源的端到端双系统导航大模型

InternVLA-N1 - Shanghai AI Lab Open Source End-to-End Dual System Navigation Large Model

InternVLA-N1 is an open source end-to-end dual-system navigation macromodel from Shanghai Artificial Intelligence Laboratory. Using a dual-system architecture, System 2 is responsible for understanding linguistic commands and planning long-range paths, while System 1 focuses on high-frequency response and agile obstacle avoidance. The model is trained entirely based on synthetic data through large-scale digital ...

Latest AI Resources

9mos ago

050.2K

FineVision - Hugging Face推出的开源视觉语言数据集

FineVision - Open Source Visual Language Dataset from Hugging Face

FineVision is Hugging Face's open source visual language dataset for training advanced visual language models. It contains 17.3 million images, 24.3 million samples, 88.9 million rounds of dialog, and 9.5 billion answer tokens. The dataset aggregates...

Latest AI Resources

10mos ago

050.2K

HeyGen - AI 数字人视频创作平台，支持多语言翻译配音

HeyGen - AI Digital Human Video Creation Platform with Multi-Language Translation and Dubbing Support

HeyGen is an AI-driven digital human video creation platform that supports a streamlined video production process, allowing users to quickly generate professional-level digital human videos. The platform is based on advanced AI technology, giving users full control over the image and voice of digital people, providing a rich library of material, including diverse background...

Latest AI Resources

1yrs ago

050.2K

VoxCPM 1.5 - 面壁智能开源的端到端文本到语音模型

VoxCPM 1.5 - Faceted Intelligence Open Source End-to-End Text-to-Speech Modeling

VoxCPM 1.5 is an open source speech generation model released by Facade Intelligence, based on text-to-speech (TTS) technology without the need for a splitter, featuring several innovations and improvements. Adopting an end-to-end diffusion autoregressive architecture, it generates continuous speech waveforms directly from text, avoiding the limitations of traditional segmentation methods...

Latest AI Resources

6mos ago

050.1K

Molmo 2 - Ai2开源的多模态视频图像理解模型系列

Molmo 2 - Ai2 open source multimodal video image understanding model series

Molmo 2 is an open source multimodal model released by the Allen Institute for AI (Ai2) to improve video and multi-image understanding. Three variants are included; Molmo 2 (8B), Molmo 2 (4B) and Molmo 2-O...

Latest AI Resources

6mos ago

050K

Step-GUI - 阶跃星辰开源的AI Agent系列模型

Step-GUI - Step-Star Open Source AI Agent Series Models

Step-GUI is Step-Star's open source AI Agent series of models, including the cloud model Step-GUI, the first MCP protocol for GUI Agents, and the industry's first open source end-side model Step-GUI Edge to support cell phone deployment.Specialized...

Latest AI Resources

6mos ago

049.7K

CRIC深度智联 - 克而瑞推出的中国房地产首个AI Agent

CRIC - The First AI Agent for Real Estate in China Launched by CRIC

CRIC Depth Intelligence is the first AI intelligent body of Chinese real estate independently developed by CRIC, based on CRIC's 20 years of experience in the real estate industry and data accumulation and multimodal big model technology, which opens up the whole chain from data integration, intelligent analysis to content generation.

Latest AI Resources

1yrs ago

049.7K

问小白5 - 问小白推出的全能AI模型

Ask White 5 - All-in-One AI Model from Ask White

Ask White 5 is the flagship "All in One" model with a very high level of intelligence. The model has excellent performance in many assessments, such as the AA-Index composite assessment score of 64.7 and the STEM ability assessment score of 86, which is close to the world's leading GPT-5.

Latest AI Resources

10mos ago

049.6K

HunyuanWorld-Voyager - 腾讯开源的超长漫游世界模型

HunyuanWorld-Voyager - Tencent open source ultra-long roaming world model

HunyuanWorld-Voyager (Hunyuan Voyager for short) is the industry's first ultra-long roaming world model released by Tencent that supports native 3D reconstruction. It is a novel video diffusion framework that generates a 3D point cloud sequence of user-defined camera paths from a single image, supporting...

Latest AI Resources

10mos ago

049.5K

Midjourney V1- Midjourney推出的首个图生视频模型

Midjourney V1- Midjourney Launches First Tupelo Video Model

Midjourney V1 is the first AI video generation model from Midjourney, which supports transforming static images into vivid and dynamic videos with the help of advanced AI technology. Users just need to upload images or images generated with Midjourney, tap...

Latest AI Resources

1yrs ago

049.5K

Xiaomi-MiMo-Audio - 小米开源的首个原生端到端语音大模型

Xiaomi-MiMo-Audio - Xiaomi Open Source's First Native End-to-End Speech Big Model

Xiaomi-MiMo-Audio is Xiaomi's open source 7-billion-parameter end-to-end speech macromodel with powerful features such as multi-language dialog, speech continuation, less-sample generalization, and audio understanding, which is able to reach the SOTA level in speech intelligence and audio understanding benchmarks, surpassing Google Gemi...

Latest AI Resources

9mos ago

049.4K

Gemini 2.5 Deep Think - 谷歌推出的AI推理模型

Gemini 2.5 Deep Think - AI inference model from Google

Gemini 2.5 Deep Think is an AI reasoning model from Google designed to solve complex tasks. It is a variant of the model that won the gold medal at the International Mathematical Olympiad (IMO) 2025, and is designed to solve complex tasks through Parallel ...

Latest AI Resources

11mos ago

049.4K

FlowAct-R1 - 字节跳动开源的实时交互数字人视频生成框架

FlowAct-R1 - 字节跳动开源的实时交互数字人视频生成框架

FlowAct-R1是字节跳动开源的实时交互数字人视频生成框架，能通过单张参考图和音频流式生成无限时长的高保真全身动态视频。核心创新在于分块流式生成技术，将视频拆解为0.5秒一小段接力处理，配合结构化...

Latest AI Resources

5mos ago

049.4K

职达AI简历 - AI简历生成与优化平台，精准分析问题、提供优化建议

Vinda AI Resume - AI Resume Generation and Optimization Platform, Precise Analysis of Problems and Optimization Suggestions

Job AI resume is an efficient and convenient intelligent resume generation and optimization platform. Based on AI technology, the platform helps users quickly generate professional and personalized resumes. Users only need to enter basic information and experience, the platform can generate high-quality resume in a short time, providing 2800+ beautiful templates, covering a variety of positions.

Latest AI Resources

1yrs ago

049.4K

万兴天幕 – 万兴科技推出AIGC视频创作平台

Wanxing Canopy - Wanxing Technology Launches AIGC Video Creation Platform

Wanxing Canopy is the AIGC video creation platform launched by Wanxing Technology, covering the three major creation fields of video, picture and audio generation, which is specially designed for media and cultural industry workers, film and television/post-production workers, art and design workers, advertising and marketing practitioners, etc. to provide one-stop professional creation solutions.

Latest AI Resources

12mos ago

049.3K

DeepSeek-OCR - DeepSeek开源的光学字符识别模型

DeepSeek-OCR - DeepSeek open source optical character recognition model

DeepSeek-OCR is an advanced optical character recognition (OCR) model open-sourced by the DeepSeek team, which converts text into images through "contextual optical compression" technology, and utilizes visual tokens for compression and decoding to achieve efficient long text processing.

Latest AI Resources

8mos ago

049.3K

Claude Sonnet 4.5 - Anthropic推出的最强AI编程模型

Claude Sonnet 4.5 - The Most Powerful AI Programming Model from Anthropic

Claude Sonnet 4.5 is an artificial intelligence model from Anthropic designed for programming, computer operations, and complex task automation. The model excels in code generation, long-duration task processing, reasoning, and mathematical computation, supporting everything from initial planning...

Latest AI Resources

9mos ago

049.2K

Seed LiveInterpret 2.0 - 字节跳动推出的同声传译模型

Seed LiveInterpret 2.0 - Simultaneous Interpretation Model Launched by ByteHopper

Seed LiveInterpret 2.0 is a state-of-the-art simultaneous interpreting model launched by the Seed team of ByteDance, supporting two-way translation between Chinese and English. The model has near real-life translation accuracy and extremely low latency, with an average speech-to-speech delay of only 2-3 seconds, which is much lower than that of...

Latest AI Resources

11mos ago

049.2K

DeepSeek-OCR 2 - DeepSeek团队开源的新一代OCR模型

DeepSeek-OCR 2 - DeepSeek团队开源的新一代OCR模型

DeepSeek-OCR 2是DeepSeek团队开源的新一代OCR模型，核心创新在于采用DeepEncoder V2架构，将传统固定栅格扫描的视觉编码方式升级为基于语义推理的动态处理。模型通过因果流...

Latest AI Resources

5mos ago

049.1K

Lumina-DiMOO - 上海AI Lab联合华为昇腾开源的多模态大模型

Lumina-DiMOO - A Multimodal Large Model Open-Sourced by Shanghai AI Lab and Huawei Ascendant

Lumina-DiMOO is a new generation of unified model for multimodal generation and understanding launched by Shanghai Artificial Intelligence Laboratory (SAL) in conjunction with Huawei Rise at the World Artificial Intelligence Conference 2025. Based on the Rise AI basic hardware and software platform and the MindSpeed MM multimodal large model suite, it accomplishes...

Latest AI Resources

9mos ago

049K

OpenReasoning-Nemotron - 英伟达推出的开源系列推理模型

OpenReasoning-Nemotron - Open Source Series of Reasoning Models from NVIDIA

OpenReasoning-Nemotron is a series of large-scale language models open-sourced by NVIDIA to support processing of reasoning tasks in math, science and code. The models are distilled based on the DeepSeek R1 0528 model with parameter scales of 1.5B...

Latest AI Resources

11mos ago

049K

Meeseeks - 美团开源的评估模型指令遵循能力的评测集

Meeseeks - Meeseeks open-source assessment set for evaluating the ability to follow model instructions

Meeseeks is an open source large model evaluation set used by the Meituan M17 team to evaluate the model's ability to follow instructions.Meeseeks uses a three-tiered evaluation framework to comprehensively measure whether the model is able to generate answers in strict accordance with the user's instructions from the macro to the micro level, without evaluating the knowledge of the content of the answers positively ...

Latest AI Resources

10mos ago

048.9K

Qwen3-Max-Preview - 通义千问推出的旗舰大语言模型

Qwen3-Max-Preview - The Flagship Big Language Model from Tongyi Qianqian

Qwen3-Max-Preview is the latest flagship large language model released by Tongyi Qianwen. It is the model with the largest number of parameters in the Qwen3 family, with a parameter size of over 1 trillion. The model has significant improvements in inference, instruction following, multi-language support and long-tail knowledge coverage...

Latest AI Resources

10mos ago

048.9K

WebWeaver - 阿里通义开源的新型双智能体框架

WebWeaver - Ali Tongyi open source new dual-intelligence body framework

WebWeaver is a new dual-intelligence body framework introduced by Alibaba Tongyi team, which is mainly used in open deep research, and can simulate the human research process, which is divided into two intelligences: planning and writing.

Latest AI Resources

9mos ago

048.9K

GLM-5 - 智谱AI推出的旗舰级开源大模型

GLM-5 - 智谱AI推出的旗舰级开源大模型

GLM-5是智谱AI推出的旗舰级开源大模型，采用744B参数规模（激活40B），专为Agentic Engineering智能体工程打造。模型在编程与Agent能力上取得开源SOTA表现，SWE-be...

Latest AI Resources

4mos ago

048.9K

文心大模型X1.1 - 百度推出的深度思考模型，理解能力更强

Wenshin Big Model X1.1 - Baidu's Deep Thinking Model for Better Understanding

Wenxin Big Model X1.1 is a deep thinking model launched by Baidu, based on a hybrid reinforcement learning framework that focuses on improving language understanding and generation. The model excels in handling complex questions, following instructions and simulating the behavior of intelligences, and can accurately provide knowledgeable answers and high-quality text content.

Latest AI Resources

9mos ago

048.8K

Hyprnote - 开源的本地优先AI会议笔记工具

Hyprnote - Open source, locally prioritized AI conference note-taking tool

Hyprnote is an open source, local-first AI meeting note-taking tool designed for professionals to protect user privacy and improve meeting efficiency. Adopting the "local first" principle, all data storage and processing is done on the user's local device to ensure data security and support offline operation.

Latest AI Resources

9mos ago

048.7K

2024年自动化流程执行创作工作的14款出色AI工具

14 Brilliant AI Tools for Automating Processes to Perform Creative Work in 2024

If you're looking to harness the power of artificial intelligence to assist with day-to-day tasks and automate workflows in your personal and work life, then you may be interested in the wide range of AI tools available. AssemblyAI has produced a five-minute video detailing the tools you can use to automate...

Latest AI Resources

2yrs ago

048.7K

NitroGen - 英伟达联合斯坦福大学、加州理工等开源的游戏AI模型

NitroGen - NVIDIA's open-source gaming AI model in conjunction with Stanford, Caltech, and others

NitroGen is an open source gaming AI model developed by NVIDIA in conjunction with Stanford University, Caltech, and other institutions, capable of playing over 1,000 different types of games. The model is based on the GROOT N1.5 architecture, and is realized by analyzing 40,000 hours of game video data (including joystick operation annotation)...

Latest AI Resources

6mos ago

048.6K

IQuest-Coder-V1 - 至知创新研究院开源的代码大模型系列

IQuest-Coder-V1 - 至知创新研究院开源的代码大模型系列

IQuest-Coder-V1是九坤投资旗下至知创新研究院研发的开源代码大模型系列，专注于代码智能领域，具备自动编程、Bug修复和代码解释等能力。模型采用创新的Code-Flow训练范式，从代码库演化...

Latest AI Resources

6mos ago

048.6K

FLUX.1 Kontext - 黑森林推出的图像生成与编辑模型

FLUX.1 Kontext - Image Generation and Editing Model from Black Forest

FLUX.1 Kontext is an image generation and editing model from Black Forest Labs that provides context-aware image processing techniques. The model understands responses to text and image cues, performs tasks such as object modification, style conversion, and background replacement, while maintaining the corner...

Latest AI Resources

1yrs ago

048.5K

OpenScreen - 开源免费的屏幕录制工具，支持Mac和Windows双系统

OpenScreen - Open source free screen recording tool for Mac and Windows.

OpenScreen is an open source and free screen recording tool that provides users with an easy to use and fully functional alternative to Screen Studio. It supports both Mac and Windows, is completely free and follows the MIT protocol, and can be used for individual...

Latest AI Resources

6mos ago

048.5K

Wide Research - Manus平台推出的多智能体协同功能

Wide Research - Multi-Intelligence Collaboration Introduced on the Manus Platform

Wide Research is a powerful feature of the Manus platform designed to handle complex and large-scale tasks. The platform supports hundreds of general-purpose intelligences working simultaneously through system-level parallel processing mechanisms and intelligence collaboration protocols.

Latest AI Resources

11mos ago

048.4K

Audio2Face - NVIDIA开源的AI 3D面部动画生成模型

Audio2Face - NVIDIA open source AI 3D facial animation generation model

Audio2Face is NVIDIA's open source AI tool capable of transforming audio input into realistic 3D facial animation. By analyzing speech features in the audio, such as phonemes and intonation, it generates precise lip synchronization and subtle emotional expressions to give vivid human expressions to virtual characters.

Latest AI Resources

9mos ago

048.2K

Gemini Robotics On-Device - 谷歌推出首个在本地运行的具身智能模型

Gemini Robotics On-Device - Google Launches First Embodied Intelligence Model to Run Locally

Gemini Robotics On-Device is a vision-language-action model from Google DeepMind that supports running locally in a robot. The model is able to perform tasks offline, performing fine actions based on natural language commands, such as folding clothes and pulling open bags...

Latest AI Resources

1yrs ago

048.2K

飞算JavaAI - AI Java开发助手，自然语言实现全流程智能化开发

Flycount JavaAI - AI Java development assistant, natural language implementation of the whole process of intelligent development

Flycount JavaAI is an intelligent Java development assistant launched by Flycount Technology. The platform supports natural language input to realize the whole process of intelligent development from requirements analysis to code generation. Developers only need to enter a description of the requirements, Flycount JavaAI can accurately understand and generate a complete engineering code framework, the platform...

Latest AI Resources

1yrs ago

048K

SoulX-Podcast - Soul AI Lab开源的对话式语音合成模型

SoulX-Podcast - Soul AI Lab's Open Source Conversational Speech Synthesis Model

SoulX-Podcast is Soul AI Lab's open source advanced multi-speaker conversational speech synthesis model designed for generating high quality podcast content. SoulX-Podcast has the ability to generate multiple rounds of conversations, which can simulate smooth conversations in real podcasting scenarios, and supports Mandarin, English, and multiple Chinese...

Latest AI Resources

8mos ago

047.9K

AntAngelMed - 蚂蚁联合浙江省卫生健康信息中心开源的医疗大模型

AntAngelMed - 蚂蚁联合浙江省卫生健康信息中心开源的医疗大模型

AntAngelMed（蚂蚁·安诊儿医疗大模型）是浙江省卫生健康信息中心、蚂蚁健康、浙江省安诊儿医学人工智能科技有限公司联合开发的开源医疗大模型。模型采用混合专家架构（MoE），总参数量达1000亿...

Latest AI Resources

5mos ago

047.8K

Youtu-GraphRAG - 腾讯优图实验室开源的图检索增强生成框架

Youtu-GraphRAG - Tencent Youtu Labs Open Source Graph Retrieval Augmentation Generation Framework

Youtu-GraphRAG is an open source graph retrieval augmentation generation framework from Tencent's Youtu Labs to help large language models handle complex Q&A tasks more accurately. By constructing a four-layer knowledge tree, the knowledge is disassembled into four levels of attributes, relationships, keywords and communities to realize the self-directed performance of cross-domain knowledge...

Latest AI Resources

9mos ago

047.8K

NeuTTS Air - 支持离线CPU运行的免费轻量级语音合成模型

NeuTTS Air - Free and Lightweight Speech Synthesis Model with Offline CPU Running Support

NeuTTS Air is open source lightweight speech synthesis model, developed by Neuphonic team, which can run in real time on local devices (e.g. cell phones, laptops, Raspberry Pi) without relying on the cloud. Using 0.5B parameter Qwen architecture and self-developed NeuCodec codec...

Latest AI Resources

8mos ago

047.7K

OneCAT - 美团联合上海交大开源的多模态模型

OneCAT - Open source multimodal modeling by Meituan and Shanghai Jiaotong University

OneCAT is a new unified multimodal model launched by Meituan in conjunction with Shanghai Jiaotong University, which adopts a pure decoder architecture and can seamlessly integrate multimodal comprehension, text-to-image generation and image editing functions. The model abandons the design of traditional multimodal models that rely on external visual coders and disambiguators through modality-specific...

Latest AI Resources

10mos ago

047.5K

MiniMax Music 1.5 - MiniMax最新推出的AI音乐生成模型

MiniMax Music 1.5 - MiniMax's latest AI music generation model

MiniMax Music 1.5 is an advanced AI music generation tool that supports generating up to 4 minutes of music based on users' natural language descriptions. The model supports a variety of music styles and mood customization, generating a natural and full vocal color, smooth transitions, richly layered arrangements...

Latest AI Resources

9mos ago

047.5K

Qwen3-Omni - 阿里通义推出的全模态AI模型

Qwen3-Omni - Omnimodal AI model launched by Ali Tongyi

Qwen3-Omni is a fully modal AI model introduced by the Ali Tongyi team that can handle multiple data types such as text, images, audio and video, and supports text interaction in 119 languages with low latency and high controllability.

Latest AI Resources

9mos ago

047.4K

Klear-Reasoner - 快手推出的全新推理模型

Klear-Reasoner - The New Reasoning Model Introduced by Racer

Klear-Reasoner is a high-performance inference model from Racer, based on Qwen3-8B-Base. The model is trained by long thought chain supervised fine-tuning and reinforcement learning to perform well in mathematical and code reasoning.Klear-Reasoner...

Latest AI Resources

10mos ago

047.4K

FLM-Audio - 智源联合南洋理工开源的全双工音频对话模型

FLM-Audio - Wisdom Source and Nanyang Polytechnic Open Source Full-Duplex Audio Dialog Modeling

FLM-Audio is a native full-duplex audio dialog grand model released by Beijing Zhiyuan Artificial Intelligence Research Institute in conjunction with Spin Matrix and Nanyang Technological University of Singapore, supporting both Chinese and English. Adopting native full-duplex architecture, it can merge listening, speaking and monologue at each time step...

Latest AI Resources

9mos ago

047.4K

Kimi Linear - 月之暗面开源的新型混合线性注意力架构

Kimi Linear - A New Hybrid Linear Attention Architecture Open-Sourced by Dark Side of the Moon

Kimi Linear is a new hybrid linear attention architecture open-sourced by Dark Side of the Moon, with Kimi Delta Attention (KDA) as the core, optimizing the traditional attention model through a finer-grained gating mechanism, which significantly improves the hardware efficiency and memory control ability ...

Latest AI Resources

8mos ago

047.2K

PaCoRe - 阶跃星辰开源的并行协同AI推理框架

PaCoRe - Step Star's open source parallel collaborative AI reasoning framework

PaCoRe (Parallel Coordinated Reasoning) is StepFun's open source innovative parallel collaborative reasoning framework, through a massively parallel thinking mechanism, from multiple perspectives to simultaneously explore the problem solution, breaking through the traditional...

Latest AI Resources

6mos ago

047.2K

InfinityHuman - 字节联合浙大推出的长视频数字人生成模型

InfinityHuman - Long video digital human generation model launched by Bytes in collaboration with ZJU

InfinityHuman is a commercial-grade long time-series audio-driven character video generation model jointly launched by ByteDance and Zhejiang University. The model is audio-driven and can generate high-resolution, long duration and visually consistent character videos.

Latest AI Resources

10mos ago

047.1K

json-render - Vercel Labs开源的AI生成UI的工具

json-render - Vercel Labs开源的AI生成UI的工具

json-render是Vercel Labs开源的AI生成UI的工具，通过“AI → JSON → UI”的流程实现结构化、可控的界面生成。要求AI仅输出符合预定义Schema的JSON数据，前端再...

Latest AI Resources

5mos ago

047.1K

Code2Video - Show Lab开源的AI教学视频生成框架

Code2Video - Show Lab open source AI teaching video generation framework

Code2Video is innovative open source project that automatically converts code snippets into high quality video content (mp4 format). The project through a unique code-centric paradigm , the use of carbon-now-cli tools to generate code into beautiful images , the use of ffmpeg will be these ...

Latest AI Resources

9mos ago

046.9K

ClawFeed - 开源AI新闻摘要工具，一站式聚合任意网站内容

ClawFeed - 开源AI新闻摘要工具，一站式聚合任意网站内容

ClawFeed是开发者Kevin He推出的开源AI新闻摘要工具，解决信息过载问题。通过聚合Twitter、RSS、GitHub等多平台信息源，利用AI自动生成4小时、每日、每周和每月的结构化摘要...

Latest AI Resources

4mos ago

046.9K

ZeroSearch - 阿里通义推出的开源大模型搜索引擎框架

ZeroSearch - Ali Tongyi launched the open source large model search engine framework

ZeroSearch is Alibaba Tongyi Lab open source innovative large model search engine framework. The framework does not need to interact with real search engines , based on the simulation of the search engine , with a large model of its own pre-training knowledge to generate relevant or noise documents , significantly reducing the training cost ( reduce 80% or more ...

Latest AI Resources

1yrs ago

046.7K

Stand-In - 腾讯微信视觉开源的轻量级视频生成框架

Stand-In - Tencent WeChat Visual Open Source Lightweight Video Generation Framework

Stand-In is a lightweight, plug-and-play identity-preserving video generation framework from Tencent's WeChat Vision team. Focusing on preserving specific identity features in video generation, it only needs to train the additional parameters of the base model 1%, and can achieve excellent results in face similarity and naturalness.

Latest AI Resources

9mos ago

046.6K

阶跃深研 - 阶跃星辰推出的AI深入研究工具

Steps Deep Research - AI Deep Research Tool by Steps Star

Steps Deep Research is an efficient AI research tool launched by Steps Star, which can autonomously complete research on complex issues and generate professional reports in a short period of time. The tool is designed for finance, consulting, healthcare, law and other fields, and excels in industry reviews with its in-depth search and information integration capabilities.

Latest AI Resources

11mos ago

046.4K