Latest AI Resources

Total 2830 articles posts
imgAK - AI图像处理工具,支持黑白照片自动上色

imgAK - AI image processing tool that supports automatic colorization of black and white photos

imgAK is a one-stop AI image processing tool, based on advanced deep learning technology, providing users with a variety of powerful image editing features. imgAK supports repairing scratches, breaks and fading problems in old photos, and can automatically colorize black and white photos to make them look new. imgAK is a one-stop AI image processing tool, based on advanced deep learning technology, offering users a variety of powerful image editing features.
6mos ago
034.6K
Genspark AI - Genspark推出的AI浏览器

Genspark AI - Genspark Launches AI Browser

Genspark AI is an innovative AI browser from Genspark, Inc.Genspark AI comes with a built-in intelligent assistant that helps users find better deals, compare product prices, and analyze user reviews while shopping to help them make smarter purchasing decisions....
6mos ago
035.2K
爱扒谱 - AI音乐处理平台,一键将音频文件转为五线谱

Love Sheet Music - AI music processing platform, a key to convert audio files to five-line score

AI Pocket Sheet Music is a music processing platform based on AI technology, mainly for music creators, teachers, students and music lovers. The platform supports one-key conversion of audio files into pentatonic scores, rapid separation of vocal and accompaniment tracks, automatic generation of complete musical works based on the melody or chords input by the user, and support for MP3 ...
6mos ago
030.1K
Music Muse - AI音乐创作平台,简单描述生成音乐作品

Music Muse - AI Music Creation Platform, Simple Description Generation of Music Compositions

Music Muse is a music creation platform based on advanced AI technology. Users input simple descriptions, such as music style, mood, rhythm, etc., and AI quickly generates music works that meet the needs, without the need for specialized music knowledge. The platform supports a variety of styles such as pop, rock, classical, etc., and can generate music according to the mood of...
6mos ago
028.6K
o3-pro - OpenAI推出的 o3 升级版推理模型

o3-pro - o3 upgraded inference model from OpenAI

o3-pro is an upgraded version of the o3 inference model from OpenAI, designed to handle complex questions and provide accurate answers. The model supports invoking ChatGPT's full suite of tools, such as web search, file analysis, image inference and Python programming, demonstrating powerful execution...
6mos ago
027.3K
AIFlowy - 开源的企业级 AI 应用开发平台

AIFlowy - Open Source Enterprise AI Application Development Platform

AIFlowy is an open source enterprise-level AI application development platform, based on Java development, against byte Coze, Tencent meta ware and Dify and other products. Support for intelligent dialog robots, private knowledge base construction, AI workflow orchestration and large model management and other functions, to provide a complete system management model...
6mos ago
027.2K
优雅YOYA - 中科闻歌推出的AI音视频内容创作平台

Elegant YOYA - AI Audio/Video Content Creation Platform Launched by Sinotech Winkler

Elegant YOYA is a multimodal literate video platform launched by Zhongke Wenge, the platform is based on AI multimodal technology to empower the whole chain of video content creation. Users only need to input the theme requirements, the platform can quickly generate scripts, images, videos, and can complete intelligent editing, voice synthesis and character mouth drive and other operations, the output...
6mos ago
025.1K
Uthana - AI 3D 角色动画生成平台,文字描述或参考视频生成逼真动画

Uthana - AI 3D character animation generation platform, text description or reference video to generate realistic animation

Uthana is a powerful AI 3D character animation generation platform. Users can input text descriptions, upload reference videos, or search motion libraries, and AI can quickly generate realistic animations and support adapting models with any bone structure. The platform is equipped with a variety of features such as style migration, API integration, customization...
6mos ago
029.8K
企鹅读伴 - 腾讯推出的中小学生AI阅读助手

Penguin Reading Companion - Tencent's AI Reading Assistant for Primary and Secondary School Students

Penguin Reading Companion is an AI reading assistant designed for primary and secondary school students by Tencent. Penguin Reading Companion relies on Tencent's hybrid big model and metamachine platform, combined with the Compulsory Education Language Curriculum Program and Curriculum Standards (2022 Edition), to provide students with personalized reading recommendations, multiple reading modes (focusing, reading aloud, listening...
6mos ago
026.7K
Rowboat - 开源的智能体开发框架

Rowboat - Open source framework for smart body development

Rowboat is an open source low-code AI IDE that supports building multi-intelligent body assistants.Rowboat is based on a visual interface and AI-assisted development features to help users quickly design, configure and test intelligent body workflows. Supporting users to describe requirements in natural language, Row...
6mos ago
028.3K
商汤如影 - 商汤科技推出的AI数字人视频制作平台

Shangtang Ruyi - AI digital human video production platform launched by Shangtang Technology

Shangtang Ruying is an AI digital human video production platform launched by Shangtang Technology. Based on big model technology, the platform supports the creation of highly realistic digital human images and personalization, including facial features, clothing, hairstyles, and so on. The platform is equipped with sound cloning, video generation, automated data labeling, real-time interaction, and other functions...
6mos ago
026.6K
JoyHallo - 京东开源的AI数字人模型

JoyHallo - Jingdong's open source AI digital human model

JoyHallo is an open source AI digital human model from Jingdong, designed for Mandarin, supporting the conversion of audio into realistic speaking video.JoyHallo embeds audio features based on the wav2vec2 model, using a semi-decoupled structure to improve the accuracy of lip movement prediction, and supports the generation of English video...
6mos ago
028.7K
必火AI - AI数字人生成平台,支持中英双语声音克隆

Mustfire AI - AI digital human generation platform, supports Chinese and English bilingual voice cloning

Must Fire AI is a domestic AI digital human generation platform for short video creators. Users can upload a 3-minute video of a real person to quickly generate a highly realistic digital human image, with a micro-expression accuracy of 0.1 millimeters. The platform supports voice synthesis and recording of voice samples to generate AI voice models comparable to real people...
6mos ago
028.8K
智谱CoCo - 智谱推出的企业级超级助手Agent

Wise Spectrum CoCo - Wise Spectrum's Enterprise Super Assistant Agent

Wisdom Spectrum CoCo is the first enterprise-level super assistant Agent launched by Wisdom Spectrum's AICO platform.Wisdom Spectrum CoCo is equipped with three core features: delivery-oriented, memory mechanism and seamless embedding. In the field of government affairs, CoCo can interpret policies, customize solutions and track implementation effects, helping policies to be implemented efficiently.
6mos ago
028K
ScienceOne - 中国科学院自动化研究所等机构推出的智能科研平台

ScienceOne - Intelligent Research Platform Launched by Institute of Automation, Chinese Academy of Sciences and Other Institutions

ScienceOne is an intelligent scientific research platform jointly launched by Institute of Automation, Chinese Academy of Sciences. The platform is based on the construction of large models of scientific foundation, and promotes a new paradigm of intelligent scientific research with multidisciplinary collaboration, providing support for the whole process of scientific research.The core products of ScienceOne include S1...
6mos ago
027.1K
QBot - 腾讯QQ浏览器推出的AI浏览器

QBot - AI Browser by Tencent QQ Browser

QBot is a smart browser with integrated AI features launched by Tencent QQ Browser. The browser is equipped with a variety of practical functions, such as AI search, which supports text, voice and image search, and can provide answers quickly and accurately.AI browsing function supports the rapid interpretation of web content to generate mind maps.
6mos ago
028.7K
元镜 - AI视频创作工具,自动生成脚本

Metamirror - AI video creation tool with automatic script generation

Metamirror is an AI video creation tool based on the human-computer symbiosis engine, which supports efficient creation from creative inspiration to finished video. The tool is equipped with automated script generation, character style unification, multimodal fusion and intelligent workflow, etc. It can quickly generate creative video scripts, multimodal split-screen design, and synthesize the complete video with one key...
6mos ago
027.3K
朱雀AI检测 - 腾讯推出的AI图片和文本检测平台

Jubilee AI Inspection - AI image and text inspection platform launched by Tencent

Vermilion Bird AI Detection is an AI detection platform launched by Tencent's hybrid security team, Vermilion Bird Labs, to help users identify AI-generated images and text content. Vermilion Bird AI detection is based on analyzing the hidden features of images, content that does not conform to common sense logic, and "watermark" logos, etc., to quickly determine whether an image is generated by AI.
6mos ago
032.9K
琴乐大模型 - 腾讯推出的AI音乐创作模型

Piano Music Big Model - AI Music Composition Model by Tencent

Qin Music Grand Model is an advanced AI music creation grand model jointly launched by Tencent AI Lab and Tencent TME Tianqin Lab. The model intelligently generates high-quality stereo audio or multi-track sheet music based on user-inputted keywords, descriptive statements or audio clips in English and Chinese.
6mos ago
025.5K
拍我AI - 爱诗科技推出的PixVerse国内版AI视频生成平台

Shoot Me AI - PixVerse Domestic Version of AI Video Generation Platform Launched by Aishi Technologies

Shoot Me AI is an innovative AI video generation platform launched by Aishi Technology, customized for the domestic market and is the domestic version of PixVerse. The platform supports the rapid generation of high-quality dynamic video content based on simple text prompts or uploaded images. The latest V4.5 version of the platform has improved video quality, animation smoothness...
6mos ago
030.6K
钉钉宜搭 - 阿里推出的低代码应用开发平台

Nail Yihu - Ali's Low-Code App Development Platform

Nail Yitai is a low-code application development platform launched by Alibaba to help enterprises quickly build digital business applications. Through the visualization of drag and drop and configuration methods, business people who do not know how to code can easily develop applications that meet their needs, greatly reducing the development threshold and cost.
6mos ago
028.1K
Seed-Music - 字节跳动推出的AI音乐生成模型

Seed-Music - AI Music Generation Model Launched by ByteHopper

Seed-Music is a big model of AI music generation launched by ByteDance, which supports transforming 10 seconds of user-recorded audio into a complete musical composition. Based on autoregressive language modeling and diffusion methods, it generates multimodal user inputs (e.g., style descriptions, audio references, scores, and sound cues) based on high...
6mos ago
029.1K
音控 - AI音乐创作平台,用歌词或旋律片段生成歌曲

Sound Control - AI music creation platform that generates songs with lyrics or melody snippets

Sound Control is an innovative AI music creation platform that provides comprehensive support for music creators. Audio Control is equipped with various functions such as AI lyrics, compositions, accompaniment generation, professional recording, etc. Users only need to input simple lyrics or melody snippets, and the AI can quickly generate complete song content, covering rock, rap, ballad, and many other...
6mos ago
028.9K
反谱 - AI音乐转谱平台,支持音频文件转五线谱和简谱

AntiScore - AI music transcription platform, supports audio files to pentatonic and simple music.

AntiSpectrum is an innovative online AI music conversion platform, based on advanced AI technology, to convert audio files (such as MP3, FLAC, etc.) into pentatonic and simple scores. AntiSpectrum has a vocal separation function, which separates the vocals from the accompaniment in the music, making it easy for music production and mixing. AntiSpectrum supports converting MIDI files...
6mos ago
037.5K
HeyGen - AI 数字人视频创作平台,支持多语言翻译配音

HeyGen - AI Digital Human Video Creation Platform with Multi-Language Translation and Dubbing Support

HeyGen is an AI-driven digital human video creation platform that supports a streamlined video production process, allowing users to quickly generate professional-level digital human videos. The platform is based on advanced AI technology, giving users full control over the image and voice of digital people, providing a rich library of material, including diverse background...
6mos ago
025.9K
Make - AI无代码自动化工作流搭建平台

Make - AI's no-code automated workflow building platform

Make is an AI-driven no-code automation platform that helps organizations improve efficiency and innovation based on automated processes. The platform offers more than 2,000 pre-built apps that support a variety of business scenarios, such as marketing, sales, finance, etc. Make's core features include no-code visual process creation, AI...
6mos ago
026K
MiMo-VL - 小米开源的多模态模型

MiMo-VL - Xiaomi's open source multimodal modeling

MiMo-VL is Xiaomi's open source multimodal grand model, consisting of a visual coder, a cross-modal projection layer and a language model. The visual coder is based on Qwen2.5-ViT, which supports native resolution inputs and preserves more details; the language model is Xiaomi's self-developed MiMo-7B, which is designed for complex projections...
6mos ago
026.4K
Fish Audio - AI 语音合成与声音克隆工具

Fish Audio - AI Speech Synthesis and Sound Cloning Tool

Fish Audio is a powerful generative AI speech synthesis tool that supports text-to-speech (TTS) and voice cloning. Users only need to input text, the tool supports the conversion to natural and smooth voice, the platform provides multiple languages and voice styles to choose from, to meet different scenarios and user...
6mos ago
035.3K
SignGemma - 谷歌 DeepMind 推出的手语翻译模型

SignGemma - Sign Language Translation Model from Google DeepMind

SignGemma is the world's most powerful sign language interpreting AI model introduced by Google DeepMind, supporting the accurate translation of American Sign Language (ASL) into English text. The model is based on multimodal training, combining visual and textual data to capture sign language actions in real time and quickly translate them into text...
6mos ago
029.2K
CRIC深度智联 - 克而瑞推出的中国房地产首个AI Agent

CRIC - The First AI Agent for Real Estate in China Launched by CRIC

CRIC Depth Intelligence is the first AI intelligent body of Chinese real estate independently developed by CRIC, based on CRIC's 20 years of experience in the real estate industry and data accumulation and multimodal big model technology, which opens up the whole chain from data integration, intelligent analysis to content generation.
6mos ago
025.2K
WebAgent - 阿里通义开源的自主搜索AI Agent

WebAgent - Ali Tongyi Open Source Autonomous Search AI Agent

WebAgent is an open source autonomous search AI Agent from Alibaba's Tongyi Labs, with powerful end-to-end autonomous information retrieval and multi-step reasoning capabilities.WebAgent can actively perceive, decide and act in the network environment like a human being, and is widely used in academic research, business decision...
6mos ago
029.6K
灵码 IDE - 通义灵码推出 AI 原生开发环境工具

Linguaphone IDE - Tongyi Linguaphone Launches AI Native Development Environment Tools

Spirit Code IDE is the AI native integrated development environment (IDE) launched by Tongyi Spirit Code, which is deeply adapted to the 3 major models of Thousand Questions, and has a powerful programming intelligent body mode to support the autonomous completion of tasks such as project perception, code retrieval, and execution of terminal operations. It supports MCP tools and integrates Magic Hitch MCP Square's 3...
6mos ago
024.5K
BAGEL - 字节跳动推出的开源多模态基础模型

BAGEL - Open source multimodal base model launched by Wordpress

BAGEL is a multimodal base model open-sourced by ByteDance with 14 billion parameters, of which 7 billion are active. The model base with the Mixed Transformer Expert Architecture (MoT) captures pixel-level and semantic-level features of an image with two independent encoders, respectively, to support efficient processing of images, text, video...
6mos ago
027K
可灵 2.1 - 快手推出的AI视频生成模型

Keling 2.1 - AI Video Generation Model Launched by Shutterstock

KeLing 2.1 is an AI video generation model launched by Racer, which is now available on the KeLing AI video platform. The model contains three versions: standard, high quality and master, providing 720P, 1080P and movie and TV level effects to meet different creative needs. The standard version of the generation speed, suitable for rapid production...
6mos ago
029.4K
小云雀 - 剪映推出的智能创作Agent

Little Lark - Smart Creation Agent by Shear Image

Little Lark is an intelligent creation Agent launched by Shear Image, based on AI technology to reshape the boundaries of content creation, making creation simpler, more efficient and more interesting. Little Lark supports zero-threshold creation of videos, digital pop-up videos, design drawings and pictures for backgrounds, users only need to enter a command, AI support efficiently complete...
6mos ago
042.1K
稿定AI社区 - AI创意内容设计平台,多种设计资源满足不同创作需求

Drafting AI Community - AI creative content design platform, a variety of design resources to meet different creative needs

Drafting AI Community is an online AI creative inspiration platform that provides users with a wealth of creative design resources and tools. The platform covers a variety of design fields, including image photos, e-commerce design, holiday themes, 3D illustrations, avatar design, Xiaohongshu materials, portrait design, etc., to meet the needs of different users.
6mos ago
026.4K
NoCode – 美团推出的零代码AI开发平台

NoCode - Zero-Code AI Development Platform Launched by Meituan

What is NoCode NoCode is a zero-code AI development platform launched by Mission. Users don't need any programming experience, they just need to describe the requirements through natural language to quickly generate website pages, utilities, small games, event pages and other applications.NoCode supports one second generation of 200...
6mos ago
037K
Sim Studio:开源的AI代理工作流构建工具

Sim Studio: open source workflow builder for AI agents

Comprehensive Introduction Sim Studio is an open source AI agent workflow building platform focused on helping users quickly design, test, and deploy large-scale language model (LLM) workflows through a lightweight, intuitive visual interface. Users can create complex workflows without deep programming by dragging and dropping...
6mos ago
050.9K
RealtimeVoiceChat:低延迟与AI进行自然口语对话

RealtimeVoiceChat: low-latency natural spoken conversation with AI

General Introduction RealtimeVoiceChat is an open source project focused on real-time, natural conversations with artificial intelligence via voice. Users use a microphone to input their voice, and the system captures the audio through a browser, quickly converts it to text, and a large-scale language model (LLM) generates back...
7mos ago
039.1K
Cooragent:一句话构建多智能体任务协作工具

Cooragent: building a multi-intelligence task collaboration tool in one sentence

General Introduction Cooragent is an open source AI agent collaboration framework developed by LeapLab at Tsinghua University and hosted on GitHub.It allows users to create intelligent AI agents with a one-sentence description and supports multiple agents to collaborate on complex tasks. The framework provides two...
7mos ago
033.1K
Claude生成深度研究报告的MCP服务

Claude's MCP service for generating in-depth research reports

Comprehensive Introduction MCP Server Deep Research is an open source tool that automatically generates structured research reports for complex problems through artificial intelligence and web search. Users enter a research question, and the tool breaks down the question, searches for authoritative information, assesses source credibility...
7mos ago
031.8K
Deep Recall:为大模型提供企业级记忆框架的开源工具

Deep Recall: an open source tool that provides an enterprise-class memory framework for large models

Comprehensive Introduction Deep Recall is an open source, enterprise-class memory framework designed for large-scale language models (LLMs). It provides hyper-personalized responsiveness through efficient contextual retrieval and integration. The framework uses a three-tier architecture, including a memory service, a reasoning service, and a coordinator, supporting...
7mos ago
038.7K