Latest AI Resources

Total 2972 articles posts
Step-Audio:多模态语音交互框架,识别语音并使用克隆语音交流等功能

Step-Audio: a multimodal voice interaction framework that recognizes speech and communicates using cloned speech, among other features

Comprehensive Introduction Step-Audio is an open source intelligent speech interaction framework designed to provide out-of-the-box speech understanding and generation capabilities for production environments. The framework supports multi-language dialog (e.g., Chinese, English, Japanese), emotional speech (e.g., happy, sad), regional dialects (e.g., Cantonese, Szechuan ...
1yrs ago
067.8K
Galaxy.ai:集成1700+AI工具库的多功能平台,用于了解市场中各类生成式AI工具(付费)

Galaxy.ai: a multifunctional platform integrating 1700+ AI tool libraries for understanding all types of generative AI tools in the market (paid)

Comprehensive Introduction Galaxy.ai is a platform that integrates a wide range of AI tools designed to provide users with comprehensive AI solutions. Whether it's text generation, image processing, video production or speech synthesis, Galaxy.ai is able to satisfy a wide range of user needs. The platform offers...
1yrs ago
067.6K
紫东太初:多模态大模型平台,支持文本创作、图像生成、3D理解、信号分析等任务

Zidong Taichu: Multi-modal large model platform supporting text creation, image generation, 3D understanding, signal analysis and other tasks

Comprehensive Introduction Zidong Taichu is a new-generation multimodal big model platform launched by the Institute of Automation of the Chinese Academy of Sciences and the Wuhan Institute of Artificial Intelligence. The platform supports multiple tasks such as multi-round question and answer, text creation, image generation, 3D understanding and signal analysis, with powerful cognitive, understanding and creation capabilities. Zidong ...
1yrs ago
067.6K
WeClone:用微信聊天记录和语音训练数字分身

WeClone: training digital doppelgangers with WeChat chats and voices

Comprehensive introduction WeClone is an open source project that uses WeChat chat logs and voice messages, combined with large language models and speech synthesis technology, to allow users to create personalized digital doppelgangers. The project can analyze the user's chat habits to train the model , but also a small number of voice samples to generate realistic sound...
11mos ago
067.6K
LinkAI:一站式AI智能体平台,客服智能体快速接入网站、公众号

LinkAI: one-stop AI intelligent body platform, customer service intelligent body fast access to the website, public number

Comprehensive Introduction LinkAI is a one-stop AI intelligent body building platform launched by Shenzhen Minimalist Future Technology Co. The platform aggregates multimodal models such as text, voice, image, etc., provides enhanced capabilities such as knowledge base, plug-ins, workflow, etc., and supports zero-code access to enterprise WeChat, public number, WeChat...
1yrs ago
067.6K
MOKI:美图公司AI短片创作工具,适合动画短片, 网文短剧, 儿童故事绘本

MOKI: Meitu's AI short film authoring tool for animated short films, online short dramas, children's stories and illustrated books.

Comprehensive Introduction MOKI is an AI short film creation tool launched by Meitu, focusing on providing users with a convenient and efficient short film production experience. The tool covers a wide range of video content production types such as animated short films, online short dramas, story illustrated books and MVs. Users can input story synopsis or import existing...
1yrs ago
067.4K
tldraw:开源无限画布白板SDK,AI生成简约线框图和UML图

tldraw: open source unlimited canvas whiteboard SDK, AI to generate minimalist wireframe diagrams and UML diagrams

General Description tldraw is a free and instant collaborative drawing tool that provides an unlimited canvas where users can quickly draw graphics, write text and collaborate instantly. Featuring an intuitive interface and excellent performance, it is suitable for team collaboration and remote work. Supported through the open source community, tldr...
1yrs ago
067.3K
Glama:集成1000+MCP服务的多功能AI聊天工具

Glama: a versatile AI chat tool integrating 1000+ MCP services

General Introduction Glama is a powerful and easy-to-use AI chat tool. It not only supports conversations with a wide range of AI models, but also uploads files, searches the web for information, and even generates professional charts. The website is geared towards users who need to process information and tasks efficiently, such as corporate teams, developers or individual users...
12mos ago
067.3K
Depth AI:构建全面的代码知识图谱,深度理解代码库的AI助手

Depth AI: An AI assistant for building a comprehensive code knowledge graph and deep understanding of the code base

Comprehensive Introduction Depth AI is an artificial intelligence assistant designed for developers to deeply understand and analyze code bases. By building a comprehensive code knowledge graph, Depth AI can answer complex technical questions and help developers manage and optimize their code more efficiently. Whether...
1yrs ago
067.3K
Sonic:音频驱动肖像图片生成面部表情生动的数字人口播视频

Sonic: Audio-driven portrait images generate digital demo videos with vivid facial expressions

General Introduction Sonic is an innovative platform focusing on global audio perception designed to generate vivid portrait animations driven by audio. Developed by a team of researchers from Tencent and Zhejiang University, the platform utilizes audio information to control facial expressions and head movements to generate natural and smooth animated videos.S...
11mos ago
067.1K
MagicQuill:智能交互式图像涂鸦编辑系统,精准局部涂鸦编辑

MagicQuill: Intelligent Interactive Image Graffiti Editing System, Precise Localized Graffiti Editing

General Introduction MagicQuill is an open-source AI interactive image editing tool jointly launched by Hong Kong University of Science and Technology (HKUST), Ant Group, Zhejiang University and University of Hong Kong. The tool aims to achieve accurate localized editing of images in an intelligent and interactive way.MagicQuill...
1yrs ago
067.1K
腾讯混元3D(Hunyuan3D):生成高分辨率3D资产,多种3D素材生成工作流

Tencent Hybrid 3D (Hunyuan3D): Generate high-resolution 3D assets, multiple 3D material generation workflows

Comprehensive Introduction Tencent Hunyuan3D (Hunyuan3D 2.0) is an advanced large-scale 3D synthesis system from Tencent designed to generate high-resolution textured 3D assets. The system consists of two core components: Hunyuan3D-DiT, a large-scale shape generation model, and Hunyuan3D-DiT, a large-scale texture...
1yrs ago
067K
Browser-Use:构建智能网页自动化工具,让AI智能体轻松操作浏览器

Browser-Use: Building Intelligent Web Automation Tools for AI Intelligents to Easily Operate Browsers

Comprehensive Introduction Browser-Use is an innovative open source web automation tool specifically designed to enable Language Models (LLMs) to naturally interact with websites. It provides a powerful and flexible framework that supports a wide range of mainstream language models, including GPT-4, Claud...
1yrs ago
067K
InstantIR:受损图像修复与图像高清放大开源项目,最低16G显存

InstantIR: damaged image repair and image high-definition zoom open source project, minimum 16G video memory

General Description InstantIR is an innovative single-image restoration model developed by the InstantX team, designed to resurrect your damaged images with extremely high-quality and realistic details, capable of high-quality restoration of damaged images. The tool not only restores the details of the image...
1yrs ago
067K
Refly:基于自由画布上流程编排的AI写作平台,自动化生成文章

Refly: an AI writing platform based on process orchestration on a free canvas for automated article generation

Comprehensive Introduction Refly is a free canvas-based AI native authoring engine designed to help users turn ideas into high-quality content through multi-threaded conversations, knowledge base integration, contextual memory and intelligent search technology. The platform covers over 20 professional scenario templates, including learning...
1yrs ago
066.8K
Agent TARS:使用视觉和命令操作电脑的开源智能体

Agent TARS: An Open Source Intelligence Using Vision and Commands to Operate Computers

Comprehensive Introduction Agent TARS is a multimodal AI intelligence open-sourced by ByteDance.The core feature is to visually understand web content and combine command line and file system operations to help users complete complex computer tasks. Instead of requiring manual operations like traditional tools, it can self...
12mos ago
066.6K
通义万相:AI创意作画|文生图|图生图|虚拟模特|个人写真|涂鸦作画

Tongyi Wanxiang: AI Creative Painting|Text-to-Picture|To-Picture|Virtual Modeling|Personal Portrait|Doodle Painting

Comprehensive Introduction Tongyi Wanxiang is an AI creative painting platform under Aliyun, providing a variety of AI art creation functions. Users can create in a variety of ways such as text to generate images, image to generate images, graffiti painting, virtual modeling and personal portraits. The platform is based on the self-developed Composer combination of generating...
1yrs ago
066.5K
AI投资系统:自动化A股投资决策系统,利用多智能体系统分析市场数据

AI investment system: automated A-share investment decision-making system that utilizes a multi-intelligence system to analyze market data

Comprehensive Introduction A_Share_investment_Agent is an A-share investment decision aid based on a multi-intelligence system. The system is designed to analyze market data, calculate the intrinsic value of stocks, analyze market sentiment, and fundamental data through multiple collaborative intelligences to...
1yrs ago
066.4K
AI reads books:AI逐页阅读PDF书籍,自动提取知识要点并生成总结

AI reads books: AI reads PDF books page by page, automatically extracts the main points of knowledge and generates summaries.

Comprehensive Introduction AI-reads-books-page-by-page is a Python-based development of intelligent PDF book analysis tool, which can automate the page-by-page analysis of PDF books, extract the key knowledge points, and after the specified page interval to generate stage...
1yrs ago
066.3K
DeOldify:使用AI技术为黑白照片和视频上色的经典开源工具

DeOldify: the classic open-source tool for colorizing black-and-white photos and videos using AI technology

Comprehensive Introduction DeOldify is an open source project based on deep learning technology, specifically designed for intelligent colorization and restoration of black and white photos and videos. The project uses an innovative NoGAN training method to successfully solve the common defects of traditional GAN networks in the image coloring process...
1yrs ago
066.2K
Sana Labs:企业知识管理和员工培训学的AI工具

Sana Labs: AI Tools for Enterprise Knowledge Management and Employee Trainology

General Introduction Sana Labs is a company dedicated to improving the efficiency of knowledge acquisition and learning in organizations through AI technology. Headquartered in Stockholm, Sweden, Sana offers a range of products including a Learning Management System (LMS), a Learning Experience Platform (LXP), an AI assistant, and more...
1yrs ago
066K
Fish Agent:端到端AI语音克隆助手,实时语音对话助理,Fish Speech衍生项目

Fish Agent: end-to-end AI voice cloning assistant, real-time voice conversation assistant, Fish Speech spin-off project

Comprehensive Introduction Fish Speech Derivative Project Fish Agent is a revolutionary end-to-end AI speech cloning system developed based on the V0.1 3B model architecture. As a fully end-to-end speech clone processing system, its most important feature is the use of innovative speechless...
1yrs ago
066K
DreamTalk:使用一张头像图片即可生成表情丰富的说话视频

DreamTalk: Generate expressive talking videos with a single avatar image!

DreamTalk Comprehensive Introduction DreamTalk is a diffusion model-driven expression talking head generation framework, jointly developed by Tsinghua University, Alibaba Group and Huazhong University of Science and Technology. It mainly consists of three parts: a noise reduction network, a style-aware lip expert and a style predictor, and can be based on...
1yrs ago
066K
混元文生视频:生成写实镜头感的高质量视频,腾讯开源视频生成大模型

Hybrid Vincennes video: generating realistic footage sense of high-quality video, Tencent open source video generation large model

Comprehensive Introduction Tencent Mixed Yuan Text Generation Video (available in Yuanbao APP) is a video generation platform based on AI technology launched by Tencent. The platform utilizes the Tencent Mixed Yuan Big Model with powerful cross-domain knowledge and natural language understanding to generate high-quality videos based on users' text descriptions...
1yrs ago
065.5K