Latest AI Resources

Total 3094 articles posts
Waifu2x Extension GUI:深度学习技术放大、修复图像与视频插帧(Windows x64)

Waifu2x Extension GUI: Deep Learning Techniques to Enlarge, Repair Image and Video Interpolation (Windows x64)

Comprehensive Introduction Waifu2x-Extension-GUI is a powerful image and video processing tool that utilizes deep convolutional neural network techniques to achieve super-resolution zoom and video frame interpolation for images, GIFs and videos. The tool supports multiple algorithms and engines, including Wai...
1yrs ago
079.8K
tldraw:开源无限画布白板SDK,AI生成简约线框图和UML图

tldraw: open source unlimited canvas whiteboard SDK, AI to generate minimalist wireframe diagrams and UML diagrams

General Description tldraw is a free and instant collaborative drawing tool that provides an unlimited canvas where users can quickly draw graphics, write text and collaborate instantly. Featuring an intuitive interface and excellent performance, it is suitable for team collaboration and remote work. Supported through the open source community, tldr...
1yrs ago
079.8K
Step-Audio:多模态语音交互框架,识别语音并使用克隆语音交流等功能

Step-Audio: a multimodal voice interaction framework that recognizes speech and communicates using cloned speech, among other features

Comprehensive Introduction Step-Audio is an open source intelligent speech interaction framework designed to provide out-of-the-box speech understanding and generation capabilities for production environments. The framework supports multi-language dialog (e.g., Chinese, English, Japanese), emotional speech (e.g., happy, sad), regional dialects (e.g., Cantonese, Szechuan ...
1yrs ago
079.7K
Agent TARS:使用视觉和命令操作电脑的开源智能体

Agent TARS: An Open Source Intelligence Using Vision and Commands to Operate Computers

Comprehensive Introduction Agent TARS is a multimodal AI intelligence open-sourced by ByteDance.The core feature is to visually understand web content and combine command line and file system operations to help users complete complex computer tasks. Instead of requiring manual operations like traditional tools, it can self...
1yrs ago
079.3K
Pinokio:一键本地部署各类AI开源项目,小白全自动部署

Pinokio: one-click local deployment of all kinds of AI open source projects, fully automated deployment of white people

Pinokio General Introduction Pinokio is an innovative AI open source project deployment tool that allows users to easily install, run, and programmatically control a wide range of big model related applications with a single click. It is supported across multiple platforms and provides a community scripting library that covers most popular A...
2yrs ago
079.2K
MOKI:美图公司AI短片创作工具,适合动画短片, 网文短剧, 儿童故事绘本

MOKI: Meitu's AI short film authoring tool for animated short films, online short dramas, children's stories and illustrated books.

Comprehensive Introduction MOKI is an AI short film creation tool launched by Meitu, focusing on providing users with a convenient and efficient short film production experience. The tool covers a wide range of video content production types such as animated short films, online short dramas, story illustrated books and MVs. Users can input story synopsis or import existing...
2yrs ago
079.1K
AI投资系统:自动化A股投资决策系统,利用多智能体系统分析市场数据

AI investment system: automated A-share investment decision-making system that utilizes a multi-intelligence system to analyze market data

Comprehensive Introduction A_Share_investment_Agent is an A-share investment decision aid based on a multi-intelligence system. The system is designed to analyze market data, calculate the intrinsic value of stocks, analyze market sentiment, and fundamental data through multiple collaborative intelligences to...
1yrs ago
079K
法行宝:AI法律顾问,人工智能法律咨询,百度AI法律平台

Fa Xing Bao: AI Legal Advisor, Artificial Intelligence Legal Consultation, Baidu AI Legal Platform

Comprehensive Introduction LawXinbao is an intelligent legal service platform launched by Baidu, which integrates advanced artificial intelligence technology with a professional legal knowledge base. The platform is dedicated to providing users with convenient and professional legal intelligent services, including intelligent legal Q&A, case analysis, contract review and other functions. Through deep learning...
1yrs ago
078.8K
TRELLIS:Microsoft开发的3D资产生成模型,支持多种格式和灵活编辑

TRELLIS: Microsoft-developed 3D asset generation model with multiple format support and flexible editing

General Introduction TRELLIS is a large-scale 3D asset generation model developed by Microsoft. It is capable of receiving text or image prompts and generating high-quality 3D assets in a variety of formats, such as radial fields, 3D Gaussians, and meshes.At the heart of TRELLIS is a unified structured latent...
1yrs ago
078.7K
问小白:提供工作和生活帮助的全能AI助手,集成满血DeepSeek-R1

Ask White: an all-around AI assistant that provides work and life help with integrated full-blooded DeepSeek-R1

Comprehensive Introduction AskSeek is an AI intelligent assistant (including web-side and APP-side) developed by Yuanshi Technology, based on the self-developed Yuanshi Big Model, currently integrating the latest DeepSeek-R1 model, aiming to simplify the user's through quick Q&A, intelligent search, text creation, and other...
11mos ago
078.7K
Sana Labs:企业知识管理和员工培训学的AI工具

Sana Labs: AI Tools for Enterprise Knowledge Management and Employee Trainology

General Introduction Sana Labs is a company dedicated to improving the efficiency of knowledge acquisition and learning in organizations through AI technology. Headquartered in Stockholm, Sweden, Sana offers a range of products including a Learning Management System (LMS), a Learning Experience Platform (LXP), an AI assistant, and more...
1yrs ago
078.7K
紫东太初:多模态大模型平台,支持文本创作、图像生成、3D理解、信号分析等任务

Zidong Taichu: Multi-modal large model platform supporting text creation, image generation, 3D understanding, signal analysis and other tasks

Comprehensive Introduction Zidong Taichu is a new-generation multimodal big model platform launched by the Institute of Automation of the Chinese Academy of Sciences and the Wuhan Institute of Artificial Intelligence. The platform supports multiple tasks such as multi-round question and answer, text creation, image generation, 3D understanding and signal analysis, with powerful cognitive, understanding and creation capabilities. Zidong ...
2yrs ago
078.6K
WeClone:用微信聊天记录和语音训练数字分身

WeClone: training digital doppelgangers with WeChat chats and voices

Comprehensive introduction WeClone is an open source project that uses WeChat chat logs and voice messages, combined with large language models and speech synthesis technology, to allow users to create personalized digital doppelgangers. The project can analyze the user's chat habits to train the model , but also a small number of voice samples to generate realistic sound...
1yrs ago
078.6K
Sonic:音频驱动肖像图片生成面部表情生动的数字人口播视频

Sonic: Audio-driven portrait images generate digital demo videos with vivid facial expressions

General Introduction Sonic is an innovative platform focusing on global audio perception designed to generate vivid portrait animations driven by audio. Developed by a team of researchers from Tencent and Zhejiang University, the platform utilizes audio information to control facial expressions and head movements to generate natural and smooth animated videos.S...
1yrs ago
078.6K
InstantIR:受损图像修复与图像高清放大开源项目,最低16G显存

InstantIR: damaged image repair and image high-definition zoom open source project, minimum 16G video memory

General Description InstantIR is an innovative single-image restoration model developed by the InstantX team, designed to resurrect your damaged images with extremely high-quality and realistic details, capable of high-quality restoration of damaged images. The tool not only restores the details of the image...
1yrs ago
078.6K
Depth AI:构建全面的代码知识图谱,深度理解代码库的AI助手

Depth AI: An AI assistant for building a comprehensive code knowledge graph and deep understanding of the code base

Comprehensive Introduction Depth AI is an artificial intelligence assistant designed for developers to deeply understand and analyze code bases. By building a comprehensive code knowledge graph, Depth AI can answer complex technical questions and help developers manage and optimize their code more efficiently. Whether...
1yrs ago
078.5K
Refly:基于自由画布上流程编排的AI写作平台,自动化生成文章

Refly: an AI writing platform based on process orchestration on a free canvas for automated article generation

Comprehensive Introduction Refly is a free canvas-based AI native authoring engine designed to help users turn ideas into high-quality content through multi-threaded conversations, knowledge base integration, contextual memory and intelligent search technology. The platform covers over 20 professional scenario templates, including learning...
1yrs ago
078.4K
A2A:谷歌发布AI智能间通信的开放协议

A2A: Google releases open protocol for communication between AI intelligences

General Introduction A2A (Agent2Agent) is an open source protocol developed by Google to allow AI intelligences developed by different frameworks or vendors to communicate and collaborate with each other. It provides a standardized set of methods for intelligences to discover each other's capabilities, share tasks, and complete work...
1yrs ago
078.4K
InvSR:开源图像超分辨率项目,提升图像分辨率质量

InvSR: Open source image super-resolution project to improve the quality of image resolution

General Introduction InvSR is an innovative open-source image super-resolution project based on diffusion inversion techniques capable of converting low-resolution images into high-quality, high-resolution images. The project utilizes the rich a priori knowledge of images embedded in pre-trained large-scale diffusion models to support, through a flexible sampling mechanism, the...
1yrs ago
078.2K
Glama:集成1000+MCP服务的多功能AI聊天工具

Glama: a versatile AI chat tool integrating 1000+ MCP services

General Introduction Glama is a powerful and easy-to-use AI chat tool. It not only supports conversations with a wide range of AI models, but also uploads files, searches the web for information, and even generates professional charts. The website is geared towards users who need to process information and tasks efficiently, such as corporate teams, developers or individual users...
1yrs ago
078K
Fish Agent:端到端AI语音克隆助手,实时语音对话助理,Fish Speech衍生项目

Fish Agent: end-to-end AI voice cloning assistant, real-time voice conversation assistant, Fish Speech spin-off project

Comprehensive Introduction Fish Speech Derivative Project Fish Agent is a revolutionary end-to-end AI speech cloning system developed based on the V0.1 3B model architecture. As a fully end-to-end speech clone processing system, its most important feature is the use of innovative speechless...
1yrs ago
077.4K
腾讯混元3D(Hunyuan3D):生成高分辨率3D资产,多种3D素材生成工作流

Tencent Hybrid 3D (Hunyuan3D): Generate high-resolution 3D assets, multiple 3D material generation workflows

Comprehensive Introduction Tencent Hunyuan3D (Hunyuan3D 2.0) is an advanced large-scale 3D synthesis system from Tencent designed to generate high-resolution textured 3D assets. The system consists of two core components: Hunyuan3D-DiT, a large-scale shape generation model, and Hunyuan3D-DiT, a large-scale texture...
1yrs ago
077.3K
TRAE SOLO - 字节跳动TRAE推出的AI自动开发助手

TRAE SOLO - AI Automated Development Assistant from Wordhop TRAE

TRAE SOLO is an AI automated development assistant introduced by TRAE, an AI programming assistant launched by ByteDance, to simplify the software development process with AI technology.TRAE SOLO understands the user's needs, supports text descriptions, voice commands, and file uploads to input the requirements, and automatically plans...
9mos ago
077.2K
SciSpace:一站式学术研究与论文写作平台,为学生和研究人员提供一体化 AI 工具

SciSpace: A one-stop academic research and paper writing platform with integrated AI tools for students and researchers

General Introduction SciSpace (formerly Typeset.io) is an AI-powered platform designed for academic research and writing. It provides a wealth of tools and resources to help researchers and students find, understand and write about literature more efficiently. The platform integrates literature management, automatic gr...
1yrs ago
077.2K
YouMind:专业创作者辅助工具,摘录各类材料并存入知识库辅助写作

YouMind: a professional creator's aid that excerpts all kinds of material and deposits it in a knowledge base to aid in writing.

General Introduction YouMind is an AI authoring system powered by top-notch Large Language Models (LLMs) designed to help users extract and preserve important content from a wide range of materials, focusing on creation rather than simple collection. Whether browsing the web, watching YouTube videos, listening to podcasts...
1yrs ago
077.1K
99AI:集成多模态AI服务的商业化Web应用(免费开源)

99AI: A commercialized web application integrating multimodal AI services (free and open source)

Comprehensive Introduction 99AI is an open source AI web application project that aims to provide an easy-to-deploy, low-threshold integrated AI service platform. The project supports intelligent dialog, multimodal modeling, application plaza, networked search, and integrates AI painting, music and video...
1yrs ago
077K
FinRobot:提升金融数据分析效率和投资研究的的智能体

FinRobot: An Intelligent Body to Improve Financial Data Analysis Efficiency and Investment Research

Comprehensive Introduction FinRobot is an open source AI intelligence platform developed by AI4Finance Foundation and designed for financial analytics. It not only covers traditional language models, but also incorporates a variety of AI technologies, aiming to provide a comprehensive solution for the financial industry.F...
1yrs ago
077K