Latest AI Resources

Total 2710 articles posts
Genesis:开源生成式物理引擎,实现基于真实物理的4D动态世界模拟

Genesis: open source generative physics engine for real physics-based 4D dynamic world simulation

General Introduction Genesis is a generative physics world designed for general purpose robotics and embodied AI learning. It provides a unified simulation platform that supports the simulation of a wide range of materials and physical phenomena.Genesis aims to unlock generative AI and physics simulation by combining...
9mos ago
019.2K
可灵 AI:快手推出的生成创意图片和视频的AI工具

Keling AI: AI tool for generating creative images and videos launched by Shutterstock

Comprehensive Introduction Kling AI (Kling AI) is a new-generation AI creative productivity platform launched by Shutterstock, aiming to help users easily create high-quality image and video content through advanced generative AI technology. The platform is based on the Kolto Big Model and Kling Big Model (Kol...
9mos ago
025.8K
Kolors:生成高质量图像的文本到图像模型,支持生成中文海报

Kolors: text-to-image model for generating high-quality images, support for generating Chinese posters

Comprehensive Introduction Kolors is a large-scale text-to-image generation model developed by the Racer team, based on potential diffusion techniques. The model is trained on billions of text-image data pairs, and is capable of generating high-quality, complex semantically accurate images with support for both Chinese and English input.Kolors in visual quality...
9mos ago
023.5K
ColorFlow:漫画着色,黑白图像自动着色,提升图像色彩一致性和质量

ColorFlow: Comic book coloring, automatic coloring of black and white images to improve image color consistency and quality

Comprehensive Introduction ColorFlow is an image sequence auto-coloring tool developed by Tencent's ARC team to solve the problem of auto-coloring black and white image sequences. The tool utilizes a retrieval-enhanced coloring pipeline to accurately generate the colors of various elements through a pool of reference images, including the character's hair color and service...
9mos ago
017.8K
即梦AI:一站式AI创作平台, 图像生成, 智能画布, 视频生成, 音乐生成

Instant Dream AI: One-stop AI creation platform, image generation, smart canvas, video generation, music generation

Comprehensive Introduction Instant Dream AI is a one-stop AI creation platform designed to provide users with versatile and powerful creation tools. Whether it's image generation, smart canvas, video generation or music generation, Instant Dream AI can help users easily realize their creativity. The platform supports multiple creation modes, including AI drawing...
8mos ago
026.4K
Class Companion: K12教师设计的课后作业管理系统,为学生提供AI辅导和作业批改

Class Companion: an after-school homework management system designed by K12 teachers to provide AI tutoring and homework correction for students

General Description Class Companion is an online education platform designed for teachers and students that uses artificial intelligence technology to provide instant feedback and personalized tutoring. The platform supports a wide range of subjects and grade levels, helping teachers save time, improve teaching efficiency, and provide students with more practic...
9mos ago
019.8K
Gauth(Gauthmath):使用AI解决作业问题,提供详细解答,字节旗下海外作业辅导APP

Gauth (Gauthmath): uses AI to solve homework problems and provide detailed answers, Byte's overseas homework tutoring app

General Introduction Gauth (formerly known as Gauthmath) is an AI homework helper website designed for students. It utilizes advanced AI technology and a team of professional tutors to provide homework answering services in a variety of subjects from math to chemistry. Users can upload an image or type in a question to quickly get...
3mos ago
022.9K
Waifu2x Extension GUI:深度学习技术放大、修复图像与视频插帧(Windows x64)

Waifu2x Extension GUI: Deep Learning Techniques to Enlarge, Repair Image and Video Interpolation (Windows x64)

Comprehensive Introduction Waifu2x-Extension-GUI is a powerful image and video processing tool that utilizes deep convolutional neural network techniques to achieve super-resolution zoom and video frame interpolation for images, GIFs and videos. The tool supports multiple algorithms and engines, including Wai...
9mos ago
018.8K
R2R:多模态内容解析并结合知识图谱与混合搜索的先进AI检索(RAG)系统

R2R: An Advanced AI Retrieval (RAG) System for Multimodal Content Parsing and Combining Knowledge Graph with Hybrid Search

Comprehensive Introduction R2R (RAG to Riches) is an advanced AI retrieval system supporting Retrieval Augmented Generation (RAG) functionality with production-ready features. Built on a containerized RESTful API, the system provides multimodal content parsing, hybrid search functionality...
9mos ago
019.7K
Megrez-3B-Omni:端侧多模态理解模型,支持文本、图像、音频多模态理解和分析

Megrez-3B-Omni: an end-side multimodal understanding model supporting text, image, and audio multimodal understanding and analysis

Comprehensive Introduction Infini-Megrez is an edge intelligence solution developed by the unquestioned core dome (Infinigence AI), aiming to achieve efficient multimodal understanding and analysis through hardware and software co-design. At the core of the project is the Megrez-3B model, which supports graph...
8mos ago
014.7K
RAGFlow:基于深度文档理解的开源RAG引擎,提供高效的检索增强生成工作流

RAGFlow: an open source RAG engine based on deep document understanding, providing efficient retrieval-enhanced generation workflows

Comprehensive Introduction RAGFlow is an open source Retrieval Augmented Generation (RAG) engine based on deep document understanding technology. It provides an efficient RAG workflow for organizations of all sizes, incorporating a large-scale language model (LLM) capable of delivering data in complex formats based on real...
8mos ago
024.5K
Depth AI:构建全面的代码知识图谱,深度理解代码库的AI助手

Depth AI: An AI assistant for building a comprehensive code knowledge graph and deep understanding of the code base

Comprehensive Introduction Depth AI is an artificial intelligence assistant designed for developers to deeply understand and analyze code bases. By building a comprehensive code knowledge graph, Depth AI can answer complex technical questions and help developers manage and optimize their code more efficiently. Whether...
9mos ago
017.2K
SystoByte:编程系统设计练习平台,提供实时AI反馈,提升面试技能

SystoByte: a programming system design practice platform that provides real-time AI feedback to improve interview skills

General Introduction SystoByte is a platform built for system design practice, designed to help users improve their system design skills, especially in interview preparation. The platform provides a rich library of system design questions that users can design through an intuitive interface and get instant access to AI-generated...
9mos ago
017.2K
FindPicLocation:使用AI技术定位照片拍摄地点,快速获取片GPS定位

FindPicLocation: Use AI technology to locate the location where the photo was taken, and quickly get the GPS location of the photo.

Comprehensive Introduction FindPicLocation is a website that utilizes artificial intelligence technology to help users locate where their photos were taken. Users just need to upload photos, and the system will automatically analyze the EXIF data in the photos, extract the GPS coordinates, and display the exact location on the map. The site aims to...
9mos ago
023.7K
CrewAI:多角色扮演协作智能框架,简化复杂任务

CrewAI: A Multi-Roleplay Collaborative Intelligence Framework to Simplify Complex Tasks

Comprehensive Introduction CrewAI is an advanced framework designed to orchestrate collaboration between role-playing and autonomous AI agents. By facilitating collaborative intelligence, CrewAI enables agents to work together seamlessly to solve complex tasks. Whether you're building an intelligent assistant platform, automating customer service teams, or multi-agent...
9mos ago
022.7K
Leffa:高保真模特虚拟试穿与人物姿势调整,Meta开源的可控人物图像生成模型

Leffa: High-fidelity model virtual fitting and character pose adjustment, Meta open source controllable character image generation model

Comprehensive Introduction Leffa is a unified framework for generating controllable character images, enabling precise manipulation of character appearance (e.g., virtual fitting) and pose (e.g., pose transfer). The framework significantly reduces distortion of fine-grained details by directing the target query to focus on the correct reference key in the attention layer, with ...
9mos ago
021.1K
MMAudio:为视频画面生成同步音效与配乐,视频到音频的多模态联合训练工具

MMAudio: generating synchronized sound effects and soundtracks for video footage, video-to-audio multimodal co-training tool

General Introduction MMAudio is an open-source project aiming to generate high-quality synchronized audio through joint multimodal training. Developed by Ho Kei Cheng et al. at the Chinese University of Hong Kong, the project's main function is to generate synchronized audio based on video and/or text input.MM...
9mos ago
021.3K
AutoGPT:工作流自动化与自主执行任务的智能体构建平台

AutoGPT: Intelligent Body Building Platform for Workflow Automation and Autonomous Task Execution

General Description AutoGPT is a powerful platform designed to help users create, deploy and manage continuously running AI agents and automate complex workflows. Developed by Significant Gravitas, the platform offers a wide range of tools and features that enable users to focus...
9mos ago
019.5K
YOO简历:智能简历生成工具,在线制作大厂简历范文,提升求职成功率

YOO Resume: intelligent resume generation tool, online production of large factory resume sample, enhance the success rate of job hunting

Comprehensive Introduction YOO Resume is an intelligent resume generation tool launched by Zhuhai Biyou Technology Co. Ltd, aiming to help users create professional resumes quickly and efficiently through artificial intelligence technology. Whether you are a new student or an experienced job seeker, YOO Resume provides personalized resume templates and...
9mos ago
016.6K
瑞达写作:一键生成论文,免费选题生成论文大纲, 论文润色,引用文献数据

Rida Writing: One-click essay generation, free topic selection to generate an essay outline, thesis touch-up, citation of literature data

Comprehensive Introduction Rida Writing is an AI platform that focuses on academic paper writing, aiming to help users efficiently complete their paper writing tasks. By entering a dissertation title, users can generate complete dissertation content with up to 50,000 words in one click. The platform offers a variety of features, including free topic selection, idea outline...
9mos ago
018.4K
Qwen-Agent:基于Qwen的智能代理应用框架,包括工具调用、代码解释器、RAG和Chrome扩展。

Qwen-Agent: Qwen-based framework for intelligent agent applications, including tool calls, code interpreters, RAGs and Chrome extensions.

Comprehensive Introduction Qwen-Agent is an intelligent agent application framework developed based on Qwen 2.0 and above, with capabilities such as command following, tool usage, planning and memorization. The framework provides a variety of sample applications such as browser assistants, code interpreters and custom assistants...
9mos ago
020.5K
Mini-Cover:在线封面制作,专为博客、短视频、社交媒体等生成个性化封面

Mini-Cover: online cover creation, designed to generate personalized covers for blogs, short videos, social media and more

General Introduction Mini-Cover is an open source online cover generation tool designed to generate personalized covers for platforms such as blogs, short videos and social media. Developed by JLinMr, the tool aims to provide a simple and efficient solution to help users quickly generate covers that meet their needs...
9mos ago
016.7K
Swarms:多智能体编排框架,企业级生产工具

Swarms: Multi-intelligent Orchestration Framework, Enterprise Production Tool

General Introduction Swarms is an enterprise-grade production-ready multi-agent orchestration framework designed to boost business productivity through efficient agent management and task processing. With support for multiple models, multiple memory systems and custom agent creation, the framework provides a modular design and comprehensive logging capabilities to ensure that the system...
9mos ago
016.9K
算了么:共享你电脑闲置 GPU 显卡算力赚钱,支持科学研究

Forget it: Share your computer's unused GPUs and graphics cards to earn money and support scientific research!

Comprehensive Introduction Nevermind is a platform that utilizes the arithmetic power of idle graphics cards to perform scientific calculations and earn revenue. Users can share their computer's idle GPU resources to support scientific research and technological progress, while earning a certain financial return. The platform aims to promote scientific progress and solve important scientific research problems...
9mos ago
020K
Sonic:音频驱动肖像图片生成面部表情生动的数字人口播视频

Sonic: Audio-driven portrait images generate digital demo videos with vivid facial expressions

General Introduction Sonic is an innovative platform focusing on global audio perception designed to generate vivid portrait animations driven by audio. Developed by a team of researchers from Tencent and Zhejiang University, the platform utilizes audio information to control facial expressions and head movements to generate natural and smooth animated videos.S...
6mos ago
021.6K
Ultravox:实时端到端语音对话的音频多模态大模型,GPT-4o语音交互的开源实现

Ultravox: an audio multimodal macromodel for real-time end-to-end voice dialog, an open source implementation of GPT-4o voice interaction

Comprehensive Introduction Ultravox is an innovative multimodal Large Language Model (LLM) designed for real-time speech processing. Unlike traditional speech recognition systems, Ultravox eliminates the need for a separate Audio Speech Recognition (ASR) stage, and is able to directly convert audio into high-dimensional space in...
9mos ago
018.8K
Research Rabbit:使用本地LLM进行网页研究和报告撰写,自动深入用户指定主题并生成总结。

Research Rabbit: Web research and report writing using native LLM, automatically drilling down into user-specified topics and generating summaries.

General Introduction Research Rabbit is a native LLM (Large Language Model) based web research and summarization assistant. After the user provides a research topic, Research Rabbit generates a search query, obtains relevant web results, and summarizes those results...
6mos ago
017.2K
AgentClientDemo:演示智能体运行过程的Python客户端,提供直观的图形用户界面

AgentClientDemo: a Python client that demonstrates the process of running an intelligent body, providing an intuitive graphical user interface

Comprehensive Introduction AgentClientDemo is a comprehensive Python project that integrates intelligent (Agent) and client (Client) functionality. The project is based on the PyQt framework and provides an intuitive and easy-to-use graphical user interface (G...
9mos ago
016.5K
佐糖:在线图片处理工具,一键抠图、去水印、照片修复、人像编辑

Zosugar: online photo processing tools, one-click keying, watermark removal, photo restoration, portrait editing

Comprehensive Introduction ZuoSugar (PicWish) is an intelligent AI image processing platform, providing a wealth of online photo editing tools, supporting the use of all platforms. Users can easily complete one-click keying, watermark removal, blurry photos become clear, lossless zoom, image cropping, image compression and black and white photo...
9mos ago
017.3K
Wasitai:检查图像是否由AI生成的简单工具,提供图像检测API

Wasitai: a simple tool to check if an image is generated by AI, providing an image detection API

General Introduction Wasitai is a powerful and handy tool that helps users easily detect whether an image is generated by AI or not. With the advancement of AI in the field of image generation, many tools and platforms are available to create realistic, high-quality images from text, sketches, or other images. However, not all...
9mos ago
019.5K
ChatFree(ChatAnywhere-2):使用GPT API创建的本地Copilot,支持任意窗口中补全对话

ChatFree (ChatAnywhere-2): Native Copilot created using the GPT API to support complementary conversations in any window.

General Introduction ChatFree is an open source project that aims to free users' AI apps from the constraints of browsers to run locally. Created using GPT API, Copilot is designed to support a wide range of office software such as Office, Word, WPS, and more. The project was developed by ...
9mos ago
020K
Sketch-Gen:生成高质量线稿和草图,反推图像提示词,一键安装包

Sketch-Gen: Generate high-quality line drawings and sketches, backpropagate image cue words, one-click package installation

General Introduction Sketch-Gen is an AI technology-based line drawing and sketch generation tool designed to help artists and designers quickly generate high-quality line drawings and sketches. The tool is derived from the Paints-UNDO project and utilizes advanced machine learning models that can...
9mos ago
016.7K
混元文生视频:生成写实镜头感的高质量视频,腾讯开源视频生成大模型

Hybrid Vincennes video: generating realistic footage sense of high-quality video, Tencent open source video generation large model

Comprehensive Introduction Tencent Mixed Yuan Text Generation Video (available in Yuanbao APP) is a video generation platform based on AI technology launched by Tencent. The platform utilizes the Tencent Mixed Yuan Big Model with powerful cross-domain knowledge and natural language understanding to generate high-quality videos based on users' text descriptions...
8mos ago
019.5K
Director:智能视频代理框架,用自然语言描述执行视频搜索、编辑和生成工作流

Director: Intelligent Video Agent Framework for Performing Video Search, Editing, and Generation Workflows with Natural Language Descriptions

General Introduction Director is an open source framework designed to simplify and optimize video interactions and workflows by building intelligent video agents. The framework is based on VideoDB's "video-as-data" infrastructure and is capable of handling complex video tasks such as searching, editing, compiling and generating...
9mos ago
018.4K