Latest AI Resources

Total 2759 articles posts
Qwen-Agent:基于Qwen的智能代理应用框架,包括工具调用、代码解释器、RAG和Chrome扩展。

Qwen-Agent: Qwen-based framework for intelligent agent applications, including tool calls, code interpreters, RAGs and Chrome extensions.

Comprehensive Introduction Qwen-Agent is an intelligent agent application framework developed based on Qwen 2.0 and above, with capabilities such as command following, tool usage, planning and memorization. The framework provides a variety of sample applications such as browser assistants, code interpreters and custom assistants...
10mos ago
028.2K
Mini-Cover:在线封面制作,专为博客、短视频、社交媒体等生成个性化封面

Mini-Cover: online cover creation, designed to generate personalized covers for blogs, short videos, social media and more

General Introduction Mini-Cover is an open source online cover generation tool designed to generate personalized covers for platforms such as blogs, short videos and social media. Developed by JLinMr, the tool aims to provide a simple and efficient solution to help users quickly generate covers that meet their needs...
10mos ago
023K
Swarms:多智能体编排框架,企业级生产工具

Swarms: Multi-intelligent Orchestration Framework, Enterprise Production Tool

General Introduction Swarms is an enterprise-grade production-ready multi-agent orchestration framework designed to boost business productivity through efficient agent management and task processing. With support for multiple models, multiple memory systems and custom agent creation, the framework provides a modular design and comprehensive logging capabilities to ensure that the system...
10mos ago
021.5K
算了么:共享你电脑闲置 GPU 显卡算力赚钱,支持科学研究

Forget it: Share your computer's unused GPUs and graphics cards to earn money and support scientific research!

Comprehensive Introduction Nevermind is a platform that utilizes the arithmetic power of idle graphics cards to perform scientific calculations and earn revenue. Users can share their computer's idle GPU resources to support scientific research and technological progress, while earning a certain financial return. The platform aims to promote scientific progress and solve important scientific research problems...
10mos ago
027.6K
Sonic:音频驱动肖像图片生成面部表情生动的数字人口播视频

Sonic: Audio-driven portrait images generate digital demo videos with vivid facial expressions

General Introduction Sonic is an innovative platform focusing on global audio perception designed to generate vivid portrait animations driven by audio. Developed by a team of researchers from Tencent and Zhejiang University, the platform utilizes audio information to control facial expressions and head movements to generate natural and smooth animated videos.S...
7mos ago
028.4K
Ultravox:实时端到端语音对话的音频多模态大模型,GPT-4o语音交互的开源实现

Ultravox: an audio multimodal macromodel for real-time end-to-end voice dialog, an open source implementation of GPT-4o voice interaction

Comprehensive Introduction Ultravox is an innovative multimodal Large Language Model (LLM) designed for real-time speech processing. Unlike traditional speech recognition systems, Ultravox eliminates the need for a separate Audio Speech Recognition (ASR) stage, and is able to directly convert audio into high-dimensional space in...
10mos ago
025.9K
Research Rabbit:使用本地LLM进行网页研究和报告撰写,自动深入用户指定主题并生成总结。

Research Rabbit: Web research and report writing using native LLM, automatically drilling down into user-specified topics and generating summaries.

General Introduction Research Rabbit is a native LLM (Large Language Model) based web research and summarization assistant. After the user provides a research topic, Research Rabbit generates a search query, obtains relevant web results, and summarizes those results...
7mos ago
023.6K
AgentClientDemo:演示智能体运行过程的Python客户端,提供直观的图形用户界面

AgentClientDemo: a Python client that demonstrates the process of running an intelligent body, providing an intuitive graphical user interface

Comprehensive Introduction AgentClientDemo is a comprehensive Python project that integrates intelligent (Agent) and client (Client) functionality. The project is based on the PyQt framework and provides an intuitive and easy-to-use graphical user interface (G...
10mos ago
021.5K
佐糖:在线图片处理工具,一键抠图、去水印、照片修复、人像编辑

Zosugar: online photo processing tools, one-click keying, watermark removal, photo restoration, portrait editing

Comprehensive Introduction ZuoSugar (PicWish) is an intelligent AI image processing platform, providing a wealth of online photo editing tools, supporting the use of all platforms. Users can easily complete one-click keying, watermark removal, blurry photos become clear, lossless zoom, image cropping, image compression and black and white photo...
10mos ago
022.9K
Wasitai:检查图像是否由AI生成的简单工具,提供图像检测API

Wasitai: a simple tool to check if an image is generated by AI, providing an image detection API

General Introduction Wasitai is a powerful and handy tool that helps users easily detect whether an image is generated by AI or not. With the advancement of AI in the field of image generation, many tools and platforms are available to create realistic, high-quality images from text, sketches, or other images. However, not all...
10mos ago
026.3K
ChatFree(ChatAnywhere-2):使用GPT API创建的本地Copilot,支持任意窗口中补全对话

ChatFree (ChatAnywhere-2): Native Copilot created using the GPT API to support complementary conversations in any window.

General Introduction ChatFree is an open source project that aims to free users' AI apps from the constraints of browsers to run locally. Created using GPT API, Copilot is designed to support a wide range of office software such as Office, Word, WPS, and more. The project was developed by ...
10mos ago
027.9K
Sketch-Gen:生成高质量线稿和草图,反推图像提示词,一键安装包

Sketch-Gen: Generate high-quality line drawings and sketches, backpropagate image cue words, one-click package installation

General Introduction Sketch-Gen is an AI technology-based line drawing and sketch generation tool designed to help artists and designers quickly generate high-quality line drawings and sketches. The tool is derived from the Paints-UNDO project and utilizes advanced machine learning models that can...
10mos ago
023.5K
混元文生视频:生成写实镜头感的高质量视频,腾讯开源视频生成大模型

Hybrid Vincennes video: generating realistic footage sense of high-quality video, Tencent open source video generation large model

Comprehensive Introduction Tencent Mixed Yuan Text Generation Video (available in Yuanbao APP) is a video generation platform based on AI technology launched by Tencent. The platform utilizes the Tencent Mixed Yuan Big Model with powerful cross-domain knowledge and natural language understanding to generate high-quality videos based on users' text descriptions...
9mos ago
025.2K
Director:智能视频代理框架,用自然语言描述执行视频搜索、编辑和生成工作流

Director: Intelligent Video Agent Framework for Performing Video Search, Editing, and Generation Workflows with Natural Language Descriptions

General Introduction Director is an open source framework designed to simplify and optimize video interactions and workflows by building intelligent video agents. The framework is based on VideoDB's "video-as-data" infrastructure and is capable of handling complex video tasks such as searching, editing, compiling and generating...
10mos ago
023.8K
识典古籍:免费在线阅读和检索古籍资源,AI助手白话解释古籍原文

Knowledge of ancient books: free online reading and retrieval of ancient resources, AI assistant vernacular interpretation of the original text of ancient books

Comprehensive Introduction Ludian Ancient Books is a digitization platform for ancient books jointly launched by Peking University and ByteDance Public Welfare, aiming to provide free online reading and retrieval services of ancient books for the public. The platform gathers more than 2,200 ancient books resources, including classic literature such as Zhou Yi, Zuo Zhuan and Li Ji, and provides high-definition...
10mos ago
023.3K
MoneyPrinterTurbo:输入视频主题一键生成视频文案和高清短视频

MoneyPrinterTurbo: Generate video copy and short HD videos in one click by entering a video theme

Comprehensive Introduction MoneyPrinterTurbo is an open source project that utilizes advanced AI big model technology to achieve the function of generating short HD videos with one click. Users only need to provide a video theme or keywords, the system will automatically generate video copy, video clips, video subtitles and...
7mos ago
026.5K
AIMedia:全自动托管AI媒体软件,自动抓取热点,自动生成新闻,自动发布各大平台。

AIMedia: Fully automated hosted AI media software that automatically grabs hotspots, automatically generates news, and automatically publishes on all major platforms.

Comprehensive Introduction AIMedia is an integrated software designed to automatically capture hot news, AI-created articles and automatically publish them to major platforms. The software supports a variety of platforms, including Today's headlines, Xiaohongshu, WeChat public number, etc. AIMedia is able to automatically get the major platforms' hot...
10mos ago
028.6K
Rubbrband:对话方式生成和编辑图像与视频的多功能平台

Rubbrband: a versatile platform for dialogically generating and editing images and videos

General Introduction Rubbrband is a versatile media generation platform that specializes in image and video generation and editing. The platform utilizes advanced AI technology to provide a variety of features such as text-to-image conversion, conceptual model training, and more to help users easily create high-quality visual content. No...
10mos ago
022.3K
FliFlik:AI图片处理客户端,一键图像高清化、放大、降噪与水印去除

FliFlik: AI image processing client, one-click image high-definition, enlargement, noise reduction and watermark removal

General Introduction FliFlik is a multimedia solution platform focused on providing efficient and convenient digital processing services. Whether it's photos, audio or video, FliFlik can optimize and enhance them with its advanced AI technology. The platform supports Windows...
10mos ago
025.6K
BISHENG(文擎毕昇):构建企业级AI应用的开源LLM DevOps平台

BISHENG: Open Source LLM DevOps Platform for Building Enterprise AI Applications

Comprehensive Introduction BISHENG is an open source LLM (Large Language Model) DevOps platform designed for next generation enterprise AI applications. The platform provides powerful and comprehensive features including generative AI workflows, RAG (Retrieval Augmented Generation), intelligent agents, unified model management...
10mos ago
029.1K
GLM-PC(智谱牛牛)正式发布内测下载,真正可以控制电脑的AI

GLM-PC (Smart Spectrum Bull) officially released for internal download, the real AI that can control the computer

GLM-PC (Bull) Introduction GLM-PC is a desktop application based on the CogAgent model, which is able to perform complex tasks quickly through natural language commands. It has the ability of task planning and interface understanding, and can autonomously complete various computer operations according to user instructions. Notes for use...
10mos ago
024.4K
PSHuman:生成逼真3D人像模型,使用一张照片生成3D人建模

PSHuman: Generate realistic 3D portrait models, use a photo to generate 3D human modeling

Comprehensive Introduction PSHuman is a single-image 3D portrait reconstruction tool based on multi-view diffusion technology. The tool is capable of generating detailed geometric structures and realistic 3D portrait models from a single photo of a clothed person.PSHuman's core technology includes cross-scale multi-view diffusion, which is capable of...
10mos ago
026.4K
TRELLIS:Microsoft开发的3D资产生成模型,支持多种格式和灵活编辑

TRELLIS: Microsoft-developed 3D asset generation model with multiple format support and flexible editing

General Introduction TRELLIS is a large-scale 3D asset generation model developed by Microsoft. It is capable of receiving text or image prompts and generating high-quality 3D assets in a variety of formats, such as radial fields, 3D Gaussians, and meshes.At the heart of TRELLIS is a unified structured latent...
10mos ago
030.5K
Bambo:轻量灵活的智能体框架,简单配置角色和工具,处理多种负载任务

Bambo: a lightweight and flexible framework for intelligent bodies, with simple configuration of roles and tools to handle multiple loads of tasks

Comprehensive Introduction Bambo is a new type of proxy framework, which is lighter and more flexible than the mainstream frameworks and can handle a variety of load tasks.Bambo achieves efficient proxy functionality by defining all the tools in the tool catalog and using asynchronous custom functions. Users can use the llm_c...
10mos ago
023.5K
Marco-o1:基于Qwen2-7B-Instruct微调的开源版OpenAI o1模型,探索开放式推理模型,解决复杂问题

Marco-o1: An Open Source Version of the OpenAI o1 Model Based on Qwen2-7B-Instruct Fine-Tuning to Explore Open Inference Models for Solving Complex Problems

Comprehensive Introduction Marco-o1 is an open reasoning model developed by Alibaba International Digital Commerce Group (AIDC-AI) to solve complex real-world problems. The model combines Chain of Thought (CoT) fine-tuning, Monte Carlo Tree Search (MCTS), and innovative reasoning strategies...
10mos ago
023K
Flow(Laminar):构建智能体的轻量级任务引擎,简化并灵活管理任务

Flow (Laminar): a lightweight task engine for building intelligences that simplifies and flexibly manages tasks

Comprehensive Introduction Flow is a lightweight task engine designed for building AI agents, emphasizing simplicity and flexibility. Unlike traditional node- and edge-based workflows, Flow uses a dynamic task queuing system that supports parallel execution, dynamic scheduling, and intelligent dependency management. Its core concept is ...
10mos ago
025.2K
MagicQuill:智能交互式图像涂鸦编辑系统,精准局部涂鸦编辑

MagicQuill: Intelligent Interactive Image Graffiti Editing System, Precise Localized Graffiti Editing

General Introduction MagicQuill is an open-source AI interactive image editing tool jointly launched by Hong Kong University of Science and Technology (HKUST), Ant Group, Zhejiang University and University of Hong Kong. The tool aims to achieve accurate localized editing of images in an intelligent and interactive way.MagicQuill...
11mos ago
031.2K
MegaParse:解析各类型文档为LLM可用数据,完整保留文档中的表格、图片等所有信息

MegaParse: parses all types of documents into LLM-available data, preserving all information in the document such as tables, pictures, etc. in its entirety

Comprehensive Introduction MegaParse is a powerful and versatile document parsing tool designed to optimize data processing for the Large Language Model (LLM). Whether you are working with text, PDF, PowerPoint presentations or Word documents, MegaParse...
11mos ago
025.4K
析言GBI(XiYan-SQL):Text-to-SQL智能数据分析,轻松实现ChatBI

Analytics GBI (XiYan-SQL): Text-to-SQL Intelligent Data Analytics for ChatBI with Ease

Comprehensive Introduction Analyzing Words GBI is an intelligent data analysis product based on big models launched by AliCloud Hundred Refine. The product utilizes advanced natural language processing technology to help users query and analyze data through natural language without having to master complex SQL syntax. Analytics GBI supports multiple data sources, including...
10mos ago
025.7K
RMBG-2-Studio:批量移除图像和视频背景的开源程序,基于RMBG 2.0优化

RMBG-2-Studio: open source program for batch removal of image and video backgrounds, optimized for RMBG 2.0

General Introduction RMBG-2-Studio is an enhanced background removal and replacement application developed based on the BRIA-RMBG-2.0 model. The application is designed to provide users with efficient and accurate image background processing capabilities for a variety of image types, including e-commerce, gaming and...
11mos ago
028.2K
OpenAlternative:精选常用SaaS产品的开源软件替代方案,寻找最佳开源替代方案

OpenAlternative: a selection of open source software alternatives to commonly used SaaS products, finding the best open source alternatives

General Introduction OpenAlternative is a platform focused on providing open source software alternatives, aiming to help users find suitable open source tools to replace the commercial SaaS products they use on a daily basis. The site helps users save money and improve through a carefully curated collection of open source tools...
11mos ago
021.2K
TextDistiller:一键总结一整本书,高效提炼书籍内容,快速掌握核心思想

TextDistiller: summarize an entire book in one click, efficiently distill the content of the book, quickly grasp the core ideas

Comprehensive Introduction TextDistiller is an advanced AI-driven tool designed to summarize books chapter-by-chapter or as a whole, providing a concise yet comprehensive overview. By using TextDistiller, users are able to quickly grasp the core ideas and key points of any book...
11mos ago
022.5K
ChainForge:测试和评估大型语言模型提示效果的开源可视化编程环境

ChainForge: An Open Source Visual Programming Environment for Testing and Evaluating the Effectiveness of Large Language Model Hints

Comprehensive Introduction ChainForge is an open source visual programming environment designed for testing and evaluating the effectiveness of Large Language Model (LLM) cues. It provides a data flow cueing engineering environment through which users can quickly explore and analyze the quality of different cues on LLM response...
11mos ago
023.5K