AI open source project

Total 1020 articles posts
InstantIR:受损图像修复与图像高清放大开源项目,最低16G显存

InstantIR: damaged image repair and image high-definition zoom open source project, minimum 16G video memory

General Description InstantIR is an innovative single-image restoration model developed by the InstantX team, designed to resurrect your damaged images with extremely high-quality and realistic details, capable of high-quality restoration of damaged images. The tool not only restores the details of the image...
1yrs ago
041.8K
Fish Agent:端到端AI语音克隆助手,实时语音对话助理,Fish Speech衍生项目

Fish Agent: end-to-end AI voice cloning assistant, real-time voice conversation assistant, Fish Speech spin-off project

Comprehensive Introduction Fish Speech Derivative Project Fish Agent is a revolutionary end-to-end AI speech cloning system developed based on the V0.1 3B model architecture. As a fully end-to-end speech clone processing system, its most important feature is the use of innovative speechless...
11mos ago
041.6K
AI Hedge Fund:开源自动化交易系统,利用多智能体进行复杂对冲基金交易决策

AI Hedge Fund: open-source automated trading system utilizing multiple intelligences for complex hedge fund trading decisions

General Introduction AI Hedge Fund is an artificial intelligence hedge fund that utilizes a multi-agent system for trading decisions. The system works in concert with multiple specialized agents, including market data agents, quantitative agents, risk management agents, and portfolio management agents, to achieve complex trading...
10mos ago
041.6K
AsrTools:语音转字幕工具,内置剪映、快手、必剪接口的轻量客户端

AsrTools: speech-to-subtitle tool, lightweight client with built-in interfaces to Cutscene, Racer, and Must-Cut

Comprehensive Introduction AsrTools is an intelligent speech-to-text tool with built-in interfaces from big players such as Cutscene, Racer, Must Cut, etc. It does not require GPU or cumbersome configuration, and supports efficient multi-threaded batch processing. It is based on PyQt5 development, beautiful and user-friendly interface, able to output SRT and TXT format words...
1yrs ago
041.5K
ChatFree(ChatAnywhere-2):使用GPT API创建的本地Copilot,支持任意窗口中补全对话

ChatFree (ChatAnywhere-2): Native Copilot created using the GPT API to support complementary conversations in any window.

General Introduction ChatFree is an open source project that aims to free users' AI apps from the constraints of browsers to run locally. Created using GPT API, Copilot is designed to support a wide range of office software such as Office, Word, WPS, and more. The project was developed by ...
12mos ago
041.5K
NeoAI:让AI接管电脑远程操作,使用自然语言控制电脑的开源项目

NeoAI: Open source project that lets AI take over remote operation of computers and control them using natural language

General Introduction NeoAI is an innovative open source AI assistant tool that allows users to easily control and manage their computers through natural language conversations. Without writing any code, users can simply use everyday conversations to find files, automate tasks, manage devices, etc.NeoAI...
11mos ago
041.5K
Step-Audio:多模态语音交互框架,识别语音并使用克隆语音交流等功能

Step-Audio: a multimodal voice interaction framework that recognizes speech and communicates using cloned speech, among other features

Comprehensive Introduction Step-Audio is an open source intelligent speech interaction framework designed to provide out-of-the-box speech understanding and generation capabilities for production environments. The framework supports multi-language dialog (e.g., Chinese, English, Japanese), emotional speech (e.g., happy, sad), regional dialects (e.g., Cantonese, Szechuan ...
9mos ago
041.4K
FinRobot:提升金融数据分析效率和投资研究的的智能体

FinRobot: An Intelligent Body to Improve Financial Data Analysis Efficiency and Investment Research

Comprehensive Introduction FinRobot is an open source AI intelligence platform developed by AI4Finance Foundation and designed for financial analytics. It not only covers traditional language models, but also incorporates a variety of AI technologies, aiming to provide a comprehensive solution for the financial industry.F...
10mos ago
041.2K
KrillinAI:一键翻译和配音的视频多语言全球化工具

KrillinAI: Multilingual Globalization Tool for Video with One-Click Translation and Dubbing

Comprehensive Introduction KrillinAI is an open-source video processing tool focused on using artificial intelligence to help users translate videos and automatically dub them. It can start from the video download, all the way to generating the finished product adapted to different platforms, the whole process is just a few clicks. The developers are available on GitHub...
6mos ago
041.2K
Infinity:生成高分辨率图像的比特自回归建模,实现无限制高分辨率图像生成

Infinity: bitwise autoregressive modeling for generating high-resolution images for unlimited high-resolution image generation

General Introduction Infinity is a groundbreaking high-resolution image generation framework developed by the FoundationVision team. The project breaks through the limitations of traditional image generation models through an innovative bit-level visual autoregressive modeling approach.The core features of Infinity...
11mos ago
041K
Llasa 1~8B:高品质语音生成和克隆的开源文本转语音模型

Llasa 1~8B: an open source text-to-speech model for high quality speech generation and cloning

General Introduction Llasa-3B is an open source text-to-speech (TTS) model developed by the Audio Lab of the Hong Kong University of Science and Technology (HKUST Audio). The model is based on the Llama 3.2B architecture, which has been carefully tuned to provide high-quality speech generation that not only supports multiple...
10mos ago
040.9K
DeOldify:使用AI技术为黑白照片和视频上色的经典开源工具

DeOldify: the classic open-source tool for colorizing black-and-white photos and videos using AI technology

Comprehensive Introduction DeOldify is an open source project based on deep learning technology, specifically designed for intelligent colorization and restoration of black and white photos and videos. The project uses an innovative NoGAN training method to successfully solve the common defects of traditional GAN networks in the image coloring process...
11mos ago
040.8K
Sonic:音频驱动肖像图片生成面部表情生动的数字人口播视频

Sonic: Audio-driven portrait images generate digital demo videos with vivid facial expressions

General Introduction Sonic is an innovative platform focusing on global audio perception designed to generate vivid portrait animations driven by audio. Developed by a team of researchers from Tencent and Zhejiang University, the platform utilizes audio information to control facial expressions and head movements to generate natural and smooth animated videos.S...
8mos ago
040.4K
Kolors:生成高质量图像的文本到图像模型,支持生成中文海报

Kolors: text-to-image model for generating high-quality images, support for generating Chinese posters

Comprehensive Introduction Kolors is a large-scale text-to-image generation model developed by the Racer team, based on potential diffusion techniques. The model is trained on billions of text-image data pairs, and is capable of generating high-quality, complex semantically accurate images with support for both Chinese and English input.Kolors in visual quality...
11mos ago
040.4K
InstantID:上传一张图片,迁移人像特征来生成不同风格图片

InstantID: upload an image and migrate the portrait features to generate different styles of images

Comprehensive Introduction InstantID is an advanced technology focused on generating images with personalized styles or poses in seconds while ensuring a high level of fidelity using a single reference ID picture. The technology employs a diffusion model-based solution by integrating facial images, landmark maps...
1yrs ago
040.3K
MakeSense:免费使用的图像标注工具,提升计算机视觉项目效率

MakeSense: a free-to-use image annotation tool to improve computer vision project efficiency

General Introduction Make Sense is a free online image annotation tool designed to help users quickly prepare datasets for computer vision projects. It requires no complicated installation, just open a browser access to use it, supports multiple operating systems, and is perfect for small deep learning projects. Users can...
9mos ago
040.1K
DreamTalk:使用一张头像图片即可生成表情丰富的说话视频

DreamTalk: Generate expressive talking videos with a single avatar image!

DreamTalk Comprehensive Introduction DreamTalk is a diffusion model-driven expression talking head generation framework, jointly developed by Tsinghua University, Alibaba Group and Huazhong University of Science and Technology. It mainly consists of three parts: a noise reduction network, a style-aware lip expert and a style predictor, and can be based on...
12mos ago
040K
RD-Agent:自动化数据驱动研发工具,通过AI技术推动以数据为导向的研发过程

RD-Agent: an automated data-driven R&D tool to drive data-driven R&D processes through AI technology

Comprehensive Introduction RD-Agent is an open source tool from Microsoft designed to automate and optimize the research and development (R&D) process. The tool focuses on data-driven scenarios to improve the efficiency of model and data development through artificial intelligence techniques.RD-Agent integrates research...
9mos ago
039.8K
LazyLLM:商汤开源构建多智能体应用的低代码开发工具

LazyLLM: Shangtang's open source low-code development tool for building multi-intelligence body applications

Comprehensive Introduction LazyLLM is an open source tool developed by the LazyAGI team, focusing on simplifying the development process of multi-intelligence large model applications. It helps developers quickly build complex AI applications through one-click deployment and lightweight gateway mechanisms, saving tedious engineering configuration...
9mos ago
039.8K
Genesis:开源生成式物理引擎,实现基于真实物理的4D动态世界模拟

Genesis: open source generative physics engine for real physics-based 4D dynamic world simulation

General Introduction Genesis is a generative physics world designed for general purpose robotics and embodied AI learning. It provides a unified simulation platform that supports the simulation of a wide range of materials and physical phenomena.Genesis aims to unlock generative AI and physics simulation by combining...
11mos ago
039.6K
Refly:基于自由画布上流程编排的AI写作平台,自动化生成文章

Refly: an AI writing platform based on process orchestration on a free canvas for automated article generation

Comprehensive Introduction Refly is a free canvas-based AI native authoring engine designed to help users turn ideas into high-quality content through multi-threaded conversations, knowledge base integration, contextual memory and intelligent search technology. The platform covers over 20 professional scenario templates, including learning...
10mos ago
039.5K
Perplexica:1比1复刻 Perplexity AI 功能和界面的开源AI搜索引擎

Perplexica: an open source AI search engine that replicates Perplexity AI's features and interface 1 to 1

Comprehensive Introduction Perplexica is an open source AI-driven search engine designed to provide answers that delve deep into the Internet. It uses advanced machine learning algorithms, such as similarity search and embedding techniques, to optimize search results and provide clear answers with cited sources.Perple...
1yrs ago
039.5K
AI reads books:AI逐页阅读PDF书籍,自动提取知识要点并生成总结

AI reads books: AI reads PDF books page by page, automatically extracts the main points of knowledge and generates summaries.

Comprehensive Introduction AI-reads-books-page-by-page is a Python-based development of intelligent PDF book analysis tool, which can automate the page-by-page analysis of PDF books, extract the key knowledge points, and after the specified page interval to generate stage...
11mos ago
039.4K
Qwen-Agent:基于Qwen的智能代理应用框架,包括工具调用、代码解释器、RAG和Chrome扩展。

Qwen-Agent: Qwen-based framework for intelligent agent applications, including tool calls, code interpreters, RAGs and Chrome extensions.

Comprehensive Introduction Qwen-Agent is an intelligent agent application framework developed based on Qwen 2.0 and above, with capabilities such as command following, tool usage, planning and memorization. The framework provides a variety of sample applications such as browser assistants, code interpreters and custom assistants...
12mos ago
039.4K
Browser-Use:构建智能网页自动化工具,让AI智能体轻松操作浏览器

Browser-Use: Building Intelligent Web Automation Tools for AI Intelligents to Easily Operate Browsers

Comprehensive Introduction Browser-Use is an innovative open source web automation tool specifically designed to enable Language Models (LLMs) to naturally interact with websites. It provides a powerful and flexible framework that supports a wide range of mainstream language models, including GPT-4, Claud...
11mos ago
039.4K
Activepieces:AI工作流程自动化,适合非技术用户的任务编排工具,开源Zapier替代品

Activepieces: AI workflow automation, task scheduling tool for non-technical users, open source Zapier replacement

General Introduction Activepieces is an open source, all-in-one automation workflow platform focused on providing intuitive and powerful automation solutions for businesses and individual users. Developed in TypeScript, the platform is extremely scalable and supports more than 200 integrated services...
11mos ago
039.3K
阿布量化交易系统:基于Python的开源量化交易平台

Abu quantitative trading system: Python based open source quantitative trading platform

Comprehensive introduction Abu quantitative trading system is an open source platform based on Python development. It was created by user "bbfamily" to help investors realize quantitative trading strategies through code. The system supports backtesting and trading of various financial products such as stocks, options, futures and bitcoin. It...
8mos ago
039.2K