478 Articles
Tags :AI open source project Page 45
General Introduction MuseV is a public project on GitHub that aims to enable the generation of avatar videos of unlimited length and high fidelity. It is based on diffusion technology and offers Image2Video, Text2Image2Video, Video2Video and many other features. Provides model structure, use cases, quick start...
Comprehensive Introduction Unstructured-IO provides a range of open source components for processing and preprocessing images and text documents such as PDF, HTML, Word documents, etc. Its main goal is to simplify and optimize data processing workflow , especially for large language model (LLM) applications to provide support.Unstructured...
General Introduction magic-html is a Python library designed to simplify the process of extracting body region content from HTML. Whether dealing with complex HTML structures or simple web pages, this library aims to provide a convenient and efficient interface for users. It supports multimodal extraction, multiple layout extracto...
WebPilot General Introduction Webpilot is a free and open source "web assistant" that allows you to communicate freely with any web page or perform automated tasks. Instead of switching pages or copying and pasting, just select text or enter commands, and webpilot will provide you with real-time information and smart...
Comprehensive Introduction DB-GPT is an open source AI native data application development framework built using AWEL (Agentic Workflow Expression Language) and intelligent body technologies. The project aims to build infrastructure in the field of large models by developing several technical capabilities, including a multi-model management system (SMMF),...
DreamTalk Comprehensive Introduction DreamTalk is a diffusion model-driven expression talking head generation framework, jointly developed by Tsinghua University, Alibaba Group and Huazhong University of Science and Technology. It is mainly composed of three parts: a noise reduction network, a style-aware lip expert and a style predictor, and is able to generate a variety of audio input based on...
General Introduction GPT Crawler is an open source tool that allows users to generate knowledge files by crawling the content of a specific website, which in turn creates customized GPT models. It is mainly used for crawling and organizing web information and supports running via API and local deployment. Users can flexibly configure the crawler to fit...
Comprehensive Introduction InstantID is an advanced technology focused on generating images with personalized styles or poses in seconds while ensuring a high level of fidelity using a single reference ID image. The technology employs a diffusion model-based solution by integrating facial images, landmark images with...
General Introduction ComfyUI Portrait Master Chinese version is a portrait cue word generation tool designed for AI image creators. The tool helps users generate high-quality portraits by optimizing the cue words. Users can choose different lens types, gender, nationality, facial expression...
General Introduction IOPaint is a free and open source AI image processing tool that supports image erasing, repairing and expanding. It uses state-of-the-art AI models to help users easily remove unwanted objects from an image, repair blemishes, add new content, and even expand an image.IOPaint is fully self-hosted...