General Introduction Kotaemon is an open source document Q&A tool designed to provide end-users and developers with Q&A capabilities based on Retrieval Augmented Generation (RAG). Developed by Cinnamon, the project supports a variety of LLM API providers (e.g. OpenAI, AzureOpenAI, Cohere, etc.) as well as native...
Comprehensive introduction HivisionIDPhotos is an open source lightweight AI document photo production tools, can intelligently identify the user photo scene and keying, to generate a standard document photo in line with a variety of specifications. The tool supports custom background colors and sizes, and in the future will also launch the beauty and intelligent change of formal dress function. With...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
General Introduction Marker is a deep learning based document processing tool designed to convert PDF files to Markdown format quickly and accurately. It supports a wide range of document types and is especially optimized for conversion of books and scientific papers.Marker is able to remove redundant content such as headers and footers, format tables and...
General Introduction SadTalker is an open source tool that combines single still portrait photos and audio files to create realistic talking head videos for a wide range of scenarios such as personalized messages, educational content, and more. The revolutionary use of 3D modeling technologies such as ExpNet and PoseVAE excel in capturing the subtle facets...
General Introduction VideoReTalking is an innovative system that allows users to generate lip-synchronized facial videos based on input audio, producing high-quality and lip-synchronized output videos even with different emotions. The system breaks down this goal into three successive tasks: facial video generation with typical expressions...
General Introduction MuseV is a public project on GitHub that aims to enable the generation of avatar videos of unlimited length and high fidelity. It is based on diffusion technology and offers Image2Video, Text2Image2Video, Video2Video and many other features. Provides model structure, use cases, quick start...
Comprehensive Introduction Unstructured-IO provides a range of open source components for processing and preprocessing images and text documents such as PDF, HTML, Word documents, etc. Its main goal is to simplify and optimize data processing workflow , especially for large language model (LLM) applications to provide support.Unstructured...
General Introduction magic-html is a Python library designed to simplify the process of extracting body region content from HTML. Whether dealing with complex HTML structures or simple web pages, this library aims to provide a convenient and efficient interface for users. It supports multimodal extraction, multiple layout extracto...
WebPilot General Introduction Webpilot is a free and open source "web assistant" that allows you to communicate freely with any web page or perform automated tasks. Instead of switching pages or copying and pasting, just select text or enter commands, and webpilot will provide you with real-time information and smart...
Comprehensive Introduction DB-GPT is an open source AI native data application development framework built using AWEL (Agentic Workflow Expression Language) and intelligent body technologies. The project aims to build infrastructure in the field of large models by developing several technical capabilities, including a multi-model management system (SMMF),...
DreamTalk Comprehensive Introduction DreamTalk is a diffusion model-driven expression talking head generation framework, jointly developed by Tsinghua University, Alibaba Group and Huazhong University of Science and Technology. It is mainly composed of three parts: a noise reduction network, a style-aware lip expert and a style predictor, and is able to generate a variety of audio input based on...
General Introduction GPT Crawler is an open source tool that allows users to generate knowledge files by crawling the content of a specific website, which in turn creates customized GPT models. It is mainly used for crawling and organizing web information and supports running via API and local deployment. Users can flexibly configure the crawler to fit...
Comprehensive Introduction InstantID is an advanced technology focused on generating images with personalized styles or poses in seconds while ensuring a high level of fidelity using a single reference ID image. The technology employs a diffusion model-based solution by integrating facial images, landmark images with...
General Introduction ComfyUI Portrait Master Chinese version is a portrait cue word generation tool designed for AI image creators. The tool helps users generate high-quality portraits by optimizing the cue words. Users can choose different lens types, gender, nationality, facial expression...
General Introduction IOPaint is a free and open source AI image processing tool that supports image erasing, repairing and expanding. It uses state-of-the-art AI models to help users easily remove unwanted objects from an image, repair blemishes, add new content, and even expand an image.IOPaint is fully self-hosted...
Comprehensive Introduction GPT Academic is a large language model interaction platform optimized for academic research, providing tools for pragmatic interaction interfaces for large language models such as GPT/GLM, especially optimized for paper translation, paper reading, touch-up and writing experience. It is modular in design and supports customized shortcut press...
General Introduction gpt-prompt-engineer is an open source project on GitHub that focuses on prompt engineering for the GPT model. Users can enter task descriptions and test cases, and this tool is able to generate, test, and rank different prompts to find the best performer. The project utilizes the GPT-4 and GPT-3.5-T...
General Introduction STORM is a knowledge integration and article generation system developed by the Oval team at Stanford University. It focuses on generating exhaustive Wikipedia-like articles (systematic papers) from scratch. The system utilizes large-scale language models for topic research, preparing synopses and simulating actual Internet sources of...
General Introduction XHS-Downloader is an open source tool designed for Xiaohongshu users to support extracting and downloading watermark-free images and video works on Xiaohongshu. The tool provides a variety of features, including getting cookies from browsers, support for command line operations, batch downloads, breakpoints, and so on. Users can...
Chief AI Sharing Circle specializes in AI learning, providing comprehensive AI learning content, AI tools and hands-on guidance. Our goal is to help users master AI technology and explore the unlimited potential of AI together through high-quality content and practical experience sharing. Whether you are an AI beginner or a senior expert, this is the ideal place for you to gain knowledge, improve your skills and realize innovation.