Kotaemon: simple to deploy open source multimodal document quiz tool
General Introduction Kotaemon is an open source document Q&A tool designed to provide end-users and developers with Q&A functionality based on Retrieval Augmented Generation (RAG). The project is developed by Cinnamon and supports a variety of LLM API providers (e.g. OpenA...
HivisionIDPhotos: open source intelligent AI photo ID creation tool
Comprehensive introduction HivisionIDPhotos is an open source lightweight AI document photo production tool, can intelligently identify the user photo scene and keying, to generate a standard document photo in line with a variety of specifications. The tool supports custom background color and size, the future will also introduce beauty and...
Marker: quickly convert PDF to Markdown open source tools
General Introduction Marker is a deep learning based document processing tool designed to convert PDF files to Markdown format quickly and accurately. It supports a wide range of document types and is especially optimized for conversion of books and scientific papers.Marker is able to remove headers...
Configuring the Python Programming Prompt Word Directive for Cursor
This directive provides a comprehensive guide to developing high-quality Python code, especially when using the FastAPI, Flask, and Django frameworks for web application and API development, as well as for data analysis and deep learning tasks. Here are the main points of the directive...
Mathpix: PDF and image documents structured conversion software, support for multi-terminal
General Description Mathpix is a powerful AI-driven document automation tool designed for researchers, developers and enterprises. It quickly and accurately converts PDFs and images into searchable, exportable and machine-readable text.Mathpix offers a wide range of features...
ChatWiki: lightweight open source enterprise knowledge base AI Q&A system
Comprehensive Introduction ChatWiki is an open source knowledge base AI Q&A system officially launched by Sesame Small Customer Service, built on Large Language Modeling (LLM) and Retrieval Augmented Generation (RAG) technology. It provides out-of-the-box data processing and model calling capabilities to help companies quickly build their own knowledge...
SadTalker: Make Photos Talk | Mouth Synchronized Audio | Synthesized Mouth Synchronized Video | Free Digital People
General Introduction SadTalker is an open source tool that combines a single still portrait photo with an audio file to create realistic talking avatar videos for a variety of scenarios such as personalized messages, educational content, and more. The revolutionary use of 3D modeling technologies such as ExpNet and PoseVA...
VideoReTalking: Audio-Driven Lip Synchronization and Video Editing System
General Introduction VideoReTalking is an innovative system that allows users to generate lip-synchronized facial videos based on the input audio, producing high-quality and lip-synchronized output videos even with different emotions. The system breaks down this goal into three consecutive tasks: with typical expressions...
Musicfy: voice song generator, convert song singing style
General Introduction Musicfy.lol is an AI-based music creation platform that allows users to transform their voice or other sounds into music through AI technology. The platform provides a variety of innovative features, such as AI sound artist, track separation, AI text to music, etc., to help users lightly...
Chatbox: multi-platform client AI desktop assistant
Chatbox General Introduction Chatbox is a desktop software that supports several of the world's most advanced AI big modeling services, including but not limited to ChatGPT.It is designed to enhance the efficiency of user's work and learning, and is highly regarded by professionals worldwide.Chatbo...









