General Introduction Unigraph is a local-first universal knowledge graph and personal search engine designed to provide users with an integrated workspace to help manage and search for a wide variety of data in their personal lives. With Unigraph, users can integrate data from different sources into a unified knowledge graph...
General Introduction ComfyUI-disty-Flow is a custom node that provides a user-friendly interface for ComfyUI. It is intended to simplify the running of workflows by providing alternative user interfaces, rather than replacing the creation of workflows.ComfyUI-disty-Flow is currently in the early stages of development, so...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
General Introduction Shortest is an AI-powered natural language end-to-end testing framework developed by the Anti-Work team. It is built on Playwright and supports GitHub integration and two-factor authentication (2FA).Shortest's main feature is to write test cases through natural language and utilize Anthropic Cl...
General Introduction Midscene.js is an AI-powered browser automation tool that controls web pages, performs assertions and extracts data through natural language commands. It supports Chrome extensions, JavaScript SDKs and YAML scripts, simplifying the process of writing and maintaining UI tests. By utilizing multimodal large ...
Comprehensive Introduction Video Analyzer is a comprehensive video analysis tool that combines computer vision, audio transcription, and natural language processing techniques to generate detailed video content descriptions. The tool does this by extracting key frames from the video, transcribing audio content, and generating natural language...
General Introduction Unsloth is an open source project designed to provide efficient tools for fine-tuning and training large language models (LLMs). The project supports a wide range of well-known models, including Llama, Mistral, Phi, and Gemma, etc. Unsloth's main features are the ability to significantly reduce memory usage and speed up training...
Comprehensive Introduction MaxKB (Max Knowledge Base) is an open source knowledge base Q&A system based on large language modeling and RAG (Retrieval Augmented Generation). The system is widely used in intelligent customer service, enterprise internal knowledge base, academic research and education and other scenarios.MaxKB supports direct upload documents or automatically crawl in...
Comprehensive Introduction OmniThink is an innovative machine writing framework designed to generate high-quality, long-form articles by mimicking the iterative expansion and reflection of human cognitive processes. The framework focuses on extending the boundaries of knowledge and generating information that is rich and deep.OmniThink generates articles by building outlines and...
General Introduction OpenAI Realtime Agents is an open source project that aims to show how OpenAI's real-time API can be utilized to build multi-intelligent body speech applications. It provides a high-level intelligent body model (borrowed from OpenAI Swarm) that allows developers to build complex multi-intelligent body speech systems in a short time...
General Introduction DeepFace is a lightweight Python library for facial recognition and facial attribute analysis (including age, gender, emotion and ethnicity). It integrates several advanced facial recognition models such as VGG-Face, FaceNet, OpenFace, DeepFace, DeepID, ArcFace, Dlib, SFace...
Comprehensive Introduction SynthLight is a portrait relighting tool based on a diffusion model. It learns to re-render synthetic face images to achieve lighting effect adjustments to real portrait photos. The tool uses a physical rendering engine to generate datasets that simulate lighting transformations under different lighting conditions.SynthLigh...
General Introduction 1-2-1-MNVTON is a GitHub-based open source project that aims to achieve efficient virtual try-on through the "Modality-specific Normalization for Virtual Try-On" (MNVTON) technology. The project solves the problem of high computational cost in traditional virtual try-on techniques by providing ...
General Introduction Kokoro-ONNX is an open source text-to-speech (TTS) tool based on ONNX runtime. Developed by thewh1teagle, the project aims to provide efficient and fast speech synthesis solutions.Kokoro-ONNX supports multiple languages, including English, and plans to support French, Japanese, Korean...
Comprehensive introduction Zerox is an open source project designed to convert PDF, DOCX, images and other documents to Markdown format through visual modeling . The project is developed by getomni-ai team , provides a simple and efficient OCR (Optical Character Recognition) solution.Zerox supports Node and Python programming languages, ...
General Description Charla is an endpoint-based chat application designed to have conversations with native language models. The application integrates with the Ollama backend, supports context-aware conversations, and saves chat sessions as Markdown files. Users can launch and enable it through simple command line operations...
Comprehensive Introduction MiniRAG is an extremely simple Retrieval Augmented Generation (RAG) framework that aims to enable good RAG performance even for small models through heterogeneous graph indexing and lightweight topology-enhanced retrieval. It is developed by the Hong Kong University Data Science Laboratory (HKUDS) and focuses on solving the Small Language Model (SLM...
Comprehensive Introduction Omni-RGPT is a multimodal large language model designed to enable region-level understanding of images and videos. By introducing the Token Mark technique, Omni-RGPT is able to highlight target regions in the visual feature space and embed these tokens directly through region cues (e.g., boxes or masks), while placing...
Comprehensive Introduction Bailing (Bailing) is an open source voice conversation assistant designed to engage in natural conversations with users through speech. The project combines speech recognition (ASR), voice activity detection (VAD), large language modeling (LLM) and speech synthesis (TTS) technologies to achieve a GPT-4o-like speech...
Comprehensive Introduction Metaverse AI (open source version) is a project hosted on GitHub, developed by libn-net team. It can clone digital human images and voices through AI technology to generate short videos, and also supports dubbing and subtitling. The tool is available for Windows, Web, H5 and small...