General Introduction Qwen4Mac is an open source project designed to integrate the Qwen Large Language Model (LLM) into the Mac's menu bar, making it easy for users to call and use it at any time. The project is developed and maintained by andreaturchet and provides an easy way for users to directly access and use Qw...
General Introduction Pocket AI (PocketPal AI Chinese version) is a powerful offline AI assistant designed to allow users to talk to AI anytime, anywhere. It is based on Small Language Models (SLMs) and runs on cell phones without internet connection, especially adapted to Chinese user experience. Pocket AI supports a variety of small language...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
General Introduction Kokoro WebGPU is the WebGPU version of the Kokoro text-to-speech (TTS) model, provided by WebML Community on the Hugging Face platform. The project utilizes WebGPU technology to enable users to run efficient text-to-speech conversions locally in their browsers.WebGPU is a modern...
General Introduction OpenHealthForAll is an open source project designed to help users manage and understand their personal health data. By leveraging artificial intelligence technology, OpenHealthForAll provides a locally run health assistant to help users better manage and analyze their health information. The project supports...
General Introduction OpenPilot is an open source autonomous driving system developed by comma.ai to enhance the driving experience and safety of existing vehicles with advanced driver assistance features. Since its first release in 2016, OpenPilot has supported over 275 vehicle models and is constantly updating and optimizing its functionality....
General Introduction Agentic Security is an open source LLM (Large Language Model) vulnerability scanning tool designed to provide developers and security professionals with comprehensive fuzzing testing and attack techniques. The tool supports customized rulesets or agent-based attacks, is able to integrate LLM APIs for stress testing, and provides wide...
General Introduction CogVLM2 is an open source multimodal model developed by the Tsinghua University Data Mining Research Group (THUDM), based on the Llama3-8B architecture, and designed to provide performance comparable to or even better than GPT-4V. The model supports image understanding, multi-round dialog, and video understanding, and is capable of handling content up to 8K long...
General Introduction VisoMaster is a powerful and easy-to-use video face-swapping and editing tool that utilizes artificial intelligence technology to achieve natural and realistic face-swapping effects. Whether it's an image or a video, VisoMaster generates high-quality face swap results with simple operations, suitable for both general users and professionals....
Comprehensive Introduction LLM-RAG-Longevity-Coach is a chatbot based on Large Language Modeling (LLM) and Retrieval Augmented Generation (RAG) technologies designed to provide users with personalized health and longevity advice. Developed by Tyler Burleigh, the project utilizes Streamlit to build the user interface,...
Comprehensive Introduction Maestro is a tool developed by Roboflow to simplify and accelerate the process of fine-tuning multimodal models, so that everyone can train their own visual macromodels. It provides ready-made recipes for fine-tuning popular visual language models (VLMs) such as Florence-2, PaliGemma ...
Synthesis One-Prompt-One-Story (1Prompt1Story) is an innovative text-to-image generation tool designed to enable consistent image generation from a single prompt. The project, presented by Tao Liu et al. at ICLR 2025, employs a training-free approach that is able to maintain character identity while...
Comprehensive Introduction The Upstash RAG Chat Component is a React component designed for Next.js applications to provide an AI chat interface based on RAG (Retrieval Augmented Generation) technology. The component combines Upstash Vector for similarity search, Together AI for large language modeling (LL...
AudioNotes is an audio/video to structured notes system based on FunASR and Qwen2. It can quickly extract audio/video content and call the big model to organize it and generate a structured Markdown notes, which is convenient for users to read and find information quickly. The system supports multiple ...
Comprehensive Introduction Bilingual Book Maker is an open source project designed to help users create multilingual versions of eBooks using AI technology. The tool mainly uses ChatGPT for translation and supports a variety of file formats, including epub, txt and srt.Bilingual Book Maker is designed for translating eBooks that have entered...
Comprehensive Introduction Rowfill is an open source document processing platform designed for knowledge workers. It utilizes advanced AI technologies to extract, analyze and process data from complex documents, images and PDFs.Rowfill supports native Large Language Models (LLM) and OpenAI Visual Models to ensure that data is hidden...
Comprehensive Introduction PRAG (Parametric Retrieval-Augmented Generation) is an innovative retrieval-augmented generation tool that aims to enhance the generation effect by embedding external knowledge directly into the parameter space of a Large Language Model (LLM). The tool overcomes the traditional contextual retrieval-augmented generation method of ...
Comprehensive Introduction GPT Researcher is an autonomous agent tool based on the Large Language Model (LLM) designed to perform local and web research and generate detailed research reports. The tool provides stable performance and faster speed by parallelizing agent work, ensuring accurate and unbiased information.GP...
Comprehensive Introduction Linly-Talker is an innovative digital human dialog system that combines Large Language Models (LLMs) with visual models to create a novel approach to human-computer interaction. The system integrates multiple technologies such as Whisper, Linly, Microsoft Speech Services and SadTalker ...
General Introduction Airweave is an open source tool designed to make any application searchable by synchronizing a user's application data, APIs, databases, and websites to graph and vector databases.Airweave simplifies the process of making data searchable, whether it is structured or unstructured,...