General Introduction ColiVara is a document storage and retrieval service based on visual embedding technology. It eliminates the need for Optical Character Recognition (OCR) or text extraction and avoids the problem of broken forms or lost images.ColiVara supports over 100 file formats including PDF, DOCX, PPTX, etc. and is able to automatically...
General Description Cursor Reset is a PowerShell scripting tool for resetting the Cursor IDE device identifiers, supporting Cursor version 0.45.x. The tool is designed to help users reset the device identifier in Cursor IDE in order to log in with a new account. The project is mainly used to learn and study Cursor ...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
Comprehensive Introduction The n8n Self-Hosted AI Starter Kit is an open source Docker Compose template designed to quickly initialize a comprehensive local AI and low-code development environment. Crafted by the n8n team, the kit combines the self-hosted n8n platform with a range of compatible AI products and components to help users quickly conceptualize...
General Introduction Gemini Teacher is an English speaking practice assistant based on Google Gemini AI. It recognizes the user's English pronunciation in real-time and provides instant feedback and correction suggestions. The tool is designed to help users improve their English speaking skills through AI-driven pronunciation assessment and grammar correction...
Comprehensive Introduction bilive is a tool designed for B station live recording, providing extremely fast live recording, auto-slicing, pop-up rendering and subtitle generation. The tool is compatible with ultra-low configuration machines, supports 7x24 hours unattended recording, automatically recognizes and renders pop-ups and subtitles, automatically slices and uploads them to B...
Comprehensive Introduction R1-V is an open source project that aims to achieve breakthroughs in visual language modeling (VLM) through low-cost reinforcement learning (RL). The project utilizes a verifiable reward mechanism to motivate VLMs to learn generalized counting abilities. Amazingly, R1-V's 2B model is able to learn the counting ability in only 100 training steps...
General Introduction PPTX2MD is an open source tool designed to convert PowerPoint PPTX files to Markdown format. Developed by GitHub user ssine, the tool supports retaining headings, lists, text formatting (such as bold, italic, color, and hyperlinks), images, and tables in a variety of formats.PPTX2MD...
General Introduction The DSPy Example Codebase is a GitHub codebase maintained by the Langtrace AI team that showcases a variety of AI program examples built using DSPy. The codebase is designed to help developers better understand and apply DSPy for AI program development by demonstrating the many features of DSPy through real-world examples. Code ...
Comprehensive Introduction Go-Proxy is a high-performance proxy server developed using the Go language , mainly used to provide proxy services in different network environments . It supports a variety of protocols , including HTTP, HTTPS, SOCKS5, WebSocket, TCP and UDP , to meet a variety of proxy needs.Go-Proxy's design goal ...
CoT-Lab (Collaborative Thinking Laboratory) is an experimental interface for exploring new paradigms in human-computer collaboration. Based on Cognitive Load Theory and Active Learning Principles, CoT-Lab facilitates deep cognitive alignment between humans and Artificial Intelligence (AI) through the creation of "Thinking Partners". The program is designed to slowly output...
General Description DeepSeek Diagrams Extension is a Chrome extension designed to help users render diagrams inline in the DeepSeek website. The extension is based on the Mermaid.js library and is able to convert charts with text descriptions directly into visual charts, enhancing the use of...
General Description Orate is an AI toolkit focused on speech generation and transcription. It provides a unified API that seamlessly integrates with leading AI providers such as OpenAI, ElevenLabs, and AssemblyAI to help users create realistic, human-like speech and transcribe audio into text.Ora...
Comprehensive Introduction Reflex LLM Examples is an open source project created by the Reflex development team to demonstrate real-world applications of the Large Language Model (LLM). The project brings together several AI applications built on Reflex, showcasing Large Language Models from providers such as Google, Anthropic, OpenAI...
Comprehensive Introduction DeepClaude is a high-performance Large Language Model (LLM) inference API and chat interface that integrates the Chained Reasoning (CoT) capabilities of DeepSeek R1 with the creativity and code generation capabilities of the Anthropic Claude model. This project significantly outperforms OpenAI o1, DeepSeek R1 ...
Comprehensive Introduction BEN2 (Background Erase Network 2) is a deep learning model developed by Prama LLC that specializes in automatically removing the background from an image and generating a foreground image. The model employs an innovative Confidence Guided Matting (CGM) pipeline through a refinement...
General Introduction AI Web Operator is an open source AI browser operator tool designed to simplify the user experience in the browser by integrating multiple AI technologies and SDKs. Built on Browserbase and the Vercel AI SDK, the tool supports a variety of Large Language Models (LLMs) such as...
Comprehensive Introduction Exa & Deepseek Chat App is an open source intelligent chat application whose main features include real-time web searching using Exa's APIs and intelligent reasoning using the Deepseek R1 language model. Developed by Exa Labs, the app aims to provide an efficient,...
Comprehensive Introduction LLM API Engine is an open source project designed to help developers rapidly build and deploy AI-powered APIs.The project leverages the Large Language Model (LLM) and intelligent web crawling technology to allow users to create custom APIs through natural language descriptions.Its key features include automatic data knot generation...
Comprehensive Introduction PengChengStarling (PengCheng Labs) is a multilingual Automatic Speech Recognition (ASR) tool capable of converting speech in different languages into corresponding text. This toolkit is developed based on the icefall project and provides a complete speech recognition process, including data processing, model training,...