General Introduction DiffBIR (Blind Image Restoration with Generative Diffusion Prior) is an image restoration tool developed by XPixelGroup, designed to generate...
General Introduction TankWork is an open source desktop agent framework designed to enable AI to perceive and control your computer through computer vision and system-level interaction. The framework allows agents to directly control computers through voice and text commands, process real-time screen content, and provide continuous audio visual...
General Description AI Auto Free is a powerful automation tool designed to help users make unlimited use of AI-driven Integrated Development Environments (IDEs) such as Cursor and Windsurf. The program offers cross-platform support and includes multiple language capabilities...
Quantum Swarm is an open source artificial intelligence framework focused on developing and researching AI population intelligence. The project is maintained by the Quarm AI team on GitHub and aims to provide a flexible and efficient platform for building and testing multi-intelligence systems.Quan...
Comprehensive Introduction XRAG (eXamining the Core) is a benchmarking framework designed for evaluating the underlying components of advanced retrieval augmentation generation (RAG) systems. By profiling and analyzing each core module, XRAG provides information on how different configurations and components affect RAG...
Comprehensive Introduction WenYan is a tool designed for Markdown article typesetting and beautification, supporting the conversion of edited Markdown articles into a format suitable for WeChat, Zhihu, Today's headlines and other platforms. Users can copy the article directly by one click...
Comprehensive Introduction CHRONOS is a news timeline summarization tool developed by Alibaba NLP team. The tool generates timeline summaries of news events through iterative self-questioning.CHRONOS is not only capable of handling open-domain timeline summarization tasks, but also in terms of efficiency and scalability...
General Introduction DeepSeek-R1 WebGPU is a cutting-edge AI inference model provided by webml-community on the Hugging Face Spaces platform, which utilizes WebGPU technology to allow users to directly...
General Introduction Go-with-the-Flow is an open source project developed by the Netflix Eyeline Studios research team to control the motion patterns of video diffusion models by distorting noise. The project allows the user to determine how the camera in the scene and...
Comprehensive Introduction X-Dyna is an open source project developed by ByteDance to generate dynamic portrait animations using zero-sample diffusion techniques. The project utilizes facial expressions and body movements in drive video to animate individual portrait images, generating realistic and context-aware motion effects.X-D...
Comprehensive Introduction Tencent Hunyuan3D (Hunyuan3D 2.0) is an advanced large-scale 3D synthesis system from Tencent designed to generate high-resolution textured 3D assets. The system consists of two core components: Hunyuan3D-DiT, a large-scale shape generation model, and Hunyuan3D-DiT, a large-scale texture...
Comprehensive Introduction RAG Web UI is an intelligent dialog system based on RAG (Retrieval Augmented Generation) technology. It helps organizations and individuals build intelligent Q&A systems based on their own knowledge base. By combining document retrieval and large language modeling, RAG Web UI provides accurate and reliable...
General Introduction UI-TARS Desktop is a graphical interface agent application based on UI-TARS (Visual Language Model) developed by ByteDance. The application allows users to control computers through natural language for more intuitive and efficient human-computer interaction.UI-TAR...
General Introduction Narrify is an innovative platform designed to transform books into concise, engaging audio summaries. With Narrify, users can quickly access key content and insights from books, making it easy to listen to book highlights whether on their commute or in their leisure...
General Introduction Devin Cursor Rules is an open source project designed to enhance the Cursor and Windsurf Integrated Development Environment (IDE) with configuration files and tools to enable advanced AI capabilities similar to Devin. The project provides over ...
General Introduction Repomix (formerly known as Repopack) is an open source tool designed to package an entire codebase into a single, AI-friendly file. This tool allows developers to easily make their codebase available to large language models (such as Claude, Chat...
General Introduction Yek is a fast Rust-based tool for reading text files from repositories or directories, chunking them, and serializing them for use in Large Language Models (LLMs). The tool uses the .gitignore rule by default to skip unwanted files and utilizes...
Comprehensive Introduction Kheish is an open source multi-role agent designed for Large Language Model (LLM) tasks that require structured, step-by-step collaboration.Kheish is more than just a simple coordinator, it is an intelligent agent in its own right, requesting modules on demand, integrating user-reversal...
General Introduction AI ContentCraft is a versatile content creation tool that integrates text generation, speech synthesis, image generation and more. It helps creators quickly generate stories, podcast scripts, and accompanying audio and video content. The tool supports multiple language conversions and can batch...
Comprehensive Introduction Unigraph is a local-first general-purpose knowledge graph and personal search engine designed to provide users with an integrated workspace to help manage and search for a wide variety of data in their personal lives. With Unigraph, users can integrate data from different sources into a...
General Introduction ComfyUI-disty-Flow is a custom node that provides a user-friendly interface to ComfyUI. It is intended to simplify the running of workflows by providing an alternative user interface to the creation of workflows.ComfyUI-disty...
General Introduction Shortest is an AI-powered natural language end-to-end testing framework developed by the Anti-Work team. It is built on Playwright and supports GitHub integration and two-factor authentication (2FA).Shortest's main features are...
General Introduction Midscene.js is an AI-powered browser automation tool that controls web pages, performs assertions and extracts data through natural language commands. It supports Chrome extensions, JavaScript SDKs and YAML scripts, simplifying UI measurement...
General Introduction ReadKidz is an innovative platform that uses artificial intelligence technology to help users create personalized children's storybooks and animations. Whether you're a parent, teacher or aspiring author creating children's books, ReadKidz makes it easy to generate high-quality story content...
Comprehensive Introduction Video Analyzer (Video Analyzer) is a comprehensive video analysis tool that combines computer vision, audio transcription and natural language processing techniques to generate detailed video content descriptions. The tool transcribes audio content by extracting key frames in the video...
Comprehensive Introduction Trae is a free AI programming tool from ByteDance, designed as an integrated development environment (IDE) for Chinese developers. It helps developers quickly generate, optimize, and debug code by leveraging advanced AI models such as Claude 3.5 and GPT-4o.T...
Comprehensive Introduction Unsloth is an open source project designed to provide efficient tools for fine-tuning and training large language models (LLMs). The project supports a variety of well-known models, including Llama, Mistral, Phi, and Gemma.Unsloth's...
Comprehensive Introduction LlamaParse is a powerful document parsing tool that can process complex documents such as PDF, PowerPoint, Word documents and spreadsheets and convert them into structured data.LlamaParse offers a variety of ways to use...
Comprehensive Introduction JENOVA is a leading global AI platform designed to provide users with powerful AI integration services. By integrating state-of-the-art AI models (e.g. GPT-4o, Claude 3.5, Gemini 2), JENOVA is able to tailor users' needs to...
General Introduction Traycer is an AI programming assistant for developers designed to significantly improve the efficiency and quality of software development by analyzing context-sensitive code and reviewing it in real-time. It is integrated into Visual Studio Code and is able to automatically plan tasks...
Comprehensive Introduction MaxKB (Max Knowledge Base) is an open source knowledge base Q&A system based on large language modeling and RAG (Retrieval Augmented Generation). The system is widely used in intelligent customer service, enterprise internal knowledge base, academic research and education and other scenarios.MaxKB...
Comprehensive Introduction UnDatas.IO is a platform focused on parsing and processing unstructured data. It utilizes advanced technology to automatically recognize document layouts and categorize tables, images, formulas and text, greatly simplifying the data processing process. The platform not only saves a lot of time in organizing data...
General Introduction NoteGen is a cross-end AI note-taking app focused on recording and writing, based on Tauri. It supports multiple platforms such as Mac, Windows, Linux, and will support iOS and Android in the future. not...
Comprehensive Introduction OmniThink is an innovative machine writing framework designed to generate high-quality, long-form essays by mimicking the iterative expansion and reflection of human cognitive processes. The framework focuses on extending the boundaries of knowledge and generating information that is rich and deep.OmniThink does this by constructing...
General Introduction OpenAI Realtime Agents is an open source project that aims to show how OpenAI's realtime API can be utilized to build multi-intelligent body speech applications. It provides a high-level intelligent body model (borrowed from OpenAI Swarm) that allows...
General Description Klap is an AI-based video editing tool designed for content creators to transform long videos into short videos suitable for social media platforms such as TikTok, Instagram Reels and YouTube Shorts...
General Introduction DeepFace is a lightweight Python library for facial recognition and facial attribute analysis (including age, gender, emotion and ethnicity). It integrates a variety of advanced facial recognition models such as VGG-Face, FaceNet, OpenFace, De...
Comprehensive Introduction SynthLight is a portrait relighting tool based on a diffusion model. It learns to re-render synthetic face images to achieve lighting effect adjustments to real portrait photos. The tool uses a physical rendering engine to generate datasets that simulate lighting transformations under different lighting conditions...
General Introduction 1-2-1-MNVTON is a GitHub-based open source project that aims to provide "Modality-specific Normalization for Virtual Try-On" (MNVTON) technology through...
General Introduction Kokoro-ONNX is an open source text-to-speech (TTS) tool based on ONNX runtime. Developed by thewh1teagle, the project aims to provide efficient and fast speech synthesis solutions.Kokoro-ONNX supports ...
Comprehensive introduction Zerox is an open source project designed to convert PDF, DOCX, images and other documents to Markdown format through visual modeling. The project is developed by getomni-ai team , provides a simple and efficient OCR (Optical Character Recognition) solution.Ze...
Comprehensive Introduction AIVLOG is an AI video editing tool designed for Vlog creators. It can automatically analyze video content and intelligently edit out the highlights, saving users 95% editing time. Whether it's daily life, travel records or conversation videos, AIVLOG can easily...
General Description Charla is an endpoint-based chat application designed to have conversations with native language models. The application integrates with the Ollama backend, supports context-aware conversations, and saves chat sessions as Markdown files. Users can simply...
Comprehensive Introduction MiniRAG is an extremely simple Retrieval Augmented Generation (RAG) framework that aims to enable good RAG performance even for small models through heterogeneous graph indexing and lightweight topology-enhanced retrieval. It is developed by the Data Science Laboratory of the University of Hong Kong (HKUDS) to address ...
Comprehensive Introduction Omni-RGPT is a multimodal large language model designed to enable region-level understanding of images and videos. By introducing the Token Mark technique, Omni-RGPT is able to highlight target regions in the visual feature space with region cues (e.g., boxes or...
Comprehensive Introduction Bailing (Bailing) is an open source voice conversation assistant designed to engage in natural conversations with users through speech. The project combines speech recognition (ASR), voice activity detection (VAD), large language modeling (LLM) and speech synthesis (TTS) technologies to achieve...
Comprehensive Introduction Metaverse AI (open source version) is a project hosted on GitHub, developed by libn-net team. It can clone digital human images and voices through AI technology to generate short videos, and also supports dubbing and subtitling. This tool provides Windo...
General Introduction WikiChat is an experimental chatbot developed at Stanford University that aims to improve the factoring of large language models by retrieving data from Wikipedia. Large language models (such as ChatGPT and GPT-4) tend to process up-to-date information or less popular topics when...
General Introduction Entretien AI is an online platform focused on helping job seekers improve their interviewing skills. It utilizes artificial intelligence technology to simulate real interview scenarios, providing instant feedback and expert guidance. Users can use this platform for targeted practice to optimize their answering strategies and communication...
General Introduction UGC Generator is a platform that utilizes artificial intelligence technology to quickly generate user-generated content (UGC) video ads. Users can generate high-quality UGC-style video ads in minutes by simply uploading product links. The platform offers a clean interface and strong...
General Introduction OpenAI Edge TTS is an open source project that provides an OpenAI-compatible native text-to-speech (TTS) API.The project uses Microsoft Edge's online text-to-speech service to allow users to generate high-quality...
General Description Charts Not Chapters is an AI-based tool focused on converting text and data into compelling infographics. It is unique in that it does not rely on templates, but instead generates each chart from scratch through AI, offering a high degree of customizability...
General Introduction Cure AI is an online platform designed for medical researchers to optimize the scientific process through artificial intelligence technology. The platform provides access to over 26 million PubMed scientific articles and ranks evidence based on the relevance and quality of user queries.C...
General Introduction AIEvo is Ant Group's open source multi-agent framework designed to efficiently create multi-agent applications. The framework strictly follows the SOP task graph to improve the execution success rate of complex tasks , and through feedback and monitoring mechanisms to ensure high flexibility and scalability.AIEvo has been produced within Ant Group ...
Comprehensive Introduction Allwyse is an intelligent platform designed for advisor practices to help advisors optimize client management and scheduling by integrating multiple tools and features. The platform offers automated scheduling, client data management, AI assistants, real-time analytics, and more to help advisors improve...
General Introduction Bakery is a platform designed for AI startups, machine learning engineers and researchers to provide simple and efficient AI model fine-tuning and monetization services. Users can access community-driven datasets through Bakery, create or upload their own datasets, fine-tune models...
Comprehensive Introduction Ragie.ai is a fully managed RAG (Retrieval-Augmented Generation) service platform designed for developers. With Ragie.ai, developers can easily connect applications with user data...
General Introduction PPTAgent is an innovative system designed to automatically generate presentations from documents. The system draws on the human approach to creating presentations, using a two-step process to ensure content quality and visualization. In addition, PPTAgent introduces PPTEval, a comprehensive...
General Introduction FlowiseAI is an open source, low-code tool designed to help developers build custom LLM (Large Language Model) applications and AI agents. With a simple drag-and-drop interface, users can quickly create and iterate on LLM applications, making the process from testing to production more efficient...
Comprehensive Introduction Big Model Detection is an AI-generated content detection tool developed by Tencent's hybrid security team, Jubilee Labs. The tool can quickly identify text and images generated by AI and help users distinguish between manually created and AI-generated content. By capturing the differences between AI-generated content and real content...
Comprehensive Introduction Orange AI is an intelligent creation tool launched by Baidu, designed to help users quickly generate documents, PPTs, charts and other content. The tool integrates a variety of AI features, including intelligent generation, academic search, correction and touch-up, etc., which greatly improves the efficiency and quality of document creation. Orange AI is not ...
Comprehensive Introduction ClipTurbo is an AI-powered short video generation tool designed to help users easily create high-quality marketing videos. By utilizing AI technology, ClipTurbo can automatically process copy, translation, icon matching and TTS voice synthesis using m...
Comprehensive Introduction SemHash is a lightweight and flexible tool for de-duplicating datasets by semantic similarity. It combines the fast embedding generation of Model2Vec with the efficient ANN (approximate nearest neighbor) similarity search of Vicinity.SemHa...
General Introduction socra is a collaborative intelligence platform designed to help users build knowledge, solve challenges, and realize ambitions through the collaboration of humans and AI. The platform provides a wealth of resources and tools to support users in innovation and research across multiple domains. socra is not only a knowledge-sharing...
Comprehensive Introduction Narrative BI is a platform focused on automated data analytics that utilizes artificial intelligence technology to provide users with natural language-generated business insights. Its core product, AI Data Analyzer, automatically extracts meaningful conclusions from data without requiring users to have sophisticated technical...
General Description Project Ambience is a neuroscience-based online platform designed to improve user focus and productivity by creating customized ambient sound spaces. The site offers a variety of audio options, including natural sounds, white noise, and other soothing sounds to help users in...
General Introduction Jellypod is a powerful AI podcast studio designed to help users easily create, edit and publish high-quality AI podcasts. With Jellypod, users can design personalized podcast hosts, refine scripts, and publish podcasts to ...
General Description Sonauto is an artificial intelligence-based music creation platform that allows users to generate complete musical compositions by simply typing cues, lyrics or melodies. The platform is known for its high-quality music production model and easy-to-use interface for beginners to professional music...
General Introduction Wegic AI is a revolutionary AI website design and development tool that allows users to easily create, modify and manage websites through a natural language dialog interface. The tool uses the latest GPT-4o model to simplify the website building process and does not require users to have any programming skills...
Comprehensive Introduction vLLM is a high-throughput and memory-efficient reasoning and service engine designed for Large Language Modeling (LLM). Originally developed by the Sky Computing Lab at the University of California, Berkeley, it has become an academic and industry-driven...
Comprehensive Introduction Cognita is an open source framework developed by TrueFoundry to simplify the development of RAG (Retrieval-Augmented Generation) based applications. The framework provides a structured, mod...
Comprehensive Introduction BotSharp is an open source project based on .NET Core dedicated to providing a comprehensive AI chatbot platform building tool. It uses C# programming, supports cross-platform operation, and aims to simplify the application of machine learning algorithms, enabling enterprise-level developers to efficiently ...
General Introduction Weebo is an open source real-time voice chatbot that utilizes Whisper Small for speech recognition, Llama 3.2 for natural language generation, and Kokoro-82M for speech synthesis. The project was developed by Aman...
General Introduction Hyper3D (Shadow Eyes Technology) is a technology company focusing on 3D modeling and asset generation with the launch of two tools, Rodin (updated Rodin 1.5) and ChatAvatar.Rodin uses AI technology to generate high quality from images or text...
Comprehensive Introduction OmAgent is a multimodal intelligent body framework developed by Om AI Lab, aiming to provide powerful AI-powered features for smart devices. By integrating state-of-the-art multimodal base models and intelligent body algorithms, the project enables developers to create efficient smart devices on a variety of...
Comprehensive Introduction RAIN (Real-time Animation Of Infinite Video Stream) is an open source project designed to achieve real-time generation of animation effects for infinite video streams. The project was developed by Pscgylotti, ti...
Comprehensive Introduction The AI Agent Service Toolkit is a complete toolset built on LangGraph, FastAPI, and Streamlit, designed to help developers quickly build and run AI agent services. The toolkit provides a ...
General Introduction SyncStudy is an innovative AI-driven learning tool designed to improve learning efficiency by instantly generating quizzes. Users can upload learning materials, and the system will automatically analyze and generate personalized quizzes to help users better master their knowledge.SyncStudy ...
General Introduction Parseur is a leading AI data extraction software designed to help users automatically extract text data from PDFs, emails and other documents. With Parseur, users can easily convert unstructured data into structured data and send it to various applications...
General Description ResumeBoostAI is an AI-based online resume builder designed to help job seekers create professional resumes quickly. The site offers a wide range of free resume templates and uses AI technology to generate resume content and optimize resumes to pass ATS (Application Tracking System...
General Introduction Memora is an agent designed to replicate human memories for each personalized AI. It helps AIs remember details of past interactions, emotions, and shared experiences just like humans do through features like timestamped memories, emotion markers, and multimodal memories.Memora supports multi-tenancy and is capable of handling...
General Introduction Tough Tongue AI is an artificial intelligence platform designed for practicing tough conversations. Users can simulate a variety of complex conversational situations, such as job interviews, salary negotiations, sales presentations, etc. by selecting preset scenarios or creating custom scenarios. The platform provides video and...
General Introduction ForgeCAD is a 3D design and manufacturing platform that utilizes artificial intelligence technology and aims to simplify and accelerate CAD workflows with AI-powered design tools. Users can generate detailed 3D models in seconds with simple image and text prompts.ForgeC...
Comprehensive Introduction Athina AI is a collaborative AI development platform designed to help teams rapidly build, test, and monitor AI features. The platform provides a rich set of tools and features including dataset evaluation, prompt management, data labeling, and experiment management.Athina AI supports technical and...
Comprehensive Introduction Weco AI Functions is a powerful platform designed to help users rapidly build and deploy AI functions. By simply describing tasks, users can generate structured output patterns with A/B testing and observational monitoring. The platform supports no-code prototyping...
General Introduction Stagehand is an AI web browsing framework focused on simplicity and extensibility. It is fully compatible with Playwright and provides three simple AI APIs (act, extract, and observe) that are built on the base...
General Introduction Micro-Agent is an open source AI coding assistant developed by Builder.io, designed to provide developers with the ability to automatically generate and test code. It generates test cases by understanding natural language descriptions and iterates the code until all tests pass, thus reducing open...
General Introduction sherpa-onnx is an open source project developed by the Next-gen Kaldi team to provide efficient offline speech recognition and speech synthesis solutions. It supports multiple platforms including Android, iOS, Raspber...
General Introduction Zep is a platform designed to provide long-lasting memory solutions for AI applications.Zep helps AI assistants continuously learn and memorize user interactions to build the user's knowledge graph.Zep supports multiple programming languages and frameworks, including Python, TypeScrip...
General Introduction Sketch-to-3D is an AI tool on Hugging Face Spaces developed by Linoy Tsaban that specializes in converting hand-drawn sketches into high-quality 3D models. It utilizes TRELLIS and SDXL technology...
General Introduction Dreamface is a powerful AI tool designed to help users easily create high-quality videos and images. With simple operations, users can generate personalized animated avatar videos, repair old photos, remove photo backgrounds, and more. The site offers a wide range of AI-driven featur...
General Introduction Eko is a production-grade JavaScript framework designed to build efficient intelligent agent workflows through natural language descriptions. It is designed to enable developers to automate everyday tasks using AI technologies without deep programming.Eko provides a uni...
General Introduction Agent Inbox is an open source project developed by the LangChain team to provide a new user experience for interacting with AI intelligences. The project allows users to manage and optimize interactions with multiple AI intelligences through a centralized interface.Ag...
General Introduction Social Media Agent (Social Media Agent) is an open source project that manages interaction information by the new Agent Inbox, designed to help users automate the generation and management of social media content. The project is developed by the LangChain team...
General Introduction Executive AI Assistant (EAIA) is an AI-based assistant tool designed to help users automate and manage their daily tasks. Developed by LangChain, the tool is capable of handling emails, scheduling, managing tasks and other...
Comprehensive Introduction MangaNinjia is an open source project developed by Alibaba Tongyi Visual Intelligence Lab (Ali-Vilab), focusing on the automated processing of line coloring. This tool achieves accurate color matching of reference images through deep learning techniques, greatly improving...
General Introduction Audiblez is an open source project designed to convert eBooks (e.g. .epub format) into audiobooks (e.g. .m4b format). The project utilizes Kokoro's high-quality speech synthesis technology to support multiple languages and multiple voices. Users can simply...
General Introduction Dessix.io is an all-in-one note-taking tool with integrated AI collaboration features designed to help users capture inspiration, organize their thoughts and create efficiently. With Dessix, users can easily collect web content or text snippets, utilize AI to automatically generate summaries and keywords, and simplify letter...
General Introduction Kats is an open source toolkit developed by a team of researchers at Meta (formerly Facebook) designed for time series analysis.Kats provides a lightweight, easy-to-use framework that covers everything from basic statistical analysis to sophisticated predictive modeling, anomaly detection, and special...