General Introduction n8n-mcp-server is an open source project hosted on GitHub and developed by Leonard Sellem. It is an MCP (Model Context Protocol) service tool specialized...
General Introduction InvSR is an innovative open-source image super-resolution project based on diffusion inversion techniques capable of converting low-resolution images into high-quality, high-resolution images. The project utilizes the rich a priori knowledge of images embedded in pre-trained large-scale diffusion models to support, through a flexible sampling mechanism, the...
General Description AI Auto Free is a powerful automation tool designed to help users make unlimited use of AI-driven Integrated Development Environments (IDEs) such as Cursor and Windsurf. The program offers cross-platform support and includes multiple language capabilities...
Comprehensive Introduction WeChat Markdown Editor (WeChat Markdown Editor) is a highly concise WeChat graphic layout tool designed to help users easily create beautiful WeChat posts. The editor supports all basic Markdown ...
General Introduction MediaCrawler is a social media content crawler tool designed for developers. By providing a powerful crawler function, it can quickly grab videos, images, comments, likes, retweets and other data from social platforms such as Xiaohongshu, Shake, Shutter, B, Weibo and other...
General Introduction TreeGPT is an open source chat application developed based on Next.js, focusing on visualizing conversations with large language models (LLMs, e.g., GPTs) through tree graph structures (directed acyclic graphs, DAGs), replacing the traditional linear chatting approach to improve the speed and...
Comprehensive Introduction WeChatAI is a Python-based WeChat group chat and personal intelligent assistant, supporting a variety of large language models (such as DeepSeek, Gemini, Tongyi Thousand Questions), which can realize intelligent conversations, auto-replies and other functions. The project uses modern ...
Comprehensive Introduction Refly is a free canvas-based AI native authoring engine designed to help users turn ideas into high-quality content through multi-threaded conversations, knowledge base integration, contextual memory and intelligent search technology. The platform covers over 20 professional scenario templates, including learning...
General Description BlinkShot is an open source, real-time AI image generator that utilizes Together AI and Flux Schnell technology to allow users to generate high-quality images as they enter prompts. The platform is completely free and supports user customization and secondary open...
Comprehensive Introduction Bailing (Bailing) is an open source voice conversation assistant designed to engage in natural conversations with users through speech. The project combines speech recognition (ASR), voice activity detection (VAD), large language modeling (LLM) and speech synthesis (TTS) technologies to achieve...
General Introduction multi-model-bolt.new is a modified version of Bolt.new that allows the use of TogetherAI models, supporting features such as deployment, mobile response and voice input. Users can be prompted directly in the browser, run...
General Introduction Gemini Teacher is an English speaking practice assistant based on Google Gemini AI. It recognizes the user's English pronunciation in real time and provides instant feedback and correction suggestions. The tool is designed to help users improve their English speaking skills through...
General Introduction DUIX (Dialogue User Interface System) is an AI-powered digital human interaction platform created by Silicon Intelligence. With open source digital human interaction capabilities, developers can easily integrate large-scale models, automatic speech recognition (ASR...
Comprehensive Introduction Chonkie is a lightweight and efficient RAG (Retrieval-Augmented Generation) text chunking library designed to help developers quickly and easily chunk text. The library supports a variety of chunking methods , including ...
Comprehensive Introduction AI-reads-books-page-by-page is a Python-based development of intelligent PDF book analysis tool, which can automate the page-by-page analysis of PDF books, extract the key knowledge points, and after the specified page interval to generate stage...
General Introduction Emigo is an open source AI programming assistant designed for Emacs, developed by MatthewZMD on GitHub. It helps programmers complete code analysis in Emacs by integrating a large-scale language model (LLM)...
General Introduction Voice-Pro is a multifunctional tool based on Gradio WebUI that supports speech-to-text, text-to-speech, real-time translation, YouTube video downloads and human voice separation. It integrates Whisper, Faster-Wh...
Comprehensive Introduction Tencent Hunyuan3D (Hunyuan3D 2.0) is an advanced large-scale 3D synthesis system from Tencent designed to generate high-resolution textured 3D assets. The system consists of two core components: Hunyuan3D-DiT, a large-scale shape generation model, and Hunyuan3D-DiT, a large-scale texture...
General Introduction AI RSS is an innovative tool that converts web content into RSS feeds through AI technology. It consists of two main parts: a browser plugin and a server side. The browser plugin allows users to select lists from web pages and generate structured data description (SDD) files...
General Introduction TinyZero is a veRL-based reinforcement learning model designed to replicate the performance of DeepSeeK-R1 Zero in countdown and multiplication tasks. Surprisingly, the project costs only $30 to run (using 2xH2...
Comprehensive Introduction Midjourney Proxy is an open source project designed to provide proxy services for Midjourney's Discord channel to convert AI drawing functions into API form. The project is completely free and open source , supports one-click face swap , image blending , graph generation ...
Comprehensive Introduction MiniPerplx (renamed Scira) is a minimalist designed AI-driven search engine that integrates a variety of useful features to provide users with a full range of information retrieval services. The project uses a modern technology stack including Next.js, Tailwi...
General Introduction MCP Containers is an open source project, hosted on GitHub, focused on providing containerized solutions for Model Context Protocol (MCP) servers. It simplifies through Docker containers...
General Introduction MTEB (Massive Text Embedding Benchmark) is an open source project developed by the embeddings-benchmark team and hosted on GitHub, aiming to model text embedding...
Comprehensive Introduction FinRobot is an open source AI intelligence platform developed by AI4Finance Foundation and designed for financial analytics. It not only covers traditional language models, but also incorporates a variety of AI technologies, aiming to provide a comprehensive solution for the financial industry.F...
General Introduction Spark-TTS is an open source Text-to-Speech (TTS) tool developed by the SparkAudio team, hosted on GitHub, designed to help users efficiently convert text into natural and fluent speech...
General Introduction YOLOE is an open source project developed by the Multimedia Intelligence Group (THU-MIG) at Tsinghua University School of Software, with the full name "You Only Look Once Eye". It is based on the PyTorch framework, which belongs to the YOLO series of extensions ...
Comprehensive Introduction NarratoAI is a fully automated tool that integrates movie and TV narration, automated editing, dubbing and subtitle generation. It relies on large-scale language modeling (LLM) technology to automatically generate copy and automatically edit videos with corresponding voiceovers and subtitles, providing users with a one-stop...
General Introduction Open Codex is an open source command line AI tool designed for developers to convert natural language instructions into precise shell commands. It uses a native language model (e.g. phi-4-mini), requires no networking or API keys, and all operations in...
Comprehensive Introduction Open Deep Research is a web-based research assistant capable of generating comprehensive research reports on any topic. The system utilizes a plan-and-do workflow that allows the user to plan and review the report structure before moving on to the time-consuming research phase...
Comprehensive Introduction Tencent Mixed Yuan Text Generation Video (available in Yuanbao APP) is a video generation platform based on AI technology launched by Tencent. The platform utilizes the Tencent Mixed Yuan Big Model with powerful cross-domain knowledge and natural language understanding to generate high-quality videos based on users' text descriptions...
General Introduction Zola is a free and open source AI chat application developed by developer Julien Thibeaut (GitHub username ibelick) and hosted on GitHub. Its best feature is that it supports multiple AI modes...
General Introduction serverless-qrcode-hub is an open source tool designed to solve the problem of frequent failure of QR codes in WeChat group chats. It is based on Cloudflare Workers and D1 databases , without the need for traditional servers to run ...
Comprehensive Introduction Fish Speech Derivative Project Fish Agent is a revolutionary end-to-end AI speech cloning system developed based on the V0.1 3B model architecture. As a fully end-to-end speech clone processing system, its most important feature is the use of innovative speechless...
General Introduction Anubis is an open source tool developed by the TecharoHQ team to protect websites from AI crawlers. It adds a SHA256 Proof-of-Work challenge to the HTTP request...
Comprehensive Introduction SemHash is a lightweight and flexible tool for de-duplicating datasets by semantic similarity. It combines the fast embedding generation of Model2Vec with the efficient ANN (approximate nearest neighbor) similarity search of Vicinity.SemHa...
Comprehensive Introduction MoneyPrinterTurbo is an open source project that utilizes advanced AI big model technology to achieve the function of generating short HD videos with one click. Users only need to provide a video theme or keywords, the system will automatically generate video copy, video clips, video subtitles and...
General Introduction CoAI.Dev (formerly Chat Nio) is a chat platform that integrates multiple AI models and supports distributed streaming, image generation, cross-device conversation synchronization and sharing. It implements a subscription and Token billing system, Key transit service and multi...
General Introduction LangGraph CodeAct is a framework open-sourced on GitHub by the LangChain AI team, based on the CodeAct architecture (see paper arXiv:2402.01030 for details). It does this by generating...
Comprehensive introduction WeClone is an open source project that uses WeChat chat logs and voice messages, combined with large language models and speech synthesis technology, to allow users to create personalized digital doppelgangers. The project can analyze the user's chat habits to train the model , but also a small number of voice samples to generate realistic sound...
Synthesis Muyan-TTS is an open source text-to-speech (TTS) model designed for podcasting scenarios. It is pre-trained with over 100,000 hours of podcast audio data and supports zero-sample speech synthesis to generate high-quality natural speech. The model is based on Llama-3.2-3...
Comprehensive Introduction promptfoo is an open source command line tool and library dedicated to evaluating and red-teaming test Large Language Model (LLM) applications. It provides developers with a complete set of tools for building reliable prompts, models, and retrieval-based generation (RAGs) with self...
Second Me is an open source project developed by the Mindverse team that allows you to create an AI on your computer that acts like a "digital doppelganger", learning your speech patterns and habits through your words and memories, and becoming a smart person who understands your...
General Introduction Kokoro 82M is an efficient speech synthesis model provided by Hugging Face, designed to generate high quality speech with fewer parameters and data. The model has 82 million parameters and is licensed under Apache 2.0...
General Introduction Kotaemon is an open source document Q&A tool designed to provide end-users and developers with Q&A functionality based on Retrieval Augmented Generation (RAG). The project is developed by Cinnamon and supports a variety of LLM API providers (e.g. OpenA...
General Introduction codemcp is an open source tool designed for Claude Desktop users, developed by Edward Z. Yang on GitHub. It makes Claude Desktop a useful...
General Introduction YTSage is a modern YouTube downloader with a clean PyQt6 interface. Users can use YTSage to download videos of any quality, extract audio, get subtitles (including auto-generated subtitles), and view the video's meta...
General Introduction Activepieces is an open source, all-in-one automation workflow platform focused on providing intuitive and powerful automation solutions for businesses and individual users. Developed in TypeScript, the platform is extremely scalable and supports more than 200 integrated services...
General Introduction DiffSynth-Engine is an open source project launched by ModelScope, hosted on GitHub.It is based on diffusion modeling technology, focusing on efficiently generating images and videos, suitable for developers to deploy AI models in production environments ...
General Introduction ChatFree is an open source project that aims to free users' AI apps from the constraints of browsers to run locally. Created using GPT API, Copilot is designed to support a wide range of office software such as Office, Word, WPS, and more. The project was developed by ...
General Introduction Trackers is an open source Python tool library focused on multi-object tracking in video. It integrates several leading tracking algorithms, such as SORT and DeepSORT, and allows users to combine different object detection models (such as YOLO...
Comprehensive Introduction DeOldify is an open source project based on deep learning technology, specifically designed for intelligent colorization and restoration of black and white photos and videos. The project uses an innovative NoGAN training method to successfully solve the common defects of traditional GAN networks in the image coloring process...
Comprehensive Introduction Qwen2.5-Omni is an open source multimodal AI model developed by Alibaba Cloud Qwen team. It can process multiple inputs such as text, images, audio and video, and generate text or natural speech responses in real time. The model was released in 2025 on 3 ...
General Description CrisperWhisper is an advanced speech recognition tool based on OpenAI Whisper that focuses on fast, accurate and word-by-word speech transcription. It provides accurate word-level timestamps, even in the case of speech fills and pauses...
General Introduction FlowDown-App is a lightweight and efficient AI conversation client, developed by a team of developers using Swift and UIKit, aiming to provide users with a fast and smooth intelligent conversation experience. The app is divided into a standard version (FlowDown...
Comprehensive Introduction 99AI is an open source AI web application project that aims to provide an easy-to-deploy, low-threshold integrated AI service platform. The project supports intelligent dialog, multimodal modeling, application plaza, networked search, and integrates AI painting, music and video...
General Description AnkiAIUtils is a set of AI-enhanced tools designed for the Anki flashcard learning system. Developed by a medical student, the tool is designed to automatically improve cards that users are struggling with during the learning process through AI technology. It can intelligently provide users with personalized...
Comprehensive Introduction MoneyPrinterPlus is an open source project aimed at generating and mixing all kinds of short videos with one click through AI technology, and automatically publishing them to multiple video platforms, such as Jieyin, Shutterbugs, Xiaohongshu, and Video Number. The tool supports local and cloud-based voice models, including chat...
Comprehensive Introduction NVIDIA Garak is an open source tool that specializes in detecting vulnerabilities in Large Language Models (LLMs). It checks the model for multiple weaknesses such as illusions, data leakage, hint injection, error message generation, harmful content generation, etc. through static, dynamic and adaptive probing...
Comprehensive Introduction Kolors is a large-scale text-to-image generation model developed by the Racer team, based on potential diffusion techniques. The model is trained on billions of text-image data pairs, and is capable of generating high-quality, complex semantically accurate images with support for both Chinese and English input.Kolors in visual quality...
General Introduction Cua is an open source project called Computer-Use Agent (pronounced "koo-ah"). It is designed for Apple Silicon devices , can create and run high-performance macOS ...
General Introduction Screenshot-to-Code is an open source tool that utilizes artificial intelligence to convert screenshots, design drafts, and Figma designs into clean, functional code. The tool supports multiple front-end technology stacks, including HTML, Tailwind CS...
Comprehensive Introduction Baichuan-Audio is an open source project developed by Baichuan Intelligence (baichuan-inc), hosted on GitHub, focusing on end-to-end voice interaction technology. The project provides a complete audio processing framework that enables speech ...
DreamTalk Comprehensive Introduction DreamTalk is a diffusion model-driven expression talking head generation framework, jointly developed by Tsinghua University, Alibaba Group and Huazhong University of Science and Technology. It mainly consists of three parts: a noise reduction network, a style-aware lip expert and a style predictor, and can be based on...
General Introduction SadTalker is an open source tool that combines a single still portrait photo with an audio file to create realistic talking avatar videos for a variety of scenarios such as personalized messages, educational content, and more. The revolutionary use of 3D modeling technologies such as ExpNet and PoseVA...
Comprehensive Introduction AIstudioProxyAPI is an open source project that uses Node.js and Playwright technology to emulate the OpenAI API by mimicking the Google AI Studio web version of...
LangBot is a large model-based instant messaging bot platform that supports multiple messaging platforms and large models. The platform adapts to QQ, WeChat (enterprise WeChat, personal WeChat), Flybook, Discord, OneBot and other messaging platforms, and supports Open...
Comprehensive introduction Dify-Plus is an AI application development platform based on the secondary development of the Dify open source project. It adds a new management center based on Dify and optimizes the functionality for enterprise scenarios. The project was initially for internal use by enterprises , and later found that the community has similar needs, it...
Comprehensive Introduction CFG-Zero-star is an open source project developed by Weichen Fan and the S-Lab team at Nanyang Technological University. It focuses on improving the Classifier Free Guidance (CFG) technique in stream matching models by optimizing the guidance strategy and zero-initial ...
General Introduction Vercel AI SDK is an open source tool developed by the Vercel team to help developers build AI applications using frameworks such as React, Svelte, Vue and Solid. It supports multiple language model providers...
General Introduction TheoremExplainAgent is an innovative project developed by TIGER AI Lab to transform complex mathematical and scientific theorems into easy-to-understand video animations using artificial intelligence techniques. The tool is based on the Large Language Model (LLM...
In the rapid development of the Internet today, download tools as an important means for users to obtain information and resources, plays an indispensable role. This article will systematically analyze five open source download tools: AB Download Manager, XDM (Xtreme Download ...
Comprehensive Introduction PocketFlow is a lightweight AI application development framework with only 100 lines of code, developed by The-Pocket team and open-sourced on GitHub. It pursues a minimalist design , the core code control in 100 lines , no external dependencies ...
General Introduction Research Rabbit is a native LLM (Large Language Model) based web research and summarization assistant. After the user provides a research topic, Research Rabbit generates a search query, obtains relevant web results, and summarizes those results...
Comprehensive Introduction OpenAOE is an open source large model group chat framework, aiming to solve the problem of the lack of chat frameworks in the current market with multiple models responding in parallel. With OpenAOE, users can talk to multiple Large Language Models (LLMs) at the same time and get parallel output. The framework supports ...
Comprehensive Introduction Voice Changer is an open source real-time voice transformation tool that supports a wide range of AI voice models such as MMVC, so-vits-svc, RVC, DDSP-SVC, and Beatrice.The tool is compatible with multiple platforms...
Because the domestic deployment can not access hugging face, so in the big brother deployment program based on the transformation to be able to deploy to cloudflare workers. Preparation 1, register cloudflare 2, register hugging fac...
General Introduction ACE++ is an open source project developed by the ali-vilab team at Alibaba Tongyi Lab (Tongyi Lab). It is based on the FLUX.1-Fill-dev model and aims to achieve image generation and compilation through simple textual commands...
General Introduction Oliva is an open source multi-intelligence assistant tool developed by Deluxer on GitHub. It helps users search for product information in the Qdrant database through the collaboration of multiple AI intelligences. The main feature is that it supports voice operation...
Comprehensive Introduction MegaTTS3 is an open source speech synthesis tool developed by ByteDance in cooperation with Zhejiang University, focusing on generating high-quality Chinese and English speech. Its core model is only 0.45B parameters , lightweight and efficient , support for mixed Chinese and English speech generation and speech cloning . The project is hosted on ...
Comprehensive Introduction Vanna is an MIT-licensed open source Python framework focused on generating SQL queries through RAG (Retrieval Augmented Generation) techniques. Users can train RAG models, apply them to their own data, and then ask questions, and Vanna will return the appropriate s...
General Introduction olmOCR is an open source tool developed by the AllenNLP team at the Allen Institute for Artificial Intelligence (AI2) that focuses on converting PDF files...
General Introduction OmniSQL is an open source project developed by the RUCKBReasoning team and hosted on GitHub. Its core function is to transform user-input natural language questions into high-quality SQL query statements to help users easily with the number of ...
Comprehensive Introduction MangaNinjia is an open source project developed by Alibaba Tongyi Visual Intelligence Lab (Ali-Vilab), focusing on the automated processing of line coloring. This tool achieves accurate color matching of reference images through deep learning techniques, greatly improving...
General Introduction AI ContentCraft is a versatile content creation tool that integrates text generation, speech synthesis, image generation and more. It helps creators quickly generate stories, podcast scripts, and accompanying audio and video content. The tool supports multiple language conversions and can batch...
General Introduction Memary is an innovative open source project focused on providing long-term memory management solutions for autonomous intelligences. The project helps intelligences break through the limitations of traditional context windows to achieve smarter interaction experiences through knowledge graphs and specialized memory modules.Memary adopts...
General Introduction Neural4D is an innovative AI-based platform focused on helping users quickly generate high-quality 3D models and animations with simple text or image input. Developed by DreamTech, it relies on the world's leading end-to-end 3D generation of large models technology...
General Introduction PhotoDoodle is an open source image editing tool, developed by ShowLab, focusing on artistic editing of photos through artificial intelligence technology. Users only need to input simple text prompt words to add cartoon style, 3D effect, light...
General Introduction AI Chatbot Supabase is an open source AI chatbot template built on Next.js and Supabase. Developed by Vercel, the project aims to provide a fully functional and customizable chatbot solution. By ...
Comprehensive Introduction AnyText is a revolutionary multilingual visual text generation and editing tool developed based on the diffusion model. It generates natural, high-quality multilingual text in images and supports flexible text editing features. It was developed by a team of researchers and presented at ICLR 2024...
Comprehensive introduction No front-end , pure configuration file configuration API channel . Just write a file can run up an API station of their own , the document has a detailed configuration guide , white friendly. uni-api is a project to unify the management of large model APIs , allowing a unified ...
General Introduction Genesis is a generative physics world designed for general purpose robotics and embodied AI learning. It provides a unified simulation platform that supports the simulation of a wide range of materials and physical phenomena.Genesis aims to unlock generative AI and physics simulation by combining...
General Introduction GraphRAG Visualizer is a web-based tool designed to help users visualize and explore artifacts from Microsoft GraphRAG. By uploading Par...
General Introduction MiMo is an open source large language modeling project developed by Xiaomi, focusing on mathematical reasoning and code generation. The core product is the MiMo-7B family of models, which contains a base model (Base), a supervised fine-tuning model (SFT), a strong chemical trained from the base model...
General Introduction ChatTTS is a generative speech model designed for conversational scenarios. It generates natural and expressive speech, supports multiple languages and multiple speakers, and is suitable for interactive conversations. The model does this by predicting and controlling fine-grained prosodic features such as laughter, pauses and interjections, sup...
General Introduction LangChain presents Open Canvas, an open source web application designed to enhance the document editing and collaboration experience with built-in dual-agent memory functionality and integrated smith to observe full execution details. The platform is powered by OpenA...
Comprehensive Introduction Easy Dataset is an open source tool designed specifically for fine-tuning large models (LLMs), hosted on GitHub. It provides an easy-to-use interface that allows users to upload files, automatically segment content, generate questions and answers, and ultimately output a suitable...
Comprehensive Introduction insanely-fast-whisper is a combination of OpenAI's Whisper model and various optimization techniques (e.g. Transformers, Optimum, Flash Attention) for audio trans...
Comprehensive Introduction XianyuAutoAgent is an intelligent customer service robot system designed for the Idlefish platform, open-sourced by developer shaxiu on GitHub. It realizes 7×24 hours automatic duty through AI technology, and helps Idlefish sellers reply...