Synthesis Gaze-LLE is a gaze target prediction tool based on a large-scale learning encoder. It was developed by Fiona Ryan, Ajay Bati, Sangmin Lee, Daniel Bolya, Judy Hoffman, and J...
General Introduction Search-R1 is an open source project, developed by PeterGriffinJin on GitHub, built on the veRL framework. It trains Large Language Models (LLMs) through Reinforcement Learning (RL) techniques, allowing the models to autonomously learn...
Comprehensive Introduction MangaNinjia is an open source project developed by Alibaba Tongyi Visual Intelligence Lab (Ali-Vilab), focusing on the automated processing of line coloring. This tool achieves accurate color matching of reference images through deep learning techniques, greatly improving...
Comprehensive introduction LangGraph CUA is an open source project developed by the LangChain team. It is based on the LangGraph framework, allowing developers to use Python to build AI intelligences that can directly operate the computer. The core of this tool ...
General Introduction BrowserTools MCP is an open source project developed by the AgentDeskAI team. It allows AI to monitor browser activity in real-time through Chrome extensions and Node.js services, including logs, network requests...
General Introduction ClickClickClick is a framework developed by BandarLabs that aims to automate Android and PC operations by using any local or remote Large Language Model (LLM). The project is currently in a highly experimental phase and supports a variety of models such as...
General Introduction Cloud Document Converter is a Chrome extension designed for converting Flying Book cloud documents to Markdown format. Users can easily download or copy Flying Book cloud documents into Markdo...
General Introduction openapi-mcp-server is an open source tool designed to transform OpenAPI v3.1 compliant APIs into AI usable resources. It is maintained by janwilmake and is based on Model Contex...
General Introduction Coding Agent is an intelligent programming assistant developed by AbhinavTheDev, designed to help developers improve their programming efficiency. The tool utilizes artificial intelligence technology to automatically generate code, provide programming suggestions, and assist developers with various coding...
General Introduction One Hub is an OpenAI interface management and distribution system based on the secondary development of the One API. The project was developed by MartialBE to provide broader model support and improved statistical capabilities.One Hub has...
General Introduction Melty is a revolutionary AI code editor that combines chat conversations with Git version control. Developed by Charlie and Jackson from Replicate, this tool aims to solve the pain points of traditional AI coding tools. Its biggest ...
Comprehensive Introduction MegaTTS3 is an open source speech synthesis tool developed by ByteDance in cooperation with Zhejiang University, focusing on generating high-quality Chinese and English speech. Its core model is only 0.45B parameters , lightweight and efficient , support for mixed Chinese and English speech generation and speech cloning . The project is hosted on ...
Comprehensive Introduction Easy Dataset is an open source tool designed specifically for fine-tuning large models (LLMs), hosted on GitHub. It provides an easy-to-use interface that allows users to upload files, automatically segment content, generate questions and answers, and ultimately output a suitable...
General Introduction Magic MCP is an AI-driven tool developed by the 21st.dev team and designed for front-end developers. It generates modern UI components on-the-fly from natural language descriptions, integrating with Cursor, WindSurf and ...
General Introduction Agno is an open source Python library developed by the agno-agi team and hosted on GitHub, dedicated to making it easy for developers to build AI intelligences with memory, knowledge, and tools. It supports multimodal text, image, audio, and video...
General Introduction Research Rabbit is a native LLM (Large Language Model) based web research and summarization assistant. After the user provides a research topic, Research Rabbit generates a search query, obtains relevant web results, and summarizes those results...
General Introduction AiPy is an open source Python command-line tool developed by the Knownsec team. It combines the Large Language Model (LLM) and the Python runtime environment to allow users to automatically generate and run Pytho...
General Introduction Open-Sora is an open source project designed to allow anyone to efficiently generate high quality videos. It is developed by the hpcaitech team to provide tools to generate video from text or images, supporting multiple resolutions and durations. The project is completely open source, with public model weights...
General Introduction CodeArena is a unique platform designed to showcase the best open source code generation models (LLMs) through real-time face-offs. Users can watch different LLMs compete in the same programming tasks and view the best performing models through real-time leaderboards. The platform utilizes Tog...
General Description InsightExpress is a Next.js-based application that generates AI-driven research reports based on user-supplied topics and emails them to users. The application utilizes Langflow's AI ...
General Introduction DisPose is an innovative open source artificial intelligence project focused on controlled character image animation generation. Developed by a team of researchers and open-sourced on GitHub, the project uses advanced deep learning techniques to achieve precise character animation control by decomposing skeletal pose information.D...
Comprehensive Introduction ColorFlow is an image sequence auto-coloring tool developed by Tencent's ARC team to solve the problem of auto-coloring black and white image sequences. The tool utilizes a retrieval-enhanced coloring pipeline to accurately generate the colors of various elements through a pool of reference images, including the character's hair color and service...
Comprehensive Introduction Probly is a spreadsheet tool developed by the PragmaticMachineLearning team and open-sourced on GitHub that combines the functionality of traditional spreadsheets with powerful AI data analysis capabilities. It not only supports the use of ...
General Introduction Gemini Next Chat is an open source project designed to help users easily deploy private Gemini applications. The project supports Gemini 1.5 and Gemini 2.0 multimodal model , users can deploy with one click on Vercel...
General Introduction MuseGAN is a music generation project based on Generative Adversarial Networks (GANs) designed to generate multi-track (multi-instrument) music. The project is capable of generating music from scratch or accompanied by user-supplied tracks.MuseGAN uses Lakh Pianor...
General Introduction Napkins.dev is a free open source project, the core function is to allow users to upload interface screenshots or wireframes to automatically generate runnable front-end code. Users only need to provide a design drawing , the tool will be through the Llama 4 model (by Together ...
Comprehensive Introduction dsRAG is a high-performance retrieval engine designed to handle complex queries on unstructured data. It performs particularly well in handling challenging queries in dense text such as financial reports, legal documents, and academic papers. dsRAG employs three key approaches to improve performance: language...
Comprehensive Introduction Kimi-Audio is an open source audio base model developed by Moonshot AI that focuses on audio understanding, generation and dialog. It supports a wide range of audio processing tasks such as speech recognition, audio Q&A and speech emotion recognition. The model has been tested over 130...
General Introduction AI no jimaku gumi (AI no subtitle group) is a powerful command line video subtitle processing tool focused on automating video subtitle extraction, transcription and translation functions. The tool integrates advanced AI technologies, including Whisper speech...
General Introduction Vercel AI SDK is an open source tool developed by the Vercel team to help developers build AI applications using frameworks such as React, Svelte, Vue and Solid. It supports multiple language model providers...
Comprehensive Introduction DeepWiki-Open is an open source project designed to automatically generate structured documentation for code repositories on GitHub, GitLab and Bitbucket. It uses AI technology to analyze the code structure , file content and logical relationships , rapid generation ...
General Description Whisper Input is an open source voice transcription tool that allows users to start recording voice by pressing the Option button and end the recording by lifting the button. The tool calls Groq Whisper Large V3 Turbo ...
Comprehensive Introduction ComfyUI-WanVideoWrapper is an open source plugin created by developer kijai, designed for the ComfyUI platform. It is based on WanVideo's Wan2.1 model , provides a powerful video ...
General Introduction llm.pdf is an open source project that allows users to run Large Language Models (LLMs) directly in PDF files. Developed by EvanZhouDev and hosted on GitHub, this project demonstrates an innovative approach: by Em...
General Introduction hugo-translator is an automated translation tool designed for Hugo's static site generator, hosted on GitHub and created by developer Rico00121. The tool is designed to help Hugo users translate their blog...
General Introduction Langfuse is an open source LLM (Large Language Model) engineering platform. It helps developers trace, debug, and optimize LLM applications by providing tools for observing calls, managing cue words, running experiments, and evaluating results. The platform is developed by the Langfuse team...
Comprehensive Introduction SuperWeChatPC is an open source WeChat enhancement tool for computers, the core of which is to provide convenience for users and developers. It initially solves the problem that WeChat can only be opened singly, and later added WeChatSDK, so that developers can call WeChat functions, such as sending messages...
General Introduction MOFA-Video is a state-of-the-art image animation generation tool that utilizes generative motion field adaptation techniques to convert static images into dynamic videos. The project was developed in collaboration with the University of Tokyo and Tencent AI Lab, and will be presented at the 2024 European Conference on Computer Vision (E...
Comprehensive Introduction Sim Studio is an open source AI agent workflow building platform focused on helping users quickly design, test, and deploy large-scale language model (LLM) workflows through a lightweight, intuitive visual interface. Users can create complex workflows without deep programming by dragging and dropping...
General Introduction PPTX2MD is an open source tool designed to convert PowerPoint PPTX files to Markdown format. Developed by GitHub user ssine, the tool supports preserving headings, lists, text formatting (e.g., bold, italic, color, and super...
General Introduction Rankify is an open source Python toolkit developed by the Data Science Group at the University of Innsbruck, Austria. It focuses on information retrieval, reordering and retrieval augmentation generation (RAG), providing a unified framework. The toolkit comes with a built-in set of 40 pre-retrieved benchmarks...
General Introduction Yutu is a powerful open source command line tool designed for YouTube users, hosted on GitHub and developed by the eat-pray-ai team. It uses terminal operations to realize the YouTube videos, playlists, frequency...
Comprehensive Introduction AIBot PRO is a .NET 6-based AI aggregation client designed to provide users with a convenient platform for integrating multiple AI products. The client supports senseless switching dialog and integrates ChatGPT, Gemini, Claude, Wenxin Yiyin...
General Introduction AIEvo is Ant Group's open source multi-agent framework designed to efficiently create multi-agent applications. The framework strictly follows the SOP task graph to improve the execution success rate of complex tasks , and through feedback and monitoring mechanisms to ensure high flexibility and scalability.AIEvo has been produced within Ant Group ...
General Introduction HiveChat is an AI chatbot for small to medium sized teams that allows administrators to configure multiple AI models (such as Deepseek, OpenAI, Claude, and Gemini) at once for easy use by team members. It ...
General Introduction MCP Containers is an open source project, hosted on GitHub, focused on providing containerized solutions for Model Context Protocol (MCP) servers. It simplifies through Docker containers...
General Introduction EditorJumper is a plugin designed for JetBrains IDE, developed by GitHub user wanniwa. It allows developers to use the JetBrains IDE (e.g. IntelliJ ...
General Introduction LaWGPT is an open source project supported by the Machine Learning and Data Mining Research Group of Nanjing University, which is dedicated to building a large language model based on Chinese legal knowledge. It is based on generalized Chinese models (such as Chinese-LLaMA and ChatGLM)...
General Introduction The NoneBot DeepSeek plugin is a NoneBot plugin that integrates the DeepSeek model and is designed to provide intelligent dialog and Q&A functionality. By accessing the DeepSeek model, users can use the NoneBot ...
General Introduction Open-Reasoner-Zero is an open source project focused on reinforcement learning (RL) research, developed by the Open-Reasoner-Zero team on GitHub. It aims to provide efficient, scalable and easy-to-use training ...
General Introduction OpenAI Agents SDK is a lightweight development tool from OpenAI designed for building multi-intelligent body workflows. Based on Python, it is easy to use and supports developers to configure Agents, task cut...
General Introduction Basic Memory is a tool for building knowledge graphs by conversing with AI assistants such as Claude. It was developed by Basic Machines and its core feature is to save conversations as Markdown files, save...
Comprehensive Introduction Vexa is an open source real-time meeting transcription and knowledge management platform designed to provide efficient meeting recording and intelligent knowledge extraction services for enterprises and individuals. It automatically joins platforms such as Google Meet, Zoom, etc. through API-driven meeting robots...
General Introduction ai-trend-publish is an open source project hosted on GitHub, developed by the OpenAISpace team, focused on tracking and publishing the latest trends in artificial intelligence in real time. This tool is designed to help developers, tech hobbyists...
Comprehensive Introduction Observers is an open source Python SDK designed to provide comprehensive observability for generative AI APIs. The library enables users to easily track and record interactions with AI models and store these observations in multiple backends. Whether...
General Introduction Ruyi-Models is an open source project designed to generate high quality videos from images. Developed by the IamCreateAI team, the project supports the generation of 768 resolution, 24 frames per second, a total of 5 seconds 120 frames of cinematic video...
General Introduction BlenderMCP is an open source tool that connects Blender to Claude AI via the Model Context Protocol (MCP) protocol. Users can use text commands to directly control ...
General Introduction CAD-MCP is an open source project that allows users to control CAD software drawing operations through natural language commands. It combines natural language processing and CAD automation technology , so that users do not need to manually operate the CAD interface , just enter simple text commands that ...
General Introduction Swarms is an enterprise-grade production-ready multi-agent orchestration framework designed to boost business productivity through efficient agent management and task processing. With support for multiple models, multiple memory systems and custom agent creation, the framework provides a modular design and comprehensive logging capabilities to ensure that the system...
Comprehensive introduction WeChatFerry is an open source WeChat robot underlying framework , created and maintained by the developer lich0821 on GitHub . The project through the WeChat Hook technology , provides a set of powerful SDK, allowing developers to WeChat ...
General Introduction AnimatedDrawings is an open source project developed by Facebook Research to transform children's drawings into animated characters through automation techniques. The project is based on the paper "A Method for A...
Comprehensive Introduction InternVL is an open source multimodal big model project developed by Shanghai Artificial Intelligence Laboratory (OpenGVLab) and hosted on GitHub. It integrates visual and linguistic processing capabilities to support the comprehensive understanding and generation of images, videos and texts.In...
General Introduction GraphCast is an advanced weather forecasting tool developed by Google DeepMind that aims to improve the accuracy of medium-term global weather forecasts through deep learning techniques. The project provides a variety of pre-trained models and sample code that users can utilize to resource...
General Introduction DragAnything is an open source project that aims to realize motion control of arbitrary objects through entity representation. The project is developed by the Showlab team and has been accepted by ECCV 2024.DragAnything provides a way to use ...
Comprehensive Introduction RooFlow is an open source AI-assisted programming tool with the core functionality of saving code, decisions and task progress during development through project logging. It is based on Roo Code extension and integrates five modes: architecture, coding, testing, debugging and Q&A. These modes inter...
General Description DeepSeek Diagrams Extension is a Chrome extension designed to help users render diagrams inline in the DeepSeek website. The extension is based on Mermaid...
Comprehensive Introduction FinRobot is an open source AI intelligence platform developed by AI4Finance Foundation and designed for financial analytics. It not only covers traditional language models, but also incorporates a variety of AI technologies, aiming to provide a comprehensive solution for the financial industry.F...
General Introduction KBLaM is an open source project developed by Microsoft, the full name is "Knowledge Base augmented Language Model" (Knowledge Base Augmented Language Model). It is through the conversion of external knowledge into vectors and embedded in a large model of ...
General Description Deep Chat is an open source AI chat component designed for web developers. It was developed by Ovidijus Parsiunas, is hosted on GitHub, and currently has over 2k stars. Users can simply configure...
General Introduction Go-with-the-Flow is an open source project developed by the Netflix Eyeline Studios research team to control the motion patterns of video diffusion models by distorting noise. The project allows the user to determine how the camera in the scene and...
Comprehensive Introduction LocalPdfChatRAG is an open source project that aims to implement intelligent chat functionality by combining local PDF documents with Retrieval Augmented Generation (RAG) models. The project allows users to upload PDF documents and ask questions through natural language to get from the document to the relative ...
General Introduction TRV is an open source tool, hosted on GitHub, designed to help users quickly convert slides and presentation notes into videos with narration. It automatically generates audio and video content from incoming presentation files through simple command line operations, suitable for those who need to quickly create presentations...
Archon is the world's first "Agenteer" project built by developer Cole Medin (GitHub username coleam00) - an open source framework focused on autonomously building, optimizing, and iterating on AI Intelligence. It is both...
General Introduction MarkPDFDown is an open source tool. It utilizes the Multimodal Large Language Model to convert PDF files into Markdown format. The developer is GitHub user jorben. the goal of this tool is simple: to make PDF documents ...
General Introduction Lumina-mGPT-2.0 is an open source project jointly developed by Shanghai AI Laboratory (Shanghai AI Laboratory), The Chinese University of Hong Kong (CUHK) and other organizations, hosted on GitHub by Alpha...
General Introduction Vision Agent is an open source project developed by LandingAI (Team Enda Wu) and hosted on GitHub, designed to help users quickly generate code to solve computer vision tasks. It utilizes an advanced agent framework and multimodal modeling...
Comprehensive introduction Yuxi-Know is an open source intelligent Q&A platform that combines knowledge graph and RAG (Retrieval Augmented Generation) technology to help users quickly get accurate answers. It is based on Neo4j storage knowledge graph , using FastAPI and VueJS structure ...
Comprehensive Introduction LHM (Large Animatable Human Reconstruction Model) is an open source project which is developed by aigc3d team to quickly generate action-supporting 3D human models from a single image. Core features ...
Comprehensive Introduction Lecca is a powerful AI platform that allows users to configure and deploy Large Language Models (LLMs) with multiple tools and workflows. Users can easily build, customize and automate their AI agents.Lecca offers a wide selection of AI providers and models...
General Introduction Flashcard is an open source language learning tool designed to provide an alternative to Duolingo. Developed by Steven Lynn (GitHub username: stvlynn), the project features a modern user interface and multilingual...
Comprehensive introduction Local-NotebookLM is an open source project that aims to provide locally run intelligent document processing and content generation tools. It is inspired by Google NotebookLM , focusing on helping users to PDF and other documents into a variety of ...
Comprehensive Introduction CHRONOS is a news timeline summarization tool developed by Alibaba NLP team. The tool generates timeline summaries of news events through iterative self-questioning.CHRONOS is not only capable of handling open-domain timeline summarization tasks, but also in terms of efficiency and scalability...
Comprehensive Introduction Potpie AI is an open source platform focused on providing developers with customized AI engineering assistants. It allows AI agents to deeply understand code structure and logic and automate tasks such as debugging, testing, and code generation by building a knowledge graph of the code base. Users can use simple...
General Introduction Minima is an open source RAG (Retrieval-Augmented Generation) solution that supports local deployment and integration with ChatGPT. The project is maintained by dmayboroda and aims ...
General Introduction Audibit is an open source project, the core function is to Hacker News, TechCrunch and other popular technology articles automatically turned into audio podcasts, so that users in the commute, fitness, or busy when listening to information through the Web or mobile. The project makes ...
General Introduction E2B Open Computer Use is an open source project that aims to provide a secure cloud-based Linux computer use experience through the E2B Desktop Sandbox.The E2B Sandbox provides a desktop graphical environment that users can connect to any large...
General Introduction Chat2DB is an open source database management and SQL client tool developed by the CodePhiliaX team , integrated with AI functionality , support for quickly writing SQL queries , managing databases , generating data reports and multi-database interaction . It supports more than 16...
Comprehensive Introduction VACE is an open source project developed by Alitongyi Visual Intelligence Lab (ali-vilab), focusing on video creation and editing. It is an all-in-one tool that integrates a variety of functions, such as generating videos based on references, editing existing video content, localization modifications, and other...
General Introduction LangManus is an open source AI automation framework hosted on GitHub. Developed by a group of former colleagues in their spare time, it is an academically-driven project with the goal of combining language models and specialized tools to accomplish web search, data crawling, and code execution...
Comprehensive Introduction Agent TARS is a multimodal AI intelligence open-sourced by ByteDance.The core feature is to visually understand web content and combine command line and file system operations to help users complete complex computer tasks. Instead of requiring manual operations like traditional tools, it can self...
Comprehensive Introduction Crawl4LLM is an open source project jointly developed by Tsinghua University and Carnegie Mellon University, focusing on optimizing the efficiency of web crawling for pre-training of large models (LLM). It significantly reduces ineffective crawling by intelligently selecting high-quality web page data, claiming to be able to originally need to crawl 1...
General Introduction This is a structured report generation blueprint project co-developed by LangChain and NVIDIA, showcased in a Jupyter notebook tutorial on GitHub. The project utilizes advanced AI technologies, specifically Llama-3.3-7...
Comprehensive Introduction Instructor is a popular Python library designed for processing structured output from Large Language Models (LLMs). Built on Pydantic, it provides a simple, transparent and user-friendly API for managing data...
General Introduction ChatFree is an open source project that aims to free users' AI apps from the constraints of browsers to run locally. Created using GPT API, Copilot is designed to support a wide range of office software such as Office, Word, WPS, and more. The project was developed by ...
Comprehensive Introduction LLManager is an open source intelligent approval management tool, developed based on LangChain's LangGraph framework, focused on automating the processing of approval requests while optimizing decision making with human review. It does this through semantic search, sample less learning and...
Comprehensive Introduction Search-o1 is an open source project that aims to enhance the performance of large-scale reasoning models (LRMs) by integrating advanced search mechanisms. The core idea is to solve the knowledge deficit problem encountered in the reasoning process through dynamic search and knowledge integration. The project was developed by sunn...
General Description LineAvatars is a free and easy to use online tool designed to generate Notion style line avatars. Users can upload a photo or take a photo via webcam and the system will automatically generate a line avatar using AI. This tool...
General Introduction Motia is an open source AI agent framework for software engineers, hosted on GitHub and developed by the MotiaDev team. It allows developers to use familiar programming languages (e.g. Python, TypeScript, Rub...