Comprehensive Introduction Sim Studio is an open source AI agent workflow building platform focused on helping users quickly design, test, and deploy large-scale language model (LLM) workflows through a lightweight, intuitive visual interface. Users can create complex multi-agent applications with drag-and-drop without deep programming...
Comprehensive Introduction Mad Professor (Grumpy Professor Reads Papers) is an open source AI academic tool designed for researchers and students to simplify the reading and analysis of academic papers. It integrates PDF processing, AI translation, RAG search, AI Q&A and voice interaction. Users can import...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
Comprehensive Introduction AIstudioProxyAPI is an open source project that uses Node.js and Playwright technology to convert the Gemini model dialog functionality of the Google AI Studio web version into a standard API interface by emulating the OpenAI API. Developers can use this proxy service...
General Introduction Step1X-Edit is an open source image editing framework developed by the Stepfun AI team and hosted on GitHub.It combines a multimodal large language model (Qwen-VL) and a diffusion transformer (DiT) to allow users to edit images with simple natural language commands, such as changing the background, removing objects...
General Introduction Klavis AI is an open source platform focused on simplifying the use and integration of the Model Context Protocol (MCP), an open standard that allows AI applications to dynamically connect with external tools and data sources.Klavis AI offers Slack, Discord clients, hosted MCP servers, and...
Comprehensive Introduction RealtimeVoiceChat is an open source project focused on real-time, natural conversations with artificial intelligence via voice. Users use a microphone to input speech, the system captures the audio through a browser, quickly converts it to text, generates a reply from a large language model (LLM), and then converts the text to speech...
General Introduction MiMo is an open source large language modeling project developed by Xiaomi, focusing on mathematical reasoning and code generation. The core product is the MiMo-7B family of models, consisting of a base model (Base), a supervised fine-tuning model (SFT), a reinforcement learning model trained from the base model (RL-Zero), and a reinforcement learning model trained from the SFT...
Synthesis Muyan-TTS is an open source text-to-speech (TTS) model designed for podcasting scenarios. It is pre-trained with over 100,000 hours of podcast audio data and supports zero-sample speech synthesis to generate high-quality natural speech. The model is built based on Llama-3.2-3B, combined with SoVITS decoding ...
General Introduction CAD-MCP is an open source project that allows users to control CAD software for drawing operations through natural language commands. It combines natural language processing and CAD automation technologies to allow users to create and modify drawings without having to manually manipulate the CAD interface, just by entering simple text commands. Project ...
Comprehensive Introduction GraphGen is an open source framework developed by OpenScienceLab, an artificial intelligence lab in Shanghai, hosted on GitHub, focused on optimizing supervised fine-tuning of Large Language Models (LLMs) by guiding synthetic data generation through knowledge graphs. It constructs fine-grained knowledge graphs from source text, utilizing pre...
General Introduction ACI.dev is an open source infrastructure platform designed to provide AI intelligences with rapid integration to over 600 tools. It ensures that intelligences have secure access to tools such as Google Calendar, Slack, and Brave Search through multi-tenant authentication and fine-grained permissions management. developers can...
General Introduction llm.pdf is an open source project that allows users to run large-scale language models (LLMs) directly in PDF files. Developed by EvanZhouDev and hosted on GitHub, this project demonstrates an innovative approach: compiling llama.cpp to asm.js via Emscripten,...
General Introduction Abogen is an open source tool designed to quickly convert ePub, PDF or plain text files to high quality audio. It uses the Kokoro-82M model to generate natural and smooth speech, and also supports synchronized subtitle generation, which is suitable for producing audiobooks, video dubbing or study aids. Use...
General Introduction Local Deep Research is an open source AI research assistant designed to help users conduct deep research and generate detailed reports for complex problems. It supports local operation, allowing users to accomplish research tasks without relying on cloud services. The tool combines local large language modeling...
General Introduction Trackers is an open source Python tool library focused on multi-object tracking in video. It integrates several leading tracking algorithms such as SORT and DeepSORT, allowing users to combine different object detection models (e.g. YOLO, RT-DETR) for flexible video analysis. Users can ...
Comprehensive Introduction Kimi-Audio is an open source audio base model developed by Moonshot AI that focuses on audio understanding, generation and dialog. It supports a variety of audio processing tasks such as speech recognition, audio Q&A, and speech emotion recognition. The model has been pre-trained with over 13 million hours of audio data,...
General Description Describe Anything is an open source project developed by NVIDIA and several universities, with the Describe Anything Model (DAM) at its core. This tool generates detailed descriptions based on areas (such as dots, boxes, doodles, or masks) that the user marks in an image or video. It does not ...
General Introduction Cooragent is an open source AI agent collaboration framework developed by LeapLab at Tsinghua University and hosted on GitHub.It allows users to create intelligent AI agents with a one-sentence description and supports multiple agents to collaborate on complex tasks. The framework provides two modes: Agent Factory automatically generates customized...
General Introduction InstantCharacter is an open source project developed by Tencent Hunyuan and the InstantX team, hosted on GitHub. It uses a reference image and a text description to generate consistent-looking character images for a wide range of scenarios and styles. The project is based on diffusion transformation...