General Introduction Agent S is an open source framework developed by Simular AI that lets intelligences operate computers like humans through a graphical user interface (GUI). It uses a multimodal large language model and empirical learning techniques to perform tasks such as browsing the web, editing documents, and using software. The project is on GitHub...
Libra is an innovative tool from Greenbit.ai, whose core function is to generate AI intelligences that can run locally through natural language conversations. Called the "Vibe Agent", it allows users to quickly create their own intelligences by describing their needs in simple terms, performing web searches, data...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
General Introduction Optexity is an open source project on GitHub, developed by the Optexity team. Its core is to use human demonstration data to train AI to complete computer tasks, especially web page operations. The project contains three code libraries : ComputerGYM, AgentAI and Playwright, users can ...
General Introduction RunRabbit is an AI-based tool that allows users to control their browsers to accomplish a variety of tasks through simple voice or text commands. Its best feature is that it understands the user's needs and then automatically manipulates web pages, such as searching for information, filling out forms or performing repetitive tasks. The website ...
Comprehensive introduction LangGraph CUA is an open source project developed by the LangChain team. It is based on the LangGraph framework, allowing developers to use Python to build AI intelligences that can directly operate computers. The core of this tool is "Computer Use Agent" (CUA), can simulate human ...
Comprehensive Introduction Agent TARS is a multimodal AI intelligence open-sourced by ByteDance, with core features that help users complete complex computer tasks by visually understanding web content and combining command line and file system operations. Instead of requiring manual operations like traditional tools, it automatically performs browser...
General Introduction Playwright MCP is an open source tool developed by Microsoft and hosted on GitHub. It allows artificial intelligence models to directly control browsers through the Model Context Protocol (MCP) protocol, performing actions such as opening web pages, clicking on elements, and entering text. The tool is based on Pl...
General Introduction Airtop is an artificial intelligence-based browser automation tool. It allows users to control cloud browsers to perform complex web operations such as logging into websites, crawling data or performing automation tasks through simple natural language commands. It solves the problem that traditional scripting is complicated and prone to...
General Introduction BrowserAgent is a tool that creates and runs AI workflows directly in the browser. It's easy to use and requires no code to be written, the user simply describes the desired workflow and the AI is automatically generated. Its core feature is completely private, all data is processed in your browser, no...
General Introduction Highlight AI is a desktop AI assistant for Windows and macOS (mobile version in development) that helps users to quickly complete tasks in any application through voice commands and screen content analysis. It captures screen content, generates code, answers questions, and works with GitHub...
Comprehensive Introduction autoMate is a local automation tool open-sourced and developed by yuruotong1 on GitHub, with AI+RPA (Artificial Intelligence+Robotic Process Automation) as its core feature. It combines the intelligent understanding of large-scale language models with the process execution capabilities of RPA, users only need to use natural language...
General Introduction Nanobrowser is an open source Chrome extension designed to automate web tasks through an AI-driven multi-agent system. It is a free alternative to OpenAI Operator, which users can use by simply providing their LLM (Large Language Model) API key, supporting o...
Comprehensive Introduction Proxy Lite is an open source, lightweight web automation tool developed by Convergence AI as a mini version of Proxy with an open weight design. It is based on the 3B parameter Visual Language Model (VLM), and is able to autonomously complete web navigation and task execution, such as finding information...
General Introduction Rabbit Android Agent is an innovative AI intelligence developed by Rabbit, designed to help users complete single or multi-step tasks on their Android devices through voice and text commands. The technology is based on Rabbit's LAM (Large Action Model)...
General Introduction Convergence is a company dedicated to helping people regain control of their time using machine learning technologies. By developing large-scale meta-learning models (LMLMs), Convergence's AI agents (browser agents) are able to acquire new skills, take action, and continuously improve in real-time use. Its core ...
General Introduction mac assistant is an AI intelligences project designed specifically for macOS, aiming to simplify user operations by combining native software and web features. The project currently supports the OpenAI and GEMINI APIs, and plans to support a native large language model run by Ollama in the future. mac_assista...
General Introduction Open Operator is an open source project that aims to automate operations in the browser through AI intelligences. Developed by Browserbase, the project combines the technologies of Stagehand and Browserbase to enable users to control the behavior of the browser through natural language commands.Ope...
General Introduction MobileAgent is a powerful mobile device operation assistant designed to improve the efficiency and automation of mobile device operation through multi-agent collaboration and enhanced visual perception modules. Developed by the X-PLUG team, it supports Android and Harmony OS systems, and is capable of working in complex...
General Introduction TankWork is an open source desktop agent framework designed to enable AI to perceive and control your computer through computer vision and system-level interaction. The framework allows agents to directly control computers through voice and text commands, process real-time screen content, and provide continuous audio visual feedback and manipulation...