Desktop Automation Intelligence

Total 44 articles posts

Sorting

Suna: Intelligent Agents for Integrated Browser Operations and Data Analytics

General Introduction Suna is an open source general-purpose AI agent developed by Kortix AI, hosted on GitHub, based on the Apache 2.0 license, which allows users to download, modify and self-host it for free. It uses natural language dialog to help users with...

4mos ago

01.3K

Strawberry: an AI smart browser for automating tasks

General Introduction Strawberry is a smart browser with a built-in AI assistant designed to help users automate their daily tasks and improve efficiency. It differs from traditional browsers by integrating AI technology that understands web content in real-time and performs complex tasks such as quick research, content writing...

Latest AI Resources # Desktop Automation Intelligence

4mos ago

01K

Fellou: a native AI browser for automating tasks

General Introduction Fellou is the world's first AI-enabled action-oriented browser from Fellou AI. Fellou is the world's first AI-enabled mobile browser, which not only provides the web browsing functionality of a traditional browser, but also automates tasks and enables deep information search through AI technology.

Latest AI Resources # Desktop Automation Intelligence

4mos ago

01.1K

AiPy: automating the task of running Python code for data analysis

General Introduction AiPy is an open source Python command-line tool developed by the Knownsec team. It combines the Large Language Model (LLM) and the Python runtime environment to allow users to automatically generate and run Pytho...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

4mos ago

01.1K

DroidRun: an open source tool for AI to automate Android phones

General Introduction DroidRun is an open source tool that lets AI operate an Android phone like a human. It helps AI automate tasks such as opening apps, sending messages, or browsing the web by extracting interactive elements such as on-screen buttons, input boxes, etc.DroidRun combines...

Latest AI Resources # Desktop Automation Intelligence

4mos ago

01.1K

Agent S: An Open Source Framework for Intelligent Bodies to Operate Computers Like Humans

General Introduction Agent S is an open-source framework developed by Simular AI that lets intelligences operate computers like humans through a graphical user interface (GUI). It uses a multimodal large language model and empirical learning techniques to accomplish tasks such as browsing the web, editing documents, using software...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

4mos ago

01.1K

Libra: a client for generating local AI intelligences with dialog (internal test)

General Introduction Libra is an innovative tool from Greenbit.ai whose core function is to generate AI intelligences that can run locally through natural language conversations. Called the "Vibe Agent", it allows users to describe their needs in simple terms and quickly create...

Latest AI Resources # Intelligent Body Application # Desktop Automation Intelligence

4mos ago

01K

Optexity: an open-source project to train AI to perform web actions with human demonstrations

General Introduction Optexity is an open source project on GitHub, developed by the Optexity team. Its core is to use human demonstration data to train AI to complete computer tasks, especially web page operations. The project contains three code libraries : Compute...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning # Desktop Automation Intelligence

4mos ago

01.2K

RunRabbit: Using Voice and Text to Operate Intelligent Bodies to Complete Computer Operations

General Introduction RunRabbit is an artificial intelligence-based tool that allows users to control their browsers to accomplish various tasks through simple voice or text commands. Its best feature is that it understands the user's needs and then automatically manipulates web pages, such as searching for information, filling out forms or performing repetitive tasks...

Latest AI Resources # Desktop Automation Intelligence

4mos ago

0966

LangGraph CUA: LangGraph-based AI Intelligence for Controlling Computer Operations

Comprehensive introduction LangGraph CUA is an open source project developed by the LangChain team. It is based on the LangGraph framework, allowing developers to use Python to build AI intelligences that can directly operate the computer. The core of this tool ...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

5mos ago

01.4K

Agent TARS: An Open Source Intelligence Using Vision and Commands to Operate Computers

Comprehensive Introduction Agent TARS is a multimodal AI intelligence open-sourced by ByteDance.The core feature is to visually understand web content and combine command line and file system operations to help users complete complex computer tasks. Instead of requiring manual operations like traditional tools, it can self...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

5mos ago

01.3K

Playwright MCP: Browser Automation MCP Service from Microsoft

General Introduction Playwright MCP is an open source tool developed by Microsoft and hosted on GitHub. It enables artificial intelligence models to directly control browsers through the Model Context Protocol (MCP) protocol, complete with opening...

Latest AI Resources # AI Java Open Source Projecct # MCP services # Desktop Automation Intelligence

5mos ago

01.6K

Airtop: A Browser Automation Tool Using Natural Language Controls

General Introduction Airtop is an artificial intelligence-based browser automation tool. It lets users control cloud browsers to perform complex web operations such as logging into a website, crawling data, or performing automation tasks through simple natural language commands. It solves the problem of writing traditional scripts that are complex and capacit...

Latest AI Resources # Desktop Automation Intelligence

5mos ago

01.2K

BrowserAgent: a tool for creating and running AI workflows in the browser

General Introduction BrowserAgent is a tool that creates and runs AI workflows directly in the browser. It's easy to use and requires no code to be written, the user simply describes the desired workflow and the AI is automatically generated. Its core feature is completely private, all data is in your browser...

Latest AI Resources # Low-code workflow # Desktop Automation Intelligence

5mos ago

01.2K

Highlight AI: An AI assistant that uses voice and screen analytics for desktop tasks

General Introduction Highlight AI is a desktop AI assistant for Windows and macOS (mobile version in development) that helps users to quickly complete tasks in any application through voice commands and screen content analysis. It captures screen content, generates generation...

Latest AI Resources # Desktop Automation Intelligence

2mos ago

01.2K

autoMate: a native tool that combines AI and RPA to automate computer tasks

Comprehensive Introduction autoMate is a local automation tool open-sourced and developed by yuruotong1 on GitHub, featuring AI+RPA (Artificial Intelligence+Robotic Process Automation) as its core feature. It combines the intelligent understanding of large-scale language models with RPA...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

5mos ago

01.4K

Nanobrowser: Multi-Intelligence Plugin for Task Automation in Browsers

General Introduction Nanobrowser is an open source Chrome extension designed to automate web tasks through an AI-driven multi-agent system. It is a free alternative to OpenAI Operator, where users simply provide their LLM...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

5mos ago

02.5K

Proxy Lite: 3B Parametric Visual Model-Driven Web Automation Tool

Comprehensive Introduction Proxy Lite is an open source, lightweight web automation tool developed by Convergence AI as a mini-version of Proxy with an open weight design. It is based on the 3B parameter Visual Language Model (VLM) and is able to self...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

5mos ago

01.3K

Rabbit Android Agent: voice-controlled intelligence for Android applications (not open)

General Introduction Rabbit Android Agent is an innovative AI intelligence developed by Rabbit, designed to help users complete single- or multi-step tasks on their Android devices through voice and text commands. The technology is based on Rabbit's ...

Latest AI Resources # Desktop Automation Intelligence

6mos ago

01.2K

Convergence: an AI assistant that automates repetitive tasks in an agent browser

General Introduction Convergence is a company dedicated to helping people regain control of their time using machine learning technology. By developing large-scale meta-learning models (LMLMs), Convergence's AI agents (browser agents) are able to acquire new skills in real-time using...

Latest AI Resources # Desktop Automation Intelligence

2mos ago

01.6K

mac assistant: AI intelligence for macOS devices to automate desktop actions

Comprehensive Introduction mac assistant is an AI intelligent body project designed for macOS, aiming to simplify user operations by combining native software and web features. The project currently supports OpenAI and GEMINI APIs, and plans to support future ...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

6mos ago

01.6K

Open Operator: Performing Automation in Cloud Browsers with AI Intelligence

General Introduction Open Operator is an open source project that aims to automate operations in the browser through AI intelligences. Developed by Browserbase, the project combines the technologies of Stagehand and Browserbase...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

7mos ago

02.2K

MobileAgent: Multi-agent Collaboration Assistant for Mobile Devices

General Introduction MobileAgent is a powerful mobile device operation assistant designed to improve the efficiency and automation of mobile device operation through multi-agent collaboration and enhanced visual perception modules. It is developed by X-PLUG team and supports Android and ...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

7mos ago

01.9K

TankWork: an intelligent body that operates computers via voice and text and provides real-time voice feedback

General Introduction TankWork is an open source desktop agent framework designed to enable AI to perceive and control your computer through computer vision and system-level interaction. The framework allows agents to directly control computers through voice and text commands, process real-time screen content, and provide continuous audio visual...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

7mos ago

01.4K

UI-TARS Desktop: Desktop Intelligentsia Application for Controlling Computers Using Natural Language

General Introduction UI-TARS Desktop is a graphical interface agent application based on UI-TARS (Visual Language Model) developed by ByteDance. The application allows users to control computers through natural language for more intuitive and efficient human-computer interaction.UI-TAR...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

7mos ago

02K

Shortest: an AI automated testing tool that uses natural language for end-to-end testing

General Introduction Shortest is an AI-powered natural language end-to-end testing framework developed by the Anti-Work team. It is built on Playwright and supports GitHub integration and two-factor authentication (2FA).Shortest's main features are...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

7mos ago

01.9K

Midscene.js: Open Source Plugin for Automated Browser Testing Driven by AI

General Introduction Midscene.js is an AI-powered browser automation tool that controls web pages, performs assertions and extracts data through natural language commands. It supports Chrome extensions, JavaScript SDKs and YAML scripts, simplifying UI measurement...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

7mos ago

01.9K

Stagehand: A Framework for Natural Language Implementation of Browser Automation Operations

General Introduction Stagehand is an AI web browsing framework focused on simplicity and extensibility. It is fully compatible with Playwright and provides three simple AI APIs (act, extract, and observe) that are built on the base...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

7mos ago

01.6K

Eko: Natural Language Builds Intelligent Body Workflows for Desktop and Browser Automation

General Introduction Eko is a production-grade JavaScript framework designed to build efficient intelligent agent workflows through natural language descriptions. It is designed to enable developers to automate everyday tasks using AI technologies without deep programming.Eko provides a uni...

Latest AI Resources # AI Java Open Source Projecct # Low-code workflow # Intelligent Body Application

5mos ago

01.8K

AutoMouser：生成浏览器自动化代码，将鼠标操作通过AI转为Selenium Python脚本

AutoMouser: Generating Browser Automation Code to Convert Mouse Actions to Selenium Python Scripts via AI

General Introduction AutoMouser is a Chrome extension that intelligently tracks user interactions and automatically generates Selenium test code using OpenAI's GPT models. It does this by recording user browser actions and converting them...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

7mos ago

01.4K

Browser Use Web UI：运行AI智能体浏览网页，让AI能够自动操作网页的开源框架

Browser Use Web UI: an open source framework for running AI intelligences to browse the web, allowing AI to automatically manipulate web pages

Comprehensive Introduction Browser Use Web UI is an innovative open source project focused on providing AI agents with a graphical interface tool for browser interaction capabilities. The project is built on top of the browser-use core framework, built with Gradio ...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

2mos ago

02.7K

E2B Open Computer Use: Running an AI operating system safely in the E2B sandbox

General Introduction E2B Open Computer Use is an open source project that aims to provide a secure cloud-based Linux computer use experience through the E2B Desktop Sandbox.The E2B Sandbox provides a desktop graphical environment that users can connect to any large...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

7mos ago

01.4K

NeoAI: Open source project that lets AI take over remote operation of computers and control them using natural language

General Introduction NeoAI is an innovative open source AI assistant tool that allows users to easily control and manage their computers through natural language conversations. Without writing any code, users can simply use everyday conversations to find files, automate tasks, manage devices, etc.NeoAI...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

7mos ago

02.5K

CogAgent: Smart Spectrum's open source intelligent visual language model for automating graphical interfaces

Comprehensive Introduction CogAgent is an open source visual language model developed by Tsinghua University Data Mining Research Group (THUDM), aiming to automate the operation of cross-platform graphical user interface (GUI). The model is based on CogVLM (GLM-4V-9B) and supports bilingual Chinese and English...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

8mos ago

01.8K

ClickClickClickClick: Enable Any LLM to Automate Android and PC Operations

General Introduction ClickClickClick is a framework developed by BandarLabs that aims to automate Android and PC operations by using any local or remote Large Language Model (LLM). The project is currently in a highly experimental phase and supports a variety of models such as...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

8mos ago

01.6K

Browser-Use: Building Intelligent Web Automation Tools for AI Intelligents to Easily Operate Browsers

Comprehensive Introduction Browser-Use is an innovative open source web automation tool specifically designed to enable Language Models (LLMs) to naturally interact with websites. It provides a powerful and flexible framework that supports a wide range of mainstream language models, including GPT-4, Claud...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

8mos ago

02.6K

Project Mariner：浏览器自动化，探索未来人机交互的研究原型（未发布）

Project Mariner: browser automation, a research prototype exploring the future of human-computer interaction (unpublished)

General Introduction Project Mariner is a research prototype launched by Google DeepMind to explore the future of human-computer interaction. The project leverages the powerful multimodal understanding and reasoning capabilities of Gemini 2.0 through a browser self...

Latest AI Resources # Desktop Automation Intelligence

7mos ago

01.6K

Dia Browser：提供智能浏览体验，集成AI工具，在浏览器中自动化处理任务（未上线）

Dia Browser: provides an intelligent browsing experience with integrated AI tools to automate tasks in the browser (not yet live)

General Description Dia Browser is a new smart browser developed by The Browser Company that aims to provide users with a more efficient browsing experience by integrating advanced AI tools. The browser is expected to be officially released in early 2025, with key features...

Latest AI Resources # Desktop Automation Intelligence

8mos ago

01.9K

Clevrr Computer：使用 PyAutoGUI 库实现自动化桌面操作智能体

Clevrr Computer: Automating Desktop Manipulation Intelligence with the PyAutoGUI Library

General Introduction Clevrr Computer is an open source project that aims to automate system operations through the use of the PyAutoGUI library. The project was inspired by Anthropic to design an automation agent that can accurately and efficiently perform operations using ...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

8mos ago

01.5K

GLM-PC (Smart Spectrum Bull) officially released for internal download, the real AI that can control the computer

GLM-PC (Bull) Introduction GLM-PC is a desktop application based on the CogAgent model, which is able to perform complex tasks quickly through natural language commands. It has the ability of task planning and interface understanding, and can autonomously complete various computer operations according to user instructions. Notes for use...

Latest AI Resources # Desktop Automation Intelligence

8mos ago

02.3K

Runner H: Automating Web Page Execution Tasks through Natural Language Commands (Apply for Internal Testing)

General Introduction Runner H is a company dedicated to developing cutting-edge action models designed to enhance worker productivity through advanced AI capabilities. Its flagship product, Runner H, is an advanced AI agent designed to help users automate complex, multi-step tasks and reduce re...

Latest AI Resources # Desktop Automation Intelligence

8mos ago

01.7K

AppAgent: automated smartphone operation using multimodal intelligences

Comprehensive Introduction AppAgent is a large language model (LLM)-based multimodal agent framework designed to manipulate smartphone applications. The framework mimics human interactions such as taps and swipes through a simplified manipulation space, thus eliminating the need for system back-end access and extending its use across different app...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

8mos ago

02K

Skyvern: Automating Browser-Based Workflows with LLM and Computer Vision

General Introduction Skyvern is a tool for automating browser workflows using Large Language Modeling (LLM) and computer vision techniques. It efficiently automates a large number of websites by providing a simple API endpoint that can replace automation solutions that are fragile or unreliable...

Latest AI Resources # Intelligent Body Application # Desktop Automation Intelligence

5mos ago

02.3K

Agent.exe: Let AI control your computer directly, an open source implementation of Claude's control computer

General Description Agent.exe is an open source Electron application that utilizes Anthropic's Claude 3.5 Sonnet API to allow users to control their local computers directly through AI. The project was developed by K...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

8mos ago

02.1K

No more