AI Personal Learning
and practical guidance

Clevrr Computer: Automating Desktop Manipulation Intelligence with the PyAutoGUI Library

General Introduction

Clevrr Computer is an open source project that aims to automate system operations by using the PyAutoGUI library. The project is supported by Anthropic Clevrr Computer has been inspired to design an automated agent that performs the user's system operation tasks accurately and efficiently.Clevrr Computer is able to automate keyboard, mouse, and screen interactions while ensuring the security and accuracy of each task. The project is currently in beta and users should be aware of the risks associated with its use.

Recommended automated desktop operating intelligences introduced by Smart Spectrum:GLM-PC (Smart Spectrum Bull) officially released for internal download, the real AI that can control the computer


Clevrr Computer: Implementing an Intelligent Body Automation OS Desktop with the PyAutoGUI Library-1

 

Clevrr Computer: Implementing an Intelligent Body Automation OS Desktop with the PyAutoGUI Library-1

 

Function List

  • Automate mouse movements, clicks and keyboard input
  • Screenshots and management windows
  • Handle errors gracefully and provide feedback
  • Execute tasks with maximum precision and avoid unintentional operations

 

Using Help

Installation process

  1. Cloning Warehouse:
    git clone https://github.com/Clevrr-AI/Clevrr-Computer.git
    cd Clevrr-Computer
  1. Install the dependencies:
    pip install -r requirements.txt
    
  2. Setting environment variables:
    commander-in-chief (military) .env_dev Rename the file to .envand add your API key and other configurations:

    AZURE_OPENAI_API_KEY=
    AZURE_OPENAI_ENDPOINT=
    AZURE_OPENAI_API_VERSION=
    AZURE_OPENAI_CHAT_DEPLOYMENT_NAME=
    GOOGLE_API_KEY=
    

Usage

  1. Run the application:
    python main.py
    

    By default, this will use the gemini model and enable floating UI.

  2. Optional Parameters:
    • Selecting a model: You can select a model by passing ---model parameter to specify the model to use. The acceptable parameters are gemini maybe openaiThe
      python main.py --model openai
      
    • Floating UI: By default, the TKinter UI will float and stay at the top of the screen. You can float the UI by passing the --float-ui denote 0 to disable this behavior.
      python main.py --float-ui 0
      

Functional operation flow

Clevrr Computer works through a multimodal AI agent running in the background with a continuous screenshot mechanism to understand what's on the screen and perform the appropriate actions using the PyAutoGUI library. The agent creates a chain of thought based on the task and uses the get_screen_info tool to get screen information. The tool takes a screenshot of the current screen and uses a grid to mark the true coordinates of the screen. The agent then uses a multimodal LLM to understand the screen content and give answers based on the agent's questions. Thought Chain Support get_screen_info tool and the PythonREPLAst tool, which is designed to perform operations using the PyAutoGUI library.

caveat

  • Use dedicated virtual machines or containers that run with minimal privileges to prevent direct system attacks or accidents.
  • Avoid providing sensitive data, such as account login information, to the model to prevent information leakage.
  • Restrict Internet access to only whitelisted domains to minimize exposure to malicious content.
  • Manual confirmation is required for decisions that may have a practical impact and for tasks that require confirmation, such as accepting cookies, executing financial transactions, or agreeing to terms of service.
May not be reproduced without permission:Chief AI Sharing Circle " Clevrr Computer: Automating Desktop Manipulation Intelligence with the PyAutoGUI Library

Chief AI Sharing Circle

Chief AI Sharing Circle specializes in AI learning, providing comprehensive AI learning content, AI tools and hands-on guidance. Our goal is to help users master AI technology and explore the unlimited potential of AI together through high-quality content and practical experience sharing. Whether you are an AI beginner or a senior expert, this is the ideal place for you to gain knowledge, improve your skills and realize innovation.

Contact Us
en_USEnglish