AI Personal Learning
and practical guidance
TRAE
Total 44 articles

Tags: desktop automation intelligentsia Page 2

TankWork:通过语音和文字操作电脑,并提供实时语音反馈的智能体-首席AI分享圈

TankWork: an intelligent body that operates computers via voice and text and provides real-time voice feedback

General Introduction TankWork is an open source desktop agent framework designed to enable AI to perceive and control your computer through computer vision and system-level interaction. The framework allows agents to directly control computers through voice and text commands, process real-time screen content, and provide continuous audio visual feedback and manipulation...

UI-TARS Desktop:使用自然语言控制电脑的桌面智能体应用-首席AI分享圈

UI-TARS Desktop: Desktop Intelligentsia Application for Controlling Computers Using Natural Language

General Introduction UI-TARS Desktop is a graphical interface agent application based on UI-TARS (Visual Language Model) developed by ByteDance. The application allows users to control computers through natural language for more intuitive and efficient human-computer interaction.UI-TARS Desktop supports cross-platform operation, both...

Eko:自然语言构建智能体工作流,实现桌面与浏览器自动化-首席AI分享圈

Eko: Natural Language Builds Intelligent Body Workflows for Desktop and Browser Automation

General Introduction Eko is a production-grade JavaScript framework designed to build efficient intelligent agent workflows through natural language descriptions. It is designed to enable developers to automate everyday tasks using AI technologies without deep programming.Eko provides a unified interface that supports the use of AI in counting...

Browser Use Web UI:运行AI智能体浏览网页,让AI能够自动操作网页的开源框架-首席AI分享圈

Browser Use Web UI: an open source framework for running AI intelligences to browse the web, allowing AI to automatically manipulate web pages

Comprehensive Introduction Browser Use Web UI is an innovative open source project focused on providing AI agents with a graphical interface tool for browser interaction capabilities. The project is built on top of the browser-use core framework , through Gradio to build a user-friendly Web interface , making it easy for AI agents to ...

NeoAI:让AI接管电脑远程操作,使用自然语言控制电脑的开源项目-首席AI分享圈

NeoAI: Open source project that lets AI take over remote operation of computers and control them using natural language

General Introduction NeoAI is an innovative open source AI assistant tool that allows users to easily control and manage their computers through natural language conversations. Without writing any code, users can just use daily conversations to find files, automate tasks, manage devices, etc. NeoAI supports Window...

CogAgent:智谱开源的智能视觉语言模型,实现图形界面自动化操作-首席AI分享圈

CogAgent: Smart Spectrum's open source intelligent visual language model for automating graphical interfaces

Comprehensive Introduction CogAgent is an open source visual language model developed by Tsinghua University Data Mining Research Group (THUDM), aiming to automate cross-platform graphical user interface (GUI) operations. The model is based on CogVLM (GLM-4V-9B), supports bilingual interactions in English and Chinese, and is able to automate GUI operations through screenshots and natural...

Browser-Use:构建智能网页自动化工具,让AI智能体轻松操作浏览器-首席AI分享圈

Browser-Use: Building Intelligent Web Automation Tools for AI Intelligents to Easily Operate Browsers

Comprehensive Introduction Browser-Use is an innovative open source web automation tool specifically designed to enable Language Models (LLMs) to naturally interact with websites. It provides a powerful and flexible framework that supports a wide range of mainstream language models, including GPT-4, Claude, and others. The tool's most notable feature...

Project Mariner:浏览器自动化,探索未来人机交互的研究原型(未发布)-首席AI分享圈

Project Mariner: browser automation, a research prototype exploring the future of human-computer interaction (unpublished)

General Introduction Project Mariner is a research prototype launched by Google DeepMind to explore the future of human-computer interaction. The project leverages the powerful multimodal understanding and reasoning capabilities of Gemini 2.0 to accomplish a variety of tasks through browser automation.Project Mariner is able to reason...

Dia Browser:提供智能浏览体验,集成AI工具,在浏览器中自动化处理任务(未上线)-首席AI分享圈

Dia Browser: provides an intelligent browsing experience with integrated AI tools to automate tasks in the browser (not yet live)

General Description Dia Browser is a new smart browser developed by The Browser Company that aims to provide users with a more efficient browsing experience by integrating advanced AI tools. The browser is expected to be officially released in early 2025, with key features including intelligent writing assistance, automated task processing and...

en_USEnglish