AppAgent: automated smartphone operation using multimodal intelligences
Comprehensive Introduction AppAgent is a Large Language Model (LLM)-based multimodal agent framework designed to manipulate smartphone applications. The framework mimics human interactions such as taps and swipes through a simplified manipulation space, thus eliminating the need for system back-end access and expanding its use in different applications...