TEN Agent: a real-time multimodal intelligent body framework that supports latency-free voice and video dialog with intelligent bodies.
Comprehensive Introduction TEN Agent is an open source real-time multimodal intelligences framework that integrates the OpenAI Realtime API and RTC to support a variety of functions such as weather querying, web searching, visual processing and RAG (Retrieval Augmented Generation). The framework aims to provide high ...