50 Articles
Tags :AI digital people Page 2
General Introduction Sonic is an innovative platform focused on global audio perception, designed to generate vivid portrait animations driven by audio. Developed by a team of researchers from Tencent and Zhejiang University, the platform utilizes audio information to control facial expressions and head movements to generate natural and smooth animated videos.Sonic ...
Comprehensive Introduction YUE Portrait EMO is a high-quality portrait dynamic video generation tool provided by AliCloud's big model service platform Hundred Refine (Model Studio). The tool generates realistic portrait dynamic video based on portrait images and human voice audio files. YUE Portrait EMO contains two independent model...
General Introduction DH_live is a real-time live digital human project based on sample less learning, aiming to provide users with a smooth and interactive live streaming experience. The project supports NVIDIA 30 and 40 series graphics cards and is capable of running in real-time at 25+ fps. Users can create and use digital in simple steps...
Comprehensive introduction Ruying AI Video Synthesis is an AI video generation platform launched by Shanghai Yuyi Technology Co. Relying on "SenseNova", a large modeling capability of SenseNova, the platform provides a variety of digital human images and timbres to choose from, so that users can generate realistic AI videos by simply inputting the text. This...
General Introduction Cicada is a platform focusing on digital human video creation, utilizing AI technology to simplify the video production process. Users can choose different digital human images, input copy and generate videos with multi-language voiceovers. The platform provides a rich library of templates and materials, which are suitable for a variety of fields such as advertising and marketing, education and training...
General Introduction EchoMimic is an open source project designed to generate realistic portrait animations through audio-driven generation. Developed by Ant Group's Terminal Technologies division, the project utilizes editable marker point conditions that combine audio and facial marker points to generate dynamic portrait videos.EchoMimic is available on multiple public datasets...
Comprehensive Introduction VideoChat is a real-time voice interaction digital human project based on open source technology, supporting end-to-end voice scheme (GLM-4-Voice - THG) and cascade scheme (ASR-LLM-TTS-THG). The project allows users to customize the image and timbre of the digital human, and supports timbre cloning and lip synchronization...
General Introduction Hallo2 is an open source project jointly developed by Fudan University and Baidu to generate high-resolution portrait animations through audio-driven generation. The project utilizes advanced Generative Adversarial Networks (GAN) and time alignment techniques to achieve 4K resolution and up to 1 hour long video generation.Hallo2 also supports...
General Introduction Ultralight Digital Human is an open source project to develop an ultra-lightweight digital human model that can run in real time on mobile devices. The project by optimizing the algorithms and model structure to achieve smooth operation on mobile devices , suitable for social applications, games and virtual...
General Introduction TalkingAvatar is a leading AI avatar platform that provides a complete AI digital person solution. Offering users a revolutionary way to create, edit and personalize video content. With advanced AI technology, users can easily rewrite videos, clone voices, synchronize lips, and create custom...