AI Personal Learning
and practical guidance
豆包Marscode1
Total 61 articles

Tagged with: ai digital people Page 2

AIGCPanel:开源克隆数字人整合系统,一键部署免费数字人客户端-首席AI分享圈

AIGCPanel: open source clone of the digital man integration system, one-click deployment of free digital man client

Comprehensive Introduction AigcPanel is a one-stop AI digital human production system for all users, developed with electron+vue3+typescript technology stack, supporting one-click deployment on Windows systems. The system is designed to be user-friendly as the core, even users with a weak technical foundation can easily master it. Main features ...

Sonic:音频驱动肖像图片生成面部表情生动的数字人口播视频-首席AI分享圈

Sonic: Audio-driven portrait images generate digital demo videos with vivid facial expressions

General Introduction Sonic is an innovative platform focused on global audio perception, designed to generate vivid portrait animations driven by audio. Developed by a team of researchers from Tencent and Zhejiang University, the platform utilizes audio information to control facial expressions and head movements to generate natural and smooth animated videos.Sonic ...

蝉镜:数字人视频创作平台,拥有数百款数字人模板以及克隆专属数字人形象(付费)-首席AI分享圈

Cicada Mirror: digital human video creation platform with hundreds of digital human templates and cloning of exclusive digital human images (paid)

General Introduction Cicada is a platform focusing on digital human video creation, utilizing AI technology to simplify the video production process. Users can choose different digital human images, input copy and generate videos with multi-language voiceovers. The platform provides a rich library of templates and materials, which are suitable for a variety of fields such as advertising and marketing, education and training...

EchoMimic:音频驱动人像照片生成说话视频(EchoMimicV2加速版安装包)-首席AI分享圈

EchoMimic: Audio-driven portrait photos to generate talking videos (EchoMimicV2 accelerated installer)

General Introduction EchoMimic is an open source project designed to generate realistic portrait animations through audio-driven generation. Developed by Ant Group's Terminal Technologies division, the project utilizes editable marker point conditions that combine audio and facial marker points to generate dynamic portrait videos.EchoMimic is available on multiple public datasets...

VideoChat:自定义形象和音色克隆的实时语音交互数字人,支持端到端语音方案和级联方案-首席AI分享圈

VideoChat: real-time voice-interactive digital person with customized image and tone cloning, supporting end-to-end voice solutions and cascading solutions

Comprehensive Introduction VideoChat is a real-time voice interaction digital human project based on open source technology, supporting end-to-end voice scheme (GLM-4-Voice - THG) and cascade scheme (ASR-LLM-TTS-THG). The project allows users to customize the image and timbre of the digital human, and supports timbre cloning and lip synchronization...

Hallo2:音频驱动生成口型/表情同步的肖像视频(Windows一键安装)-首席AI分享圈

Hallo2: audio-driven generation of lip-synchronized/expression-synchronized portrait videos (Windows one-click installation)

General Introduction Hallo2 is an open source project jointly developed by Fudan University and Baidu to generate high-resolution portrait animations through audio-driven generation. The project utilizes advanced Generative Adversarial Networks (GAN) and time alignment techniques to achieve 4K resolution and up to 1 hour long video generation.Hallo2 also supports...

UltraLight Digital Human:开源端侧实时运行的超轻量级数字人,附一键安装包-首席AI分享圈

UltraLight Digital Human: Open source end-side real-time running ultra-lightweight digital human with one-click installation package

General Introduction Ultralight Digital Human is an open source project to develop an ultra-lightweight digital human model that can run in real time on mobile devices. The project by optimizing the algorithms and model structure to achieve smooth operation on mobile devices , suitable for social applications, games and virtual...

RenderNet:锁定面部特征,创建人物一致性的图像、视频运镜到口播视频-首席AI分享圈

RenderNet: targeting facial features to create character-consistent images, video dribbling to spoken word videos

General Introduction RenderNet is a generator tool that focuses on creating images and videos that maintain character consistency based on artificial intelligence technology. Users can generate character-driven images and videos with simple text prompts. The tool supports a wide range of image and video generation options, and users can make their own...

TANGO:语音生成协调手势人像视频的工具,全身像数字人-首席AI分享圈

TANGO: a tool for voice-generated coordinated gesture portrait videos with full-body digital humans

General Introduction TANGO (Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation) is an open source collaborative speech gesture video generation framework jointly developed by the University of Tokyo and CyberAgent AI Labs An open source collaborative speech gesture video generation framework jointly developed by the University of Tokyo and CyberAgent AI Lab. The ...

即创:依托巨量引擎生成电商营销物料,快速发布适合抖音推广的商品讲解视频-首席AI分享圈

That is to create: relying on a huge engine to generate e-commerce marketing materials, rapid release of products suitable for jittery voice promotion of explaining the video

Introduction of Instant Creation Instant Creation is a one-stop intelligent creative production and management platform launched by Jitterbug, aiming to provide efficient, convenient and professional content creation services for creators. The platform integrates a variety of AI functions, such as intelligent filming, AI video scripts, graphic tools, merchandise card tools, AI live backgrounds, AI live scripts...

SadTalker:让照片说话|嘴型同步音频|合成口型同步视频|免费数字人-首席AI分享圈

SadTalker: Make Photos Talk | Mouth Synchronized Audio | Synthesized Mouth Synchronized Video | Free Digital People

General Introduction SadTalker is an open source tool that combines single still portrait photos and audio files to create realistic talking head videos for a wide range of scenarios such as personalized messages, educational content, and more. The revolutionary use of 3D modeling technologies such as ExpNet and PoseVAE excel in capturing the subtle facets...

MuseV+Muse Talk:完整数字人视频生成框架|人像转视频|姿态转视频|唇形同步-首席AI分享圈

MuseV+Muse Talk: Complete Digital Human Video Generation Framework | Portrait to Video | Pose to Video | Lip Synchronization

General Introduction MuseV is a public project on GitHub that aims to enable the generation of avatar videos of unlimited length and high fidelity. It is based on diffusion technology and offers Image2Video, Text2Image2Video, Video2Video and many other features. Provides model structure, use cases, quick start...

en_USEnglish