HunyuanVideo-Avatar - Tencent hybrid open source voice digital human model
HunyuanVideo-Avatar is an advanced voice digital human model jointly launched by Tencent Mixed Yuan team and Tencent Music Tianqin Lab. The model is based on the innovative multimodal diffusion Transformer architecture, which generates a natural expression based on the user's uploaded character image and audio...
HeyGen - AI Digital Human Video Creation Platform with Multi-Language Translation and Dubbing Support
HeyGen is an AI-driven digital human video creation platform that supports a streamlined video production process, allowing users to quickly generate professional-level digital human videos. The platform is based on advanced AI technology, giving users full control over the image and voice of digital people, providing a rich library of material, including diverse background...
Keevx - AI Digital Human Video Creation Platform, One-Click Script and Video Generation
Keevx is a platform for AI digital human video creation, mainly for overseas SMEs and individual creators. Based on AI intelligent script generation and translation functions, with high-quality public portraits and templates, it provides users with one-click digital human marketing video generation services.
Make - AI's no-code automated workflow building platform
Make is an AI-driven no-code automation platform that helps organizations improve efficiency and innovation based on automated processes. The platform offers more than 2,000 pre-built apps that support a variety of business scenarios, such as marketing, sales, finance, etc. Make's core features include no-code visual process creation, AI...
MiMo-VL - Xiaomi's open source multimodal modeling
MiMo-VL is Xiaomi's open source multimodal grand model, consisting of a visual coder, a cross-modal projection layer and a language model. The visual coder is based on Qwen2.5-ViT, which supports native resolution inputs and preserves more details; the language model is Xiaomi's self-developed MiMo-7B, which is designed for complex projections...
Olovka AI - AI academic writing assistance platform that provides accurate writing advice and assistance
Olovka AI is an AI academic writing assistance platform for students, which provides accurate writing advice and assistance based on students' academic level, field of specialization and type of paper. Based on intelligent algorithms, Olovka AI helps students quickly write high-quality academic papers that will be...
Fish Audio - AI Speech Synthesis and Sound Cloning Tool
Fish Audio is a powerful generative AI speech synthesis tool that supports text-to-speech (TTS) and voice cloning. Users only need to input text, the tool supports the conversion to natural and smooth voice, the platform provides multiple languages and voice styles to choose from, to meet different scenarios and user...
SignGemma - Sign Language Translation Model from Google DeepMind
SignGemma is the world's most powerful sign language interpreting AI model introduced by Google DeepMind, supporting the accurate translation of American Sign Language (ASL) into English text. The model is based on multimodal training, combining visual and textual data to capture sign language actions in real time and quickly translate them into text...
FLUX.1 Kontext - Image Generation and Editing Model from Black Forest
FLUX.1 Kontext is an image generation and editing model from Black Forest Labs that provides context-aware image processing techniques. The model understands responses to text and image cues, performs tasks such as object modification, style conversion, and background replacement, while maintaining the corner...
WebAgent - Ali Tongyi Open Source Autonomous Search AI Agent
WebAgent is an open source autonomous search AI Agent from Alibaba's Tongyi Labs, with powerful end-to-end autonomous information retrieval and multi-step reasoning capabilities.WebAgent can actively perceive, decide and act in the network environment like a human being, and is widely used in academic research, business decision...








