AI Sharing Circle

Daily sharing of the latest AI products, projects, frameworks, paper interpretations, etc.~
HunyuanVideo-Avatar - 腾讯混元开源的语音数字人模型

HunyuanVideo-Avatar - Tencent hybrid open source voice digital human model

HunyuanVideo-Avatar is an advanced voice digital human model jointly launched by Tencent Mixed Yuan team and Tencent Music Tianqin Lab. The model is based on the innovative multimodal diffusion Transformer architecture, which generates a natural expression based on the user's uploaded character image and audio...
10mos ago
045.8K
HeyGen - AI 数字人视频创作平台,支持多语言翻译配音

HeyGen - AI Digital Human Video Creation Platform with Multi-Language Translation and Dubbing Support

HeyGen is an AI-driven digital human video creation platform that supports a streamlined video production process, allowing users to quickly generate professional-level digital human videos. The platform is based on advanced AI technology, giving users full control over the image and voice of digital people, providing a rich library of material, including diverse background...
10mos ago
043.9K
Keevx - AI 数字人视频创作平台,一键生成脚本和视频

Keevx - AI Digital Human Video Creation Platform, One-Click Script and Video Generation

Keevx is a platform for AI digital human video creation, mainly for overseas SMEs and individual creators. Based on AI intelligent script generation and translation functions, with high-quality public portraits and templates, it provides users with one-click digital human marketing video generation services.
10mos ago
052.8K
Make - AI无代码自动化工作流搭建平台

Make - AI's no-code automated workflow building platform

Make is an AI-driven no-code automation platform that helps organizations improve efficiency and innovation based on automated processes. The platform offers more than 2,000 pre-built apps that support a variety of business scenarios, such as marketing, sales, finance, etc. Make's core features include no-code visual process creation, AI...
10mos ago
046.2K
MiMo-VL - 小米开源的多模态模型

MiMo-VL - Xiaomi's open source multimodal modeling

MiMo-VL is Xiaomi's open source multimodal grand model, consisting of a visual coder, a cross-modal projection layer and a language model. The visual coder is based on Qwen2.5-ViT, which supports native resolution inputs and preserves more details; the language model is Xiaomi's self-developed MiMo-7B, which is designed for complex projections...
10mos ago
049.5K
Olovka AI - AI学术写作辅助平台,提供精准的写作建议和辅助

Olovka AI - AI academic writing assistance platform that provides accurate writing advice and assistance

Olovka AI is an AI academic writing assistance platform for students, which provides accurate writing advice and assistance based on students' academic level, field of specialization and type of paper. Based on intelligent algorithms, Olovka AI helps students quickly write high-quality academic papers that will be...
10mos ago
045K
Fish Audio - AI 语音合成与声音克隆工具

Fish Audio - AI Speech Synthesis and Sound Cloning Tool

Fish Audio is a powerful generative AI speech synthesis tool that supports text-to-speech (TTS) and voice cloning. Users only need to input text, the tool supports the conversion to natural and smooth voice, the platform provides multiple languages and voice styles to choose from, to meet different scenarios and user...
10mos ago
071K
SignGemma - 谷歌 DeepMind 推出的手语翻译模型

SignGemma - Sign Language Translation Model from Google DeepMind

SignGemma is the world's most powerful sign language interpreting AI model introduced by Google DeepMind, supporting the accurate translation of American Sign Language (ASL) into English text. The model is based on multimodal training, combining visual and textual data to capture sign language actions in real time and quickly translate them into text...
10mos ago
052K
FLUX.1 Kontext - 黑森林推出的图像生成与编辑模型

FLUX.1 Kontext - Image Generation and Editing Model from Black Forest

FLUX.1 Kontext is an image generation and editing model from Black Forest Labs that provides context-aware image processing techniques. The model understands responses to text and image cues, performs tasks such as object modification, style conversion, and background replacement, while maintaining the corner...
10mos ago
042.8K
WebAgent - 阿里通义开源的自主搜索AI Agent

WebAgent - Ali Tongyi Open Source Autonomous Search AI Agent

WebAgent is an open source autonomous search AI Agent from Alibaba's Tongyi Labs, with powerful end-to-end autonomous information retrieval and multi-step reasoning capabilities.WebAgent can actively perceive, decide and act in the network environment like a human being, and is widely used in academic research, business decision...
10mos ago
050.1K