AI Sharing Circle

Daily sharing of the latest AI products, projects, frameworks, paper interpretations, etc.~
Keevx - AI 数字人视频创作平台,一键生成脚本和视频

Keevx - AI Digital Human Video Creation Platform, One-Click Script and Video Generation

Keevx is a platform for AI digital human video creation, mainly for overseas SMEs and individual creators. Based on AI intelligent script generation and translation functions, with high-quality public portraits and templates, it provides users with one-click digital human marketing video generation services.
8mos ago
038.1K
Make - AI无代码自动化工作流搭建平台

Make - AI's no-code automated workflow building platform

Make is an AI-driven no-code automation platform that helps organizations improve efficiency and innovation based on automated processes. The platform offers more than 2,000 pre-built apps that support a variety of business scenarios, such as marketing, sales, finance, etc. Make's core features include no-code visual process creation, AI...
8mos ago
034.5K
MiMo-VL - 小米开源的多模态模型

MiMo-VL - Xiaomi's open source multimodal modeling

MiMo-VL is Xiaomi's open source multimodal grand model, consisting of a visual coder, a cross-modal projection layer and a language model. The visual coder is based on Qwen2.5-ViT, which supports native resolution inputs and preserves more details; the language model is Xiaomi's self-developed MiMo-7B, which is designed for complex projections...
8mos ago
037.6K
Olovka AI - AI学术写作辅助平台,提供精准的写作建议和辅助

Olovka AI - AI academic writing assistance platform that provides accurate writing advice and assistance

Olovka AI is an AI academic writing assistance platform for students, which provides accurate writing advice and assistance based on students' academic level, field of specialization and type of paper. Based on intelligent algorithms, Olovka AI helps students quickly write high-quality academic papers that will be...
8mos ago
033.9K
Fish Audio - AI 语音合成与声音克隆工具

Fish Audio - AI Speech Synthesis and Sound Cloning Tool

Fish Audio is a powerful generative AI speech synthesis tool that supports text-to-speech (TTS) and voice cloning. Users only need to input text, the tool supports the conversion to natural and smooth voice, the platform provides multiple languages and voice styles to choose from, to meet different scenarios and user...
8mos ago
053.2K
SignGemma - 谷歌 DeepMind 推出的手语翻译模型

SignGemma - Sign Language Translation Model from Google DeepMind

SignGemma is the world's most powerful sign language interpreting AI model introduced by Google DeepMind, supporting the accurate translation of American Sign Language (ASL) into English text. The model is based on multimodal training, combining visual and textual data to capture sign language actions in real time and quickly translate them into text...
8mos ago
038.7K
FLUX.1 Kontext - 黑森林推出的图像生成与编辑模型

FLUX.1 Kontext - Image Generation and Editing Model from Black Forest

FLUX.1 Kontext is an image generation and editing model from Black Forest Labs that provides context-aware image processing techniques. The model understands responses to text and image cues, performs tasks such as object modification, style conversion, and background replacement, while maintaining the corner...
8mos ago
032.4K
WebAgent - 阿里通义开源的自主搜索AI Agent

WebAgent - Ali Tongyi Open Source Autonomous Search AI Agent

WebAgent is an open source autonomous search AI Agent from Alibaba's Tongyi Labs, with powerful end-to-end autonomous information retrieval and multi-step reasoning capabilities.WebAgent can actively perceive, decide and act in the network environment like a human being, and is widely used in academic research, business decision...
8mos ago
037.9K
灵码 IDE - 通义灵码推出 AI 原生开发环境工具

Linguaphone IDE - Tongyi Linguaphone Launches AI Native Development Environment Tools

Spirit Code IDE is the AI native integrated development environment (IDE) launched by Tongyi Spirit Code, which is deeply adapted to the 3 major models of Thousand Questions, and has a powerful programming intelligent body mode to support the autonomous completion of tasks such as project perception, code retrieval, and execution of terminal operations. It supports MCP tools and integrates Magic Hitch MCP Square's 3...
8mos ago
034.3K
BAGEL - 字节跳动推出的开源多模态基础模型

BAGEL - Open source multimodal base model launched by Wordpress

BAGEL is a multimodal base model open-sourced by ByteDance with 14 billion parameters, of which 7 billion are active. The model base with the Mixed Transformer Expert Architecture (MoT) captures pixel-level and semantic-level features of an image with two independent encoders, respectively, to support efficient processing of images, text, video...
8mos ago
036.2K