AI Sharing Circle

Daily sharing of the latest AI products, projects, frameworks, paper interpretations, etc.~
GLM-4.1V-Thinking - 智谱AI推出的开源视觉语言模型系列

GLM-4.1V-Thinking - A Series of Open Source Visual Language Models from Smart Spectrum AI

GLM-4.1V-Thinking is an open source visual language model introduced by Smart Spectrum AI, designed for complex cognitive tasks.GLM-4.1V-Thinking supports multimodal inputs, covering images, videos and documents. Based on the GLM-4V architecture, the model introduces a chain of thought...
1mos ago
02.1K
ThinkSound - 阿里通义推出的音频生成模型

ThinkSound - Audio Generation Model launched by Ali Tongyi

ThinkSound is the first CoT (Chain Thinking) audio generation model introduced by Ali Tongyi's speech team. The model can generate accurately matched sound effects for video images, based on the introduction of CoT reasoning, to solve the problem of traditional technology is difficult to capture the dynamic details of the screen and spatial relationships.
1mos ago
01.6K
Qwen-TTS - 阿里通义千问推出的语音合成模型

Qwen-TTS - Speech Synthesis Model launched by Ali Tongyi Qianqian

Qwen-TTS is an advanced speech synthesis model introduced by Ali Tongyi. The model can efficiently convert text into natural and smooth speech, supporting multiple languages and dialects, such as Mandarin, English, Beijing dialect, etc., to meet the needs of different regions and scenes. Relying on massive corpus training, the model's speech output is of high quality, rhyming...
1mos ago
02K
MultiAgentPPT - 开源的AI演示文稿生成系统

MultiAgentPPT - Open Source AI Presentation Generation System

MultiAgentPPT is an open source multi-intelligent AI presentation generation system. Users only need to enter the subject , the system is based on multi-intelligent collaboration , automatically complete the outline generation , subject splitting , parallel research and content summarization and other steps to quickly generate high-quality PPT....
1mos ago
02.3K
Ovis-U1 - 阿里推出的多模态统一AI模型

Ovis-U1 - Multimodal Unified AI Model Introduced by Ali

Ovis-U1 is a multimodal unified model introduced by the Ovis team of Alibaba Group with a parameter scale of 3 billion. The model is equipped with three core capabilities: multimodal understanding, text-to-image generation, and image editing. With advanced architectural design and collaborative and unified training methods, the model supports the realization of high-fidelity image...
1mos ago
02K
Doppl - 谷歌推出的AI虚拟试衣应用

Doppl - AI virtual fitting app from Google

Doppl is an AI virtual fitting application launched by Google. After the user uploads a full body photo, the application supports the clothing picture or screenshot "wear" in the digital version of their own body, and can be converted from static pictures to AI-generated video, so that users can more truly feel the effect of clothing on the body.
2mos ago
01.7K
迅雷MCP - 迅雷推出的AI自动下载服务

Xunlei MCP - AI automatic download service launched by Xunlei

Xunlei MCP is launched by Xunlei, an automatic download service based on AI technology. Users in the AI application that supports the service, with voice or text input download demand, AI can automatically search for network resources and start downloading. Xunlei MCP supports PC version of Xunlei and NAS Xunlei, breaking the traditional download mode, allowing...
2mos ago
01.6K
咔皮记账 - 商汤科技推出的智能AI记账应用

Kapi Bookkeeping - Intelligent AI Bookkeeping App by ShangTech

Kapi Bookkeeping is an intelligent AI bookkeeping app launched by Shangtang Technology. The application takes automatic bookkeeping as its core function, automatically recognizes amounts and classifications, and supports voice input, making bookkeeping easy and convenient. Kapi Bookkeeping can intelligently analyze billing data and regularly push personalized consumption summaries and financial advice to help users better...
2mos ago
01.9K
Gemini CLI - 谷歌开源的编程Agent

Gemini CLI - Google Open Source Programming Agent

Gemini CLI is Google's open source AI programming tool based on incorporating the Gemini Big Model into the developer's endpoint to provide developers with powerful AI capabilities. The tool understands code, manipulates files, executes commands, and dynamically troubleshoots problems to help developers efficiently write generation...
2mos ago
01.4K
AnimaTensor - 吐司AI等机构推出的二次元图像生成模型

AnimaTensor - A quadratic image generation model from Toast AI and others

AnimaTensor is a quadratic image generation model from the CagliostroLab team and TensorArt, based on an innovative V-Prediction technique that optimizes noise scheduling by predicting the "speed" of the image generation process....
2mos ago
01.4K