GLM-4.1V-Thinking - A Series of Open Source Visual Language Models from Smart Spectrum AI
GLM-4.1V-Thinking is an open source visual language model introduced by Smart Spectrum AI, designed for complex cognitive tasks.GLM-4.1V-Thinking supports multimodal inputs, covering images, videos and documents. Based on the GLM-4V architecture, the model introduces a chain of thought...
ThinkSound - Audio Generation Model launched by Ali Tongyi
ThinkSound is the first CoT (Chain Thinking) audio generation model introduced by Ali Tongyi's speech team. The model can generate accurately matched sound effects for video images, based on the introduction of CoT reasoning, to solve the problem of traditional technology is difficult to capture the dynamic details of the screen and spatial relationships.
Qwen-TTS - Speech Synthesis Model launched by Ali Tongyi Qianqian
Qwen-TTS is an advanced speech synthesis model introduced by Ali Tongyi. The model can efficiently convert text into natural and smooth speech, supporting multiple languages and dialects, such as Mandarin, English, Beijing dialect, etc., to meet the needs of different regions and scenes. Relying on massive corpus training, the model's speech output is of high quality, rhyming...
MultiAgentPPT - Open Source AI Presentation Generation System
MultiAgentPPT is an open source multi-intelligent AI presentation generation system. Users only need to enter the subject , the system is based on multi-intelligent collaboration , automatically complete the outline generation , subject splitting , parallel research and content summarization and other steps to quickly generate high-quality PPT....
Ovis-U1 - Multimodal Unified AI Model Introduced by Ali
Ovis-U1 is a multimodal unified model introduced by the Ovis team of Alibaba Group with a parameter scale of 3 billion. The model is equipped with three core capabilities: multimodal understanding, text-to-image generation, and image editing. With advanced architectural design and collaborative and unified training methods, the model supports the realization of high-fidelity image...
Doppl - AI virtual fitting app from Google
Doppl is an AI virtual fitting application launched by Google. After the user uploads a full body photo, the application supports the clothing picture or screenshot "wear" in the digital version of their own body, and can be converted from static pictures to AI-generated video, so that users can more truly feel the effect of clothing on the body.
Xunlei MCP - AI automatic download service launched by Xunlei
Xunlei MCP is launched by Xunlei, an automatic download service based on AI technology. Users in the AI application that supports the service, with voice or text input download demand, AI can automatically search for network resources and start downloading. Xunlei MCP supports PC version of Xunlei and NAS Xunlei, breaking the traditional download mode, allowing...
Kapi Bookkeeping - Intelligent AI Bookkeeping App by ShangTech
Kapi Bookkeeping is an intelligent AI bookkeeping app launched by Shangtang Technology. The application takes automatic bookkeeping as its core function, automatically recognizes amounts and classifications, and supports voice input, making bookkeeping easy and convenient. Kapi Bookkeeping can intelligently analyze billing data and regularly push personalized consumption summaries and financial advice to help users better...
Gemini CLI - Google Open Source Programming Agent
Gemini CLI is Google's open source AI programming tool based on incorporating the Gemini Big Model into the developer's endpoint to provide developers with powerful AI capabilities. The tool understands code, manipulates files, executes commands, and dynamically troubleshoots problems to help developers efficiently write generation...
AnimaTensor - A quadratic image generation model from Toast AI and others
AnimaTensor is a quadratic image generation model from the CagliostroLab team and TensorArt, based on an innovative V-Prediction technique that optimizes noise scheduling by predicting the "speed" of the image generation process....