MuseSteamer - Baidu Launches Big Model for Video Generation
MuseSteamer is a large model for multimodal video generation launched by Baidu. The model can quickly generate high-quality dynamic video content based on text descriptions or images provided by the user, supporting a variety of clarity and functionality versions to meet the needs of different scenarios of creation.
Painting Thinking - AI Video Generation Platform Launched by Baidu
Painting is an AI video generation platform launched by Baidu, based on AI technology to help users easily create personalized videos. Painting intuitive interface, powerful tools, with inspiration recommendation function, can provide creators with creative inspiration, support a key to the same operation, can quickly generate similar videos, simplify the creative process.
GLM-4.1V-Thinking - A Series of Open Source Visual Language Models from Smart Spectrum AI
GLM-4.1V-Thinking is an open source visual language model introduced by Smart Spectrum AI, designed for complex cognitive tasks.GLM-4.1V-Thinking supports multimodal inputs, covering images, videos and documents. Based on the GLM-4V architecture, the model introduces a chain of thought...
ThinkSound - Audio Generation Model launched by Ali Tongyi
ThinkSound is the first CoT (Chain Thinking) audio generation model introduced by Ali Tongyi's speech team. The model can generate accurately matched sound effects for video images, based on the introduction of CoT reasoning, to solve the problem of traditional technology is difficult to capture the dynamic details of the screen and spatial relationships.
Qwen-TTS - Speech Synthesis Model launched by Ali Tongyi Qianqian
Qwen-TTS is an advanced speech synthesis model introduced by Ali Tongyi. The model can efficiently convert text into natural and smooth speech, supporting multiple languages and dialects, such as Mandarin, English, Beijing dialect, etc., to meet the needs of different regions and scenes. Relying on massive corpus training, the model's speech output is of high quality, rhyming...
MultiAgentPPT - Open Source AI Presentation Generation System
MultiAgentPPT is an open source multi-intelligent AI presentation generation system. Users only need to enter the subject , the system is based on multi-intelligent collaboration , automatically complete the outline generation , subject splitting , parallel research and content summarization and other steps to quickly generate high-quality PPT....
Ovis-U1 - Multimodal Unified AI Model Introduced by Ali
Ovis-U1 is a multimodal unified model introduced by the Ovis team of Alibaba Group with a parameter scale of 3 billion. The model is equipped with three core capabilities: multimodal understanding, text-to-image generation, and image editing. With advanced architectural design and collaborative and unified training methods, the model supports the realization of high-fidelity image...
Doppl - AI virtual fitting app from Google
Doppl is an AI virtual fitting application launched by Google. After the user uploads a full body photo, the application supports the clothing picture or screenshot "wear" in the digital version of their own body, and can be converted from static pictures to AI-generated video, so that users can more truly feel the effect of clothing on the body.
Xunlei MCP - AI automatic download service launched by Xunlei
Xunlei MCP is launched by Xunlei, an automatic download service based on AI technology. Users in the AI application that supports the service, with voice or text input download demand, AI can automatically search for network resources and start downloading. Xunlei MCP supports PC version of Xunlei and NAS Xunlei, breaking the traditional download mode, allowing...
Kapi Bookkeeping - Intelligent AI Bookkeeping App by ShangTech
Kapi Bookkeeping is an intelligent AI bookkeeping app launched by Shangtang Technology. The application takes automatic bookkeeping as its core function, automatically recognizes amounts and classifications, and supports voice input, making bookkeeping easy and convenient. Kapi Bookkeeping can intelligently analyze billing data and regularly push personalized consumption summaries and financial advice to help users better...