InteriorGS - 3D Gaussian Semantic Dataset launched by Qunar Technologies
InteriorGS is a high-quality 3D Gaussian semantic dataset introduced by Qunar Technology. The dataset contains 1,000 3D scenes covering more than 80 indoor environments such as homes, convenience stores, wedding halls and museums. The dataset has more than 554,000 object instances in 755 categories...
DragonV2.1 - Zero-Sample Speech Synthesis Model from Microsoft
DragonV2.1 is an advanced zero-sample text-to-speech (TTS) model from Microsoft. Based on the Transformer architecture, the model supports multi-language and zero-sample speech cloning, and generates natural, expressive speech with only 5-90 seconds of voice prompts.
ScreenCoder - Open Source UI Screenshot Generation Front-End Code Tool
ScreenCoder is an open source intelligent tool to quickly convert UI design screenshots into high quality HTML/CSS code. Tools based on modular multi-intelligence architecture , combined with visual understanding , layout planning and code synthesis techniques to support the generation of high-precision and semantic front-end ...
Kimi K2 High-Speed Edition - High-Speed Edition of the language model released by Dark Side of the Moon Kimi
Kimi K2 High Speed Edition (kimi-k2-turbo-preview) is a high-performance language model introduced by Kimi, the Dark Side of the Moon. The model is optimized on the basis of Kimi K2, the output speed is greatly increased, and 40 Token per second can be generated...
dots.ocr - the open source multilingual document parsing model launched by the Little Red Book hi lab
dots.ocr is a multilingual document parsing model open-sourced by Xiaohongshu hi lab, based on a 1.7 billion-parameter visual language model (VLM), which can efficiently perform document layout detection and content recognition while maintaining a good reading order.
HYPIR - A new large model for image restoration introduced by a team from the Chinese Academy of Sciences
HYPIR is a large model for image restoration introduced by Dong Chao's team at Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences. The model combines the fractional prior of diffusion modeling with adversarial generative networks to achieve efficient, high-quality image restoration.HYPIR can quickly restore old photos and improve resolution while keeping text clear...
FLUX.1 Krea [dev] - Black Forest and Krea AI joint venture on Vincennes graph models
FLUX.1 Krea [dev] is a text-generated graph model from Black Forest Labs and Krea AI. The model is capable of generating high-quality, photorealistic images based on input text descriptions with a unique aesthetic style that avoids traditional A...
Qwen3-Coder-Flash - an open source high performance programming model from Ali Tongyi
Qwen3-Coder-Flash is a high-performance programming model introduced by Ali Tongyi Thousand Questions team, which has excellent agent-based programming and tool invocation capabilities, and is good at handling complex programming tasks. The model supports 256K tokens of long context understanding, and can scale to 1M ...
Wide Research - Multi-Intelligence Collaboration Introduced on the Manus Platform
Wide Research is a powerful feature of the Manus platform designed to handle complex and large-scale tasks. The platform supports hundreds of general-purpose intelligences working simultaneously through system-level parallel processing mechanisms and intelligence collaboration protocols.
Seed Diffusion - the newest diffusion language model from ByteHopper
Seed Diffusion is an experimental diffusion language model introduced by ByteHop that handles code generation tasks. The model is based on techniques such as two-stage diffusion training, constrained sequential learning, and enhanced efficient parallel decoding, which significantly improves inference speed to 2146 tokens/s, which is faster than...