AI Sharing Circle

Daily sharing of the latest AI products, projects, frameworks, paper interpretations, etc.~
InteriorGS - 群核科技推出的3D高斯语义数据集

InteriorGS - 3D Gaussian Semantic Dataset launched by Qunar Technologies

InteriorGS is a high-quality 3D Gaussian semantic dataset introduced by Qunar Technology. The dataset contains 1,000 3D scenes covering more than 80 indoor environments such as homes, convenience stores, wedding halls and museums. The dataset has more than 554,000 object instances in 755 categories...
2mos ago
019.5K
DragonV2.1 - 微软推出的零样本语音合成模型

DragonV2.1 - Zero-Sample Speech Synthesis Model from Microsoft

DragonV2.1 is an advanced zero-sample text-to-speech (TTS) model from Microsoft. Based on the Transformer architecture, the model supports multi-language and zero-sample speech cloning, and generates natural, expressive speech with only 5-90 seconds of voice prompts.
2mos ago
020.4K
ScreenCoder – 开源的UI截图生成前端代码工具

ScreenCoder - Open Source UI Screenshot Generation Front-End Code Tool

ScreenCoder is an open source intelligent tool to quickly convert UI design screenshots into high quality HTML/CSS code. Tools based on modular multi-intelligence architecture , combined with visual understanding , layout planning and code synthesis techniques to support the generation of high-precision and semantic front-end ...
2mos ago
021.1K
Kimi K2 高速版 - 月之暗面Kimi推出的高速版语言模型

Kimi K2 High-Speed Edition - High-Speed Edition of the language model released by Dark Side of the Moon Kimi

Kimi K2 High Speed Edition (kimi-k2-turbo-preview) is a high-performance language model introduced by Kimi, the Dark Side of the Moon. The model is optimized on the basis of Kimi K2, the output speed is greatly increased, and 40 Token per second can be generated...
2mos ago
025.5K
dots.ocr - 小红书hi lab推出的开源多语言文档解析模型

dots.ocr - the open source multilingual document parsing model launched by the Little Red Book hi lab

dots.ocr is a multilingual document parsing model open-sourced by Xiaohongshu hi lab, based on a 1.7 billion-parameter visual language model (VLM), which can efficiently perform document layout detection and content recognition while maintaining a good reading order.
2mos ago
029.2K
HYPIR - 中国科学院团队推出的新型图像复原大模型

HYPIR - A new large model for image restoration introduced by a team from the Chinese Academy of Sciences

HYPIR is a large model for image restoration introduced by Dong Chao's team at Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences. The model combines the fractional prior of diffusion modeling with adversarial generative networks to achieve efficient, high-quality image restoration.HYPIR can quickly restore old photos and improve resolution while keeping text clear...
2mos ago
023.7K
FLUX.1 Krea [dev] - 黑森林和Krea AI联合推出的文生图模型

FLUX.1 Krea [dev] - Black Forest and Krea AI joint venture on Vincennes graph models

FLUX.1 Krea [dev] is a text-generated graph model from Black Forest Labs and Krea AI. The model is capable of generating high-quality, photorealistic images based on input text descriptions with a unique aesthetic style that avoids traditional A...
2mos ago
023.1K
Qwen3-Coder-Flash - 阿里通义推出的开源高性能编程模型

Qwen3-Coder-Flash - an open source high performance programming model from Ali Tongyi

Qwen3-Coder-Flash is a high-performance programming model introduced by Ali Tongyi Thousand Questions team, which has excellent agent-based programming and tool invocation capabilities, and is good at handling complex programming tasks. The model supports 256K tokens of long context understanding, and can scale to 1M ...
2mos ago
019K
Wide Research - Manus平台推出的多智能体协同功能

Wide Research - Multi-Intelligence Collaboration Introduced on the Manus Platform

Wide Research is a powerful feature of the Manus platform designed to handle complex and large-scale tasks. The platform supports hundreds of general-purpose intelligences working simultaneously through system-level parallel processing mechanisms and intelligence collaboration protocols.
2mos ago
018.5K
Seed Diffusion - 字节跳动最新推出的扩散语言模型

Seed Diffusion - the newest diffusion language model from ByteHopper

Seed Diffusion is an experimental diffusion language model introduced by ByteHop that handles code generation tasks. The model is based on techniques such as two-stage diffusion training, constrained sequential learning, and enhanced efficient parallel decoding, which significantly improves inference speed to 2146 tokens/s, which is faster than...
2mos ago
021.5K