AI Sharing Circle

AI is changing the world!
MAI-UI - 阿里通义实验室开源的通用GUI智能体基座模型

MAI-UI - Ali Tongyi Labs Open Source Universal GUI Intelligent Body Base Model

MAI-UI is an open source generalized GUI intelligent body base model from Alibaba Tongyi Labs, with four major capabilities: cross-application operation, fuzzy semantic understanding, active user interaction and multi-step process coordination. Adopting end-cloud collaboration architecture, the lightweight model resides in the device to handle daily tasks, and complex tasks can call the cloud big...
5mos ago
037.4K
MiniMax M2.1 - MiniMax开源的编码和代理模型

MiniMax M2.1 - MiniMax open source coding and agent modeling

MiniMax M2.1 is MiniMax's open source coding and agent model with 10 billion activations and support for many major programming languages such as Rust, Java, Golang, C++, Kotlin, Objective-C, TypeS...
5mos ago
024.8K
InstanceAssemble - 小红书联合复旦大学开源的布局控制生成技术

InstanceAssemble - Little Red Book and Fudan University open source layout control generation technology

InstanceAssemble is a layout control generation technology jointly open-sourced by Little Red Book and Fudan University, which realizes accurate image generation from simple to complex and from sparse to dense layout through the mechanism of "Instance Assemble Attention". Adopting a two-stage cascade architecture, Mr. Mr. into the image background, and then one by one ...
5mos ago
021.4K
Zen Browser - 基于Firefox内核的开源AI网页浏览器

Zen Browser - Open source AI web browser based on Firefox kernel

Zen Browser is an open source browser based on the Firefox kernel, focusing on a simple and efficient browsing experience, with the core features of vertical tab bar and workspace isolation. With the sidebar design, it can clearly display the full titles of 50+ tabs and supports multi-window split-screen browsing.
5mos ago
033.3K
QwenLong-L1.5 - 阿里通义实验室开源的长文本推理模型

QwenLong-L1.5 - Ali Tongyi Labs open source long text inference model

QwenLong-L1.5 is an open source long text inference model from Alibaba Tongyi Lab, focusing on solving complex inference problems with ultra-long contexts (e.g., 1M-4M tokens). The core breakthrough lies in three major innovations in the post-training phase: through knowledge graph, SQL parsing and multi-intelligence...
5mos ago
026.4K
Infographic - 阿里AntV团队开源的信息图生成框架

Infographic - Ali AntV team open source infographic generation framework

Infographic is a new generation of Ali AntV team open source framework , based on G2 and Ant Design development , focusing on rapid generation of high-quality infographics , providing 30 + layout templates , 120 + preset themes and AI intelligent generation capabilities .
5mos ago
030.8K
opcode - 专为Claude Code设计的开源图形化桌面应用

opcode - open source graphical desktop application designed for Claude Code

opcode is designed for Claude Code open source graphical desktop application , the developer winfunc based on Tauri 2 + React 18 + Rust development. Provides a visual interface to manage Claude Code projects , support for creating ...
5mos ago
029K
TurboDiffusion - 生数科技联合清华等开源的视频生成加速框架

TurboDiffusion - Raw Digital Technology, Tsinghua and other open source video generation acceleration framework

TurboDiffusion is a video generation acceleration framework jointly open-sourced by Tsinghua University, BioDigital Technology, and UC Berkeley, which is able to improve video generation speed by 100-200 times while maintaining nearly lossless picture quality. Through sparse linear attention, sample step distillation and 8-bit...
5mos ago
033.3K
MedASR - 谷歌开源的医疗语音识别模型

MedASR - Google's open source medical speech recognition model

MedASR is a 105 million parameter medical speech recognition model open-sourced by Google, fine-tuned on a 5,000-hour desensitized clinical corpus, optimized for drug, dosage, and anatomical terminology, with a built-in 6-gram medical language model, and a word error rate of only 4.6 on the private radiology dataset RAD-DICT...
5mos ago
033.9K
Fun-Audio-Chat-8B - 阿里通义开源的端到端语音交互大模型

Fun-Audio-Chat-8B - Ali Tongyi Open Source End-to-End Speech Interaction Grand Modeling

Fun-Audio-Chat-8B is an open source 8 billion parameter end-to-end speech big model by Ali Tongyi team, direct speech in speech out, no need for ASR+LLM+TTS splicing, bilingual fluent in Chinese and English, with low latency and natural timbre. Using dual-resolution shared LLM with 25Hz...
5mos ago
030.7K