MAI-UI - Ali Tongyi Labs Open Source Universal GUI Intelligent Body Base Model
MAI-UI is an open source generalized GUI intelligent body base model from Alibaba Tongyi Labs, with four major capabilities: cross-application operation, fuzzy semantic understanding, active user interaction and multi-step process coordination. Adopting end-cloud collaboration architecture, the lightweight model resides in the device to handle daily tasks, and complex tasks can call the cloud big...
MiniMax M2.1 - MiniMax open source coding and agent modeling
MiniMax M2.1 is MiniMax's open source coding and agent model with 10 billion activations and support for many major programming languages such as Rust, Java, Golang, C++, Kotlin, Objective-C, TypeS...
InstanceAssemble - Little Red Book and Fudan University open source layout control generation technology
InstanceAssemble is a layout control generation technology jointly open-sourced by Little Red Book and Fudan University, which realizes accurate image generation from simple to complex and from sparse to dense layout through the mechanism of "Instance Assemble Attention". Adopting a two-stage cascade architecture, Mr. Mr. into the image background, and then one by one ...
Zen Browser - Open source AI web browser based on Firefox kernel
Zen Browser is an open source browser based on the Firefox kernel, focusing on a simple and efficient browsing experience, with the core features of vertical tab bar and workspace isolation. With the sidebar design, it can clearly display the full titles of 50+ tabs and supports multi-window split-screen browsing.
QwenLong-L1.5 - Ali Tongyi Labs open source long text inference model
QwenLong-L1.5 is an open source long text inference model from Alibaba Tongyi Lab, focusing on solving complex inference problems with ultra-long contexts (e.g., 1M-4M tokens). The core breakthrough lies in three major innovations in the post-training phase: through knowledge graph, SQL parsing and multi-intelligence...
Infographic - Ali AntV team open source infographic generation framework
Infographic is a new generation of Ali AntV team open source framework , based on G2 and Ant Design development , focusing on rapid generation of high-quality infographics , providing 30 + layout templates , 120 + preset themes and AI intelligent generation capabilities .
opcode - open source graphical desktop application designed for Claude Code
opcode is designed for Claude Code open source graphical desktop application , the developer winfunc based on Tauri 2 + React 18 + Rust development. Provides a visual interface to manage Claude Code projects , support for creating ...
TurboDiffusion - Raw Digital Technology, Tsinghua and other open source video generation acceleration framework
TurboDiffusion is a video generation acceleration framework jointly open-sourced by Tsinghua University, BioDigital Technology, and UC Berkeley, which is able to improve video generation speed by 100-200 times while maintaining nearly lossless picture quality. Through sparse linear attention, sample step distillation and 8-bit...
MedASR - Google's open source medical speech recognition model
MedASR is a 105 million parameter medical speech recognition model open-sourced by Google, fine-tuned on a 5,000-hour desensitized clinical corpus, optimized for drug, dosage, and anatomical terminology, with a built-in 6-gram medical language model, and a word error rate of only 4.6 on the private radiology dataset RAD-DICT...
Fun-Audio-Chat-8B - Ali Tongyi Open Source End-to-End Speech Interaction Grand Modeling
Fun-Audio-Chat-8B is an open source 8 billion parameter end-to-end speech big model by Ali Tongyi team, direct speech in speech out, no need for ASR+LLM+TTS splicing, bilingual fluent in Chinese and English, with low latency and natural timbre. Using dual-resolution shared LLM with 25Hz...









