QwenLong-L1.5 - Ali Tongyi Labs open source long text inference model
QwenLong-L1.5 is an open source long text inference model from Alibaba Tongyi Lab, focusing on solving complex inference problems with ultra-long contexts (e.g., 1M-4M tokens). The core breakthrough lies in three major innovations in the post-training phase: through knowledge graph, SQL parsing and multi-intelligence...
Infographic - Ali AntV team open source infographic generation framework
Infographic is a new generation of Ali AntV team open source framework , based on G2 and Ant Design development , focusing on rapid generation of high-quality infographics , providing 30 + layout templates , 120 + preset themes and AI intelligent generation capabilities .
opcode - open source graphical desktop application designed for Claude Code
opcode is designed for Claude Code open source graphical desktop application , the developer winfunc based on Tauri 2 + React 18 + Rust development. Provides a visual interface to manage Claude Code projects , support for creating ...
TurboDiffusion - Raw Digital Technology, Tsinghua and other open source video generation acceleration framework
TurboDiffusion is a video generation acceleration framework jointly open-sourced by Tsinghua University, BioDigital Technology, and UC Berkeley, which is able to improve video generation speed by 100-200 times while maintaining nearly lossless picture quality. Through sparse linear attention, sample step distillation and 8-bit...
MedASR - Google's open source medical speech recognition model
MedASR is a 105 million parameter medical speech recognition model open-sourced by Google, fine-tuned on a 5,000-hour desensitized clinical corpus, optimized for drug, dosage, and anatomical terminology, with a built-in 6-gram medical language model, and a word error rate of only 4.6 on the private radiology dataset RAD-DICT...
Fun-Audio-Chat-8B - Ali Tongyi Open Source End-to-End Speech Interaction Grand Modeling
Fun-Audio-Chat-8B is an open source 8 billion parameter end-to-end speech big model by Ali Tongyi team, direct speech in speech out, no need for ASR+LLM+TTS splicing, bilingual fluent in Chinese and English, with low latency and natural timbre. Using dual-resolution shared LLM with 25Hz...
PromptFill - Open Source Structured Prompt Word Generation AI Tool Designed for AI Drawing
PromptFill is a structured cue generation tool designed specifically for AI painting, which helps users quickly build, manage and iterate complex prompts through visual "fill-in-the-blank" interactions, improving the efficiency and quality of AI image generation.PromptFill's core features...
GLM-4.7 - Wisdom Spectrum AI open source the latest generation of flagship large models
GLM-4.7 is the latest generation of flagship grand model released and open-sourced by Smart Spectrum AI, which is deeply optimized for AI programming, complex reasoning and intelligent body tasks. The model supports 200k context length and 128k maximum output, with multi-language coding, long-range task planning and tool collaboration capabilities...
NitroGen - NVIDIA's open-source gaming AI model in conjunction with Stanford, Caltech, and others
NitroGen is an open source gaming AI model developed by NVIDIA in conjunction with Stanford University, Caltech, and other institutions, capable of playing over 1,000 different types of games. The model is based on the GROOT N1.5 architecture, and is realized by analyzing 40,000 hours of game video data (including joystick operation annotation)...
Qwen-Image-Layered - AI image editing model open-sourced by Ali team
Qwen-Image-Layered is an open source AI image editing model by Ali team, which can intelligently decompose ordinary images into independent transparent layers to achieve accurate editing similar to Photoshop. The model is open source using the Apache 2.0 protocol and supports flexible control of layers...









