HunyuanImage 3.0 - Tencent open source free multimodal image generation model
HunyuanImage 3.0 (HunyuanImage 3.0) is a native multimodal image generation model released and open-sourced by Tencent. The model parameter size of 80B, is currently the best evaluation results, the largest number of parameters of the open source image generation model. Hybrid Image 3.0 supports real-time image generation, users can side...
Hunyuan3D-Part - Tencent open source free 3D components to generate models
Hunyuan3D-Part (Hybrid 3D-Part) is a 3D generation model released and open-sourced by Tencent. Composed of P3 - SAM and X - Part, it realizes high-precision and controllable component-based 3D generation for the first time, and supports 50 + components to be generated automatically. Users can first use...
AudioFly - KU Xunfei open source text generation sound AI models
AudioFly is KDDI open source AI model for text to generate sound effects. Based on the potential diffusion model architecture, with 1 billion parameters, trained on large-scale, diverse audio text datasets, covering AudioSet, AudioCaps, TUT and other public datasets and internal...
Hunyuan3D-Omni - Tencent Mixed-Year Open Source 3D Model Generation Framework
Hunyuan3D-Omni (Hybrid 3D-Omni) is an open source 3D asset generation framework by Tencent's Hybrid 3D team, which realizes accurate 3D model generation through multiple control signals. Based on Hunyuan3D 2.1 architecture, it introduces a unified control encoder that can handle point...
FLM-Audio - Wisdom Source and Nanyang Polytechnic Open Source Full-Duplex Audio Dialog Modeling
FLM-Audio is a native full-duplex audio dialog grand model released by Beijing Zhiyuan Artificial Intelligence Research Institute in conjunction with Spin Matrix and Nanyang Technological University of Singapore, supporting both Chinese and English. Adopting native full-duplex architecture, it can merge listening, speaking and monologue at each time step...
CWM - Meta FAIR open source code world language model
CWM (Code World Model) is a 32-billion-parameter open-source world language model released by the Meta FAIR team, designed for code generation and reasoning. Introducing the concept of "world model", it can simulate the code execution process, predict the variable state changes, and advance...
Neovate Code - Ant Open Source's Intelligent Programming Assistant
Neovate Code is an open source intelligent programming assistant from Ant Group's Alipay Experience Technology Department, which improves development efficiency through artificial intelligence technology. With conversational development features, developers can describe the requirements through natural language, Neovate Code can understand and generate the corresponding generation...
Audio2Face - NVIDIA open source AI 3D facial animation generation model
Audio2Face is NVIDIA's open source AI tool capable of transforming audio input into realistic 3D facial animation. By analyzing speech features in the audio, such as phonemes and intonation, it generates precise lip synchronization and subtle emotional expressions to give vivid human expressions to virtual characters.
Qwen3-VL - AliCloud Tongyi Qianqian open source multimodal visual language big model
Qwen3-VL is an open source multimodal visual language large model by AliCloud Tongyi Qianqian team, the number of references reaches 235 billion, and the model file is about 471GB.Containing instruction version and thinking version, it adopts enhanced MRope interleaved layout, DeepStack and other technologies, which can effectively utilize the visual transform...
Qwen3Guard - Ali Qwen open source security model
Qwen3Guard is a fine-tuned security protection model based on the Qwen3 base model, designed for security detection. It provides accurate security categorization of prompts and responses, provides risk levels, and supports English, Chinese, and multi-language environments.Qwen3Guard comes with two pro...









