VoxCPM 1.5 - Faceted Intelligence Open Source End-to-End Text-to-Speech Modeling
VoxCPM 1.5 is an open source speech generation model released by Facade Intelligence, based on text-to-speech (TTS) technology without the need for a splitter, featuring several innovations and improvements. Adopting an end-to-end diffusion autoregressive architecture, it generates continuous speech waveforms directly from text, avoiding the limitations of traditional segmentation methods...
Mistral Vibe - Open Source Command Line Coding Assistant from Mistral AI
Mistral Vibe is an open source command line coding assistant from Mistral AI, developed based on the Devstral model, which supports natural language interaction to complete code search, file manipulation, version control and other tasks. Can automatically scan the project structure and Git status through the @ symbol...
GLM-TTS - Open Source Industrial Grade Speech Synthesis System by Smart Spectrum AI
GLM-TTS is an open source industrial-grade speech synthesis system with powerful speech synthesis capabilities. Adopting a two-stage generation architecture: the first stage will be converted to text into speech token sequences, and the second stage will be converted into high-quality audio token sequences. The system supports only 3 seconds of voice samples to complete the sound...
Devstral 2 - The Next Generation Family of Programming Models from Mistral AI
Devstral 2 is a family of next-generation programming models designed for software engineering tasks from Mistral AI, consisting of Devstral 2 (123B parameter) and Devstral Small 2 (24B parameter) versions.D...
GLM-ASR - Wisdom Spectrum AI open source high-performance speech recognition model series
GLM-ASR is a family of high-performance speech recognition models open-sourced by Smart Spectrum AI, including the cloud-based model GLM-ASR-2512 and the open-source end-side model GLM-ASR-Nano-2512.GLM-ASR-2512 is the world's leading cloud-based speech recognition model, supporting multiple...
OpenAutoGLM - Smart Spectrum AI open source cell phone AI Agent model
OpenAutoGLM is an open source intelligent body model with the ability of "cell phone use", which can understand the content of the cell phone screen through multi-modal perception, and automatically generate the operation flow to complete the user-specified tasks. Users only need to use natural language to describe the needs, such as "open Meituan to search for nearby hot pot ...
SurfSense - Open source AI research and knowledge management tool, NotebookLM's strongest pinto!
SurfSense is an open source AI research and knowledge management tool. Highly customizable, it can connect to search engines, Slack, Jira, Notion, YouTube, GitHub, and many other external data sources to facilitate users to integrate information. Users can upload a variety of...
GLM-4.6V - Wisdom Spectrum AI open source multimodal large language model series
GLM-4.6V is a series of multimodal large language models open-sourced by Smart Spectrum AI. The series contains two versions: GLM-4.6V (106B-A12B), the basic version for cloud and high-performance cluster scenarios, with the Mixed Expert (MoE) architecture, a total of about 106 billion references, and an activation...
InkSight - Google's open source AI handwriting recognition tool
InkSight is Google's open source AI handwriting recognition tool that converts paper handwritten notes into editable digital inked files (e.g. SVG format). Unlike traditional OCR , can recognize text content , can restore the handwriting style , paragraph structure and focus marking , support for multi-language processing .
NewBie-image-Exp0.1 - NewBieAI-Lab open source experimental anime literate graphical models
NewBie-image-Exp0.1 is the first experimental anime text-born graph model open-sourced by the NewBieAI-Lab team, using the Next-DiT architecture with 3.5B parameters, optimized for the secondary style. The model is optimized for the secondary style by a dual text encoder (GEMMA3-4B...








