AI Sharing Circle

AI is changing the world!
VoxCPM 1.5 - 面壁智能开源的端到端文本到语音模型

VoxCPM 1.5 - Faceted Intelligence Open Source End-to-End Text-to-Speech Modeling

VoxCPM 1.5 is an open source speech generation model released by Facade Intelligence, based on text-to-speech (TTS) technology without the need for a splitter, featuring several innovations and improvements. Adopting an end-to-end diffusion autoregressive architecture, it generates continuous speech waveforms directly from text, avoiding the limitations of traditional segmentation methods...
5mos ago
041.8K
Mistral Vibe - Mistral AI推出的开源命令行编码助手

Mistral Vibe - Open Source Command Line Coding Assistant from Mistral AI

Mistral Vibe is an open source command line coding assistant from Mistral AI, developed based on the Devstral model, which supports natural language interaction to complete code search, file manipulation, version control and other tasks. Can automatically scan the project structure and Git status through the @ symbol...
5mos ago
032.7K
GLM-TTS - 智谱AI推出的开源工业级语音合成系统

GLM-TTS - Open Source Industrial Grade Speech Synthesis System by Smart Spectrum AI

GLM-TTS is an open source industrial-grade speech synthesis system with powerful speech synthesis capabilities. Adopting a two-stage generation architecture: the first stage will be converted to text into speech token sequences, and the second stage will be converted into high-quality audio token sequences. The system supports only 3 seconds of voice samples to complete the sound...
5mos ago
032.6K
Devstral 2 - Mistral AI 推出的新一代编程模型家族

Devstral 2 - The Next Generation Family of Programming Models from Mistral AI

Devstral 2 is a family of next-generation programming models designed for software engineering tasks from Mistral AI, consisting of Devstral 2 (123B parameter) and Devstral Small 2 (24B parameter) versions.D...
5mos ago
031.4K
GLM-ASR - 智谱AI开源的高性能语音识别模型系列

GLM-ASR - Wisdom Spectrum AI open source high-performance speech recognition model series

GLM-ASR is a family of high-performance speech recognition models open-sourced by Smart Spectrum AI, including the cloud-based model GLM-ASR-2512 and the open-source end-side model GLM-ASR-Nano-2512.GLM-ASR-2512 is the world's leading cloud-based speech recognition model, supporting multiple...
5mos ago
038.4K
OpenAutoGLM - 智谱AI开源的手机AI Agent模型

OpenAutoGLM - Smart Spectrum AI open source cell phone AI Agent model

OpenAutoGLM is an open source intelligent body model with the ability of "cell phone use", which can understand the content of the cell phone screen through multi-modal perception, and automatically generate the operation flow to complete the user-specified tasks. Users only need to use natural language to describe the needs, such as "open Meituan to search for nearby hot pot ...
5mos ago
033.4K
SurfSense - 开源的AI研究与知识管理工具,NotebookLM最强平替

SurfSense - Open source AI research and knowledge management tool, NotebookLM's strongest pinto!

SurfSense is an open source AI research and knowledge management tool. Highly customizable, it can connect to search engines, Slack, Jira, Notion, YouTube, GitHub, and many other external data sources to facilitate users to integrate information. Users can upload a variety of...
5mos ago
031.3K
GLM-4.6V - 智谱AI开源的多模态大语言模型系列

GLM-4.6V - Wisdom Spectrum AI open source multimodal large language model series

GLM-4.6V is a series of multimodal large language models open-sourced by Smart Spectrum AI. The series contains two versions: GLM-4.6V (106B-A12B), the basic version for cloud and high-performance cluster scenarios, with the Mixed Expert (MoE) architecture, a total of about 106 billion references, and an activation...
5mos ago
028.9K
InkSight - Google开源的AI手写识别工具

InkSight - Google's open source AI handwriting recognition tool

InkSight is Google's open source AI handwriting recognition tool that converts paper handwritten notes into editable digital inked files (e.g. SVG format). Unlike traditional OCR , can recognize text content , can restore the handwriting style , paragraph structure and focus marking , support for multi-language processing .
5mos ago
028K
NewBie-image-Exp0.1 - NewBieAI-Lab开源的实验性动漫文生图模型

NewBie-image-Exp0.1 - NewBieAI-Lab open source experimental anime literate graphical models

NewBie-image-Exp0.1 is the first experimental anime text-born graph model open-sourced by the NewBieAI-Lab team, using the Next-DiT architecture with 3.5B parameters, optimized for the secondary style. The model is optimized for the secondary style by a dual text encoder (GEMMA3-4B...
6mos ago
031.1K