GLM-ASR - Wisdom Spectrum AI open source high-performance speech recognition model series
GLM-ASR is a family of high-performance speech recognition models open-sourced by Smart Spectrum AI, including the cloud-based model GLM-ASR-2512 and the open-source end-side model GLM-ASR-Nano-2512.GLM-ASR-2512 is the world's leading cloud-based speech recognition model, supporting multiple...
OpenAutoGLM - Smart Spectrum AI open source cell phone AI Agent model
OpenAutoGLM is an open source intelligent body model with the ability of "cell phone use", which can understand the content of the cell phone screen through multi-modal perception, and automatically generate the operation flow to complete the user-specified tasks. Users only need to use natural language to describe the needs, such as "open Meituan to search for nearby hot pot ...
SurfSense - Open source AI research and knowledge management tool, NotebookLM's strongest pinto!
SurfSense is an open source AI research and knowledge management tool. Highly customizable, it can connect to search engines, Slack, Jira, Notion, YouTube, GitHub, and many other external data sources to facilitate users to integrate information. Users can upload a variety of...
GLM-4.6V - Wisdom Spectrum AI open source multimodal large language model series
GLM-4.6V is a series of multimodal large language models open-sourced by Smart Spectrum AI. The series contains two versions: GLM-4.6V (106B-A12B), the basic version for cloud and high-performance cluster scenarios, with the Mixed Expert (MoE) architecture, a total of about 106 billion references, and an activation...
InkSight - Google's open source AI handwriting recognition tool
InkSight is Google's open source AI handwriting recognition tool that converts paper handwritten notes into editable digital inked files (e.g. SVG format). Unlike traditional OCR , can recognize text content , can restore the handwriting style , paragraph structure and focus marking , support for multi-language processing .
NewBie-image-Exp0.1 - NewBieAI-Lab open source experimental anime literate graphical models
NewBie-image-Exp0.1 is the first experimental anime text-born graph model open-sourced by the NewBieAI-Lab team, using the Next-DiT architecture with 3.5B parameters, optimized for the secondary style. The model is optimized for the secondary style by a dual text encoder (GEMMA3-4B...
LongCat-Image - LongCat team open source image generation and editing model of the Mission
LongCat-Image is an open source image generation and editing model released by the LongCat team of Meituan. Using a hybrid backbone architecture (MM-DiT+Single-DiT), combined with a visual language model (VLM) conditional encoder, it is able to realize text-generated images and multiple rounds of image editing...
VibeVoice-Realtime - Microsoft open source lightweight real-time text-to-speech model
VibeVoice-Realtime is Microsoft's open source lightweight real-time text-to-speech (TTS) model designed for low latency and real-time interaction. Supports streaming text input , from the first text token can be vocalized , the delay is only about 300 milliseconds , suitable for dynamic number ...
Flowra - AI workflow development tool open-sourced by Magic Hitch and Wooli WULI team
Flowra is ModelScope joint woo mile WULI team open source graph execution engine and node package development tools, is the core component of FlowBench. Through the directed acyclic graph (DAG) organization workflow , with intelligent caching , parallel scheduling , distributed support ...
RoboCOIN - A real robot dataset for dual-armed robots open-sourced by Wisdom Source in collaboration with several universities
RoboCOIN is the world's first large-scale dual-arm robot real machine dataset open-sourced by Beijing Zhiyuan Artificial Intelligence Research Institute in conjunction with a number of enterprises and colleges and universities, which contains 15 types of robot platforms, 180,000 real operation trajectories, and 421 types of task scenarios. The most important feature is the use of hierarchical annotation system to disassemble the task ...









