AI Sharing Circle

AI is changing the world!
SenseNova-SI - 商汤科技开源的空间智能大模型系列

SenseNova-SI - A Family of Open Source Spatial Intelligence Large Models from ShangTech

SenseNova-SI is an open source spatial intelligence grand model released by ShangTech, focusing on improving AI's ability in spatial understanding and reasoning. The model excels in six core dimensions, including spatial measurement, reconstruction, relationship judgment, perspective transformation, deformation analysis, and spatial reasoning, significantly outperforming other...
5mos ago
024.5K
Omnilingual ASR - Meta推出的多语言语音识别框架

Omnilingual ASR - Multilingual Speech Recognition Framework from Meta

Omnilingual ASR is a multilingual speech recognition framework introduced by Meta, covering 1600+ languages, with 78% language character error rate lower than 10%. its 7 billion parameter wav2vec 2.0 encoder combined with CTC and Transformer decoder, support...
5mos ago
028.3K
Frappe Builder - 开源的AI低代码网站构建工具,拖拽组件快速搭建

Frappe Builder - Open source AI low-code website builder, drag-and-drop components for fast building

Frappe Builder is open source low-code website builder, developed by Frappe, the core feature is to provide a Figma-like visual editor that supports drag-and-drop components to build websites quickly. Part of the Frappe ecology (Frappeverse)...
5mos ago
031.1K
DeepOCR - 基于DeepSeek-OCR模型的开源复刻项目

DeepOCR - Open source replica project based on the DeepSeek-OCR model

DeepOCR is an open source replication project that implements the core architecture of DeepSeek-OCR, which efficiently processes textual information through optical compression techniques. The core is DeepEncoder, consisting of SAM-base (processing high-resolution images), 16× convolutional compressor...
5mos ago
027.8K
NocoBase - 免费开源的AI无代码开发平台,可视化构建应用

NocoBase - Free and open source AI no-code development platform to build apps visually

NocoBase is based on AI-driven open-source no-code development platform that supports the rapid construction of business systems, without programming to complete the application development through configuration. The project uses Apache-2.0 protocol , provides private deployment and flexible scalability , suitable for enterprise management , collaboration platforms and other fields ...
5mos ago
028K
UniWorld V2 - 兔展智能联合北大推出的新一代图像编辑模型

UniWorld V2 - A New Generation of Image Editing Models Launched by Rabbit Show Intelligence in Association with Peking University

UniWorld V2 is a new generation of image editing model jointly launched by RabbitZhan Intelligence and UniWorld team of Peking University. It has significant advantages in the field of image editing, especially in Chinese comprehension and execution of complex commands. The model can accurately render artistic Chinese fonts and support fine...
5mos ago
029.8K
SmartResume - 阿里巴巴开源的AI简历解析与优化工具

SmartResume - Alibaba open source AI resume parsing and optimization tool

SmartResume is Alibaba open source intelligent resume parsing and optimization tool , can efficiently extract structured information from PDF, images or Office documents , such as basic information , education and work experience . By integrating OCR technology and PDF metadata...
5mos ago
031.4K
Step-Audio-EditX - 阶跃星辰开源的首个LLM级音频编辑大模型

Step-Audio-EditX - Step-Star's first open source LLM-level audio editing large model

Step-Audio-EditX is an open source audio editing grand model, developed by the Step-Star team, focusing on fine-grained manipulation of audio content through artificial intelligence technology. The model can dynamically adjust the mood of the audio, speaking style (such as petulant, old man accent, etc.) and paralinguistic elements (such as laughter, sigh...
5mos ago
030.6K
Open-o3 Video - 北大联合字节开源的视频推理模型

Open-o3 Video - A Video Reasoning Model Open-Sourced by Peking University United Bytes

Open-o3 Video is an open source video inference model jointly developed by Peking University and ByteDance, focusing on enhancing video inference through temporal and spatial evidence. By explicitly labeling key evidence with timestamps and bounding boxes, it helps the model better understand and interpret video content.
5mos ago
026.9K
Handy - 开源免费的本地AI语音转文字工具

Handy - Open Source Free Native AI Speech to Text Tool

Handy is open source and free local speech to text tool, supporting Windows, MacOS and Linux systems, developed by Rust and React. It is suitable for quick transcription and text input by processing voice data locally without uploading it to the cloud to ensure privacy and security.
5mos ago
058.9K