AI Sharing Circle

AI is changing the world!
Ling-1T - 蚂蚁集团开源的万亿参数通用语言模型

meso- (chemistry)Ling-1T - Ant Group's open source universal language model for trillions of parameters

Ling-1T is a trillion-parameter general-purpose language model open-sourced by Ant Group, which belongs to the flagship product of the Ling 2.0 series of Bering's large models. The model adopts a highly efficient MoE architecture, supports 128K context windows, and surpasses GPT in 7 benchmarks including code generation, mathematical reasoning, and logic test...
19hrs ago
02.4K
聆音EchoCare - 香港科学院开源的超声基座大模型

meso- (chemistry)EchoCare - Hong Kong Academy of Sciences open source ultrasound base large model

EchoCare is a large model of ultrasound base developed by the Center for Artificial Intelligence and Robotics Innovation (CAIR) at the Hong Kong Institute of Innovation and Research of the Chinese Academy of Sciences (CAS), trained based on the world's largest ultrasound image dataset (more than 4.5 million images), covering multi-center, multi-region, multi-ethnicity, and more than 50 individuals...
22hrs ago
01.5K
Code2Video - Show Lab开源的AI教学视频生成框架

Code2Video - Show Lab open source AI teaching video generation framework

Code2Video is innovative open source project that automatically converts code snippets into high quality video content (mp4 format). The project through a unique code-centric paradigm , the use of carbon-now-cli tools to generate code into beautiful images , the use of ffmpeg will be these ...
2dys ago
03.5K
SceneGen - 上海交大开源的单图像生成3D场景框架

SceneGen - Shanghai Jiaotong University open source single image to generate 3D scene framework

SceneGen is an open source method for generating 3D scenes from a single image at Shanghai Jiao Tong University. From a single scene image and a target resource mask, a complete scene containing multiple 3D resources is efficiently generated, including the geometric structure of the resources, texture and relative spatial location.
2dys ago
02.7K
Ming-UniAudio - 蚂蚁开源的统一音频多模态生成模型

Ming-UniAudio - Ant open source unified audio multimodal generation model

Ming-UniAudio is Ant Group's open source unified audio multimodal generation model that supports mixed input and output of text, audio, image and video. Using multi-scale Transformer and hybrid expert (MoE) architecture , through modality-aware routing mechanism to efficiently handle cross-modal ...
3dys ago
05.1K
AIMangaStudio - 免费的AI漫画创作工具,提供完整创作流程

AIMangaStudio - Free AI manga authoring tool with complete authoring flow

AIMangaStudio is a free AI manga creation tool that provides creators with a complete manga creation pipeline, including plot generation, sub-scene design, character setting and other functions, which can simplify the production process from script to manga page. It supports natural language generation of comic scripts, including plot, dialog...
4dys ago
06.3K
FireRedChat - 小红书开源的全双工语音交互系统

FireRedChat - Little Red Book's open source full-duplex voice interaction system

FireRedChat is an open source full-duplex voice interaction system for Xiaohongshu with real-time bidirectional dialog capabilities and support for controlled interruptions. Adopts a modular design , including transcription control module , interaction module and dialogue manager , etc., supports cascade and semi-cascade architecture , can be flexibly deployed .
5dys ago
07.9K
Logics-Parsing - 阿里开源的文档解析模型

Logics-Parsing - Ali open source document parsing model

Logics-Parsing is an open source Ali end-to-end document parsing model , based on Qwen2.5-VL-7B. Optimize document layout analysis and reading order inference through reinforcement learning , PDF images can be converted to structured HTML output to support a variety of content ...
7dys ago
010.7K
Ring-1T-preview - 蚂蚁集团开源的万亿参数大模型

Ring-1T-preview - Ant Group's open-source trillion-parameter macromodel

Ring-1T-preview is an open source trillion-parameter big model of Ant Group, based on Ling 2.0 MoE architecture, pre-trained on 20T corpus, and trained in reasoning ability by self-developed reinforcement learning system ASystem. In natural language reasoning ...
1wks ago
010.6K
RoboBrain-X0 - 智源研究院开源的零样本跨本体泛化具身模型

RoboBrain-X0 - Wisdom Source Research Institute open source zero-sample cross ontology generalized embodiment model

RoboBrain-X0 is the world's first open source embodied model that supports zero-sample cross-ontology generalization open-sourced by Wisdom Source Research Institute, which is of great industrial significance. It can drive multiple real robots of different configurations to complete basic operation tasks without fine-tuning, and after a small amount of sample fine-tuning, it demonstrates the ability to replicate ...
1wks ago
08.9K