AI Personal Learning
and practical guidance
CyberKnife Drawing Mirror
Total 908 articles

Tags: ai open source projects Page 14

dsRAG:用于处理非结构化数据和复杂查询的检索引擎-首席AI分享圈

dsRAG: A Retrieval Engine for Unstructured Data and Complex Queries

Comprehensive Introduction dsRAG is a high-performance retrieval engine designed to handle complex queries on unstructured data. It performs particularly well in handling challenging queries in dense text such as financial reports, legal documents, and academic papers. dsRAG employs three key approaches to improve performance: semantic segmentation,...

Graphiti:动态知识图谱构建和查询工具(具有时间感知的长记忆方案)-首席AI分享圈

Graphiti: dynamic knowledge graph construction and query tool (time-aware long memory scheme)

General Introduction Graphiti is a tool developed by getzep for building and querying dynamic, time-aware knowledge graphs. It is capable of representing complex and evolving relationships between entities and querying them through a variety of methods such as temporal, full-text, semantic, and graph algorithms.Graphiti can simultaneously handle non...

中文基于满血 DeepSeek-R1 蒸馏数据集,支持中文R1蒸馏SFT数据集-首席AI分享圈

Chinese based full-blooded DeepSeek-R1 distillation dataset, supports Chinese R1 distillation SFT dataset

Comprehensive Introduction The Chinese DeepSeek-R1 distillation dataset is an open source Chinese dataset containing 110K pieces of data designed to support machine learning and natural language processing research. The dataset is released by Cong Liu's NLP team. The dataset contains not only mathematical data, but also a large amount of general types of data, such as logical reasoning...

MoBA: Kimi 推出的支持长上下文处理的大语言模型-首席AI分享圈

MoBA: A Large Language Model for Long Context Processing by Kimi

Comprehensive Introduction MoBA (Mixture of Block Attention) is an innovative attention mechanism developed by MoonshotAI, designed for large language models (LLMs) with long context processing.MoBA learns to attend to the most relevant KV blocks by dividing the full context into multiple blocks, with each query token learning to attend to the most relevant KV blocks, thus...

AIBot PRO:集成多种AI产品的商业化聚合平台-首席AI分享圈

AIBot PRO: A commercialization aggregation platform integrating multiple AI products

Comprehensive Introduction AIBot PRO is a .NET 6-based AI aggregation client designed to provide users with a convenient platform for integrating multiple AI products. The client supports senseless switching dialog and integrates multiple AI products such as ChatGPT, Gemini, Claude, Wenxin Yiyin, Tongyi Qianqian and Xunfei Starfire.AIBot...

ColossalAI:提供高效大规模AI模型训练解决方案-首席AI分享圈

ColossalAI: Providing Efficient Large-Scale AI Model Training Solutions

Comprehensive Introduction ColossalAI is an open source platform developed by HPC-AI Technologies to provide an efficient and cost-effective solution for large-scale AI model training and inference. By supporting multiple parallelization strategies, heterogeneous memory management, and mixed-precision training, ColossalAI is able to significantly reduce model training and inference...

HealthGPT:支持医学图像分析与诊断问答的医疗大模型-首席AI分享圈

HealthGPT: A Medical Big Model to Support Medical Image Analysis and Diagnostic Q&A

Comprehensive Introduction HealthGPT is a state-of-the-art medical grand visual language model designed to enable unified medical visual understanding and generation capabilities through heterogeneous knowledge adaptation. The goal of the project is to integrate medical vision understanding and generation capabilities into a unified autoregressive framework, significantly enhancing the medical image processing...

MatAnyone: 提取视频指定目标人像的开源工具,生成目标人像视频-首席AI分享圈

MatAnyone: Extract video to specify the target portrait of the open-source tool to generate the target portrait video

General Introduction MatAnyone is an open source project focusing on video keying, developed by a research team at S-Lab, Nanyang Technological University, Singapore and released on GitHub. It provides users with stable and efficient video processing capabilities through consistent memory propagation techniques , especially good at dealing with complex backgrounds...

OmniParser:用户界面截图解析成结构化元素,便于大模型理解和操作-首席AI分享圈

OmniParser: user interface screenshots parsed into structured elements for easy understanding and manipulation by large models

General Introduction OmniParser is a tool developed by Microsoft to parse user interface screenshots into structured and easy-to-understand elements. This tool significantly improves the ability of GPT-4V to generate accurate actions in the corresponding interface area.OmniParser not only supports a wide range of large language models, but also...

Step-Audio:多模态语音交互框架,识别语音并使用克隆语音交流等功能-首席AI分享圈

Step-Audio: a multimodal voice interaction framework that recognizes speech and communicates using cloned speech, among other features

Comprehensive Introduction Step-Audio is an open source intelligent voice interaction framework designed to provide out-of-the-box speech understanding and generation capabilities for production environments. The framework supports multi-language dialog (e.g., Chinese, English, Japanese), emotional speech (e.g., happy, sad), regional dialects (e.g., Cantonese, Szechuan), and can...

en_USEnglish