AI Personal Learning
and practical guidance
Beanbag Marscode1
Total 914 articles

Tags: ai open source projects Page 18

MedRAX: 利用多模态大模型进行胸部X光片分析的智能体-首席AI分享圈

MedRAX: A Smart Body for Chest X-ray Analysis Using Multimodal Large Models

Comprehensive Introduction MedRAX is a state-of-the-art AI intelligence designed specifically for Chest X-ray (CXR) analysis. It integrates state-of-the-art CXR analysis tools and a multimodal large language model to dynamically process complex medical queries without additional training.MedRAX, through its modular design and strong technological base,...

LangBot:开源大模型即时通信机器人,支持多微信、QQ、飞书等多平台部署AI机器人-首席AI分享圈

LangBot: open source large model instant messaging robot, support for multiple WeChat, QQ, Flybook and other multi-platform deployment of AI robots

Comprehensive Introduction LangBot is a large model-based instant messaging bot platform that supports multiple messaging platforms and large models. The platform adapts to QQ, WeChat (enterprise WeChat, personal WeChat), Flybook, Discord, OneBot and other messaging platforms, and supports OpenAI GPT, ChatGPT, DeepSeek, D...

zChunk:基于Llama-70B的通用语义分块策略-首席AI分享圈

zChunk: a generic semantic chunking strategy based on Llama-70B

Comprehensive Introduction zChunk is a novel chunking strategy developed by ZeroEntropy to provide a solution for generic semantic chunking. The strategy is based on the Llama-70B model and optimizes the chunking process of a document by prompting for chunks to be generated, ensuring that a high signal-to-noise ratio is maintained during information retrieval. zChunk is particularly suited for...

Hibiki:实时语音翻译模型,保留原声特点的流式翻译-首席AI分享圈

Hibiki: a real-time speech translation model, streaming translation that preserves the characteristics of the original voice

General Introduction Hibiki is a high-fidelity real-time speech translation model developed by Kyutai Labs. Unlike traditional offline translation, Hibiki is able to generate natural speech translation in the target language and provide text translation in real time while the user is speaking. The model adopts a multi-stream architecture, and is able to simultaneously...

OpenHealthForAll:个人健康数据管理AI助手,上传检查报告定制健康计划-首席AI分享圈

OpenHealthForAll: AI assistant for personal health data management, uploading examination reports to customize health plans

General Introduction OpenHealthForAll is an open source project designed to help users manage and understand their personal health data. By leveraging artificial intelligence technology, OpenHealthForAll provides a locally run health assistant to help users better manage and analyze their health information. The project supports...

OpenPilot:开源自动驾驶系统,为爱车DIY一套自己的智能驾驶系统-首席AI分享圈

OpenPilot: open source autonomous driving system, DIY a set of your own intelligent driving system for your car

General Introduction OpenPilot is an open source autonomous driving system developed by comma.ai to enhance the driving experience and safety of existing vehicles with advanced driver assistance features. Since its first release in 2016, OpenPilot has supported over 275 vehicle models and is constantly updating and optimizing its functionality....

Agentic Security:开源的LLM漏洞扫描工具,提供全面的模糊测试和攻击技术-首席AI分享圈

Agentic Security: open source LLM vulnerability scanning tool that provides comprehensive fuzz testing and attack techniques

General Introduction Agentic Security is an open source LLM (Large Language Model) vulnerability scanning tool designed to provide developers and security professionals with comprehensive fuzzing testing and attack techniques. The tool supports customized rulesets or agent-based attacks, is able to integrate LLM APIs for stress testing, and provides wide...

CogVLM2:开源多模态模型,支持视频理解与多轮对话-首席AI分享圈

CogVLM2: Open Source Multimodal Modeling with Support for Video Comprehension and Multi-Round Dialogue

General Introduction CogVLM2 is an open source multimodal model developed by the Tsinghua University Data Mining Research Group (THUDM), based on the Llama3-8B architecture, and designed to provide performance comparable to or even better than GPT-4V. The model supports image understanding, multi-round dialog, and video understanding, and is capable of handling content up to 8K long...

VisoMaster:强大且易用的图片/视频换脸和编辑软件-首席AI分享圈

VisoMaster: Powerful and easy-to-use photo/video face changing and editing software

General Introduction VisoMaster is a powerful and easy-to-use video face-swapping and editing tool that utilizes artificial intelligence technology to achieve natural and realistic face-swapping effects. Whether it's an image or a video, VisoMaster generates high-quality face swap results with simple operations, suitable for both general users and professionals....

Bilingual Book Maker:使用AI翻译制作双语电子书,全书自动化翻译工具-首席AI分享圈

Bilingual Book Maker: Use AI translation to make bilingual e-books, full book automated translation tool

Comprehensive Introduction Bilingual Book Maker is an open source project designed to help users create multilingual versions of eBooks using AI technology. The tool mainly uses ChatGPT for translation and supports a variety of file formats, including epub, txt and srt.Bilingual Book Maker is designed for translating eBooks that have entered...

Rowfill:批量提取文档结构化信息并自动化分析-首席AI分享圈

Rowfill: Batch Extraction of Structured Information from Documents and Automated Analysis

Comprehensive Introduction Rowfill is an open source document processing platform designed for knowledge workers. It utilizes advanced AI technologies to extract, analyze and process data from complex documents, images and PDFs.Rowfill supports native Large Language Models (LLM) and OpenAI Visual Models to ensure that data is hidden...

en_USEnglish