AI Personal Learning
and practical guidance
Beanbag Marscode1
Total 947 articles

Tags: ai open source projects Page 21

ColiVara:基于视觉嵌入的文档存储与检索服务-首席AI分享圈

ColiVara: Visual Embedding Based Document Storage and Retrieval Service

General Introduction ColiVara is a document storage and retrieval service based on visual embedding technology. It eliminates the need for Optical Character Recognition (OCR) or text extraction and avoids the problem of broken forms or lost images.ColiVara supports over 100 file formats including PDF, DOCX, PPTX, etc. and is able to automatically...

n8n自托管AI入门套件:快速搭建本地AI环境的开源模板-首席AI分享圈

n8n Self-hosted AI Starter Kit: an open source template for quickly building a local AI environment

Comprehensive Introduction The n8n Self-Hosted AI Starter Kit is an open source Docker Compose template designed to quickly initialize a comprehensive local AI and low-code development environment. Crafted by the n8n team, the kit combines the self-hosted n8n platform with a range of compatible AI products and components to help users quickly conceptualize...

bilive:B站无人监守直播录制与自动切片、上传工具-首席AI分享圈

bilive: Unsupervised live recording and automatic slicing and uploading tools for B station

Comprehensive Introduction bilive is a tool designed for B station live recording, providing extremely fast live recording, auto-slicing, pop-up rendering and subtitle generation. The tool is compatible with ultra-low configuration machines, supports 7x24 hours unattended recording, automatically recognizes and renders pop-ups and subtitles, automatically slices and uploads them to B...

R1-V:低成本强化学习实现视觉语言模型泛化能力-首席AI分享圈

R1-V: Low-cost reinforcement learning for visual language model generalization capability

Comprehensive Introduction R1-V is an open source project that aims to achieve breakthroughs in visual language modeling (VLM) through low-cost reinforcement learning (RL). The project utilizes a verifiable reward mechanism to motivate VLMs to learn generalized counting abilities. Amazingly, R1-V's 2B model is able to learn the counting ability in only 100 training steps...

CoT-Lab:探索人机协作迭代思考的实验性对话工具-首席AI分享圈

CoT-Lab: an experimental dialog tool for exploring iterative thinking about human-computer collaboration

CoT-Lab (Collaborative Thinking Laboratory) is an experimental interface for exploring new paradigms in human-computer collaboration. Based on Cognitive Load Theory and Active Learning Principles, CoT-Lab facilitates deep cognitive alignment between humans and Artificial Intelligence (AI) through the creation of "Thinking Partners". The program is designed to slowly output...

PengChengStarling:对比Whisper-Large v3更小、更快的多语言语音转文字工具-首席AI分享圈

PengChengStarling: Smaller and Faster Multilingual Speech-to-Text Tool than Whisper-Large v3

Comprehensive Introduction PengChengStarling (PengCheng Labs) is a multilingual Automatic Speech Recognition (ASR) tool capable of converting speech in different languages into corresponding text. This toolkit is developed based on the icefall project and provides a complete speech recognition process, including data processing, model training,...

en_USEnglish