AI Personal Learning
and practical guidance
CyberKnife Drawing Mirror
Total 908 articles

Tags: ai open source projects Page 44

Retrieval based Voice Conversion WebUI:基于检索的语音转换框架|模拟真人歌声-首席AI分享圈

Retrieval based Voice Conversion WebUI: A Framework for Retrieval-based Voice Conversion | Simulating Real-life Singing Voices

Comprehensive Introduction Retrieval based Voice Conversion WebUI is a simple and easy-to-use VITS-based voice conversion framework, which can realize voice conversion between any speakers, including song covers and real-time voice changing. It features low latency, excellent voice changing effect, small amount of data training...

VoiceCraft:开源零样本语音克隆与文本转语音工具-首席AI分享圈

VoiceCraft: open source zero-sample speech cloning and text-to-speech tool

Comprehensive Introduction VoiceCraft is an open source speech editing and zero-sample speech synthesis tool based on the Neural Codec language model. It employs an innovative coded sequence generation method that enables insertion, deletion and replacement operations on existing speech sequences to generate natural and coherent edited speech. At the same time, ...

CoAI.Dev (Chat Nio):AI聚合应用 一站式 B/C 端解决方案,支持弹性计费和订阅计划模式-首席AI分享圈

CoAI.Dev (Chat Nio): One-stop B/C solution for AI aggregation apps with flexible billing and subscription plan model support

General Introduction CoAI.Dev (formerly Chat Nio) is a chat platform that integrates multiple AI models and supports distributed streaming, image generation, cross-device conversation synchronization and sharing. It implements a subscription and Token billing system, Key transit service and multi-model support, and also includes connected search and AI...

ChatOllama:基于Nuxt 3和Ollama的本地实时聊天应用UI-首席AI分享圈

ChatOllama: Native real-time chat application UI based on Nuxt 3 and Ollama

Comprehensive introduction ChatOllama is an open source online chat application project based on a large language model (LLM) , supporting numerous language models and knowledge base management. Users can use the platform for model management ( list display , download , delete ) , chat with the model and other functions . The project utilizes the Nuxt 3 framework ...

MinerU:PDF文档提取转换为多模态Markdown格式,支持电子书OCR扫描-首席AI分享圈

MinerU: PDF document extraction and conversion to multimodal Markdown format, support e-book OCR scanning

Comprehensive Introduction MinerU is an open source data extraction tool developed by the OpenDataLab team at the Shanghai Artificial Intelligence Laboratory, focusing on efficiently extracting content from complex PDF documents, web pages, and eBooks. It can convert multimodal PDF documents containing images, formulas, tables and other elements into easy-to-analyze M...

Tap4 AI WebUI:开源轻量级AI工具导航项目-首席AI分享圈

Tap4 AI WebUI: open source lightweight AI tool navigation project

Comprehensive introduction Tap4 AI WebUI is an open source lightweight AI tool navigation website project , designed to help users easily build their own AI tool catalog . The project uses Next.js and Supabase technology stack , support for multi- language SEO optimization , to provide AI tools classification filtering , search and detailed display functions ...

CodeFormer:图像与视频面部复原,老照片修复,提供一键部署版-首席AI分享圈

CodeFormer: image and video facial restoration, old photo restoration, offers one-click deployment version

CodeFormer General Introduction CodeFormer is a codebase for robust blind face repair, developed by a team of researchers at S-Lab, Nanyang Technological University and presented at NeurIPS 2022. The project utilizes the Codebook Lookup Transformer technology, which aims to improve...

Moshi:实时语音对话框架,支持多种语言和口音的语音对话基础模型-首席AI分享圈

Moshi: a real-time speech dialog framework with support for multiple languages and accents for speech dialog base models

Comprehensive Introduction Moshi Chat is an end-to-end real-time AI voice assistant from Kyutai, a French non-profit AI lab. It not only listens in real-time, but also engages in natural conversations and supports multimodal interactions, including the ability to see, hear, and speak.Moshi Chat understands the user's intonation and can be used in...

QAnything:高度集成RAG处理流程的本地知识库问答系统-首席AI分享圈

QAnything: Local Knowledge Base Q&A System with Highly Integrated RAG Processing Flow

QAnything General Introduction QAnything (Question and Answer based on Anything) is a local knowledge base Q&A system launched by NetEase, which supports all kinds of file formats and databases and can be installed and used offline. It can handle PDF, Word, PPT, XLS and other formats of documents, support for cross...

OpenSPG:开源知识图谱引擎-首席AI分享圈

OpenSPG: Open Source Knowledge Graph Engine

Comprehensive Introduction OpenSPG is an open source knowledge graph engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic Augmented Programmable Graph) framework. The engine is designed to provide features such as explicit semantic representation, logical rule definition and operational framework to support the construction and management of domain knowledge graphs.OpenSPG combines...

en_USEnglish