AI Sharing Circle

Day arching a pawn and sharing for the king!
1-2-1-MNVTON:高效图像、视频中人物虚拟试穿服装(待开放)

1-2-1-MNVTON: Efficient images, virtual trying on of clothes by people in videos (to be opened)

General Introduction 1-2-1-MNVTON is a GitHub-based open source project that aims to provide "Modality-specific Normalization for Virtual Try-On" (MNVTON) technology through...
1yrs ago
048.9K
Kokoro-ONNX:高效文本转语音工具,支持多语言和多声音选择

Kokoro-ONNX: Efficient Text-to-Speech Tool with Multi-Language and Multi-Voice Support

General Introduction Kokoro-ONNX is an open source text-to-speech (TTS) tool based on ONNX runtime. Developed by thewh1teagle, the project aims to provide efficient and fast speech synthesis solutions.Kokoro-ONNX supports ...
1yrs ago
0103K
Zerox:PDF、DOCX、图像转换为Markdown,视觉模型高精度OCR

Zerox: PDF, DOCX, image conversion to Markdown, visual modeling high-precision OCR

Comprehensive introduction Zerox is an open source project designed to convert PDF, DOCX, images and other documents to Markdown format through visual modeling. The project is developed by getomni-ai team , provides a simple and efficient OCR (Optical Character Recognition) solution.Ze...
1yrs ago
074.6K
AIVLOG:自动剪辑视频精彩片段,轻松制作专业Vlog

AIVLOG: Automatically editing video highlights, easy to make professional Vlogs

Comprehensive Introduction AIVLOG is an AI video editing tool designed for Vlog creators. It can automatically analyze video content and intelligently edit out the highlights, saving users 95% editing time. Whether it's daily life, travel records or conversation videos, AIVLOG can easily...
1yrs ago
063.6K
Charla:基于终端的极简AI聊天工具,本地集成Ollama后端

Charla: a minimalist endpoint-based AI chat tool with native integration to the Ollama backend

General Description Charla is an endpoint-based chat application designed to have conversations with native language models. The application integrates with the Ollama backend, supports context-aware conversations, and saves chat sessions as Markdown files. Users can simply...
1yrs ago
059.3K
Windsurf Wave 2 重大更新:引入网页搜索和自动化记忆功能,并提供企业级混合部署版本

Windsurf Wave 2 Major Update: Introduces Web Search and Automated Memory Features with Enterprise Hybrid Deployment Edition

Codeium recently rolled out the Windsurf Wave 2 update, bringing several important feature upgrades to developers, including Web search, automated memories, and code execution optimization. As a Top 2 AI Coding tool, these updates are designed to provide developers with 20...
1yrs ago
050.1K
谷歌发布 Vertex AI RAG 引擎:一站式构建可靠的检索增强生成应用

Google releases Vertex AI RAG engine: one-stop-shop for building reliable search-enhanced generative applications

Generative AI and Large Language Models (LLMs) are transforming industries, but two key challenges can hinder enterprise adoption: disillusionment (generating incorrect or meaningless information) and limited knowledge beyond their training data. Retrieval-augmented generation (RAG) and grounding ...
1yrs ago
050.7K
MiniRAG:简化检索增强生成框架,实体图索引召回相关文本块

MiniRAG: Simplified Retrieval Enhanced Generation Framework, Entity Graph Index Recall Relevant Text Blocks

Comprehensive Introduction MiniRAG is an extremely simple Retrieval Augmented Generation (RAG) framework that aims to enable good RAG performance even for small models through heterogeneous graph indexing and lightweight topology-enhanced retrieval. It is developed by the Data Science Laboratory of the University of Hong Kong (HKUDS) to address ...
1yrs ago
061.4K
Perplexity AI 提出与美国 TikTok 合并(收购)的竞标方案

Perplexity AI makes bid to merge (acquire) with US-based TikTok

The gist: Perplexity AI submitted a bid to TikTok's parent company ByteDance on Saturday proposing that Perplexity merge with TikTok's U.S. operations, CNBC has learned. A source familiar with the situation revealed...
1yrs ago
040.8K
Omni-RGPT:图像和视频区域级理解多模态大模型,提升视觉内容分析能力

Omni-RGPT: A Multimodal Large Model for Image and Video Region-Level Understanding to Enhance Visual Content Analysis

Comprehensive Introduction Omni-RGPT is a multimodal large language model designed to enable region-level understanding of images and videos. By introducing the Token Mark technique, Omni-RGPT is able to highlight target regions in the visual feature space with region cues (e.g., boxes or...
1yrs ago
061.9K