AI Sharing Circle

Day arching a pawn and sharing for the king!

1-2-1-MNVTON: Efficient images, virtual trying on of clothes by people in videos (to be opened)

General Introduction 1-2-1-MNVTON is a GitHub-based open source project that aims to provide "Modality-specific Normalization for Virtual Try-On" (MNVTON) technology through...

Latest AI Resources # AI Java Open Source Projecct # AI Face Swap and Dress Up

2yrs ago

075K

Kokoro-ONNX: Efficient Text-to-Speech Tool with Multi-Language and Multi-Voice Support

General Introduction Kokoro-ONNX is an open source text-to-speech (TTS) tool based on ONNX runtime. Developed by thewh1teagle, the project aims to provide efficient and fast speech synthesis solutions.Kokoro-ONNX supports ...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

2yrs ago

0136.3K

Zerox: PDF, DOCX, image conversion to Markdown, visual modeling high-precision OCR

Comprehensive introduction Zerox is an open source project designed to convert PDF, DOCX, images and other documents to Markdown format through visual modeling. The project is developed by getomni-ai team , provides a simple and efficient OCR (Optical Character Recognition) solution.Ze...

Latest AI Resources # AI Java Open Source Projecct # Document Extraction and Cleaning

2yrs ago

0102K

AIVLOG: Automatically editing video highlights, easy to make professional Vlogs

Comprehensive Introduction AIVLOG is an AI video editing tool designed for Vlog creators. It can automatically analyze video content and intelligently edit out the highlights, saving users 95% editing time. Whether it's daily life, travel records or conversation videos, AIVLOG can easily...

Latest AI Resources # AI audio/video editor

2yrs ago

098K

Charla: a minimalist endpoint-based AI chat tool with native integration to the Ollama backend

General Description Charla is an endpoint-based chat application designed to have conversations with native language models. The application integrates with the Ollama backend, supports context-aware conversations, and saves chat sessions as Markdown files. Users can simply...

Latest AI Resources # AI Java Open Source Projecct # AI Localized Chat Application

2yrs ago

084.6K

Windsurf Wave 2 重大更新：引入网页搜索和自动化记忆功能，并提供企业级混合部署版本

Windsurf Wave 2 Major Update: Introduces Web Search and Automated Memory Features with Enterprise Hybrid Deployment Edition

Codeium recently rolled out the Windsurf Wave 2 update, bringing several important feature upgrades to developers, including Web search, automated memories, and code execution optimization. As a Top 2 AI Coding tool, these updates are designed to provide developers with 20...

AI News

2yrs ago

071.8K

Google releases Vertex AI RAG engine: one-stop-shop for building reliable search-enhanced generative applications

Generative AI and Large Language Models (LLMs) are transforming industries, but two key challenges can hinder enterprise adoption: disillusionment (generating incorrect or meaningless information) and limited knowledge beyond their training data. Retrieval-augmented generation (RAG) and grounding ...

AI News

2yrs ago

073.2K

MiniRAG: Simplified Retrieval Enhanced Generation Framework, Entity Graph Index Recall Relevant Text Blocks

Comprehensive Introduction MiniRAG is an extremely simple Retrieval Augmented Generation (RAG) framework that aims to enable good RAG performance even for small models through heterogeneous graph indexing and lightweight topology-enhanced retrieval. It is developed by the Data Science Laboratory of the University of Hong Kong (HKUDS) to address ...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Graph # Knowledge Retrieval with RAG Framework

2yrs ago

089.5K

Perplexity AI makes bid to merge (acquire) with US-based TikTok

The gist: Perplexity AI submitted a bid to TikTok's parent company ByteDance on Saturday proposing that Perplexity merge with TikTok's U.S. operations, CNBC has learned. A source familiar with the situation revealed...

AI News

2yrs ago

065K

Omni-RGPT: A Multimodal Large Model for Image and Video Region-Level Understanding to Enhance Visual Content Analysis

Comprehensive Introduction Omni-RGPT is a multimodal large language model designed to enable region-level understanding of images and videos. By introducing the Token Mark technique, Omni-RGPT is able to highlight target regions in the visual feature space with region cues (e.g., boxes or...

Latest AI Resources # AI Java Open Source Projecct

2yrs ago

090.7K

Loading more