General Introduction Kokoro-ONNX is an open source text-to-speech (TTS) tool based on ONNX runtime. Developed by thewh1teagle, the project aims to provide efficient and fast speech synthesis solutions.Kokoro-ONNX supports multiple languages, including English, and plans to support French, Japanese, Korean...
Comprehensive introduction Zerox is an open source project designed to convert PDF, DOCX, images and other documents to Markdown format through visual modeling . The project is developed by getomni-ai team , provides a simple and efficient OCR (Optical Character Recognition) solution.Zerox supports Node and Python programming languages, ...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
Comprehensive Introduction AIVLOG is an AI video editing tool designed for Vlog creators. It can automatically analyze video content and intelligently edit out the highlights, saving users 95% editing time. Whether it's daily life, travel records or conversation videos, AIVLOG can handle it easily. Users do not need to have...
General Description Charla is an endpoint-based chat application designed to have conversations with native language models. The application integrates with the Ollama backend, supports context-aware conversations, and saves chat sessions as Markdown files. Users can launch and enable it through simple command line operations...
Comprehensive Introduction MiniRAG is an extremely simple Retrieval Augmented Generation (RAG) framework that aims to enable good RAG performance even for small models through heterogeneous graph indexing and lightweight topology-enhanced retrieval. It is developed by the Hong Kong University Data Science Laboratory (HKUDS) and focuses on solving the Small Language Model (SLM...
Comprehensive Introduction Omni-RGPT is a multimodal large language model designed to enable region-level understanding of images and videos. By introducing the Token Mark technique, Omni-RGPT is able to highlight target regions in the visual feature space and embed these tokens directly through region cues (e.g., boxes or masks), while placing...
Comprehensive Introduction Bailing (Bailing) is an open source voice conversation assistant designed to engage in natural conversations with users through speech. The project combines speech recognition (ASR), voice activity detection (VAD), large language modeling (LLM) and speech synthesis (TTS) technologies to achieve a GPT-4o-like speech...
Comprehensive Introduction Metaverse AI (open source version) is a project hosted on GitHub, developed by libn-net team. It can clone digital human images and voices through AI technology to generate short videos, and also supports dubbing and subtitling. The tool is available for Windows, Web, H5 and small...
General Introduction WikiChat is an experimental chatbot developed at Stanford University that aims to improve the factuality of large language models by retrieving data from Wikipedia. Large language models (such as ChatGPT and GPT-4) tend to make errors when dealing with up-to-date information or less popular topics.WikiCh...
General Introduction Entretien AI is an online platform focused on helping job seekers improve their interviewing skills. It utilizes artificial intelligence technology to simulate real interview scenarios, providing instant feedback and expert guidance. Users can use this platform for targeted practice to optimize their answering strategies and communication skills. Net...
General Introduction UGC Generator is a platform that utilizes artificial intelligence technology to quickly generate user-generated content (UGC) video ads. Users can generate high-quality UGC-style video ads in minutes by simply uploading product links. The platform provides a clean interface and powerful features to help users...
General Introduction OpenAI Edge TTS is an open source project that provides a native text-to-speech (TTS) API compatible with OpenAI.The project uses Microsoft Edge's online text-to-speech service to allow users to generate high-quality speech output.OpenAI Edge TTS supports a wide range of speech options...
General Description Charts Not Chapters is an AI-based tool focused on converting text and data into compelling infographics. It is unique in that it does not rely on templates, but instead generates each chart from scratch through AI, offering a high degree of customization. Users can create infographics from text, spreadsheets...
Comprehensive Introduction Cure AI is an online platform designed for medical researchers to optimize the scientific process through artificial intelligence technology. The platform provides access to over 26 million PubMed scientific articles and ranks evidence based on the relevance and quality of user queries.Cure AI works by seamlessly guiding...
General Introduction AIEvo is Ant Group's open source multi-agent framework designed to efficiently create multi-agent applications. The framework strictly follows the SOP task graph to improve the execution success rate of complex tasks , and through feedback and monitoring mechanisms to ensure high flexibility and scalability.AIEvo has been verified in the Ant Group internal production environment ...
Comprehensive Introduction Allwyse is an intelligent platform designed specifically for advisor businesses, designed to help advisors optimize client management and scheduling by integrating multiple tools and features. The platform provides automated scheduling, client data management, AI assistant, real-time analytics, and other features to help advisors improve productivity...
General Introduction Bakery is a platform designed for AI startups, machine learning engineers and researchers to provide simple and efficient AI model fine-tuning and monetization services. Users can access community-driven datasets through Bakery, create or upload their own datasets, fine-tune model settings, and market...
Comprehensive Introduction Ragie.ai is a fully managed RAG (Retrieval-Augmented Generation) service platform designed for developers. With Ragie.ai, developers can easily connect applications with user data, utilizing pre-built integration tools such as Google Drive, Gmail,...
General Introduction PPTAgent is an innovative system designed to automatically generate presentations from documents. The system draws on the human approach to creating presentations, using a two-step process to ensure content quality and visualization. In addition, PPTAgent introduces PPTEval, a comprehensive evaluation framework for generating presentations from within...