AI open source project

Total 1020 articles posts

Sorting

Kreuzberg: open source tool to extract text from any document

General Introduction Kreuzberg is a library for simplifying text extraction from PDF files, designed to provide a simple, hassle-free text extraction solution. The library is particularly suitable for RAG (Retrieval-Augmented Generatio...

Latest AI Resources # AI Java Open Source Projecct # Document Extraction and Cleaning

6mos ago

02K

HunyuanVideoGP: A Hybrid Video Generation Model with Support for Running on Low-End GPUs

General Introduction HunyuanVideoGP is a large-scale video generation model developed by DeepBeepMeep and designed for low-end GPU users. The model is an improved version of the original Hunyuan Video model, significantly reducing memory and graphics memory requirements...

Latest AI Resources # AI Image to Video # AI Java Open Source Projecct

6mos ago

01.9K

InspireMusic: Ali's open source unified music, song and audio generation framework

General Introduction InspireMusic is a PyTorch-based open source toolkit focused on music, song, and audio generation. It provides a unified framework for generating high-quality audio with controls for text cues, music structure, and music style.Inspire...

Latest AI Resources # AI Java Open Source Projecct # AI Music

5mos ago

02.6K

Gemini Playground: Serverless Deployment of a Gemini Multimodal Dialog Site

General Introduction Gemini Playground is an open source project designed to help users quickly deploy a multimodal dialog site . The project is developed by technical crawling shrimp , support the use of Gemini API Key in 10 seconds to complete the deployment . Whether the user is ...

Latest AI Resources # AI Java Open Source Projecct # Free Large Model API

6mos ago

02.7K

wdoc: retrieve content and summarize knowledge from massive, multi-source documents

Comprehensive Introduction wdoc is a powerful RAG (Retrieval Augmentation Generation) system designed for processing and analyzing large and diverse documents. It is capable of retrieving from a wide range of document types, including PDFs, web pages, YouTube videos, audio files, etc. wdoc is particularly well suited for processing...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

6mos ago

02.5K

Magic 1-For-1: 高效生成视频的开源项目，号称在一分钟内生成一分钟的视频

Magic 1-For-1: efficient generation of video open source project that claims to generate a minute of video in one minute

Comprehensive Introduction Magic 1-For-1 is an efficient video generation model designed to optimize memory usage and reduce inference latency. The model decomposes the text-to-video generation task into two subtasks: text-to-image generation and image-to-video generation, enabling more efficient training and distillation...

Latest AI Resources # AI Java Open Source Projecct # AI text to video

6mos ago

02.6K

DataLine: AI Data Analysis and Visualization Client for Fast Chart and Report Generation

Comprehensive Introduction DataLine is a powerful AI data analysis and visualization tool designed to help users interact with a variety of data sources through simple operations. Whether it is a CSV file or a mainstream database such as Postgres, MySQL, Snowflake, SQL...

Latest AI Resources # AI Java Open Source Projecct # AI data analysis

6mos ago

02.8K

FinRobot: An Intelligent Body to Improve Financial Data Analysis Efficiency and Investment Research

Comprehensive Introduction FinRobot is an open source AI intelligence platform developed by AI4Finance Foundation and designed for financial analytics. It not only covers traditional language models, but also incorporates a variety of AI technologies, aiming to provide a comprehensive solution for the financial industry.F...

Latest AI Resources # AI Java Open Source Projecct # AI Financial Data Analytics

6mos ago

02.5K

Simba: Knowledge management system for organizing documents, seamlessly integrated into any RAG system

General Introduction Simba is a portable Knowledge Management System (KMS) designed to integrate seamlessly with any Retrieval Augmentation Generation (RAG) system. Created by GitHub user GitHamza0206, the project provides an efficient knowledge management solution for a variety of...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

6mos ago

02.3K

LocalPdfChatRAG: Intelligent Chat Tool to Support Local Multi-Source PDF Document Q&A

Comprehensive Introduction LocalPdfChatRAG is an open source project that aims to implement intelligent chat functionality by combining local PDF documents with Retrieval Augmented Generation (RAG) models. The project allows users to upload PDF documents and ask questions through natural language to get from the document to the relative ...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

6mos ago

02.4K

Deep Searcher: Efficient Retrieval of Enterprise Private Documents and Intelligent Q&A

Comprehensive Introduction Deep Searcher is a tool that combines powerful big language models (such as DeepSeek and OpenAI) and vector databases (such as Milvus) designed to search, evaluate, and reason based on private data, providing highly accurate answers...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

6mos ago

02K

Flashcard：基于Dify构建的单词闪卡外语学习工具，替代多邻国（Duolingo）

Flashcard: a word flashcard foreign language learning tool built on Dify, replacing Duolingo.

General Introduction Flashcard is an open source language learning tool designed to provide an alternative to Duolingo. Developed by Steven Lynn (GitHub username: stvlynn), the project features a modern user interface and multilingual...

Latest AI Resources # AI Java Open Source Projecct # AI Educational Tools

6mos ago

02.3K

LineAvatars: a free tool for generating Notion-style line avatars

General Description LineAvatars is a free and easy to use online tool designed to generate Notion style line avatars. Users can upload a photo or take a photo via webcam and the system will automatically generate a line avatar using AI. This tool...

Latest AI Resources # AI Image Style Control # AI Java Open Source Projecct

6mos ago

02.4K

Goku: Generates detailed and consistent videos, ideal for creating commercials with detailed characters and objects.

Comprehensive Introduction Goku is a federated image and video generation model based on stream transformation techniques designed to achieve industry-grade performance. It integrates advanced high-quality visual generation techniques, including fine-grained data organization, model design, and stream transform formulation.Goku's main contributions include high-quality fine-grained...

Latest AI Resources # AI Image to Video # AI Java Open Source Projecct # AI text to video

6mos ago

03.2K

Gemini Cursor：基于Gemini构建的AI桌面智能助手，能看、能听、能说

Gemini Cursor: an AI desktop smart assistant built on Gemini that can see, hear and speak

General Introduction Gemini Cursor is a desktop intelligent assistant based on Google's Gemini 2.0 Flash (experimental) model. It enables visual, auditory, and voice interactions through a multimodal API, providing real-time low-latency use...

Latest AI Resources # AI Java Open Source Projecct # Multimodal Real-Time Interactive Products

6mos ago

04K

Data Formulator: an AI-driven data visualization tool

General Introduction Data Formulator is an open source AI-driven data visualization tool developed by Microsoft Research. The tool combines a graphical user interface (GUI) and natural language input (NL) to enable users to quickly create and iterate through simple interactions and commands...

Latest AI Resources # AI Java Open Source Projecct # AI data analysis

6mos ago

02.8K

Ai2 OLMoE: An Open Source iOS AI App Based on OLMoE Models Running Offline

General Introduction Ai2 OLMoE is an open source iOS app developed by the Allen Institute for AI (Ai2, Allen Institute for Artificial Intelligence) to provide AI models that run entirely on devices. The app utilizes Ai2's open source ol...

Latest AI Resources # AI Big Model Native Conversation Tool # AI Java Open Source Projecct # AI Localized Chat Application

6mos ago

03.5K

Meetily: an AI assistant for generating meeting minutes, transcribing and generating meeting summaries in real-time

General Description Meetily is an AI-powered meeting assistant developed by Zackriya Solutions that captures meeting audio in real-time, performs voice transcription, and generates meeting summaries. It is unique in that all processing is done locally on the device, ensuring user privacy...

Latest AI Resources # AI Java Open Source Projecct # AI Text and Audio/Video Summarization Tool

6mos ago

03.1K

DeepSeek-VL2: an expert visual language model for advanced multimodal understanding

Comprehensive Introduction DeepSeek-VL2 is a series of advanced Mixture-of-Experts (MoE) visual language models that significantly improve the performance of its predecessor, DeepSeek-VL. The models are useful in visual question and answer, optical character recognition, text...

Latest AI Resources # AI Java Open Source Projecct # Multimodal Real-Time Interactive Products

6mos ago

03K

Zonos: High Quality Speech Synthesis and Speech Cloning Tools

General Introduction Zonos is an open source speech synthesis and speech cloning tool developed by Zyphra.The Zonos-v0.1 version uses an advanced Transformer and blending model to generate high quality speech output. The tool supports multiple languages...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

6mos ago

03.2K

ChatGPT Box: Browser plug-in to make ChatGPT work on other web pages

General Introduction ChatGPT Box is an open source browser extension designed to deeply integrate ChatGPT into the user's browser. Developed by josStorer, the tool supports multiple languages and provides a variety of features such as calling chat pairs on any page...

Latest AI Resources # AI Java Open Source Projecct # AI Integrated Multi-Model Dialog Platform # Browser AI Assistant

4mos ago

02.6K

小半 WordPress AI 助手：实现对话、文章生成与翻译的 WordPress AI助手插件

Little Half WordPress AI Assistant: A WordPress AI Assistant Plugin for Conversation, Post Generation and Translation

Comprehensive Introduction WordPress AI Assistant Plugin (wp-ai-chat) is an open source WordPress plugin designed to provide users with a variety of AI features, including AI conversations, article generation, article summarization, article translation and content reading. The plugin supports docking multiple ...

Latest AI Resources # AI Writing # AI Java Open Source Projecct

6mos ago

02.6K

Promptfoo: Providing a Safe and Reliable LLM Application Testing Tool

Comprehensive Introduction promptfoo is an open source command line tool and library dedicated to evaluating and red-teaming test Large Language Model (LLM) applications. It provides developers with a complete set of tools for building reliable prompts, models, and retrieval-based generation (RAGs) with self...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.6K

NoneBot DeepSeek 插件：基于 NoneBot&DeepSeek 实现客服智能对话

NoneBot DeepSeek Plugin: Intelligent dialog for customer service based on NoneBot & DeepSeek.

General Introduction The NoneBot DeepSeek plugin is a NoneBot plugin that integrates the DeepSeek model and is designed to provide intelligent dialog and Q&A functionality. By accessing the DeepSeek model, users can use the NoneBot ...

Latest AI Resources # AI Customer Service Robot # AI Java Open Source Projecct

6mos ago

02.5K

Solana Agent Kit: an open source toolkit for connecting AI intelligences to the Solana protocol

General Introduction Solana Agent Kit is an open source toolkit designed to seamlessly connect AI intelligences to the Solana blockchain protocol. The kit allows both AI researchers and cryptocurrency developers to use any model-trained intelligent body to execute over...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Development Framework

6mos ago

02.5K

LiberSonora: Audiobook Subtitle Extraction and Multilingual Translation, Audiobook Transcription into Multiple Languages

General Introduction LiberSonora, which means "free sound", is a powerful AI-enabled open source audiobook toolset. The toolset supports intelligent subtitle extraction, AI title generation, multi-language translation, etc., and is capable of batch offline processing under GPU acceleration.LiberSo...

Latest AI Resources # AI Java Open Source Projecct # AI Translation # AI Speech to Text

6mos ago

02.4K

go-stock: AI-enabled stock analysis tool, real-time monitoring of self-picked stock quotes and in-depth analysis based on AI

Comprehensive introduction go-stock is an AI-enabled stock analysis tool built on Wails and NaiveUI. The tool is able to monitor real-time stock quotes, provide cost and profit/loss display and push up/down alarm function. All data is saved locally to ensure that users...

Latest AI Resources # AI Java Open Source Projecct # AI Financial Data Analytics

6mos ago

02.9K

RSS Translator: a tool for subscribing to and translating RSS content in real time

General Introduction RSS Translator is an open source, clean and self-deployable tool designed to help users translate and subscribe to RSS content in real time. The tool supports multiple translation engines including Google Translate, Microsoft Tra...

Latest AI Resources # AI Java Open Source Projecct # AI Translation

6mos ago

02.7K

KTransformers: Large Model Inference Performance Engine: Extreme Acceleration, Flexible Empowerment

KTransformers: A high-performance Python framework designed to break through the bottleneck of large model inference. It is not just a simple model running tool, but also a set of extreme performance optimization engine and flexible interface empowerment platform. KTransf...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

03K

VideoRAG: A RAG framework for understanding ultra-long videos with support for multimodal retrieval and knowledge graph construction

Comprehensive Introduction VideoRAG is a retrieval-enhanced generative framework designed for processing and understanding very long contextual videos. The tool combines a graph-driven textual knowledge base with hierarchical multimodal context encoding to efficiently process on a single NVIDIA RTX 3090 GPU...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

6mos ago

03K

Tifa-Deepsex-14b-CoT: a large model that specializes in roleplaying and ultra-long fiction generation

Comprehensive Introduction Tifa-Deepsex-14b-CoT is a Deepseek-R1-14B deep-optimized macromodel based on Deepseek-R1-14B, focusing on role-playing, fictional text generation, and Chain of Thought (CoT) push...

Latest AI Resources # AI Java Open Source Projecct # AI Role Play

6mos ago

06.6K

Instructor: a Python library to simplify structured output workflows for large language models

Comprehensive Introduction Instructor is a popular Python library designed for processing structured output from Large Language Models (LLMs). Built on Pydantic, it provides a simple, transparent and user-friendly API for managing data...

Latest AI Resources # AI Java Open Source Projecct # Document Extraction and Cleaning

6mos ago

02.4K

MedRAX: A Smart Body for Chest X-ray Analysis Using Multimodal Large Models

Comprehensive Introduction MedRAX is a state-of-the-art AI intelligence designed for chest radiograph (CXR) analysis. It integrates state-of-the-art CXR analysis tools and multimodal large language models to dynamically process complex medical queries without additional training.MedRAX, through its modular design...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Application # Visual Target Detection

5mos ago

02.7K

LangBot：开源大模型即时通信机器人，支持多微信、QQ、飞书等多平台部署AI机器人

LangBot: open source large model instant messaging robot, support for multiple WeChat, QQ, Flybook and other multi-platform deployment of AI robots

LangBot is a large model-based instant messaging bot platform that supports multiple messaging platforms and large models. The platform adapts to QQ, WeChat (enterprise WeChat, personal WeChat), Flybook, Discord, OneBot and other messaging platforms, and supports Open...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.9K

zChunk: a generic semantic chunking strategy based on Llama-70B

Comprehensive Introduction zChunk is a novel chunking strategy developed by ZeroEntropy that aims to provide a solution for generic semantic chunking. The strategy is based on the Llama-70B model, which optimizes the chunking process of documents by prompting for chunks to be generated, ensuring that information retrieval is maintained at a high...

Latest AI Resources # AI Java Open Source Projecct # Document Extraction and Cleaning

6mos ago

02.3K

Hibiki: a real-time speech translation model, streaming translation that preserves the characteristics of the original voice

General Introduction Hibiki is a high-fidelity real-time speech translation model developed by Kyutai Labs. Unlike traditional offline translation, Hibiki is able to generate natural speech translation in the target language and provide text translation in real time while the user is speaking. The model...

Latest AI Resources # AI Java Open Source Projecct # AI Translation

6mos ago

03.3K

Qwen4Mac: Use Qwen's big models in the Mac menu bar to have conversations on the go!

General Introduction Qwen4Mac is an open source project designed to integrate the Qwen Large Language Model (LLM) into the Mac's menu bar, making it easy for users to call and use at any time. The project is developed and maintained by andreaturchet and provides an easy way for users to...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.1K

口袋AI：手机中运行的离线AI助手，适配 DeepSeek-R1 (5.37GB)

Pocket AI: offline AI assistant running in your phone, adapted for DeepSeek-R1 (5.37GB)

General Introduction Pocket AI (PocketPal AI Chinese version) is a powerful offline AI assistant designed to allow users to talk to AI anytime, anywhere. The project is based on Small Language Models (SLMs) and runs on cell phones without internet connection, especially adapted to Chinese user experience. Mouth...

Latest AI Resources # AI Java Open Source Projecct # AI Localized Chat Application

6mos ago

03.1K

Kokoro WebGPU: A Text-to-Speech Service for Offline Operation in Browsers

General Introduction Kokoro WebGPU is a WebGPU version of the Kokoro text-to-speech (TTS) model, provided by WebML Community on the Hugging Face platform. The project utilizes WebGPU technology to enable users to...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

6mos ago

03.4K

OpenHealthForAll：个人健康数据管理AI助手，上传检查报告定制健康计划

OpenHealthForAll: AI assistant for personal health data management, uploading examination reports to customize health plans

General Introduction OpenHealthForAll is an open source project designed to help users manage and understand their personal health data. By utilizing artificial intelligence technology, OpenHealthForAll provides a locally run health assistant to help users better manage...

Latest AI Resources # AI Java Open Source Projecct # AI Life Efficiency Assistant

6mos ago

02K

OpenPilot: open source autonomous driving system, DIY a set of your own intelligent driving system for your car

General Introduction OpenPilot is an open source autonomous driving system developed by comma.ai to enhance the driving experience and safety of existing vehicles with advanced driver assistance features. Since its first release in 2016, OpenPilot has supported over 2...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.6K

Agentic Security：开源的LLM漏洞扫描工具，提供全面的模糊测试和攻击技术

Agentic Security: open source LLM vulnerability scanning tool that provides comprehensive fuzz testing and attack techniques

General Introduction Agentic Security is an open source LLM (Large Language Model) vulnerability scanning tool designed to provide developers and security professionals with comprehensive fuzz testing and attack techniques. The tool supports customized rule sets or agent-based attacks and is able to integrate LLM AP...

Latest AI Resources # AI Java Open Source Projecct # prompt jailbreak

6mos ago

02.7K

CogVLM2: Open Source Multimodal Modeling with Support for Video Comprehension and Multi-Round Dialogue

Comprehensive Introduction CogVLM2 is an open source multimodal model developed by the Tsinghua University Data Mining Research Group (THUDM), based on the Llama3-8B architecture, and designed to provide performance comparable to or even better than GPT-4V. The model supports image understanding, multi-round dialogs, and visual ...

Latest AI Resources # AI Java Open Source Projecct # Visual Target Detection

6mos ago

02.4K

VisoMaster: Powerful and easy-to-use photo/video face changing and editing software

General Introduction VisoMaster is a powerful and easy-to-use video face-swapping and editing tool that utilizes artificial intelligence technology to achieve natural and realistic face-swapping effects. Whether it's an image or a video, VisoMaster can generate high-quality face swap results with simple operations, suitable for general...

Latest AI Resources # AI Java Open Source Projecct # AI Face Swap and Dress Up # AI video face swap

6mos ago

04.1K

RAG-based construction of a mini-assistant providing health advice (pilot project)

Comprehensive Introduction LLM-RAG-Longevity-Coach is a chatbot based on Large Language Modeling (LLM) and Retrieval Augmented Generation (RAG) technologies designed to provide users with personalized health and longevity advice. The project was developed by Tyler Burle...

Latest AI Resources # AI Java Open Source Projecct # AI Life Efficiency Assistant

6mos ago

02.7K

Maestro: A tool to simplify the process of fine-tuning mainstream open source visual language models

Comprehensive Introduction Maestro is a tool developed by Roboflow to simplify and accelerate the process of fine-tuning multimodal models, so that everyone can train their own visual macromodels. It provides ready-made recipes for fine-tuning popular visual language models (VLMs) such as F...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

6mos ago

02.6K

One-Prompt-One-Story: Text Prompts Generate Character Identity Consistent Images

Synthesis One-Prompt-One-Story (1Prompt1Story) is an innovative text-to-image generation tool designed to enable consistent image generation from a single prompt. It was presented by Tao Liu et al. at the ICLR 2025...

Latest AI Resources # AI Image Style Control # AI Java Open Source Projecct

6mos ago

02.1K

Adding a RAG-driven online chat tool to Next.js applications

Comprehensive Introduction The Upstash RAG Chat Component is a React component designed for Next.js apps to provide an AI chat interface based on RAG (Retrieval Augmented Generation) technology. The component combines the Upstash V...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.2K

AudioNotes: Quickly Extract Audio and Video Content and Generate Structured Notes

Comprehensive Introduction AudioNotes is an audio/video to structured notes system built on FunASR and Qwen2. It can quickly extract audio/video content and call the big model to organize it and generate a structured Markdown notes, which is convenient for...

Latest AI Resources # AI Java Open Source Projecct # AI Speech to Text

6mos ago

02.6K

Bilingual Book Maker：使用AI翻译制作双语电子书，全书自动化翻译工具

Bilingual Book Maker: Use AI translation to make bilingual e-books, full book automated translation tool

General Introduction Bilingual Book Maker is an open source project designed to help users create multilingual versions of eBooks using AI technology. The tool mainly uses ChatGPT for translation and supports multiple file formats including epub, txt and srt...

Latest AI Resources # AI Java Open Source Projecct # AI Translation

6mos ago

02.7K

Rowfill: Batch Extraction of Structured Information from Documents and Automated Analysis

General Introduction Rowfill is an open source document processing platform designed for knowledge workers. It uses advanced artificial intelligence techniques to extract, analyze and process data from complex documents, images and PDFs.Rowfill supports Native Large Language Model (LLM) and Ope...

Latest AI Resources # AI Java Open Source Projecct # AI data analysis # Document Extraction and Cleaning

6mos ago

02.3K

PRAG: Parameterized Retrieval Augmentation Generation Tool for Improving the Performance of Q&A Systems

Comprehensive Introduction PRAG (Parametric Retrieval-Augmented Generation) is an innovative retrieval-augmented generation tool designed to enhance the generation of large language models (LLMs) by embedding external knowledge directly into the parameter space of...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

6mos ago

03.5K

GPT Researcher: Generate comprehensive, detailed research reports utilizing local and web-based data

Comprehensive Introduction GPT Researcher is an autonomous agent tool based on the Large Language Model (LLM) designed to perform local and web research and generate detailed research reports. The tool provides stable performance and faster speed by parallelizing agent work, ensuring that the information is accurate...

Latest AI Resources # AI Java Open Source Projecct # Generate in-depth research report

4mos ago

02.2K

Linly-Talker：数字人智能对话系统，结合大语言模型与视觉模型，实现互动新体验

Linly-Talker: An Intelligent Dialogue System for Digital People, Combining Big Language Modeling and Visual Modeling for a New Interactive Experience

Comprehensive Introduction Linly-Talker is an innovative digital human dialog system that combines Large Language Models (LLMs) with visual models to create a novel approach to human-computer interaction. The system integrates a variety of technologies such as Whisper, Linly, Micros...

Latest AI Resources # AI Java Open Source Projecct # AI Digital Man

6mos ago

02.6K

Airweave: enabling apps to quickly integrate knowledge bases for intelligent searching

General Introduction Airweave is an open source tool designed to make any application searchable by synchronizing a user's application data, APIs, databases, and websites to graph and vector databases.Airweave simplifies the process of making data searchable, whether it's structured data or...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

6mos ago

02.1K

ai-gradio: Easily Integrate Multiple AI Models and Build Multimodal Applications Based on Gradio

Comprehensive Introduction ai-gradio is an open source Python toolkit designed to help developers easily integrate and use multiple AI models. Built on Gradio, the project provides a unified interface to support multiple AI models and services. Whether it is text, speech or video...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.9K

OpenDeepResearcher: automated in-depth research tool to write complete research reports

General Introduction OpenDeepResearcher is an open source automated deep research tool designed to improve research efficiency through artificial intelligence techniques. The project is developed by mshumer and hosted on GitHub.OpenDeepResear...

Latest AI Resources # AI Java Open Source Projecct # Generate in-depth research report

4mos ago

02.7K

ColiVara: Visual Embedding Based Document Storage and Retrieval Service

General Introduction ColiVara is a document storage and retrieval service based on visual embedding technology. It eliminates the need for Optical Character Recognition (OCR) or text extraction and avoids the problems of broken forms or lost images.ColiVara supports more than 100 file formats, including PDF...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

6mos ago

02.6K

Cursor Reset：重置 Cursor 0.45.x 以上版本设备标识的脚本

Cursor Reset: A script to reset the device identifier for Cursor versions 0.45.x and above.

General Introduction Cursor Reset is a PowerShell scripting tool for resetting device identifiers in Cursor IDE, supporting Cursor version 0.45.x. The tool is designed to help users reset device identifiers in the Cursor IDE...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

03.8K

n8n Self-hosted AI Starter Kit: an open source template for quickly building a local AI environment

Comprehensive Introduction The n8n Self-Hosted AI Starter Kit is an open source Docker Compose template designed to quickly initialize a comprehensive local AI and low-code development environment. Crafted by the n8n team, the suite combines the self-hosted n8n platform with a range of compatible AI...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

03.2K

Gemini Teacher: English Speaking Pronunciation Correction Assistant

General Introduction Gemini Teacher is an English speaking practice assistant based on Google Gemini AI. It recognizes the user's English pronunciation in real time and provides instant feedback and correction suggestions. The tool is designed to help users improve their English speaking skills through...

Latest AI Resources # AI Java Open Source Projecct # AI Educational Tools

6mos ago

02.7K

bilive: Unsupervised live recording and automatic slicing and uploading tools for B station

Comprehensive Introduction bilive is a tool designed for B station live recording, providing extremely fast live recording, auto-slicing, pop-up rendering and subtitle generation. The tool is compatible with ultra-low configuration machines, supports 7x24 hours unattended recording, automatically recognizes and renders pop-ups and subtitles, automatically slices and...

Latest AI Resources # AI Java Open Source Projecct # AI audio/video editor

6mos ago

02.7K

R1-V: Low-cost reinforcement learning for visual language model generalization capability

Comprehensive Introduction R1-V is an open source project that aims to achieve breakthroughs in visual language modeling (VLM) through low-cost reinforcement learning (RL). The project utilizes a verifiable reward mechanism to incentivize VLMs to learn generic counting abilities. Amazingly, R1-V's 2B ...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.8K

PPTX2MD: Specialized tool for converting PPTX files to Markdown

General Introduction PPTX2MD is an open source tool designed to convert PowerPoint PPTX files to Markdown format. Developed by GitHub user ssine, the tool supports preserving headings, lists, text formatting (e.g., bold, italic, color, and super...

Latest AI Resources # AI Java Open Source Projecct # Document Extraction and Cleaning

6mos ago

02.3K

DSPy Examples: Practical examples demonstrating DSPy functionality

General Introduction The DSPy Example Codebase is a GitHub codebase maintained by the Langtrace AI team that showcases a variety of example AI programs built using DSPy. The codebase is designed to demonstrate the many features of DSPy through real-world examples to help developers better understand...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.8K

Go-Proxy: A High Performance Reverse Proxy Server for Docker Integration

Comprehensive Introduction Go-Proxy is a high-performance proxy server developed using the Go language , mainly used to provide proxy services in different network environments . It supports a variety of protocols , including HTTP, HTTPS, SOCKS5, WebSocket, TCP and UDP, can ...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.8K

CoT-Lab: an experimental dialog tool for exploring iterative thinking about human-computer collaboration

CoT-Lab is an experimental interface for exploring a new paradigm of human-computer collaboration. Based on Cognitive Load Theory and Active Learning Principles, CoT-Lab facilitates deep cognitive alignment between humans and Artificial Intelligence (AI) through the creation of "thinking partner" relationships. The program aims to...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.2K

Browser extension to enable DeepSeek official chat interface to support inline rendering charts

General Description DeepSeek Diagrams Extension is a Chrome extension designed to help users render diagrams inline in the DeepSeek website. The extension is based on Mermaid...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.5K

Orate: A Unified API for Integrating Well-Known Speech Generation, Speech Transcription and Voice Change Models

Comprehensive Introduction Orate is an AI toolkit focused on speech generation and transcription. It provides a unified API that seamlessly integrates with leading AI providers such as OpenAI, ElevenLabs, and AssemblyAI to help users create forced...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech # AI Speech to Text

6mos ago

02.9K

Reflex LLM Examples: a collection of AI applications demonstrating practical applications of large language models

Comprehensive Introduction Reflex LLM Examples is an open source project created by the Reflex development team to demonstrate practical applications of the Large Language Model (LLM). The project brings together several AI applications built on Reflex, showcasing applications from Googl...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.1K

DeepClaude：融合DeepSeek R1链式推理与Claude创造力的聊天界面

DeepClaude: A Chat Interface Fusing DeepSeek R1 Chained Reasoning and Claude Creativity

Comprehensive Introduction DeepClaude is a high-performance Large Language Model (LLM) inference API and chat interface that integrates the chained inference (CoT) capabilities of DeepSeek R1 with the creativity and code generation of the Anthropic Claude model...

Latest AI Resources # AI Java Open Source Projecct # AI Localized Chat Application

6mos ago

03.5K

BEN2: Deep learning model for fast background removal from images, videos

Comprehensive Introduction BEN2 (Background Erase Network 2) is a deep learning model developed by Prama LLC that specializes in automatically removing the background from an image and generating a foreground image. The model uses an innovative Confiden...

Latest AI Resources # AI Java Open Source Projecct # AI keying to change backgrounds

2mos ago

03.8K

AI Web Operator：浏览器自动化操作，OpenAI Operator的开源实现

AI Web Operator: Browser Automation, an Open Source Implementation of OpenAI Operator

General Introduction AI Web Operator is an open source AI browser operator tool designed to simplify the user experience in the browser by integrating multiple AI technologies and SDKs. The tool is based on Browserbase and Vercel...

Latest AI Resources # AI Java Open Source Projecct # Multimodal Real-Time Interactive Products

6mos ago

02.8K

Exa & Deepseek Chat App：实时Web搜索与智能推理的开源聊天应用

Exa & Deepseek Chat App: Open Source Chat App for Real-Time Web Search and Intelligent Reasoning

Comprehensive Introduction Exa & Deepseek Chat App is an open source intelligent chat application, the main features include real-time web search using Exa's API and intelligent use of Deepseek R1 language model...

Latest AI Resources # AI Java Open Source Projecct # AI search tool

6mos ago

02.6K

LLM API Engine: Rapid API Generation and Deployment through Natural Language

Comprehensive Introduction LLM API Engine is an open source project designed to help developers rapidly build and deploy AI-powered APIs.The project leverages the Large Language Model (LLM) and intelligent web crawling technology to allow users to create custom APIs through natural language descriptions.Its main...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.2K

PengChengStarling：对比Whisper-Large v3更小、更快的多语言语音转文字工具

PengChengStarling: Smaller and Faster Multilingual Speech-to-Text Tool than Whisper-Large v3

Comprehensive Introduction PengChengStarling (PengCheng Labs) is a multilingual Automatic Speech Recognition (ASR) tool capable of converting speech in different languages into corresponding text. This toolkit is developed based on the icefall project and provides a complete speech recognition process...

Latest AI Resources # AI Java Open Source Projecct # AI Speech to Text

6mos ago

02.2K

Doc2XAPITranslate：文档全文翻译：快速将英文PDF/MD论文翻译为中文文档

Doc2XAPITranslate: full-text translation of documents: quickly translate English PDF/MD papers into Chinese documents.

Comprehensive Introduction Doc2XAPITranslate is a powerful full-text document translation tool designed for quickly translating English PDF or Markdown papers into Chinese documents. The tool supports a variety of translators, including DeepSeek, OpenAI, O...

Latest AI Resources # AI Java Open Source Projecct # AI Translation

6mos ago

02.6K

SpeechGPT 2.0-preview: an end-to-end anthropomorphic speech dialog grand model for real-time interaction

SpeechGPT 2.0-preview is the first anthropomorphic real-time interaction system introduced by OpenMOSS, which is trained based on millions of hours of speech data. The system is equipped with anthropomorphic spoken expression and 100ms low latency response, supporting natural and smooth real...

Latest AI Resources # AI Java Open Source Projecct # Multimodal Real-Time Interactive Products

6mos ago

02.8K

Goose: open source scalable programming intelligences that automate the full range of programming tasks

General Introduction Goose is an open source AI agent tool developed by Block, Inc. designed to help developers automate everyday development tasks. It supports a wide range of Large Language Models (LLMs) and interacts with users via the command line or desktop application interfaces.Goose can perform a wide range of tasks from agent...

Latest AI Resources # AI Java Open Source Projecct # AI Programming # Intelligent Body Development Framework

6mos ago

04K

Fullmoon: iOS App for Native Large Language Modeling Chats

General Description Fullmoon is an application designed for iOS devices and aims to provide the ability to chat privately with native large language models. The app is optimized for Apple Silicon and is supported on iPhone, iPad and Mac. Users of the chat...

Latest AI Resources # AI Java Open Source Projecct # AI Localized Chat Application

6mos ago

03.3K

Onlook: open source Cursor for front-end design, design and publish code in React applications

General Introduction Onlook is an open source design tool built for designers and developers that allows users to design directly in a running React application and convert design changes to code. The tool provides an intuitive visual editing experience similar to Figma or Webf...

Latest AI Resources # AI Java Open Source Projecct # AI Page Design

6mos ago

02.4K

YuE: Transforms lyrics into a base model of a complete song, supporting a wide range of musical styles

General Introduction YuE is an open source full song generation base model that focuses on transforming lyrics into full songs. Unlike other models that can only generate short snippets of non-vocal music, YuE is capable of generating full songs with lead and backing vocals up to several minutes in length. The model addresses music generation in...

Latest AI Resources # AI Java Open Source Projecct # AI Music

6mos ago

03.2K

PocketPal AI：iOS和Android设备离线使用的小型语言模型聊天工具

PocketPal AI: A Small Language Modeling Chat Tool for Offline Use on iOS and Android Devices

General Introduction PocketPal AI is an open-source mobile app designed to bring Small Language Models (SLMs) directly to your phone for both iOS and Android users...

Latest AI Resources # AI Java Open Source Projecct # AI Localized Chat Application

6mos ago

06.3K

Cog-ComfyUI: Running ComfyUI Workflows with APIs

General Introduction Cog-ComfyUI is an open source project designed to run ComfyUI workflows via an API. Created by GitHub user fofr, the project provides an efficient way to integrate and run ComfyUI workflows.ComfyUI is a ...

Latest AI Resources # AI Image Generation Aids # AI Java Open Source Projecct # ComfyUI

6mos ago

02.8K

Supermemory: Importing bookmarks and web content to build a personal knowledge base

General Introduction Supermemory is an open source project designed to help users build their "second brain". With a powerful Chrome extension and AI technology, it allows users to easily save, organize and retrieve data from web pages, Twitter bookmarks,...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

6mos ago

02.9K

Open NotebookLM: convert PDF to podcasts of open source tools

General Introduction Open NotebookLM is an open source project designed to convert any PDF document into a podcast. The tool utilizes open source Large Language Model (LLM) and Text-to-Speech (TTS) models to process PDF content and generate natural dialog suitable for audio podcasts...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

6mos ago

02.7K

Qwen2.5-VL: an open source multimodal grand model supporting image-video document parsing

Comprehensive Introduction Qwen2.5-VL is an open source multimodal big model developed by Qwen team of Alibaba Cloud (Alibaba Cloud). It can handle text, images, video and documents at the same time , is an upgraded version of Qwen2-VL , based on Qwen2.5...

Latest AI Resources # AI Java Open Source Projecct

5mos ago

02.7K

Lux: command line video downloader that supports almost all video platforms

General Introduction Lux is a fast and simple video download library and command line tool written in Go. It supports downloading videos from multiple websites, including YouTube, Bilibili, Youku, etc. Lux offers a variety of download options and features, such as multi-threaded download...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

02.2K

R1 Overthinker: Forcing DeepSeek R1 Models to Think Longer

General Introduction DeepSeek R1 Overthinker is a tool designed to enhance the depth of thinking of DeepSeek R1 models. By lengthening the model's reasoning process, the tool enables the model to think more deeply, thus improving the quality of its answer...

Latest AI Resources # AI Java Open Source Projecct

6mos ago

03.5K

Langui: an open source library of AI user interface components

Comprehensive Introduction LangbaseInc's Langui is an open source user interface component library designed for generative AI and Large Language Model (LLM) projects. The library is based on Tailwind CSS and provides a collection of pre-built UI components to help developers quickly construct...

Latest AI Resources # AI Java Open Source Projecct # AI Page Design

7mos ago

02.4K

MNN-LLM-Android: MNN Multimodal Language Model for Android Applications

Comprehensive Introduction MNN (Mobile Neural Network) is an efficient, lightweight deep learning framework developed by Alibaba and optimized for mobile devices.MNN is not only capable of fast inference on mobile devices, but also supports multimodal tasks, including text generation...

Latest AI Resources # AI Java Open Source Projecct # AI Localized Chat Application

6mos ago

03.3K

AI RSS Generator: a tool to convert web content into RSS feeds via AI

General Introduction AI RSS is an innovative tool that converts web content into RSS feeds through AI technology. It consists of two main parts: a browser plugin and a server side. The browser plugin allows users to select lists from web pages and generate structured data description (SDD) files...

Latest AI Resources # AI Java Open Source Projecct # AI Life Efficiency Assistant

7mos ago

02.5K

UltraRAG: A One-Stop RAG System Solution to Simplify Data Construction and Model Fine-Tuning

Comprehensive Introduction UltraRAG is a RAG (Retrieval Augmented Generation) system solution jointly proposed by the THUNLP group at Tsinghua University, the NEUIR group at Northeastern University, Modelbest.Inc and the 9#AISoft team. The framework is based on agile deployment and modularized building...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

7mos ago

02.2K

Llasa 1~8B: an open source text-to-speech model for high quality speech generation and cloning

General Introduction Llasa-3B is an open source text-to-speech (TTS) model developed by the Audio Lab of the Hong Kong University of Science and Technology (HKUST Audio). The model is based on the Llama 3.2B architecture, which has been carefully tuned to provide high-quality speech generation that not only supports multiple...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech # AI voice cloning

6mos ago

03.2K

Fast GraphRAG: A Highly Accurate and Low-Cost Graphical Search Enhancement Generation Tool

Comprehensive Introduction Fast GraphRAG is an open source tool developed by Circlemind AI to enable efficient and accurate retrieval augmentation generation (RAG) through knowledge graph and PageRank algorithms. The tool intelligently adapts to the user's use...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Graph # Knowledge Retrieval with RAG Framework

7mos ago

02.4K

TinyZero: A Low-Cost Replication of DeepSeeK-R1 Zero's Epiphany Effect

General Introduction TinyZero is a veRL-based reinforcement learning model designed to replicate the performance of DeepSeeK-R1 Zero in countdown and multiplication tasks. Surprisingly, the project costs only $30 to run (using 2xH2...

Latest AI Resources # AI Java Open Source Projecct

7mos ago

03.9K

Open R1：Hugging Face 复现 DeepSeek-R1 的训练过程

Open R1: Hugging Face Replicates the Training Process of DeepSeek-R1

General Introduction Hugging Face's Open R1 project is a fully open-source DeepSeek-R1 replication project designed to build the missing parts of the R1 pipeline so that everyone can replicate and build upon them. The project is designed to be simple and consists mainly of training and evaluating...

Latest AI Resources # AI Java Open Source Projecct

7mos ago

04K

Open Operator: Performing Automation in Cloud Browsers with AI Intelligence

General Introduction Open Operator is an open source project that aims to automate operations in the browser through AI intelligences. Developed by Browserbase, the project combines the technologies of Stagehand and Browserbase...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

7mos ago

03.6K

Cerebr: the open source browser plugin that talks to web content

General Description Cerebr is a powerful AI assistant extension for Chrome, designed to enhance your productivity and learning experience.Cerebr was designed from the ground up with the need for a clean, efficient browser AI assistant, with a minimalist design and powerful...

Latest AI Resources # AI Java Open Source Projecct # Browser AI Assistant

4mos ago

02.6K