AI open source project

Total 1020 articles posts

Sorting

MCP service for reading and modifying Figma designs using Cursor.

General Introduction Cursor Talk to Figma MCP is an open source project that connects the AI programming tool Cursor with the design software Figm...

1yrs ago

095.1K

FinRobot: An Intelligent Body to Improve Financial Data Analysis Efficiency and Investment Research

Comprehensive Introduction FinRobot is an open source AI intelligence platform developed by AI4Finance Foundation and designed for financial analytics. It not only covers traditional language models, but also incorporates a variety of AI technologies, aiming to provide a comprehensive solution for the financial industry.F...

Latest AI Resources # AI Java Open Source Projecct # AI Financial Data Analytics

1yrs ago

095K

RoomGPT: Upload room photos and redesign using AI

General Introduction RoomGPT is an open source project developed by GitHub user Nutlope that allows users to upload photos of rooms and generate redesigned versions of them using artificial intelligence technology. The project aims to give users access to professional-grade interior design without expensive designer fees...

Latest AI Resources # AI Image Style Control # AI Java Open Source Projecct

2yrs ago

095K

LangBot：开源大模型即时通信机器人，支持多微信、QQ、飞书等多平台部署AI机器人

LangBot: open source large model instant messaging robot, support for multiple WeChat, QQ, Flybook and other multi-platform deployment of AI robots

LangBot is a large model-based instant messaging bot platform that supports multiple messaging platforms and large models. The platform adapts to QQ, WeChat (enterprise WeChat, personal WeChat), Flybook, Discord, OneBot and other messaging platforms, and supports Open...

Latest AI Resources # AI Java Open Source Projecct

1yrs ago

095K

Markdownify MCP Server：基于MCP协议将各种内容转换为Markdown格式

Markdownify MCP Server: Converts various content to Markdown format based on the MCP protocol.

General Introduction Markdownify MCP Server is an open source tool based on the Model Context Protocol, hosted on GitHub by developer Zach Caceres ...

Latest AI Resources # AI Java Open Source Projecct # MCP services # Document Extraction and Cleaning

1yrs ago

095K

Kotaemon: simple to deploy open source multimodal document quiz tool

General Introduction Kotaemon is an open source document Q&A tool designed to provide end-users and developers with Q&A functionality based on Retrieval Augmented Generation (RAG). The project is developed by Cinnamon and supports a variety of LLM API providers (e.g. OpenA...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Graph # Knowledge Retrieval with RAG Framework

2yrs ago

094.7K

Aide: AI helper extension that enhances the VSCode development experience with one-click annotation, conversion, and UI-generated code

General Introduction AIDE (AI-assisted Development Extension) is a powerful VSCode AI-assisted development extension, focusing on providing unique and practical AI programming assistance. It is different from GitHu...

Latest AI Resources # AI Java Open Source Projecct # AI Programming

2yrs ago

094.6K

AnyText: Generate and edit multi-language image text, highly controllable to generate multiple lines of Chinese in the image

Comprehensive Introduction AnyText is a revolutionary multilingual visual text generation and editing tool developed based on the diffusion model. It generates natural, high-quality multilingual text in images and supports flexible text editing features. It was developed by a team of researchers and presented at ICLR 2024...

Latest AI Resources # AI Image Generation Aids # AI Image Style Control # AI Java Open Source Projecct

2yrs ago

094.6K

Flying Paddle PP-TableMagic: Structured Information Extraction for Complex Tables

The goal of table recognition is to parse tables in images, accurately identify table structures and cell locations, and reduce them to structured table formats (e.g., HTML). In today's information age, a large amount of important tabular data still exists in an unstructured state (e.g., scanned documents with pictures of statistical tables...).

Latest AI Resources # AI Java Open Source Projecct # Document Extraction and Cleaning

1yrs ago

094.6K

TxAgent: the AI tool that helps doctors analyze drug effects and treatment options

Comprehensive Introduction TxAgent is an open-source AI tool developed by Harvard University's Medical and Scientific Artificial Intelligence Team (MIMS) to help physicians analyze drug interactions and develop personalized treatment plans. It combines patient-specific situations through multi-step reasoning and real-time retrieval of biomedical knowledge...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Application

1yrs ago

094.6K

PandasAI: Data Analytics Dialog Platform for Data Queries and Chart Generation in Natural Language

General Introduction PandasAI is a Python based open source platform designed to simplify the process of data analysis through natural language processing techniques. Enabling users to work conversationally with databases (e.g. SQL, CSV, pandas, polars, mongodb, n...

Latest AI Resources # AI Java Open Source Projecct # AI data analysis

2yrs ago

094.5K

Fish Agent：端到端AI语音克隆助手，实时语音对话助理，Fish Speech衍生项目

Fish Agent: end-to-end AI voice cloning assistant, real-time voice conversation assistant, Fish Speech spin-off project

Comprehensive Introduction Fish Speech Derivative Project Fish Agent is a revolutionary end-to-end AI speech cloning system developed based on the V0.1 3B model architecture. As a fully end-to-end speech clone processing system, its most important feature is the use of innovative speechless...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning # Multimodal Real-Time Interactive Products

2yrs ago

094.4K

Eko: Natural Language Builds Intelligent Body Workflows for Desktop and Browser Automation

General Introduction Eko is a production-grade JavaScript framework designed to build efficient intelligent agent workflows through natural language descriptions. It is designed to enable developers to automate everyday tasks using AI technologies without deep programming.Eko provides a uni...

Latest AI Resources # AI Java Open Source Projecct # Low-code workflow # Intelligent Body Application

1yrs ago

094.2K

R1-V: Low-cost reinforcement learning for visual language model generalization capability

Comprehensive Introduction R1-V is an open source project that aims to achieve breakthroughs in visual language modeling (VLM) through low-cost reinforcement learning (RL). The project utilizes a verifiable reward mechanism to incentivize VLMs to learn generic counting abilities. Amazingly, R1-V's 2B ...

Latest AI Resources # AI Java Open Source Projecct

1yrs ago

094.2K

HyperChat: AI Conversation Client for Performing Complex Tasks with MCP Intelligence

General Introduction HyperChat is an open source chat client developed by BigSweetPotatoStudio, hosted on GitHub, and designed to provide a comprehensive overview of the BigSweetPotatoStudio language model by integrating APIs from several large language models (LLMs) such as OpenAI, Cla...

Latest AI Resources # AI Java Open Source Projecct # AI Localized Chat Application # Intelligent Body Application

1yrs ago

094.1K

AutoAgent: a framework for rapid creation and deployment of AI intelligences through natural language

General Introduction AutoAgent is an open source AI intelligences framework developed by the Data Intelligence Laboratory of the University of Hong Kong (HKUDS) and hosted on GitHub.It allows users to rapidly create and deploy customized AI intelligences by describing their requirements in purely natural language, without any programming base...

Latest AI Resources # AI Java Open Source Projecct # No code development # Intelligent Body Development Framework

1yrs ago

094.1K

uni-api：轻量大模型API转换为OpenAI接口，YAML文件配置API渠道

uni-api: lightweight big model API converted to OpenAI interface, YAML file to configure API channel

Comprehensive introduction No front-end , pure configuration file configuration API channel . Just write a file can run up an API station of their own , the document has a detailed configuration guide , white friendly. uni-api is a project to unify the management of large model APIs , allowing a unified ...

Latest AI Resources # AI Java Open Source Projecct

2yrs ago

094.1K

openapi-mcp-server: letting AI directly invoke MCP services with open APIs

General Introduction openapi-mcp-server is an open source tool designed to transform OpenAPI v3.1 compliant APIs into AI usable resources. It is maintained by janwilmake and is based on Model Contex...

Latest AI Resources # AI Java Open Source Projecct # MCP services

1yrs ago

094K

VideoRAG: A RAG framework for understanding ultra-long videos with support for multimodal retrieval and knowledge graph construction

Comprehensive Introduction VideoRAG is a retrieval-enhanced generative framework designed for processing and understanding very long contextual videos. The tool combines a graph-driven textual knowledge base with hierarchical multimodal context encoding to efficiently process on a single NVIDIA RTX 3090 GPU...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

1yrs ago

094K

Perplexica：1比1复刻 Perplexity AI 功能和界面的开源AI搜索引擎

Perplexica: an open source AI search engine that replicates Perplexity AI's features and interface 1 to 1

Comprehensive Introduction Perplexica is an open source AI-driven search engine designed to provide answers that delve deep into the Internet. It uses advanced machine learning algorithms, such as similarity search and embedding techniques, to optimize search results and provide clear answers with cited sources.Perple...

Latest AI Resources # AI Java Open Source Projecct # AI search tool

2yrs ago

093.9K

Genesis: open source generative physics engine for real physics-based 4D dynamic world simulation

General Introduction Genesis is a generative physics world designed for general purpose robotics and embodied AI learning. It provides a unified simulation platform that supports the simulation of a wide range of materials and physical phenomena.Genesis aims to unlock generative AI and physics simulation by combining...

Latest AI Resources # AI Java Open Source Projecct # AI Text & Image to 3D

2yrs ago

093.8K

InfiniteYou: a photo generation and editing tool that preserves facial features

General Introduction InfiniteYou is an open source project developed by the ByteDance Intelligent Creation team. It is based on Diffusion Transformers (DiTs) technology, using the FLUX.1-dev model, the core function is to allow users to upload a photo and enter a text description, generating...

Latest AI Resources # AI Image Style Control # AI Java Open Source Projecct

1yrs ago

093.8K

Open source tool for real-time speech to text

General Introduction realtime-transcription-fastrtc is an open source project focused on converting speech to text in real time. It uses FastRTC technology to process low-latency audio streams , combined with the local Whisper model to achieve efficient ...

Latest AI Resources # AI Java Open Source Projecct # AI Speech to Text

1yrs ago

093.6K

LM Speed: Rapidly Test Large Model API Performance

Comprehensive Introduction LM Speed is a tool designed specifically for AI developers, along with an online service site, lmspeed.net.Its core function is to test and analyze the performance of language model APIs, helping users to quickly identify speed bottlenecks and optimize calling strategies. This...

Latest AI Resources # AI Java Open Source Projecct

1yrs ago

093.5K

OpenAOE: Large Model Group Chat Framework: Chatting with Multiple Large Language Models Simultaneously

Comprehensive Introduction OpenAOE is an open source large model group chat framework, aiming to solve the problem of the lack of chat frameworks in the current market with multiple models responding in parallel. With OpenAOE, users can talk to multiple Large Language Models (LLMs) at the same time and get parallel output. The framework supports ...

Latest AI Resources # AI Java Open Source Projecct # AI Integrated Multi-Model Dialog Platform

1yrs ago

093.4K

Morphik Core: an open source RAG platform for processing multimodal data

General Introduction Morphik Core is an open source project developed by the morphik-org team and hosted on GitHub. It used to be called DataBridge Core, but is now renamed Morphik Core.This...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

1yrs ago

093.4K

MegaParse：解析各类型文档为LLM可用数据，完整保留文档中的表格、图片等所有信息

MegaParse: parses all types of documents into LLM-available data, preserving all information in the document such as tables, pictures, etc. in its entirety

Comprehensive Introduction MegaParse is a powerful and versatile document parsing tool designed to optimize data processing for the Large Language Model (LLM). Whether you are working with text, PDF, PowerPoint presentations or Word documents, MegaParse...

Latest AI Resources # AI Java Open Source Projecct # Document Extraction and Cleaning

2yrs ago

093.3K

AI Chatbot Supabase：开源的Next.js和Supabase构建的AI聊天机器人，快速部署到Vercel。

AI Chatbot Supabase: open source Next.js and Supabase built AI chatbot for rapid deployment to Vercel.

General Introduction AI Chatbot Supabase is an open source AI chatbot template built on Next.js and Supabase. Developed by Vercel, the project aims to provide a fully functional and customizable chatbot solution. By ...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

2yrs ago

093.3K

LlamaCoder: Quickly Generate and Publish Small Web Applications Using Prompt Words

General Introduction LlamaCoder is an open source code generation tool based on Llama 3.1 and Together AI. It can generate small applications with simple prompts and is suitable for developers to quickly realize their ideas.LlamaCoder provides...

Latest AI Resources # AI Java Open Source Projecct # AI Programming # AI Page Design

2yrs ago

093.2K

WeClone: training digital doppelgangers with WeChat chats and voices

Comprehensive introduction WeClone is an open source project that uses WeChat chat logs and voice messages, combined with large language models and speech synthesis technology, to allow users to create personalized digital doppelgangers. The project can analyze the user's chat habits to train the model , but also a small number of voice samples to generate realistic sound...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

1yrs ago

093.2K

Oliva: a voice-controlled multi-intelligence product search assistant

General Introduction Oliva is an open source multi-intelligence assistant tool developed by Deluxer on GitHub. It helps users search for product information in the Qdrant database through the collaboration of multiple AI intelligences. The main feature is that it supports voice operation...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Application # Knowledge Retrieval with RAG Framework

1yrs ago

093.2K

X-Kit: Twitter Data Crawl and Analyzing X User Data and Tweets

General Introduction X-Kit is an open source tool designed to crawl and analyze X (formerly Twitter) user data and tweets. Developed by GitHub user xiaoxiunique, the tool is designed to help users automate the process of obtaining basic information and tweets about a given X user and...

Latest AI Resources # AI Java Open Source Projecct

2yrs ago

093.2K

CapsWriter-Offline: Speech Input and Subtitle Transcription Tool for the PC

General Introduction CapsWriter-Offline is a voice input and subtitle transcription tool for PC, hosted on GitHub and built by developer HaujetZhao. It runs completely offline and does not require an Internet connection to realize speech-to-text and audio-visual...

Latest AI Resources # AI Java Open Source Projecct # AI Speech to Text

1yrs ago

093.1K

MyCoder: Command-line AI tool for automating code fixes, test case generation

General Introduction MyCoder is an open source project developed by the drivecore team and hosted on GitHub, aiming to provide developers with intelligent programming assistance through a command line interface. It is based on Anthropic's Claude AP...

Latest AI Resources # AI Java Open Source Projecct # AI Programming

1yrs ago

093.1K

修复无效JSON字符串，解决 LLMs 生成的 JSON 数据中可能出现的格式错误。

Fixes invalid JSON strings and resolves possible formatting errors in JSON data generated by LLMs.

General Description A module for fixing invalid JSON files, especially for parsing erroneous JSON data output by Large Language Models (LLMs). The module fixes common JSON syntax errors such as missing quotes, incorrect commas, unescaped characters and incomplete key-value...

Latest AI Resources # AI Java Open Source Projecct

2yrs ago

093.1K

Vercel AI SDK: Building AI-powered applications with popular front-end frameworks

General Introduction Vercel AI SDK is an open source tool developed by the Vercel team to help developers build AI applications using frameworks such as React, Svelte, Vue and Solid. It supports multiple language model providers...

Latest AI Resources # AI Java Open Source Projecct

2yrs ago

093.1K

Tarsier: an open source video comprehension model for generating high-quality video descriptions

Comprehensive Introduction Tarsier is a family of open-source video-language models developed by ByteDance for generating high-quality video descriptions. It consists of a simple structure: the CLIP-ViT processes video frames, combined with a Large Language Model (LLM) to analyze...

Latest AI Resources # AI Java Open Source Projecct

1yrs ago

093K

DeepClaude：融合DeepSeek R1链式推理与Claude创造力的聊天界面

DeepClaude: A Chat Interface Fusing DeepSeek R1 Chained Reasoning and Claude Creativity

Comprehensive Introduction DeepClaude is a high-performance Large Language Model (LLM) inference API and chat interface that integrates the chained inference (CoT) capabilities of DeepSeek R1 with the creativity and code generation of the Anthropic Claude model...

Latest AI Resources # AI Java Open Source Projecct # AI Localized Chat Application

2yrs ago

093K

Kolors: text-to-image model for generating high-quality images, support for generating Chinese posters

Comprehensive Introduction Kolors is a large-scale text-to-image generation model developed by the Racer team, based on potential diffusion techniques. The model is trained on billions of text-image data pairs, and is capable of generating high-quality, complex semantically accurate images with support for both Chinese and English input.Kolors in visual quality...

Latest AI Resources # AI Java Open Source Projecct # AI Self-Deployment Image Generation Tool

2yrs ago

092.9K

AgentGPT: An Open Source Project to Create and Run Automated AI Intelligences

General Introduction AgentGPT is an open source project developed by the Reworkd team and hosted on GitHub, designed to allow users to autonomously create, configure, and deploy AI intelligences through a browser. Users simply set a goal, and AgentGPT can...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Development Framework

1yrs ago

092.9K

Probly: open source tabular tool for AI to analyze data and generate charts

Comprehensive Introduction Probly is a spreadsheet tool developed by the PragmaticMachineLearning team and open-sourced on GitHub that combines the functionality of traditional spreadsheets with powerful AI data analysis capabilities. It not only supports the use of ...

Latest AI Resources # AI Java Open Source Projecct # AI data analysis

1yrs ago

092.9K

MegaTTS3: A Lightweight Model for Synthesizing Chinese and English Speech

Comprehensive Introduction MegaTTS3 is an open source speech synthesis tool developed by ByteDance in cooperation with Zhejiang University, focusing on generating high-quality Chinese and English speech. Its core model is only 0.45B parameters , lightweight and efficient , support for mixed Chinese and English speech generation and speech cloning . The project is hosted on ...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech # AI voice cloning

1yrs ago

092.8K

AI2SRT: Create short narrated videos or video summaries for long videos with one click using Gemini models

Comprehensive Introduction AI2SRT is an open source project that utilizes the GeminiAI Big Model to generate short narrated videos and video summaries for long videos with one click, while supporting audio and video transcription subtitles. The project aims to simplify the video content creation process and provide efficient subtitle generation and translation functions. Users can pass...

Latest AI Resources # AI Java Open Source Projecct # AI audio/video editor

2yrs ago

092.8K

MockingBird：快速克隆声音与模型训练，基于 xtts v2 实现的文本转语音

MockingBird: Fast Voice Cloning and Model Training, Text-to-Speech based on xtts v2 Implementation

Comprehensive introduction MockingBird is an open source project designed to achieve rapid speech cloning and text-to-speech through AI technology. Users only need to provide 5 seconds of voice samples to generate any voice content. The project supports a variety of Chinese datasets , and in Windows ...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

2yrs ago

092.8K

E2B Open Computer Use: Running an AI operating system safely in the E2B sandbox

General Introduction E2B Open Computer Use is an open source project that aims to provide a secure cloud-based Linux computer use experience through the E2B Desktop Sandbox.The E2B Sandbox provides a desktop graphical environment that users can connect to any large...

Latest AI Resources # AI Java Open Source Projecct # Desktop Automation Intelligence

2yrs ago

092.8K

LazyLLM: Shangtang's open source low-code development tool for building multi-intelligence body applications

Comprehensive Introduction LazyLLM is an open source tool developed by the LazyAGI team, focusing on simplifying the development process of multi-intelligence large model applications. It helps developers quickly build complex AI applications through one-click deployment and lightweight gateway mechanisms, saving tedious engineering configuration...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Development Framework

1yrs ago

092.7K

Orate: A Unified API for Integrating Well-Known Speech Generation, Speech Transcription and Voice Change Models

Comprehensive Introduction Orate is an AI toolkit focused on speech generation and transcription. It provides a unified API that seamlessly integrates with leading AI providers such as OpenAI, ElevenLabs, and AssemblyAI to help users create forced...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech # AI Speech to Text

2yrs ago

092.7K

Deploying hugging face's free api on cloudflare to support interface forwarding

Because the domestic deployment can not access hugging face, so in the big brother deployment program based on the transformation to be able to deploy to cloudflare workers. Preparation 1, register cloudflare 2, register hugging fac...

Latest AI Resources # AI Java Open Source Projecct # Free Large Model API

2yrs ago

092.7K

OmniGen: Unified Image Generation Model with Multimodal Inputs to Generate Character-Consistent Images

General Introduction OmniGen is a "general purpose" image generation model developed by VectorSpaceLab that allows users to create diverse and contextually rich visuals with simple text prompts or multimodal inputs. It is particularly well suited for applications that need to recognize...

Latest AI Resources # AI online image generation # AI Java Open Source Projecct

2yrs ago

092.7K

TheoremExplainAgent：利用 Manim 生成5分钟以上数学讲解动画视频

TheoremExplainAgent: Generate 5+ minute animated math explainer videos with Manim

General Introduction TheoremExplainAgent is an innovative project developed by TIGER AI Lab to transform complex mathematical and scientific theorems into easy-to-understand video animations using artificial intelligence techniques. The tool is based on the Large Language Model (LLM...

Latest AI Resources # AI Java Open Source Projecct # AI Educational Tools

1yrs ago

092.6K

PhotoDoodle: AI tool for adding artistic doodles to photos with text commands

General Introduction PhotoDoodle is an open source image editing tool, developed by ShowLab, focusing on artistic editing of photos through artificial intelligence technology. Users only need to input simple text prompt words to add cartoon style, 3D effect, light...

Latest AI Resources # AI image editing # AI Java Open Source Projecct

1yrs ago

092.6K

Devika: open-source AI software engineer intelligence that understands, splits instructions into subtasks and writes code

General Introduction Devika is an advanced AI software engineer that understands high-level human instructions, breaks them down into steps, studies the relevant information, and writes code to achieve a given goal. It intelligently develops software using large-scale language models, planning and reasoning algorithms, and web browsing capabilities.D...

Latest AI Resources # AI Java Open Source Projecct # AI Programming # Intelligent Body Application

1yrs ago

092.5K

ell: Lightweight Functional Cue Word Engineering Framework

General Introduction ell is a lightweight functional language model programming library developed by former OpenAI researcher William Guss. It is designed with the idea of treating cues as programs, not just strings. ell provides automated version control and serialization...

Latest AI Resources # AI Java Open Source Projecct # PROMPTS Aids

1yrs ago

092.5K

MarkItDown: Microsoft Document Intelligent Conversion Tool to convert various files to Markdown format

General Introduction MarkItDown is a Python tool developed by Microsoft designed to convert various files and office documents to Markdown format. The tool supports a wide range of file types, including PDF, PowerPoint, Word, Excel, diagrams...

Latest AI Resources # AI Java Open Source Projecct # Document Extraction and Cleaning

2yrs ago

092.4K

QAnything: Local Knowledge Base Q&A System with Highly Integrated RAG Processing Flow

QAnything Comprehensive Introduction QAnything (Question and Answer based on Anything) is a local knowledge base Q&A system launched by NetEase, which supports all kinds of file formats and databases, and can be installed and used offline....

Latest AI Resources # AI Open Services # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

2yrs ago

092.4K

Weebo: a real-time voice chatbot that provides a natural language conversational experience

General Introduction Weebo is an open source real-time voice chatbot that utilizes Whisper Small for speech recognition, Llama 3.2 for natural language generation, and Kokoro-82M for speech synthesis. The project was developed by Aman...

Latest AI Resources # AI Java Open Source Projecct # Multimodal Real-Time Interactive Products

2yrs ago

092.4K

PantoMatrix（EMAGE）：全身手势生成框架，从音频生成全身手势的3D动画框架

PantoMatrix (EMAGE): full-body gesture generation framework, 3D animation framework for generating full-body gestures from audio

Comprehensive Introduction PantoMatrix is an advanced full-body gesture generation framework capable of generating complete human movements from audio and partial gestures, including face, partial body, hand and full-body movements. The framework utilizes the latest multimodal datasets and deep learning techniques to provide high-quality 3D...

Latest AI Resources # AI Java Open Source Projecct

2yrs ago

092.4K

NodeRAG: A Heterogeneous Graph-Based Tool for Accurate Information Retrieval and Generation

A Comprehensive Introduction NodeRAG is an open source Retrieval Augmented Generation (RAG) system hosted on GitHub and developed by Terry-Xu-666. It optimizes information retrieval and generation through heterogeneous graph structures, significantly improving retrieval accuracy and contextual relevance.Nod...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

1yrs ago

092.4K

Langui: an open source library of AI user interface components

Comprehensive Introduction LangbaseInc's Langui is an open source user interface component library designed for generative AI and Large Language Model (LLM) projects. The library is based on Tailwind CSS and provides a collection of pre-built UI components to help developers quickly construct...

Latest AI Resources # AI Java Open Source Projecct # AI Page Design

2yrs ago

092.3K

Flow (Laminar): a lightweight task engine for building intelligences that simplifies and flexibly manages tasks

Comprehensive Introduction Flow is a lightweight task engine designed for building AI agents, emphasizing simplicity and flexibility. Unlike traditional node- and edge-based workflows, Flow uses a dynamic task queuing system that supports parallel execution, dynamic scheduling, and intelligent dependency management. Its core concept is ...

Latest AI Resources # AI Java Open Source Projecct # Low-code workflow

2yrs ago

092.2K

Memary: an open-source project to enhance Agent long-term memory using knowledge graphs

General Introduction Memary is an innovative open source project focused on providing long-term memory management solutions for autonomous intelligences. The project helps intelligences break through the limitations of traditional context windows to achieve smarter interaction experiences through knowledge graphs and specialized memory modules.Memary adopts...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Development Framework # Knowledge Graph

2yrs ago

092.1K

Gemini Cursor：基于Gemini构建的AI桌面智能助手，能看、能听、能说

Gemini Cursor: an AI desktop smart assistant built on Gemini that can see, hear and speak

General Introduction Gemini Cursor is a desktop intelligent assistant based on Google's Gemini 2.0 Flash (experimental) model. It enables visual, auditory, and voice interactions through a multimodal API, providing real-time low-latency use...

Latest AI Resources # AI Java Open Source Projecct # Multimodal Real-Time Interactive Products

1yrs ago

092.1K

VoiceCraft: open source zero-sample speech cloning and text-to-speech tool

Comprehensive Introduction VoiceCraft is an open source speech editing and zero-sample speech synthesis tool based on the neural codec language model. It employs an innovative coded sequence generation method that enables insertion, deletion and replacement operations on existing speech sequences to generate natural, coherent edited speech...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

2yrs ago

092K

ANP: An Open Source Protocol for Secure and Efficient Communication between Intelligent Agents

General Introduction AgentNetworkProtocol (ANP for short) is an open source protocol project, hosted on GitHub, focused on providing secure and efficient communication solutions for intelligent agents (AI Agents). It works through a three-layer architecture - identity and encryption...

Latest AI Resources # AI Java Open Source Projecct

1yrs ago

091.9K

FlashMLA：优化Hopper GPU的MLA解码内核（DeepSeek 开源周第一天）

FlashMLA: Optimizing the MLA Decoding Kernel for Hopper GPUs (DeepSeek Open Source Week Day 1)

General Introduction FlashMLA is an efficient MLA (Multi-head Latent Attention) decoding kernel developed by DeepSeek AI, optimized for NVIDIA Hopper architecture GPUs...

Latest AI Resources # AI Java Open Source Projecct

1yrs ago

091.9K

Fast-Agent: Declarative Grammar and MCP Integration for Rapidly Building Multi-Intelligent Body Workflows

General Introduction Fast-Agent is an open source tool maintained by the evalstate team on GitHub, designed to help developers quickly define, test and build multi-intelligence workflows. It is based on a simple declarative syntax, and supports the use of MCP (Mode...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Development Framework

1yrs ago

091.8K

AI customer service bots that are automatically manned by idle sellers throughout the day

Comprehensive Introduction XianyuAutoAgent is an intelligent customer service robot system designed for the Idlefish platform, open-sourced by developer shaxiu on GitHub. It realizes 7×24 hours automatic duty through AI technology, and helps Idlefish sellers reply...

Latest AI Resources # AI Side Hustle Money Making Programs # AI Customer Service Robot # AI Java Open Source Projecct

1yrs ago

091.8K

DreamTalk: Generate expressive talking videos with a single avatar image!

DreamTalk Comprehensive Introduction DreamTalk is a diffusion model-driven expression talking head generation framework, jointly developed by Tsinghua University, Alibaba Group and Huazhong University of Science and Technology. It mainly consists of three parts: a noise reduction network, a style-aware lip expert and a style predictor, and can be based on...

Latest AI Resources # AI Java Open Source Projecct # AI Digital Man # Port Synchronization

2yrs ago

091.7K

Zonos: High Quality Speech Synthesis and Speech Cloning Tools

General Introduction Zonos is an open source speech synthesis and speech cloning tool developed by Zyphra.The Zonos-v0.1 version uses an advanced Transformer and blending model to generate high quality speech output. The tool supports multiple languages...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

1yrs ago

091.7K

Easy Voice Toolkit: AI Voice Toolkit for Local Deployment

Comprehensive Introduction Easy-Voice-Toolkit is a multifunctional toolkit based on the Open Source Speech Project, providing a variety of automated audio tools for speech recognition, speech transcription, speech conversion, dataset creation and model training. Users can selectively use these tools as needed...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech # AI voice cloning

2yrs ago

091.6K

Parler-TTS: Generating speaker-specific text-to-speech models from input text

General Introduction Parler-TTS is an open source text-to-speech (TTS) modeling library developed by Hugging Face, designed to generate high-quality, natural-sounding speech. The model is capable of generating speech based on input text with a specific speaker style (e.g. gender, pitch, speaking style...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

1yrs ago

091.6K

ModelBest: The World's Leading Lightweight, High-Performance End-Side Big Model

General Introduction ModelBest is a company specializing in developing lightweight and high-performance large models, dedicated to applying advanced AI technologies to mainstream consumer electronics and various end devices in daily life. Its MiniCPM series of end-side models are characterized by extreme arithmetic power and memory usage efficiency...

Latest AI Resources # AI Big Model Native Conversation Tool # AI Java Open Source Projecct

2yrs ago

091.6K

Ichigo (llama3-s): local real-time voice AI assistant, open source version of Siri

General Introduction Ichigo is an open source real-time speech AI project that aims to extend text-based language models with native "listening" capabilities. The project uses early fusion techniques inspired by Meta's Chameleon paper.Ichigo's goal is to become...

Latest AI Resources # AI Java Open Source Projecct # Multimodal Real-Time Interactive Products

2yrs ago

091.6K

OWL: An automated tool for multi-intelligence collaboration on realistic tasks

Comprehensive Introduction OWL (Optimized Workforce Learning) is an open source framework developed by the CAMEL-AI team focused on optimizing multi-intelligence collaboration for automating real-world tasks. Based on the CAMEL-AI framework ...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Development Framework

1yrs ago

091.5K

Audiblez: Generate Audiobooks, Convert eBooks to Audiobooks with Kokoro

General Introduction Audiblez is an open source project designed to convert eBooks (e.g. .epub format) into audiobooks (e.g. .m4b format). The project utilizes Kokoro's high-quality speech synthesis technology to support multiple languages and multiple voices. Users can simply...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

2yrs ago

091.5K

Research Rabbit：使用本地LLM进行网页研究和报告撰写，自动深入用户指定主题并生成总结。

Research Rabbit: Web research and report writing using native LLM, automatically drilling down into user-specified topics and generating summaries.

General Introduction Research Rabbit is a native LLM (Large Language Model) based web research and summarization assistant. After the user provides a research topic, Research Rabbit generates a search query, obtains relevant web results, and summarizes those results...

Latest AI Resources # AI Java Open Source Projecct # Generate in-depth research report

1yrs ago

091.5K

Flock: low-code workflow orchestration to build chatbots quickly

General Introduction Flock is an open source low-code platform for workflows, hosted on GitHub and developed by the Onelevenvy team. It is based on LangChain and LangGraph technologies and is focused on helping users quickly build chat machines...

Latest AI Resources # AI Customer Service Robot # AI Java Open Source Projecct # Low-code workflow

1yrs ago

091.5K

OASIS: Multi-Intelligence Simulation of Social Media Interactions of Millions of Users to Study Complex Social Phenomena

General Introduction OASIS (Open Agent Social Interaction Simulations) is an open source social media simulator capable of simulating the behavior of up to one million users. The platform combines large-scale language modeling and rule-based...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Application

1yrs ago

091.5K

Ant Design X：快速构建AI聊天界面的工具包，支持模型集成和数据流管理。

Ant Design X: A toolkit for rapidly building AI chat interfaces with support for model integration and data flow management.

Comprehensive Introduction Ant Design X is a toolkit open-sourced by Ant Group, designed to help developers quickly build AI-driven dialog interfaces. It provides a rich set of components and templates, supports model integration compatible with OpenAI standards, and is suitable for a variety of applications such as intelligent customer service, AI assistants, and other...

Latest AI Resources # AI Java Open Source Projecct

2yrs ago

091.4K

Data Formulator: an AI-driven data visualization tool

General Introduction Data Formulator is an open source AI-driven data visualization tool developed by Microsoft Research. The tool combines a graphical user interface (GUI) and natural language input (NL) to enable users to quickly create and iterate through simple interactions and commands...

Latest AI Resources # AI Java Open Source Projecct # AI data analysis

1yrs ago

091.2K

G-Search-MCP: MCP Server for Free Google Search

General Introduction G-Search-MCP is an open source Google search tool hosted on GitHub and modified by developer jae-jae based on google-search. It passes MCP (Model Context...

Latest AI Resources # AI Java Open Source Projecct # MCP services

1yrs ago

091.2K

Leffa：高保真模特虚拟试穿与人物姿势调整，Meta开源的可控人物图像生成模型

Leffa: High-fidelity model virtual fitting and character pose adjustment, Meta open source controllable character image generation model

Comprehensive Introduction Leffa is a unified framework for generating controllable character images, enabling precise manipulation of character appearance (e.g., virtual fitting) and pose (e.g., pose transfer). The framework significantly reduces distortion of fine-grained details by directing the target query to focus on the correct reference key in the attention layer, with ...

Latest AI Resources # AI Image Style Control # AI Java Open Source Projecct # AI Face Swap and Dress Up

2yrs ago

091.2K

Optexity: an open-source project to train AI to perform web actions with human demonstrations

General Introduction Optexity is an open source project on GitHub, developed by the Optexity team. Its core is to use human demonstration data to train AI to complete computer tasks, especially web page operations. The project contains three code libraries : Compute...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning # Desktop Automation Intelligence

1yrs ago

091.2K

MiniMind-V: 1 hour training of a 26M parameter visual language model

General Introduction MiniMind-V is an open source project, hosted on GitHub, designed to help users train a lightweight visual language model (VLM) with only 26 million parameters in less than an hour. It is based on the MiniMind language model, with new visual...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

1yrs ago

091.2K

Insanely Fast Whisper: fast and efficient transcription of speech to text open source project

Comprehensive Introduction insanely-fast-whisper is a combination of OpenAI's Whisper model and various optimization techniques (e.g. Transformers, Optimum, Flash Attention) for audio trans...

Latest AI Resources # AI Java Open Source Projecct # AI Speech to Text

2yrs ago

091.1K

Ruyi-Models: generate image to video open source models, support lens control and motion amplitude control

General Introduction Ruyi-Models is an open source project designed to generate high quality videos from images. Developed by the IamCreateAI team, the project supports the generation of 768 resolution, 24 frames per second, a total of 5 seconds 120 frames of cinematic video...

Latest AI Resources # AI Image to Video # AI Java Open Source Projecct

2yrs ago

091K

OrionChat: Simple Web Chat Interface with Integrated Multi-Platform AI Models (Deployment-Free)

General Introduction OrionChat is a web-based AI chat interface that provides users with a unified platform to interact with multiple mainstream AI models. The project supports a wide range of AI models including Ollama (running locally), OpenAI GPT, Google Gemi...

Latest AI Resources # AI Java Open Source Projecct # AI Localized Chat Application

2yrs ago

090.9K

RapBank：根据歌词和伴奏直接生成说唱(Rap)人声的模型（目前开放了数据集）

RapBank: a model for directly generating rap (Rap) vocals from lyrics and backing tracks (currently open dataset)

General Introduction RapBank is a dataset and toolset designed for rap lyrics generation. The project was created by NZqian to provide researchers and developers with a high-quality rap lyrics data by collecting and processing rap songs from YouTube...

Latest AI Resources # AI Java Open Source Projecct # AI Music

2yrs ago

090.9K

SegAnyMo: open source tool to automatically segment arbitrary moving objects from video

General Introduction SegAnyMo is an open source project developed by a team of researchers at UC Berkeley and Peking University, including members such as Nan Huang. This tool focuses on video processing and can automatically recognize and segment arbitrary moving objects in a video, such as people, animals or...

Latest AI Resources # AI Java Open Source Projecct # AI keying to change backgrounds # Visual Target Detection

1yrs ago

090.8K

Voice Changer: A real-time voice changer to make your favorite anime characters sing!

Comprehensive Introduction Voice Changer is an open source real-time voice transformation tool that supports a wide range of AI voice models such as MMVC, so-vits-svc, RVC, DDSP-SVC, and Beatrice.The tool is compatible with multiple platforms...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

2yrs ago

090.8K

Bilingual Book Maker：使用AI翻译制作双语电子书，全书自动化翻译工具

Bilingual Book Maker: Use AI translation to make bilingual e-books, full book automated translation tool

General Introduction Bilingual Book Maker is an open source project designed to help users create multilingual versions of eBooks using AI technology. The tool mainly uses ChatGPT for translation and supports multiple file formats including epub, txt and srt...

Latest AI Resources # AI Java Open Source Projecct # AI Translation

1yrs ago

090.7K

Fullmoon: iOS App for Native Large Language Modeling Chats

General Description Fullmoon is an application designed for iOS devices and aims to provide the ability to chat privately with native large language models. The app is optimized for Apple Silicon and is supported on iPhone, iPad and Mac. Users of the chat...

Latest AI Resources # AI Java Open Source Projecct # AI Localized Chat Application

2yrs ago

090.7K

TripoSG: Generating high-resolution 3D modeled digital assets from a single image

General Introduction TripoSG is an open source project developed by the VAST AI research team to generate high-quality 3D models from a single image. The project uses large-scale rectifier-flow converter technology, combined with hybrid supervised training and high-quality datasets, to allow the generated 3D models to have...

Latest AI Resources # AI Java Open Source Projecct # AI Text & Image to 3D

1yrs ago

090.7K

NodeTool: a node orchestration-based workflow visualization client for AI models

General Introduction NodeTool is an innovative AI authoring platform designed to provide a simple, intuitive interface for AI enthusiasts, developers, data scientists and creatives. Whether you're an artist, developer, or beginner, NodeTool helps you quickly prototype creative...

Latest AI Resources # AI Java Open Source Projecct # Low-code workflow

2yrs ago

090.6K

SVFR: A Unified Framework for Realizing Video Face Repair, Repairing Black and White, Blurry Portrait Old Videos

Comprehensive Introduction SVFR (Stable Video Face Restoration) is a unified framework for video face restoration that supports Basic Face Restoration (BFR), colorization, repair, and their combination tasks. The framework utilizes generative and kinematic priors by unifying ...

Latest AI Resources # AI Image Enlargement and Restoration # AI Java Open Source Projecct

2yrs ago

090.5K

MIDI-3D: An open source tool to quickly generate multi-object 3D scenes from a single image

General Introduction MIDI-3D is an open source project developed by the VAST-AI-Research team to quickly generate 3D scenes containing multiple objects from a single image for developers, researchers and creators. This tool is based on the multi-instance diffusion modeling technique...

Latest AI Resources # AI Java Open Source Projecct # AI Text & Image to 3D

1yrs ago

090.5K

E2M: Convert multiple file formats to Markdown for easy document formatting unification

General Introduction E2M (Everything to Markdown) is an open source Python library designed to convert a wide range of file formats to Markdown format. The tool supports formats including doc, docx, epub, html, htm, u...

Latest AI Resources # AI Java Open Source Projecct # Document Extraction and Cleaning

2yrs ago

090.5K

Browse AI: Extracting and Monitoring Structured Data Without Code

Introducing Browse Browse AI is a no-coding cloud-based web automation software designed to help users extract and monitor data from any website without programming. You can train a bot to perform data extraction, monitoring and automation tasks with just one mouse point...

Latest AI Resources # AI Open Services # AI Java Open Source Projecct # No code development

2yrs ago

090.5K

ALog: portable AI voice diary app with speech-to-text support.

General Introduction ALog is an AI-based voice diary application designed to help users record their daily lives by voice. The project is developed by duxins and open-sourced on GitHub. Users can record their diary through voice input, and the app will automatically convert the voice into text...

Latest AI Resources # AI Java Open Source Projecct # AI Speech to Text

2yrs ago

090.3K