Latest AI Resources

Total 2622 articles posts
Notta:AI会议记录与音频转录工具,自动转录会议、采访或录音

Notta: AI meeting recording and audio transcription tool to automatically transcribe meetings, interviews or recordings

General Description Notta is a powerful AI meeting recording and audio transcription tool designed to help users automatically convert meetings, interviews or audio recordings into searchable text. With Notta, users can easily transcribe, edit, summarize and collaborate to boost productivity.Notta supports...
7mos ago
04.1K
小智 AI 聊天机器人:打造你的AI聊天伴侣,轻松实现语音对话和智能互动

Xiaozhi AI Chatbot: Build your AI chatting companion, easily realize voice conversation and intelligent interaction

Comprehensive Introduction Xiaozhi AI Chatbot is an open source project based on the ESP32 development board, designed to help users build their own AI chat companion. The project was developed by Shrimp and is mainly used for teaching purposes to help more people get started with AI hardware development and to understand how to apply large language models to real...
5mos ago
04.4K
WrenAI:对话式数据分析AI助手,直接获取答案、SQL查询与分析报表

WrenAI: Conversational Data Analytics AI Assistant with Direct Access to Answers, SQL Queries & Analytics Reports

General Introduction WrenAI is an open source SQL AI assistant specifically designed to help data teams, product teams and business teams gain data insights through natural language conversations. It is capable of converting natural language into SQL queries, generating charts, spreadsheets and reports, supporting multilingual...
7mos ago
04.2K
Ryne AI:学术工作与专业报告学习助手,研究并撰写论文,绕过AI检测

Ryne AI: Academic Work & Professional Report Learning Assistant to Research and Write Papers to Bypass AI Detection

General Introduction Ryne AI is an AI tool platform designed for students to enhance learning efficiency and academic performance by providing a wide range of AI tools. The platform's main features include text humanization, AI detection and avoidance, intelligent study assistant, essay writing assistant, and note-taking tools...
7mos ago
03.4K
VITA:开源视觉与语音实时交互的多模态大语言模型

VITA: Open Source Multimodal Large Language Model for Real-Time Interaction between Vision and Speech

General Introduction VITA is a leading open source interactive multimodal large language modeling project, pioneering the ability to achieve true full multimodal interaction. The project launched VITA-1.0 in August 2024, pioneering the first open source interactive fully-modal large language model.2024...
7mos ago
04.4K
TransRouter:基于Gemini多模态模型,实时中英互译的音频转换工具

TransRouter: A Real-Time Audio Conversion Tool for Chinese-to-English Translation Based on Gemini Multimodal Modeling

TransRouter is a real-time voice translation tool based on Google's Gemini model, specifically designed for real-time voice translation between English and Chinese. The tool can be seamlessly integrated into video conferencing software such as Zoom, providing an easy way for cross-language...
7mos ago
03.5K
opensource_notebooklm:基于Deepseek-V3和PlayHT TTS的NotebookLM开源实现

opensource_notebooklm: open source implementation of NotebookLM based on Deepseek-V3 and PlayHT TTS

General Introduction Open Source NotebookLM is an innovative artificial intelligence project that combines Deepseek-V3's language understanding capabilities with PlayHT's speech synthesis technology, aiming to create an intelligent note-taking conversation system. The project was developed by Build Fast w...
7mos ago
03.1K
Vision is All You Need:使用视觉语言模型构建智能文档检索系统(Vision RAG)

Vision is All You Need: Building an Intelligent Document Retrieval System Using Visual Language Models (Vision RAG)

Comprehensive Introduction Vision-is-all-you-need is an innovative visual RAG (Retrieval Augmented Generation) system demonstration project that breaks new ground in applying Visual Language Modeling (VLM) to the document processing domain. Unlike traditional text chunking methods, the system directly makes...
7mos ago
03.7K
Diffbot GraphRAG LLM:依赖外部实时知识图谱数据的LLM推理服务

Diffbot GraphRAG LLM: LLM reasoning service relying on external real-time knowledge graph data

Comprehensive Introduction Diffbot LLM Reasoning Server is an innovative large-scale language modeling system with special optimizations and improvements based on the LLama model architecture. The most important feature of the project is the integration of real-time Knowledge Graph with retrieval-enhanced generation...
7mos ago
03.5K
MetaGPT:多智能体协作框架,构建 AI 软件开发团队实现自然语言编程

MetaGPT: A Multi-Intelligence Collaboration Framework for Building AI Software Development Teams for Natural Language Programming

Comprehensive Introduction MetaGPT is an innovative multi-intelligence body framework designed to model the operations of a complete AI software company. Created by geekan (Alexander Wu), the goal of the project is to combine GPT models with different roles into a collaborative entity...
5mos ago
04.4K
Twelve Labs:理解视频内容的多模态AI解决方案,视频搜索、生成、嵌入API服务

Twelve Labs: multimodal AI solution for understanding video content, video search, generation, embedding API services

General Introduction Twelve Labs is a multimodal AI company focused on video understanding, dedicated to helping users understand and process large amounts of video content through advanced AI technologies. Its core technologies include video search, generation, and embedding, which are able to extract key features from video such as actions, objects...
6mos ago
02.9K
Fish Agent:端到端AI语音克隆助手,实时语音对话助理,Fish Speech衍生项目

Fish Agent: end-to-end AI voice cloning assistant, real-time voice conversation assistant, Fish Speech spin-off project

Comprehensive Introduction Fish Speech Derivative Project Fish Agent is a revolutionary end-to-end AI speech cloning system developed based on the V0.1 3B model architecture. As a fully end-to-end speech clone processing system, its most important feature is the use of innovative speechless...
7mos ago
03.8K
FunClip:智能剪辑视频内容为短片,轻松实现精准视频片段提取/裁剪

FunClip: Intelligent editing of video content into short clips, easy to realize accurate video clip extraction/cropping

Comprehensive Introduction FunClip is a fully open source localized automatic video editing tool developed by TONGYI Speech Lab of Alibaba Dharma Institute. The tool integrates the industrial-grade Paraformer-Large speech recognition model, which can accurately recognize the speech in the video...
7mos ago
04.5K
Dify-WebUI:基于Dify API的桌面智能对话客户端,提供企业级AI对话能力

Dify-WebUI: Desktop Intelligent Conversation Client based on Dify API, providing enterprise-grade AI conversation capabilities

Comprehensive Introduction Dify-WebUI is a modern desktop smart conversation app based on the Dify API, designed to provide enterprises with powerful AI conversation capabilities. The application supports a variety of preset theme colors to meet the personalized needs of enterprises, and has a knowledge base management function to support...
7mos ago
04.3K
小红书AI运营助手:自动生成和发布小红书文章

Xiaohongshu AI operation assistant: automatically generate and publish Xiaohongshu articles

Comprehensive Introduction Xiaohongshu AI Operation Assistant (xhsaipublisher) is an automation tool designed for publishing articles on the Xiaohongshu platform. The program combines a graphical user interface with automation scripts that utilize big model technology to generate content and automatically log in and publish via browser...
7mos ago
04.5K
Doc2X:文档图片公式识别与转换工具,支持多格式转换与高精度翻译

Doc2X: Document image formula recognition and conversion tools, support for multi-format conversion and high-precision translation

Comprehensive introduction Doc2X is a powerful document image formula recognition and conversion tools, is committed to providing efficient and intelligent document processing solutions. Whether it is an academic research paper, a textbook, a corporate document or a financial report, Doc2X can accurately recognize PDF tables and...
6mos ago
03.7K
3MinTop:3分钟AI读书,快速掌握书籍精华培养阅读习惯

3MinTop: 3-minute AI reading, quickly grasp the essence of the book to develop reading habits

Comprehensive Introduction 3MinTop is an AI-driven reading assistant designed to help users master the core content of books in a short period of time, lower the threshold of reading, and develop good reading habits. Whether you are a novice reader or an experienced reader, 3MinTop can summarize and pass through intelligent...
7mos ago
03.3K
Orchestra: Building Smart AI Teams for Easier and More Efficient Multi-Intelligence Collaborative Development

Orchestra: Building Smart AI Teams for Easier and More Efficient Multi-Intelligence Collaborative Development

Comprehensive Introduction Orchestra is an innovative lightweight Python framework that focuses on building multi-intelligence collaborative systems based on the Large Language Model (LLM). It employs a unique method of arranging intelligences so that multiple AI intelligences can work together harmoniously like a symphony orchestra. By modeling ...
7mos ago
02.6K
Harbor:一键部署本地LLM开发环境,轻松管理和运行AI服务的容器化工具集

Harbor: a containerized toolset for easily managing and running AI services with one-click deployment of local LLM development environments

Comprehensive Introduction Harbor is a revolutionary containerized LLM toolset focused on simplifying the deployment and management of local AI development environments. It enables developers with a clean command line interface (CLI) and companion application to launch and manage with a single click, including LLM backends, API interfaces, front...
7mos ago
03.3K
ExtractThinker:提取和分类文档为结构化数据,优化文档处理流程

ExtractThinker: extracting and classifying documents into structured data to optimize the document processing flow

Comprehensive Introduction ExtractThinker is a flexible document intelligence tool that utilizes Large Language Models (LLMs) to extract and classify structured data from documents, providing a seamless ORM-like document processing workflow. It supports a variety of document loaders, including Tess...
7mos ago
03.3K
NeoAI:让AI接管电脑远程操作,使用自然语言控制电脑的开源项目

NeoAI: Open source project that lets AI take over remote operation of computers and control them using natural language

General Introduction NeoAI is an innovative open source AI assistant tool that allows users to easily control and manage their computers through natural language conversations. Without writing any code, users can simply use everyday conversations to find files, automate tasks, manage devices, etc.NeoAI...
7mos ago
04.9K
HtmlRAG:构建高效HTML检索增强生成系统,优化RAG系统中的HTML文档检索与处理

HtmlRAG: Building an Efficient HTML Retrieval Enhanced Generation System, Optimizing HTML Document Retrieval and Processing in RAG Systems

Comprehensive Introduction HtmlRAG is an innovative open source project focused on improving the processing of HTML documents in Retrieval Augmented Generation (RAG) systems. The project presents a novel approach that argues that using HTML formatting in RAG systems is more efficient than plain text. The project contains a complete ...
7mos ago
03.7K
TryOffAnyone:从人物身上提取服装为平铺服装展示图的AI工具

TryOffAnyone: AI tool for extracting garments from a person as a tiled garment display image

Comprehensive Introduction TryOffAnyone is a breakthrough AI image processing tool specialized in solving the challenges of clothing display in the e-commerce field. It is able to intelligently convert photos of clothes in real people's wearing state into lay-flat display effect images, this technology is based on the latest Latent Dif...
7mos ago
03K
ScrapeGraphAI:一个提示词搞定网页抓取,无需编写规则智能网页内容提取工具

ScrapeGraphAI: A single cue word for web crawling, no need to write rules intelligent web content extraction tools

Comprehensive Introduction ScrapeGraphAI is an innovative Python web crawling library that cleverly combines Large Language Modeling (LLM) and Direct Graph Logic to create crawling pipelines for websites and local documents. The uniqueness of this tool lies in its perfect level of simplicity and power...
7mos ago
02.6K
AnkiAIUtils: Anki Flashcard Learning AI Toolset, an intelligent assistant that automatically optimizes memorized cards

AnkiAIUtils: Anki Flashcard Learning AI Toolset, an intelligent assistant that automatically optimizes memorized cards

General Description AnkiAIUtils is a set of AI-enhanced tools designed for the Anki flashcard learning system. Developed by a medical student, the tool is designed to automatically improve cards that users are struggling with during the learning process through AI technology. It can intelligently provide users with personalized...
7mos ago
03.2K
YouMind:专业创作者辅助工具,摘录各类材料并存入知识库辅助写作

YouMind: a professional creator's aid that excerpts all kinds of material and deposits it in a knowledge base to aid in writing.

General Introduction YouMind is an AI authoring system powered by top-notch Large Language Models (LLMs) designed to help users extract and preserve important content from a wide range of materials, focusing on creation rather than simple collection. Whether browsing the web, watching YouTube videos, listening to podcasts...
7mos ago
03.5K
Story-Adapter:根据长篇故事生成连续且风格一致的图像插画

Story-Adapter: generating continuous and consistent graphic illustrations based on a long story

General Introduction Story-Adapter is an innovative story visualization framework that converts textual stories into coherent image sequences. Developed by researchers, this project employs an iterative approach that requires no training to generate high-quality story illustrations. The framework is characterized by its ability to handle long...
7mos ago
03.4K
ElizaOS:构建自主执行的多智能体,功能完备的开源AI智能体开发框架

ElizaOS: Building Autonomously Executing Multi-Intelligents, a Fully Functional Open Source AI Intelligent Body Development Framework

Comprehensive introduction Eliza is an advanced multi-intelligent body (Multi-Agent) development framework , is committed to simplifying the construction and deployment of autonomous intelligent body (Autonomous Agent) process . It supports the deployment of multiple intelligent bodies with different role settings , can realize intelligent ...
7mos ago
04.9K
Aiarty Image Matting:专业AI图像抠图,精准去除背景,免授权安装包

Aiarty Image Matting: professional AI image keying, accurate background removal, license-free installer

Comprehensive Introduction AIARTY AI Image Keying is an advanced AI image processing software designed for e-commerce, design and photography fields. The software utilizes state-of-the-art AI technology to accurately remove image backgrounds, process details such as complex hairs and translucent objects, and achieve foreground and background without...
6mos ago
03K
Memary:利用知识图谱增强Agent长期记忆的开源项目

Memary: an open-source project to enhance Agent long-term memory using knowledge graphs

General Introduction Memary is an innovative open source project focused on providing long-term memory management solutions for autonomous intelligences. The project helps intelligences break through the limitations of traditional context windows to achieve smarter interaction experiences through knowledge graphs and specialized memory modules.Memary adopts...
7mos ago
04.8K
AI reads books:AI逐页阅读PDF书籍,自动提取知识要点并生成总结

AI reads books: AI reads PDF books page by page, automatically extracts the main points of knowledge and generates summaries.

Comprehensive Introduction AI-reads-books-page-by-page is a Python-based development of intelligent PDF book analysis tool, which can automate the page-by-page analysis of PDF books, extract the key knowledge points, and after the specified page interval to generate stage...
7mos ago
04.4K
Boolpic:免费图片编辑和优化工具,去除背景,添加滤镜和动画,图像压缩和放大

Boolpic: free photo editing and optimization tool, remove background, add filters and animations, image compression and enlargement

General Introduction Boolpic is a free AI-driven image editing tool designed to help users efficiently process and optimize images. The platform offers a variety of powerful features including background removal, image effects and filters, image animation, image compression and resizing, etc.Boolpic's...
8mos ago
04.1K
BgSub:消除或替换图像背景,智能优化图像背景和边缘

BgSub: Eliminate or replace image backgrounds, intelligently optimize image backgrounds and edges

General Introduction BgSub is a convenient and easy-to-use online image processing tool that allows users to quickly eliminate or replace image backgrounds without having to upload an image. The platform utilizes advanced artificial intelligence technology to perform all operations within the browser, ensuring user privacy and data security.BgSub can be used in just ...
8mos ago
03.1K
灵办AI:提升工作与学习效率的办公全能AI助手

Spirit Office AI: Office All-in-One AI Assistant to Enhance Work and Learning Efficiency

Comprehensive Introduction Lingban AI is an all-in-one AI assistant designed to enhance users' work and study efficiency. It offers a variety of functions, including translation, dialog, writing, AI search, AI reading, copy rewriting, code generation and correction, and more. Whether you need to translate a foreign language, generate copy, or perform code...
4mos ago
03.2K