AI open source project

Total 1020 articles posts
Agent TARS:使用视觉和命令操作电脑的开源智能体

Agent TARS: An Open Source Intelligence Using Vision and Commands to Operate Computers

Comprehensive Introduction Agent TARS is a multimodal AI intelligence open-sourced by ByteDance.The core feature is to visually understand web content and combine command line and file system operations to help users complete complex computer tasks. Instead of requiring manual operations like traditional tools, it can self...
5mos ago
02.5K
LiberSonora:有声书字幕提取与多语言翻译,有声小说转录为多语言

LiberSonora: Audiobook Subtitle Extraction and Multilingual Translation, Audiobook Transcription into Multiple Languages

General Introduction LiberSonora, which means "free sound", is a powerful AI-enabled open source audiobook toolset. The toolset supports intelligent subtitle extraction, AI title generation, multi-language translation, etc., and is capable of batch offline processing under GPU acceleration.LiberSo...
6mos ago
02.5K
LLManager:智能自动化流程审批与人类审核结合的管理工具

LLManager: a management tool that combines intelligent automated process approvals with human reviews

Comprehensive Introduction LLManager is an open source intelligent approval management tool, developed based on LangChain's LangGraph framework, focused on automating the processing of approval requests while optimizing decision making with human review. It does this through semantic search, sample less learning and...
4mos ago
02.5K
AnimeGamer:用语言指令生成动漫视频和角色互动的开源工具

AnimeGamer: An Open Source Tool for Generating Anime Videos and Character Interactions with Language Commands

AnimeGamer is an open source tool launched by Tencent ARC Lab. Users can generate anime videos with simple language commands, such as "Sousuke drive around in a purple car", as well as allow different anime characters to interact with each other, such as Kiki from The Witch's House, and Sky City...
4mos ago
02.4K
Agent S:像人类一样操作电脑的开源智能体框架

Agent S: An Open Source Framework for Intelligent Bodies to Operate Computers Like Humans

General Introduction Agent S is an open-source framework developed by Simular AI that lets intelligences operate computers like humans through a graphical user interface (GUI). It uses a multimodal large language model and empirical learning techniques to accomplish tasks such as browsing the web, editing documents, using software...
4mos ago
02.4K
阿布量化交易系统:基于Python的开源量化交易平台

Abu quantitative trading system: Python based open source quantitative trading platform

Comprehensive introduction Abu quantitative trading system is an open source platform based on Python development. It was created by user "bbfamily" to help investors realize quantitative trading strategies through code. The system supports backtesting and trading of various financial products such as stocks, options, futures and bitcoin. It...
5mos ago
02.4K
KrillinAI:一键翻译和配音的视频多语言全球化工具

KrillinAI: Multilingual Globalization Tool for Video with One-Click Translation and Dubbing

Comprehensive Introduction KrillinAI is an open-source video processing tool focused on using artificial intelligence to help users translate videos and automatically dub them. It can start from the video download, all the way to generating the finished product adapted to different platforms, the whole process is just a few clicks. The developers are available on GitHub...
2mos ago
02.4K
Deep Recall:为大模型提供企业级记忆框架的开源工具

Deep Recall: an open source tool that provides an enterprise-class memory framework for large models

Comprehensive Introduction Deep Recall is an open source, enterprise-class memory framework designed for large-scale language models (LLMs). It provides hyper-personalized responsiveness through efficient contextual retrieval and integration. The framework uses a three-tier architecture, including a memory service, a reasoning service, and a coordinator, supporting...
3mos ago
02.4K
Agenta:集成到AI应用的提示词与模型效果评估工具

Agenta: a tool for evaluating the effectiveness of cue words and models integrated into AI applications

Comprehensive Introduction Agenta is an open source AI model management tool specialized in helping users easily experiment with cue words, test model effects and monitor runs. It is suitable for people who want to develop AI applications quickly, providing a platform that is simple to operate. You can use it to try the effect of different cue words on...
5mos ago
02.4K
RLAMA:命令行操作的本地文档智能问答 RAG 系统

RLAMA: A RAG System for Intelligent Quizzing of Local Documents Operated from the Command Line

Comprehensive Introduction RLAMA is a document intelligent Q&A RAG (Retrieval Augmentation Generation) system developed open-source by DonTizi and hosted on GitHub, whose core feature lies in the realization of functionality through command line operations. Users can use simple terminal commands to connect to local ...
5mos ago
02.4K
DeepRant:实时翻译游戏聊天内容的开源客户端

DeepRant: An Open Source Client for Real-Time Translation of Game Chat Content

General Introduction DeepRant is an open source translation tool for gamers, designed to solve the problem of language barriers in international servers. It realizes instant translation of in-game text through shortcut keys, supports multiple languages to translate each other, and allows players to quickly understand and reply to chat messages without exiting the game...
5mos ago
02.4K
Deep Research:基于AI的深度研究助手,提供高效的研究工具和报告生成功能

Deep Research: an AI-based deep research assistant that provides efficient research tools and report generation capabilities

General Introduction Deep Research is an AI-based research assistant designed to perform iterative deep research by combining search engines, web crawling, and large language models. The project was released by dzhng on GitHub with the goal of providing an easy-to-use deep research genera...
4mos ago
02.4K
ScrapeGraphAI:一个提示词搞定网页抓取,无需编写规则智能网页内容提取工具

ScrapeGraphAI: A single cue word for web crawling, no need to write rules intelligent web content extraction tools

Comprehensive Introduction ScrapeGraphAI is an innovative Python web crawling library that cleverly combines Large Language Modeling (LLM) and Direct Graph Logic to create crawling pipelines for websites and local documents. The uniqueness of this tool lies in its perfect level of simplicity and power...
7mos ago
02.4K
自动解析PDF内容并提取文字与表格的开源服务

Automatically parse PDF content and extract text and tables of open source services

Comprehensive Introduction It can automatically analyze the layout of PDF documents, identify text, titles, images, tables, formulas and other elements in the page, and determine their correct order. The tool supports OCR functionality and can convert scanned PDF to searchable text. It runs on Docker and provides two models...
4mos ago
02.4K
OpenAlternative:精选常用SaaS产品的开源软件替代方案,寻找最佳开源替代方案

OpenAlternative: a selection of open source software alternatives to commonly used SaaS products, finding the best open source alternatives

General Introduction OpenAlternative is a platform focused on providing open source software alternatives, aiming to help users find suitable open source tools to replace the commercial SaaS products they use on a daily basis. The site helps users save money and improve through a carefully curated collection of open source tools...
8mos ago
02.4K
CoT-Lab:探索人机协作迭代思考的实验性对话工具

CoT-Lab: an experimental dialog tool for exploring iterative thinking about human-computer collaboration

CoT-Lab is an experimental interface for exploring a new paradigm of human-computer collaboration. Based on Cognitive Load Theory and Active Learning Principles, CoT-Lab facilitates deep cognitive alignment between humans and Artificial Intelligence (AI) through the creation of "thinking partner" relationships. The program aims to...
6mos ago
02.4K