AI Personal Learning
and practical guidance
TRAE
Total 1020 articles

Tags: ai open source projects Page 45

InstantIR:受损图像修复与图像高清放大开源项目,最低16G显存-首席AI分享圈

InstantIR: damaged image repair and image high-definition zoom open source project, minimum 16G video memory

General Description InstantIR is an innovative single-image restoration model developed by the InstantX team, designed to resurrect your damaged images with extremely high-quality and realistic details, capable of performing high-quality restoration of damaged images. The tool not only restores the details of the image, through additional text tips...

OmniParse:从文档/多媒体中提取任何非结构化数据解析为结构化数据-首席AI分享圈

OmniParse: extract any unstructured data from documents/multimedia and parse it into structured data

Comprehensive Introduction OmniParse is a powerful data parsing and optimization platform designed to transform any unstructured data into structured, actionable data optimized for the GenAI (Generative Artificial Intelligence) framework. Whether you are working with documents, tables, images, videos, audio files or web content,...

tldraw:开源无限画布白板SDK,AI生成简约线框图和UML图-首席AI分享圈

tldraw: open source unlimited canvas whiteboard SDK, AI to generate minimalist wireframe diagrams and UML diagrams

General Description tldraw is a free and instant collaborative drawing tool that provides an unlimited canvas where users can quickly draw graphics, write text and collaborate instantly. Featuring an intuitive interface and excellent performance, it is suitable for team collaboration and remote work. Supported by the open source community, tldraw not only works...

PandasAI:数据分析对话平台,用自然语言完成数据查询与图表生成-首席AI分享圈

PandasAI: Data Analytics Dialog Platform for Data Queries and Chart Generation in Natural Language

General Introduction PandasAI is a Python based open source platform designed to simplify the process of data analysis through natural language processing techniques. Enabling users to interact with databases (e.g. SQL, CSV, pandas, polars, mongodb, noSQL, etc.) in a conversational manner. The platform utilizes large-scale language modeling ...

RD-Agent:自动化数据驱动研发工具,通过AI技术推动以数据为导向的研发过程-首席AI分享圈

RD-Agent: an automated data-driven R&D tool to drive data-driven R&D processes through AI technology

Comprehensive Introduction RD-Agent is an open source tool from Microsoft designed to automate and optimize the research and development (R&D) process. The tool focuses on data-driven scenarios to improve the efficiency of model and data development through artificial intelligence techniques.RD-Agent integrates research (Research) and development (D...

TableGPT2: A Multimodal Model for Tabular Data Integration

Comprehensive Introduction TableGPT2 is a multimodal model developed by a team from Zhejiang University, focusing on the integration and processing of tabular data. The model is pre-trained and fine-tuned to be able to perform well in tabular data related tasks while maintaining strong general-purpose language and coding capabilities.TableGPT2 is innovative in...

VideoChat:自定义形象和音色克隆的实时语音交互数字人,支持端到端语音方案和级联方案-首席AI分享圈

VideoChat: real-time voice-interactive digital person with customized image and tone cloning, supporting end-to-end voice solutions and cascading solutions

Comprehensive Introduction VideoChat is a real-time voice interaction digital human project based on open source technology, supporting end-to-end voice scheme (GLM-4-Voice - THG) and cascade scheme (ASR-LLM-TTS-THG). The project allows users to customize the image and timbre of the digital human, and supports timbre cloning and lip synchronization...

SFT-data-builder:利用免费大模型API生成AI训练数据,0成本大模型训练数据生成-首席AI分享圈

SFT-data-builder: generate AI training data using free big model API, 0 cost big model training data generation

Comprehensive Introduction SFT-data-builder is an open source project designed to generate high-quality SFT training data by combining user's private domain data using the free Big Model API. The tool supports a variety of AI model formats and provides one-click generation, batch generation, flexible editing and local storage to help users quickly...

Aggregator: one-stop agent crawling and aggregation platform, free agent pool (please use in compliance)

Comprehensive introduction Aggregator is an open source project aimed at creating a free proxy pool that can crawl a variety of available proxy nodes. The platform has a flexible plug-in system , the user can according to the special needs of the target site , through plug-ins to achieve specific functions . The project is mainly used to learn crawling techniques , banned ...

OpenHands:AI 驱动的软件开发多智能代理助手,覆盖开发者各类操作-首席AI分享圈

OpenHands: AI-driven Multi-Intelligent Agent Assistant for Software Development, Covering All Types of Developer Operations

General Introduction OpenHands is an open source project developed by the All-Hands-AI team to streamline the software development process through AI technology. Formerly known as OpenDevin and now renamed OpenHands, the platform provides a powerful AI-driven development assistant that executes what human developers can...

Perplexica:1比1复刻 Perplexity AI 功能和界面的开源AI搜索引擎-首席AI分享圈

Perplexica: an open source AI search engine that replicates Perplexity AI's features and interface 1 to 1

General Introduction Perplexica is an open source AI-driven search engine designed to provide answers that reach deep into the Internet. It uses advanced machine learning algorithms, such as similarity search and embedding techniques, to optimize search results and provide clear answers with cited sources.Perplexica is powered by SearxNG ...

Scraperr:自托管网页数据抓取工具-首席AI分享圈

Scraperr: self-hosted web data scraping tool

General Introduction Scraperr is a self-hosted web data scraping tool that allows users to scrape web data by specifying XPath elements. Users submit a URL and corresponding crawl element and the results are displayed in a table and can be downloaded as an Excel file.Scraperr supports user login to manage the crawl...

AppAgent:利用多模态智能体自动操作智能手机-首席AI分享圈

AppAgent: automated smartphone operation using multimodal intelligences

Comprehensive Introduction AppAgent is a Large Language Model (LLM)-based multimodal agent framework designed to manipulate smartphone applications. The framework mimics human interactions such as taps and swipes through a simplified manipulation space, thus eliminating the need for system back-end access and expanding its use in different applications...

en_USEnglish