AI open source project

Total 1020 articles posts
tldraw:开源无限画布白板SDK,AI生成简约线框图和UML图

tldraw: open source unlimited canvas whiteboard SDK, AI to generate minimalist wireframe diagrams and UML diagrams

General Description tldraw is a free and instant collaborative drawing tool that provides an unlimited canvas where users can quickly draw graphics, write text and collaborate instantly. Featuring an intuitive interface and excellent performance, it is suitable for team collaboration and remote work. Supported through the open source community, tldr...
1yrs ago
039.1K
NVIDIA Garak:检测LLM漏洞的开源工具,确保生成式AI的安全性

NVIDIA Garak: Open-source tool to detect LLM vulnerabilities and secure generative AI

Comprehensive Introduction NVIDIA Garak is an open source tool that specializes in detecting vulnerabilities in Large Language Models (LLMs). It checks the model for multiple weaknesses such as illusions, data leakage, hint injection, error message generation, harmful content generation, etc. through static, dynamic and adaptive probing...
1yrs ago
039.1K
RealtimeVoiceChat:低延迟与AI进行自然口语对话

RealtimeVoiceChat: low-latency natural spoken conversation with AI

General Introduction RealtimeVoiceChat is an open source project focused on real-time, natural conversations with artificial intelligence via voice. Users use a microphone to input their voice, and the system captures the audio through a browser, quickly converts it to text, and a large-scale language model (LLM) generates back...
7mos ago
039K
腾讯混元3D(Hunyuan3D):生成高分辨率3D资产,多种3D素材生成工作流

Tencent Hybrid 3D (Hunyuan3D): Generate high-resolution 3D assets, multiple 3D material generation workflows

Comprehensive Introduction Tencent Hunyuan3D (Hunyuan3D 2.0) is an advanced large-scale 3D synthesis system from Tencent designed to generate high-resolution textured 3D assets. The system consists of two core components: Hunyuan3D-DiT, a large-scale shape generation model, and Hunyuan3D-DiT, a large-scale texture...
10mos ago
038.8K
AutoGen:微软开发的多智能体对话框架

AutoGen: A Multi-Intelligent Body Dialog Framework Developed by Microsoft

Comprehensive Introduction AutoGen is an open source framework developed by a team of Microsoft researchers focused on simplifying the building of large language model (LLM) applications through multi-intelligent body conversations. It allows developers to create AI agents that can talk to each other and collaborate to solve tasks. This approach not only improves the performance of LLM...
10mos ago
038.8K
Deep Recall:为大模型提供企业级记忆框架的开源工具

Deep Recall: an open source tool that provides an enterprise-class memory framework for large models

Comprehensive Introduction Deep Recall is an open source, enterprise-class memory framework designed for large-scale language models (LLMs). It provides hyper-personalized responsiveness through efficient contextual retrieval and integration. The framework uses a three-tier architecture, including a memory service, a reasoning service, and a coordinator, supporting...
7mos ago
038.7K
YuE:将歌词转化为完整歌曲的基础模型,支持多种音乐风格

YuE: Transforms lyrics into a base model of a complete song, supporting a wide range of musical styles

General Introduction YuE is an open source full song generation base model that focuses on transforming lyrics into full songs. Unlike other models that can only generate short snippets of non-vocal music, YuE is capable of generating full songs with lead and backing vocals up to several minutes in length. The model addresses music generation in...
10mos ago
038.6K
Amurex:开源AI会议记录助手,自动记录会议内容生成总结

Amurex: open source AI meeting recording assistant, automatic recording of meeting content to generate summaries

General Introduction Amurex is an open source AI meeting assistant developed by The Personal AI Company that aims to improve meeting efficiency through intelligent features.Amurex can provide real-time suggestions, generate intelligent summaries, record meeting content, and automatically send follow...
11mos ago
038.4K
Agent S:像人类一样操作电脑的开源智能体框架

Agent S: An Open Source Framework for Intelligent Bodies to Operate Computers Like Humans

General Introduction Agent S is an open-source framework developed by Simular AI that lets intelligences operate computers like humans through a graphical user interface (GUI). It uses a multimodal large language model and empirical learning techniques to accomplish tasks such as browsing the web, editing documents, using software...
8mos ago
038.4K
Memary:利用知识图谱增强Agent长期记忆的开源项目

Memary: an open-source project to enhance Agent long-term memory using knowledge graphs

General Introduction Memary is an innovative open source project focused on providing long-term memory management solutions for autonomous intelligences. The project helps intelligences break through the limitations of traditional context windows to achieve smarter interaction experiences through knowledge graphs and specialized memory modules.Memary adopts...
11mos ago
038.4K
AnyText:生成和编辑多语言图像文本,高可控在图像中生成多行中文

AnyText: Generate and edit multi-language image text, highly controllable to generate multiple lines of Chinese in the image

Comprehensive Introduction AnyText is a revolutionary multilingual visual text generation and editing tool developed based on the diffusion model. It generates natural, high-quality multilingual text in images and supports flexible text editing features. It was developed by a team of researchers and presented at ICLR 2024...
11mos ago
038.3K
99AI:集成多模态AI服务的商业化Web应用(免费开源)

99AI: A commercialized web application integrating multimodal AI services (free and open source)

Comprehensive Introduction 99AI is an open source AI web application project that aims to provide an easy-to-deploy, low-threshold integrated AI service platform. The project supports intelligent dialog, multimodal modeling, application plaza, networked search, and integrates AI painting, music and video...
1yrs ago
038.2K
ModelBest(面壁智能):全球领先的轻量高性能端侧大模型

ModelBest: The World's Leading Lightweight, High-Performance End-Side Big Model

General Introduction ModelBest is a company specializing in developing lightweight and high-performance large models, dedicated to applying advanced AI technologies to mainstream consumer electronics and various end devices in daily life. Its MiniCPM series of end-side models are characterized by extreme arithmetic power and memory usage efficiency...
1yrs ago
038.2K
FinGPT:开源金融大语言模型平台,助力金融分析与预测

FinGPT: Open Source Financial Big Language Modeling Platform for Financial Analytics and Prediction

Comprehensive Introduction FinGPT is an open source financial big language modeling platform developed by the AI4Finance Foundation, designed for the financial sector to solve complex financial tasks and drive innovation in fintech.FinGPT utilizes lightweight adaptation techniques and reinforcement learning approaches...
10mos ago
038.1K
ChatTTS:模仿真人说话声音的语音生成模型(ChatTTS一键加速包)

ChatTTS: a speech generation model that mimics the voice of a real person speaking (ChatTTS one-click acceleration package)

General Introduction ChatTTS is a generative speech model designed for conversational scenarios. It generates natural and expressive speech, supports multiple languages and multiple speakers, and is suitable for interactive conversations. The model does this by predicting and controlling fine-grained prosodic features such as laughter, pauses and interjections, sup...
10mos ago
038.1K
Research Rabbit:使用本地LLM进行网页研究和报告撰写,自动深入用户指定主题并生成总结。

Research Rabbit: Web research and report writing using native LLM, automatically drilling down into user-specified topics and generating summaries.

General Introduction Research Rabbit is a native LLM (Large Language Model) based web research and summarization assistant. After the user provides a research topic, Research Rabbit generates a search query, obtains relevant web results, and summarizes those results...
8mos ago
037.9K
WeClone:用微信聊天记录和语音训练数字分身

WeClone: training digital doppelgangers with WeChat chats and voices

Comprehensive introduction WeClone is an open source project that uses WeChat chat logs and voice messages, combined with large language models and speech synthesis technology, to allow users to create personalized digital doppelgangers. The project can analyze the user's chat habits to train the model , but also a small number of voice samples to generate realistic sound...
8mos ago
037.9K
MegaTTS3:合成中英文语音的轻量模型

MegaTTS3: A Lightweight Model for Synthesizing Chinese and English Speech

Comprehensive Introduction MegaTTS3 is an open source speech synthesis tool developed by ByteDance in cooperation with Zhejiang University, focusing on generating high-quality Chinese and English speech. Its core model is only 0.45B parameters , lightweight and efficient , support for mixed Chinese and English speech generation and speech cloning . The project is hosted on ...
8mos ago
037.9K
NarratoAI:文本生成影视解说与自动化剪辑神器

NarratoAI: Text-Generated Movie and TV Narration and Automated Editing Tool

Comprehensive Introduction NarratoAI is a fully automated tool that integrates movie and TV narration, automated editing, dubbing and subtitle generation. It relies on large-scale language modeling (LLM) technology to automatically generate copy and automatically edit videos with corresponding voiceovers and subtitles, providing users with a one-stop...
1yrs ago
037.8K
百聆 (Bailing):低延时的开源语音对话助手,轻松实现自然对话交流

Bailing: a low-latency open source voice dialog assistant that easily realizes natural conversational exchanges

Comprehensive Introduction Bailing (Bailing) is an open source voice conversation assistant designed to engage in natural conversations with users through speech. The project combines speech recognition (ASR), voice activity detection (VAD), large language modeling (LLM) and speech synthesis (TTS) technologies to achieve...
10mos ago
037.8K
AI ContentCraft:生成短故事、对话脚本、配音、配图的多功能AI内容创作工具

AI ContentCraft: a versatile AI content creation tool for generating short stories, dialog scripts, voiceovers, and graphics

General Introduction AI ContentCraft is a versatile content creation tool that integrates text generation, speech synthesis, image generation and more. It helps creators quickly generate stories, podcast scripts, and accompanying audio and video content. The tool supports multiple language conversions and can batch...
10mos ago
037.8K
SegAnyMo:从视频中自动分割任意运动物体的开源工具

SegAnyMo: open source tool to automatically segment arbitrary moving objects from video

General Introduction SegAnyMo is an open source project developed by a team of researchers at UC Berkeley and Peking University, including members such as Nan Huang. This tool focuses on video processing and can automatically recognize and segment arbitrary moving objects in a video, such as people, animals or...
8mos ago
037.7K
UltraRAG:一站式RAG系统解决方案,简化数据构建与模型微调

UltraRAG: A One-Stop RAG System Solution to Simplify Data Construction and Model Fine-Tuning

Comprehensive Introduction UltraRAG is a RAG (Retrieval Augmented Generation) system solution jointly proposed by the THUNLP group at Tsinghua University, the NEUIR group at Northeastern University, Modelbest.Inc and the 9#AISoft team. The framework is based on agile deployment and modularized building...
10mos ago
037.6K
Flow(Laminar):构建智能体的轻量级任务引擎,简化并灵活管理任务

Flow (Laminar): a lightweight task engine for building intelligences that simplifies and flexibly manages tasks

Comprehensive Introduction Flow is a lightweight task engine designed for building AI agents, emphasizing simplicity and flexibility. Unlike traditional node- and edge-based workflows, Flow uses a dynamic task queuing system that supports parallel execution, dynamic scheduling, and intelligent dependency management. Its core concept is ...
12mos ago
037.5K
BuffGPT:企业级生成式AI应用低代码开发平台

BuffGPT: A Low-Code Development Platform for Enterprise-Grade Generative AI Applications

Comprehensive Introduction BuffGPT is an open source AI application development platform based on the Large Language Model (LLM), providing out-of-the-box features such as data processing, model invocation, RAG retrieval, and visual workflow orchestration to help users easily build and operate generative AI applications. The platform supports privatization...
9mos ago
037.5K
MiniRAG:简化检索增强生成框架,实体图索引召回相关文本块

MiniRAG: Simplified Retrieval Enhanced Generation Framework, Entity Graph Index Recall Relevant Text Blocks

Comprehensive Introduction MiniRAG is an extremely simple Retrieval Augmented Generation (RAG) framework that aims to enable good RAG performance even for small models through heterogeneous graph indexing and lightweight topology-enhanced retrieval. It is developed by the Data Science Laboratory of the University of Hong Kong (HKUDS) to address ...
10mos ago
037.3K
Goose:开源可扩展的编程智能体,自动化执行编程全流程任务

Goose: open source scalable programming intelligences that automate the full range of programming tasks

General Introduction Goose is an open source AI agent tool developed by Block, Inc. designed to help developers automate everyday development tasks. It supports a wide range of Large Language Models (LLMs) and interacts with users via the command line or desktop application interfaces.Goose can perform a wide range of tasks from agent...
10mos ago
037.3K
OpenAOE:大模型群聊框架:同时与多个大语言模型聊天

OpenAOE: Large Model Group Chat Framework: Chatting with Multiple Large Language Models Simultaneously

Comprehensive Introduction OpenAOE is an open source large model group chat framework, aiming to solve the problem of the lack of chat frameworks in the current market with multiple models responding in parallel. With OpenAOE, users can talk to multiple Large Language Models (LLMs) at the same time and get parallel output. The framework supports ...
10mos ago
037.3K
Paper2Code:将机器学习论文自动转化为可运行代码

Paper2Code: Automatically Converting Machine Learning Papers into Runnable Code

General Introduction Paper2Code is an open source project that aims to solve the problem of lack of code implementations for machine learning papers. It automatically transforms scientific papers into runnable code repositories through the multi-agent Large Language Modeling (LLM) system PaperCoder. The system uses planning ...
7mos ago
037.2K
AnkiAIUtils: Anki Flashcard Learning AI Toolset, an intelligent assistant that automatically optimizes memorized cards

AnkiAIUtils: Anki Flashcard Learning AI Toolset, an intelligent assistant that automatically optimizes memorized cards

General Description AnkiAIUtils is a set of AI-enhanced tools designed for the Anki flashcard learning system. Developed by a medical student, the tool is designed to automatically improve cards that users are struggling with during the learning process through AI technology. It can intelligently provide users with personalized...
11mos ago
037.2K