Latest AI Resources

Total 3105 articles posts
元真数字人:数字人直播、口播短视频,商业化AI虚拟人直播工具

Yuanzhen digital people: digital people live, oral short video, commercialization AI avatar live tool

Comprehensive Introduction Yuanzhen Digital People is a leading AIGC (Artificial Intelligence Generated Content) platform dedicated to providing users with one-stop services such as digital people live broadcasting, short video production and AI assistant. The platform integrates AI algorithm synthesis and GPT-style big models, supports users to create exclusive Q&A models, provides real...
1yrs ago
072.5K
5ire:支持本地向量知识库的跨平台大模型桌面客户端

5ire: cross-platform large model desktop client with support for local vector knowledge bases

General Introduction 5ire is an open source cross-platform big model desktop client designed to provide users with convenient local vector knowledge base management and big model interaction capabilities. The software supports parsing and vectorized storage of multiple document formats with powerful retrieval-enhanced generation (RAG) capabilities. In addition, 5i...
2yrs ago
072.5K
AICamp:适合团队使用的大模型集成聊天平台,接入自有API或免费使用GPT-4o-mini

AICamp: an integrated chat platform for teams with large models, access to its own API or free use of GPT-4o-mini

Comprehensive Introduction AICamp is a comprehensive AI platform designed to simplify the use of various AI tools and models. It provides a shared workspace for teams, facilitating team members to collaborate and improve productivity.AICamp offers a wide range of advanced AI features to help organizations bring...
1yrs ago
072.4K
MiniRAG:简化检索增强生成框架,实体图索引召回相关文本块

MiniRAG: Simplified Retrieval Enhanced Generation Framework, Entity Graph Index Recall Relevant Text Blocks

Comprehensive Introduction MiniRAG is an extremely simple Retrieval Augmented Generation (RAG) framework that aims to enable good RAG performance even for small models through heterogeneous graph indexing and lightweight topology-enhanced retrieval. It is developed by the Data Science Laboratory of the University of Hong Kong (HKUDS) to address ...
1yrs ago
072.4K
Fun-ASR - 钉钉、通义联合推出的新一代语音识别模型

Fun-ASR - A New Generation of Speech Recognition Models Jointly Launched by Nail and Tongyi

Fun-ASR is a big model of speech recognition jointly launched by Nail and Tongyi Labs. The model has been trained with massive audio data and can accurately recognize multi-industry terminology, such as Internet, technology, home decoration, etc., significantly improving the recognition accuracy. The model combines with Nail enterprise information for inference optimization to reduce the illusion problem...
10mos ago
072.3K
MindSearch:开源AI搜索引擎框架,部署您自己的 Perplexity 搜索引擎!

MindSearch: open source AI search engine framework to deploy your own Perplexity search engine!

Comprehensive Introduction MindSearch is an open source AI search engine framework launched by Shanghai Artificial Intelligence Laboratory (SAL), aiming to simulate human thought process for complex information gathering and integration. The tool combines the advanced technology of large-scale language modeling (LLM) and search engine through multi-intelligence...
2yrs ago
072.3K
WebShaper - 阿里通义开源的AI训练数据合成系统

WebShaper - Ali Tongyi's open source AI training data synthesis system

WebShaper is an AI training data synthesis system launched by Alibaba's Tongyi Lab, which is based on formal modeling and intelligence expansion mechanism to generate high-quality and scalable training data to help AI intelligences improve complex information retrieval capabilities. The system introduces the concept of "knowledge projection"...
11mos ago
072.3K
心流AI助手:深度知识搜索工具,集成知识库的专业知识研究助手

Mindstream AI Assistant: Deep Knowledge Search Tool, Expertise Research Assistant with Integrated Knowledge Base

Comprehensive Introduction Heartstream AI Assistant is an intelligent search and knowledge acquisition tool designed to help users efficiently acquire all kinds of knowledge, whether it's daily life encyclopedias or professional academic papers. With Mindstream AI Assistant, users can easily search the whole Internet content, quickly find the information they need, and enter the efficient Mindstream state...
1yrs ago
072.2K
Paper2Code:将机器学习论文自动转化为可运行代码

Paper2Code: Automatically Converting Machine Learning Papers into Runnable Code

General Introduction Paper2Code is an open source project that aims to solve the problem of lack of code implementations for machine learning papers. It automatically transforms scientific papers into runnable code repositories through the multi-agent Large Language Modeling (LLM) system PaperCoder. The system uses planning ...
1yrs ago
072.2K
Deep Recall:为大模型提供企业级记忆框架的开源工具

Deep Recall: an open source tool that provides an enterprise-class memory framework for large models

Comprehensive Introduction Deep Recall is an open source, enterprise-class memory framework designed for large-scale language models (LLMs). It provides hyper-personalized responsiveness through efficient contextual retrieval and integration. The framework uses a three-tier architecture, including a memory service, a reasoning service, and a coordinator, supporting...
1yrs ago
072.1K
DomoAI:智能视频艺术风格转换|图像转视频|文本转视频

DomoAI: Intelligent Video Art Style Conversion|Image to Video|Text to Video

General Description DomoAI has recently launched its Video to Video feature, which converts existing videos into a completely different art style with amazing results. It allows users to easily create unique styles of visual art. Other features included in the platform can convert still images to motion video, text to picture...
2yrs ago
072.1K
Handy - 开源免费的本地AI语音转文字工具

Handy - Open Source Free Native AI Speech to Text Tool

Handy is open source and free local speech to text tool, supporting Windows, MacOS and Linux systems, developed by Rust and React. It is suitable for quick transcription and text input by processing voice data locally without uploading it to the cloud to ensure privacy and security.
7mos ago
072K
SmartRead:自动标注技术PDF文档并提供相关引用源

SmartRead: Automatically annotate technical PDF documents and provide relevant citation sources

Comprehensive Introduction SmartRead is an AI-based open source tool designed for technical documents. It can automatically analyze PDF files, mark key content, such as important terms, titles or core ideas to help users quickly understand complex documents. At the same time, it can also provide with the main document...
1yrs ago
072K
Image AI:集成多类AI图片编辑工具,免费视频换脸,简单上手

Image AI: Integration of multiple types of AI photo editing tools, free video face changing, easy to start!

Comprehensive Introduction Image AI is a remarkable all-in-one AI image platform that offers a wide range of advanced image tools to help users easily achieve high-quality visual effects. Whether it's face swap, image recognition, text to generate images, or image de-contextualization, Image AI can meet...
2yrs ago
072K
Agent Zero - 免费AI智能体框架,具备持久记忆功能

Agent Zero - Free AI Intelligent Body Framework with Persistent Memory

Agent Zero is an open-source artificial intelligence framework to create general-purpose, highly customizable intelligent assistants. Through dynamic learning and evolution, it is able to handle a wide range of tasks, with persistent memory capabilities that remember previous experiences and solutions to accomplish subsequent tasks more efficiently.
1yrs ago
072K
AI Test Kitchen:Google创意生成与AI技术实验平台

AI Test Kitchen: Google's Experimental Platform for Idea Generation and AI Technology

Comprehensive Introduction AI Test Kitchen is an experimentation platform launched by Google Labs to explore the combination of artificial intelligence and creativity. The platform allows users to experience and give feedback on emerging AI technologies such as LaMDA.The platform provides a variety of tools to help users transform ideas into real...
2yrs ago
071.9K
AnimeGamer:用语言指令生成动漫视频和角色互动的开源工具

AnimeGamer: An Open Source Tool for Generating Anime Videos and Character Interactions with Language Commands

AnimeGamer is an open source tool launched by Tencent ARC Lab. Users can generate anime videos with simple language commands, such as "Sousuke drive around in a purple car", as well as allow different anime characters to interact with each other, such as Kiki from The Witch's House, and Sky City...
1yrs ago
071.8K
opensource_notebooklm:基于Deepseek-V3和PlayHT TTS的NotebookLM开源实现

opensource_notebooklm: open source implementation of NotebookLM based on Deepseek-V3 and PlayHT TTS

General Introduction Open Source NotebookLM is an innovative artificial intelligence project that combines Deepseek-V3's language understanding capabilities with PlayHT's speech synthesis technology, aiming to create an intelligent note-taking conversation system. The project was developed by Build Fast w...
1yrs ago
071.7K
笔格设计:在线图片编辑器,免费使用图像生成工具,轻松制作精美图片

Pen Grid Design: online photo editor, free to use the image generation tool, easy to create beautiful pictures

General Introduction Pen Grid Design is a website that provides online image editing and design services. Users can easily create and edit all kinds of images, including posters, PPT, GIF, etc. through this platform. Pen Grid Design provides a wealth of design materials and templates, and supports AI smart tools, such as AI image generation, A...
1yrs ago
071.7K
MegaParse:解析各类型文档为LLM可用数据,完整保留文档中的表格、图片等所有信息

MegaParse: parses all types of documents into LLM-available data, preserving all information in the document such as tables, pictures, etc. in its entirety

Comprehensive Introduction MegaParse is a powerful and versatile document parsing tool designed to optimize data processing for the Large Language Model (LLM). Whether you are working with text, PDF, PowerPoint presentations or Word documents, MegaParse...
2yrs ago
071.7K
WiseMind AI:本地化文档对话与笔记工具

WiseMind AI: Localized Document Conversation and Notes Tool

General Introduction WiseMind AI is an AI-powered learning assistant that focuses on improving users' learning efficiency and knowledge management capabilities. Its core feature is fully localized data storage to ensure user privacy and security, as well as support for the import and processing of more than 10 document formats. Whether...
1yrs ago
071.7K
CogniWerk:免费使用FLUX1.1等模型生成图像,支持Civitai导入和训练LoRA

CogniWerk: free image generation using models such as FLUX 1.1, support for Civitai import and training LoRA

General Description CogniWerk is a browser-based image idea generation platform designed to provide professionals with advanced generative AI image modeling. The platform helps users easily create text, image and video content through a user-friendly interface.The core of CogniWerk...
2yrs ago
071.6K
Devika:开源的AI软件工程师智能体,能够理解、拆分指令为子任务并编写代码

Devika: open-source AI software engineer intelligence that understands, splits instructions into subtasks and writes code

General Introduction Devika is an advanced AI software engineer that understands high-level human instructions, breaks them down into steps, studies the relevant information, and writes code to achieve a given goal. It intelligently develops software using large-scale language models, planning and reasoning algorithms, and web browsing capabilities.D...
1yrs ago
071.6K
RLAMA:命令行操作的本地文档智能问答 RAG 系统

RLAMA: A RAG System for Intelligent Quizzing of Local Documents Operated from the Command Line

Comprehensive Introduction RLAMA is a document intelligent Q&A RAG (Retrieval Augmentation Generation) system developed open-source by DonTizi and hosted on GitHub, whose core feature lies in the realization of functionality through command line operations. Users can use simple terminal commands to connect to local ...
1yrs ago
071.5K
Krita:开源数字绘画软件,集成ComfyUI免去繁琐配置(PS+AI)

Krita: open source digital painting software, integrated ComfyUI free of cumbersome configuration (PS + AI)

General Introduction Krita is a free open source and free professional painting software designed for illustrators, cartoonists, concept artists and animators. It provides a powerful brush engine, layer management, animation tools and a wealth of extended resources.Krita supports a variety of painting styles and work...
2yrs ago
071.5K
灵宇智能:商业数字人直播服务商|飞影数字人|数字人直播带货

Lingyu Intelligence: Business Digital People Live Service Provider|Flying Shadow Digital People|Digital People Live Streaming Bandwagon

General Introduction Lingyu Intelligence is a Chinese company specializing in AI products and technologies that create intelligent physical digital people with "souls" and use AI technology to analyze data to help individuals, entrepreneurs, and businesses grow their revenues. They have launched a new product for virtual live streaming and interactive sales promotion...
2yrs ago
071.4K
VibeVoice - 微软推出的文本到语音模型

VibeVoice - Text-to-Speech Model from Microsoft

VibeVoice is a new text-to-speech (TTS) model from Microsoft. The model generates conversational audio from up to four different speakers and supports up to 90 minutes of continuous voice output, breaking the length limitations of traditional TTS systems.
10mos ago
071.4K
Boolpic:免费图片编辑和优化工具,去除背景,添加滤镜和动画,图像压缩和放大

Boolpic: free photo editing and optimization tool, remove background, add filters and animations, image compression and enlargement

General Introduction Boolpic is a free AI-driven image editing tool designed to help users efficiently process and optimize images. The platform offers a variety of powerful features including background removal, image effects and filters, image animation, image compression and resizing, etc.Boolpic's...
1yrs ago
071.4K
XRAG:优化检索增强生成系统的可视化评估工具

XRAG: A Visual Evaluation Tool for Optimizing Retrieval Enhancement Generation Systems

Comprehensive Introduction XRAG (eXamining the Core) is a benchmarking framework designed for evaluating the underlying components of advanced retrieval augmentation generation (RAG) systems. By profiling and analyzing each core module, XRAG provides information on how different configurations and components affect RAG...
1yrs ago
071.3K
Leffa:高保真模特虚拟试穿与人物姿势调整,Meta开源的可控人物图像生成模型

Leffa: High-fidelity model virtual fitting and character pose adjustment, Meta open source controllable character image generation model

Comprehensive Introduction Leffa is a unified framework for generating controllable character images, enabling precise manipulation of character appearance (e.g., virtual fitting) and pose (e.g., pose transfer). The framework significantly reduces distortion of fine-grained details by directing the target query to focus on the correct reference key in the attention layer, with ...
2yrs ago
071.2K
触手AI:简单易上手的AI绘图工具,支持训练自己的图像风格

Tentacle AI: simple and easy to use AI drawing tools, support training your own image style

Comprehensive Introduction Touch AI is a professional AI creation platform under Jellyfish Intelligence, providing AI painting, online drawing and massive models and other functions. The platform supports minimalist and professional modes with strong ease of use, provides a variety of drawing styles and design models, rich plug-in options, and allows users to experience AIGC creation capabilities online...
2yrs ago
071.2K