Latest AI Resources

Total 2972 articles posts
Memary:利用知识图谱增强Agent长期记忆的开源项目

Memary: an open-source project to enhance Agent long-term memory using knowledge graphs

General Introduction Memary is an innovative open source project focused on providing long-term memory management solutions for autonomous intelligences. The project helps intelligences break through the limitations of traditional context windows to achieve smarter interaction experiences through knowledge graphs and specialized memory modules.Memary adopts...
1yrs ago
063.3K
Research Rabbit:使用本地LLM进行网页研究和报告撰写,自动深入用户指定主题并生成总结。

Research Rabbit: Web research and report writing using native LLM, automatically drilling down into user-specified topics and generating summaries.

General Introduction Research Rabbit is a native LLM (Large Language Model) based web research and summarization assistant. After the user provides a research topic, Research Rabbit generates a search query, obtains relevant web results, and summarizes those results...
11mos ago
063.3K
99AI:集成多模态AI服务的商业化Web应用(免费开源)

99AI: A commercialized web application integrating multimodal AI services (free and open source)

Comprehensive Introduction 99AI is an open source AI web application project that aims to provide an easy-to-deploy, low-threshold integrated AI service platform. The project supports intelligent dialog, multimodal modeling, application plaza, networked search, and integrates AI painting, music and video...
1yrs ago
063.1K
AnyText:生成和编辑多语言图像文本,高可控在图像中生成多行中文

AnyText: Generate and edit multi-language image text, highly controllable to generate multiple lines of Chinese in the image

Comprehensive Introduction AnyText is a revolutionary multilingual visual text generation and editing tool developed based on the diffusion model. It generates natural, high-quality multilingual text in images and supports flexible text editing features. It was developed by a team of researchers and presented at ICLR 2024...
1yrs ago
063K
VibeVoice - 微软推出的文本到语音模型

VibeVoice - Text-to-Speech Model from Microsoft

VibeVoice is a new text-to-speech (TTS) model from Microsoft. The model generates conversational audio from up to four different speakers and supports up to 90 minutes of continuous voice output, breaking the length limitations of traditional TTS systems.
7mos ago
062.7K
AutoGen:微软开发的多智能体对话框架

AutoGen: A Multi-Intelligent Body Dialog Framework Developed by Microsoft

Comprehensive Introduction AutoGen is an open source framework developed by a team of Microsoft researchers focused on simplifying the building of large language model (LLM) applications through multi-intelligent body conversations. It allows developers to create AI agents that can talk to each other and collaborate to solve tasks. This approach not only improves the performance of LLM...
1yrs ago
062.7K
AI ContentCraft:生成短故事、对话脚本、配音、配图的多功能AI内容创作工具

AI ContentCraft: a versatile AI content creation tool for generating short stories, dialog scripts, voiceovers, and graphics

General Introduction AI ContentCraft is a versatile content creation tool that integrates text generation, speech synthesis, image generation and more. It helps creators quickly generate stories, podcast scripts, and accompanying audio and video content. The tool supports multiple language conversions and can batch...
1yrs ago
062.7K
Fun-ASR - 钉钉、通义联合推出的新一代语音识别模型

Fun-ASR - A New Generation of Speech Recognition Models Jointly Launched by Nail and Tongyi

Fun-ASR is a big model of speech recognition jointly launched by Nail and Tongyi Labs. The model has been trained with massive audio data and can accurately recognize multi-industry terminology, such as Internet, technology, home decoration, etc., significantly improving the recognition accuracy. The model combines with Nail enterprise information for inference optimization to reduce the illusion problem...
7mos ago
062.7K
Arcade:录制屏幕操作快速生成产品互动演示视频

Arcade: Record on-screen operations to quickly generate interactive product demo videos.

General Description Arcade is an easy-to-use online platform that helps users quickly create interactive demos. It is suitable for marketers, product managers and sales teams to demonstrate product features. By recording on-screen actions, Arcade automatically generates interactive demo content that users can use in just a few minutes...
12mos ago
062.7K
ColorFlow:漫画着色,黑白图像自动着色,提升图像色彩一致性和质量

ColorFlow: Comic book coloring, automatic coloring of black and white images to improve image color consistency and quality

Comprehensive Introduction ColorFlow is an image sequence auto-coloring tool developed by Tencent's ARC team to solve the problem of auto-coloring black and white image sequences. The tool utilizes a retrieval-enhanced coloring pipeline to accurately generate the colors of various elements through a pool of reference images, including the character's hair color and service...
1yrs ago
062.6K
堆友:AI设计工具箱与创意平台

Heap Friend: AI Design Toolkit and Creative Platform

Comprehensive Introduction PileYou is an online platform built by Alibaba's design team that integrates a variety of AI design tools, designed for designers and creative workers. The platform provides AI generation tools from text to images, including vertical industry design tools, PileYou Camera, Deer Class Marketing Chart, AI Art Characters, Model Change...
1yrs ago
062.5K
Genesis:开源生成式物理引擎,实现基于真实物理的4D动态世界模拟

Genesis: open source generative physics engine for real physics-based 4D dynamic world simulation

General Introduction Genesis is a generative physics world designed for general purpose robotics and embodied AI learning. It provides a unified simulation platform that supports the simulation of a wide range of materials and physical phenomena.Genesis aims to unlock generative AI and physics simulation by combining...
1yrs ago
062.4K
Edraw.AI(亿图):在线协作白板工具,AI生成流程图和多种图表

Edraw.AI: Online collaborative whiteboard tool, AI-generated flowcharts and multiple diagrams

Comprehensive Introduction Edraw.AI is a revolutionary AI-powered online visualization whiteboard collaboration platform that integrates more than 40 intelligent tools and a library of carefully designed templates. The platform uses advanced AI technology to quickly transform users' textual thoughts into professional visual diagrams. The platform supports...
1yrs ago
062.4K
魔音工坊:专业配音与短视频解说创作平台|真人配音|克隆声音|一键成片

Magic Voice Workshop: professional voice-over and short video narration creation platform | real person voice-over | clone voice | one-click into a film

Comprehensive Introduction Magic Voice Workshop is a one-stop short video and AI dubbing platform with information on software dubbing, real-life dubbing, sound libraries, cloning services and more. The platform integrates audio editing, AI copy generation, video editing and collaboration tools for audio-related services and content creation. Users experience the audio editor...
1yrs ago
062.4K
通义千问:阿里推出的多模态大模型,拥有文本回答、图片理解、视频解析能力

Tongyi Thousand Questions: a large multimodal model launched by Ali with text answering, image understanding, and video parsing capabilities

Comprehensive Introduction Tongyi Thousand Questions is an intelligent big model developed by Aliyun, aiming to provide a human-like interaction experience through deep learning and natural language processing technology. It can quickly generate creative copy to add fun to life, and serve as a learning assistant to help users easily learn all kinds of knowledge. With cutting-edge technology and evolving...
1yrs ago
062.2K
逗哥配音:专注短视频解说、创作的智能配音神器

Teaser Dubbing: Intelligent dubbing tool that focuses on short video narration and creation

Comprehensive Introduction Tease Dubbing is a popular AI dubbing software with over 5 million users. The software utilizes advanced AI intelligent dubbing technology to provide professional and realistic dubbing effects, which is applicable to a variety of scenarios such as short videos, advertisement production, education and training. Teaser Dubbing is committed to providing users with fast...
1yrs ago
062.1K
Perplexica:1比1复刻 Perplexity AI 功能和界面的开源AI搜索引擎

Perplexica: an open source AI search engine that replicates Perplexity AI's features and interface 1 to 1

Comprehensive Introduction Perplexica is an open source AI-driven search engine designed to provide answers that delve deep into the Internet. It uses advanced machine learning algorithms, such as similarity search and embedding techniques, to optimize search results and provide clear answers with cited sources.Perple...
1yrs ago
062.1K
Slidesgo:免费PPT模板下载,辅助AI生成演示文稿,提供教育版工具

Slidesgo: free PPT templates to download, assist AI to generate presentations, provide educational version of the tool

General Introduction Slidesgo is a platform that provides a large number of free and customizable Google Slides and PowerPoint presentation templates. Users can pick templates in different styles or colors based on needs, such as business, education or medical topics. The site offers icons, letter...
2yrs ago
062.1K
Flair:AI生成专业摄影效果的商品展示图,产品商拍专用工具

Flair: AI generates professional photographic effect of the product display map, product commercial photography special tools

Comprehensive Introduction Flair is an AI-based online design tool focused on generating high-quality photographic images for e-commerce products. Users can quickly create realistic product scene images through drag-and-drop operations, which greatly improves design efficiency. The platform provides a wealth of templates and 3D elements to support real...
1yrs ago
061.7K
UltraRAG:一站式RAG系统解决方案,简化数据构建与模型微调

UltraRAG: A One-Stop RAG System Solution to Simplify Data Construction and Model Fine-Tuning

Comprehensive Introduction UltraRAG is a RAG (Retrieval Augmented Generation) system solution jointly proposed by the THUNLP group at Tsinghua University, the NEUIR group at Northeastern University, Modelbest.Inc and the 9#AISoft team. The framework is based on agile deployment and modularized building...
1yrs ago
061.7K
MedRAX: 利用多模态大模型进行胸部X光片分析的智能体

MedRAX: A Smart Body for Chest X-ray Analysis Using Multimodal Large Models

Comprehensive Introduction MedRAX is a state-of-the-art AI intelligence designed for chest radiograph (CXR) analysis. It integrates state-of-the-art CXR analysis tools and multimodal large language models to dynamically process complex medical queries without additional training.MedRAX, through its modular design...
1yrs ago
061.7K
NodeRAG:基于异构图的精准信息检索与生成工具

NodeRAG: A Heterogeneous Graph-Based Tool for Accurate Information Retrieval and Generation

A Comprehensive Introduction NodeRAG is an open source Retrieval Augmented Generation (RAG) system hosted on GitHub and developed by Terry-Xu-666. It optimizes information retrieval and generation through heterogeneous graph structures, significantly improving retrieval accuracy and contextual relevance.Nod...
11mos ago
061.6K