Latest AI Resources

Total 2759 articles posts
Flair:AI生成专业摄影效果的商品展示图,产品商拍专用工具

Flair: AI generates professional photographic effect of the product display map, product commercial photography special tools

Comprehensive Introduction Flair is an AI-based online design tool focused on generating high-quality photographic images for e-commerce products. Users can quickly create realistic product scene images through drag-and-drop operations, which greatly improves design efficiency. The platform provides a wealth of templates and 3D elements to support real...
12mos ago
025.3K
MedRAX: 利用多模态大模型进行胸部X光片分析的智能体

MedRAX: A Smart Body for Chest X-ray Analysis Using Multimodal Large Models

Comprehensive Introduction MedRAX is a state-of-the-art AI intelligence designed for chest radiograph (CXR) analysis. It integrates state-of-the-art CXR analysis tools and multimodal large language models to dynamically process complex medical queries without additional training.MedRAX, through its modular design...
7mos ago
025.3K
CogAgent:智谱开源的智能视觉语言模型,实现图形界面自动化操作

CogAgent: Smart Spectrum's open source intelligent visual language model for automating graphical interfaces

Comprehensive Introduction CogAgent is an open source visual language model developed by Tsinghua University Data Mining Research Group (THUDM), aiming to automate the operation of cross-platform graphical user interface (GUI). The model is based on CogVLM (GLM-4V-9B) and supports bilingual Chinese and English...
10mos ago
025.3K
5ire:支持本地向量知识库的跨平台大模型桌面客户端

5ire: cross-platform large model desktop client with support for local vector knowledge bases

General Introduction 5ire is an open source cross-platform big model desktop client designed to provide users with convenient local vector knowledge base management and big model interaction capabilities. The software supports parsing and vectorized storage of multiple document formats with powerful retrieval-enhanced generation (RAG) capabilities. In addition, 5i...
12mos ago
025.3K
BuildIn.AI:适合 Notion 用户的知识管理工具

BuildIn.AI: A Knowledge Management Tool for Notion Users

General Introduction BuildIn.AI is a cloud-based platform focused on real-time collaboration and knowledge management, designed to help users efficiently create, manage and share information. It is suitable for individuals, teams or professionals, providing a digital workplace that integrates document storage, real-time editing and information organization...
8mos ago
025.3K
WebShaper - 阿里通义开源的AI训练数据合成系统

WebShaper - Ali Tongyi's open source AI training data synthesis system

WebShaper is an AI training data synthesis system launched by Alibaba's Tongyi Lab, which is based on formal modeling and intelligence expansion mechanism to generate high-quality and scalable training data to help AI intelligences improve complex information retrieval capabilities. The system introduces the concept of "knowledge projection"...
3mos ago
025.2K
Glama:集成1000+MCP服务的多功能AI聊天工具

Glama: a versatile AI chat tool integrating 1000+ MCP services

General Introduction Glama is a powerful and easy-to-use AI chat tool. It not only supports conversations with a wide range of AI models, but also uploads files, searches the web for information, and even generates professional charts. The website is geared towards users who need to process information and tasks efficiently, such as corporate teams, developers or individual users...
7mos ago
025.2K
Flow(Laminar):构建智能体的轻量级任务引擎,简化并灵活管理任务

Flow (Laminar): a lightweight task engine for building intelligences that simplifies and flexibly manages tasks

Comprehensive Introduction Flow is a lightweight task engine designed for building AI agents, emphasizing simplicity and flexibility. Unlike traditional node- and edge-based workflows, Flow uses a dynamic task queuing system that supports parallel execution, dynamic scheduling, and intelligent dependency management. Its core concept is ...
10mos ago
025.2K
混元文生视频:生成写实镜头感的高质量视频,腾讯开源视频生成大模型

Hybrid Vincennes video: generating realistic footage sense of high-quality video, Tencent open source video generation large model

Comprehensive Introduction Tencent Mixed Yuan Text Generation Video (available in Yuanbao APP) is a video generation platform based on AI technology launched by Tencent. The platform utilizes the Tencent Mixed Yuan Big Model with powerful cross-domain knowledge and natural language understanding to generate high-quality videos based on users' text descriptions...
9mos ago
025.2K
Fay数字人框架:集成语言模型与3D数字角色,支持多种应用场景

Fay Digital Human Framework: Integrated language modeling and 3D digital characters to support multiple application scenarios

Comprehensive Introduction Fay is an open source 3D virtual digital human framework that integrates language models and digital characters for a variety of application scenarios, such as virtual shopping guides, virtual anchors, assistants, waiters, teachers, and voice- or text-based mobile assistants.The Fay framework supports full offline use, providing m...
9mos ago
025.2K
AnimeGamer:用语言指令生成动漫视频和角色互动的开源工具

AnimeGamer: An Open Source Tool for Generating Anime Videos and Character Interactions with Language Commands

AnimeGamer is an open source tool launched by Tencent ARC Lab. Users can generate anime videos with simple language commands, such as "Sousuke drive around in a purple car", as well as allow different anime characters to interact with each other, such as Kiki from The Witch's House, and Sky City...
6mos ago
025.2K
Deep Recall:为大模型提供企业级记忆框架的开源工具

Deep Recall: an open source tool that provides an enterprise-class memory framework for large models

Comprehensive Introduction Deep Recall is an open source, enterprise-class memory framework designed for large-scale language models (LLMs). It provides hyper-personalized responsiveness through efficient contextual retrieval and integration. The framework uses a three-tier architecture, including a memory service, a reasoning service, and a coordinator, supporting...
5mos ago
025.2K
自得语音:智能语音合成平台|语音克隆

Zide Speech: Intelligent Speech Synthesis Platform|Speech Cloning

Comprehensive Introduction Zide Voice is a voice synthesis platform that uses advanced AI technology. Users can simply upload a piece of voice, which can be supplemented with text to generate realistic and emotional voice clips. The platform is equipped with features such as quick character customization, cloud-based voice generation, and anthropomorphic voice synthesis. There is no need to download any software through...
1yrs ago
025.1K
VibeVoice - 微软推出的文本到语音模型

VibeVoice - Text-to-Speech Model from Microsoft

VibeVoice is a new text-to-speech (TTS) model from Microsoft. The model generates conversational audio from up to four different speakers and supports up to 90 minutes of continuous voice output, breaking the length limitations of traditional TTS systems.
2mos ago
025.1K
通义千问:阿里推出的多模态大模型,拥有文本回答、图片理解、视频解析能力

Tongyi Thousand Questions: a large multimodal model launched by Ali with text answering, image understanding, and video parsing capabilities

Comprehensive Introduction Tongyi Thousand Questions is an intelligent big model developed by Aliyun, aiming to provide a human-like interaction experience through deep learning and natural language processing technology. It can quickly generate creative copy to add fun to life, and serve as a learning assistant to help users easily learn all kinds of knowledge. With cutting-edge technology and evolving...
8mos ago
025.1K
Humiris:根据请求自动调用最佳LLM,构建高性能AI应用的基础设施

Humiris: Building an infrastructure for high-performance AI applications by automatically invoking the best LLMs on request

Comprehensive Introduction Humiris AI is a platform focused on delivering next-generation AI infrastructure designed to improve the accuracy and performance of AI applications by blending multiple large-scale language models (LLMs). The platform supports users in utilizing AI technologies at scale without sacrificing quality or control...
9mos ago
025.1K
Morphic:AI驱动的开源搜索引擎,提供智能问答、视频搜索、生成UI代码

Morphic: AI-powered open-source search engine that offers smart Q&A, video search, and generates UI code

General Introduction Morphic is a search engine based on AI technology with a generative user interface designed to provide intelligent Q&A and an efficient search experience. Users can perform a variety of searches with Morphic, including text, video, etc., and can save search history and share search results.Mo...
11mos ago
025.1K
Dream API:oneapi/newapi中转API,针对个人用户提供免费公益API

Dream API: oneapi/newapi transit API, providing free public service API for individual users.

Introduction After recommending many free large model API services in the Chief AI Sharing Circle, I suddenly found an important issue: there are official free small size scales; there are reverse API models; but there has been no free "official conversion" API. The reason why it has not been recommended is that the free "official conversion" API is not available. The reason why I haven't recommended it is that free "official conversion" is not available for large models.
12mos ago
025.1K
AI ContentCraft:生成短故事、对话脚本、配音、配图的多功能AI内容创作工具

AI ContentCraft: a versatile AI content creation tool for generating short stories, dialog scripts, voiceovers, and graphics

General Introduction AI ContentCraft is a versatile content creation tool that integrates text generation, speech synthesis, image generation and more. It helps creators quickly generate stories, podcast scripts, and accompanying audio and video content. The tool supports multiple language conversions and can batch...
9mos ago
025K
Mlion:一站式加密市场洞察AI工具箱,使用AI技术参考历史数据进行预测

Mlion: a one-stop crypto market insight AI toolkit that uses AI technology to make predictions with reference to historical data

General Introduction Mlion is an AI tool platform focused on cryptocurrency market analysis. It offers a variety of features, including price prediction, address analysis, and trading queries, to help users better understand and operate the cryptocurrency market. The platform integrates a variety of AI technologies and aims to provide users with efficient...
9mos ago
025K
AnyText:生成和编辑多语言图像文本,高可控在图像中生成多行中文

AnyText: Generate and edit multi-language image text, highly controllable to generate multiple lines of Chinese in the image

Comprehensive Introduction AnyText is a revolutionary multilingual visual text generation and editing tool developed based on the diffusion model. It generates natural, high-quality multilingual text in images and supports flexible text editing features. It was developed by a team of researchers and presented at ICLR 2024...
10mos ago
025K
opensource_notebooklm:基于Deepseek-V3和PlayHT TTS的NotebookLM开源实现

opensource_notebooklm: open source implementation of NotebookLM based on Deepseek-V3 and PlayHT TTS

General Introduction Open Source NotebookLM is an innovative artificial intelligence project that combines Deepseek-V3's language understanding capabilities with PlayHT's speech synthesis technology, aiming to create an intelligent note-taking conversation system. The project was developed by Build Fast w...
9mos ago
025K
EchoMimic:音频驱动人像照片生成说话视频(EchoMimicV2加速版安装包)

EchoMimic: Audio-driven portrait photos to generate talking videos (EchoMimicV2 accelerated installer)

General Introduction EchoMimic is an open source project designed to generate realistic portrait animations through audio-driven generation. Developed by Ant Group's Terminal Technologies division, the project utilizes editable marker point conditions to generate dynamic portrait videos using a combination of audio and facial marker points.EchoMimic...
9mos ago
025K
STORM:基于Topic搜索网络数据,生成带引用的论文、长文报告

STORM: Search web data based on Topic to generate papers with citations, long paper reports

General Introduction STORM is a knowledge integration and article generation system developed by the Oval team at Stanford University. It focuses on generating exhaustive Wikipedia-like articles (systematic papers) from scratch. The system utilizes large-scale language models for topic research, preparing synopses and simulating actual interconnected...
7mos ago
024.9K
Agent Zero - 免费AI智能体框架,具备持久记忆功能

Agent Zero - Free AI Intelligent Body Framework with Persistent Memory

Agent Zero is an open-source artificial intelligence framework to create general-purpose, highly customizable intelligent assistants. Through dynamic learning and evolution, it is able to handle a wide range of tasks, with persistent memory capabilities that remember previous experiences and solutions to accomplish subsequent tasks more efficiently.
4mos ago
024.9K
Harbor:一键部署本地LLM开发环境,轻松管理和运行AI服务的容器化工具集

Harbor: a containerized toolset for easily managing and running AI services with one-click deployment of local LLM development environments

Comprehensive Introduction Harbor is a revolutionary containerized LLM toolset focused on simplifying the deployment and management of local AI development environments. It enables developers with a clean command line interface (CLI) and companion application to launch and manage with a single click, including LLM backends, API interfaces, front...
10mos ago
024.9K