Latest AI Resources

Total 3103 articles posts
Perplexica:1比1复刻 Perplexity AI 功能和界面的开源AI搜索引擎

Perplexica: an open source AI search engine that replicates Perplexity AI's features and interface 1 to 1

Comprehensive Introduction Perplexica is an open source AI-driven search engine designed to provide answers that delve deep into the Internet. It uses advanced machine learning algorithms, such as similarity search and embedding techniques, to optimize search results and provide clear answers with cited sources.Perple...
2yrs ago
076K
NodeRAG:基于异构图的精准信息检索与生成工具

NodeRAG: A Heterogeneous Graph-Based Tool for Accurate Information Retrieval and Generation

A Comprehensive Introduction NodeRAG is an open source Retrieval Augmented Generation (RAG) system hosted on GitHub and developed by Terry-Xu-666. It optimizes information retrieval and generation through heterogeneous graph structures, significantly improving retrieval accuracy and contextual relevance.Nod...
1yrs ago
075.9K
秘塔AI搜索:提供无广告的高效学术搜索服务,研究模式深度挖掘知识

Secreta AI Search: Providing ad-free and efficient academic search services, research model for deep knowledge mining

General Introduction Secreta AI Search is a technology company dedicated to improving productivity through artificial intelligence technology. The site provides ad-free and efficient academic search services, aiming to provide users with accurate and fast search results. Secret Tower AI Search has a self-developed large language model, MetaLLM, which can...
1yrs ago
075.9K
Midjourney:创造你想象中的图像|Midjourney中文官网介绍|官网开放免费测试

Midjourney: Create the images of your imagination|Midjourney Chinese website introduction|Official website open for free testing

Midjourney Introduction Midjourney is an independent research lab exploring new mediums of thought and expanding the imagination of the human species. It provides an AI service that generates images based on textual descriptions, allowing users to create a variety of art forms, from realistic to abstract wind...
2yrs ago
075.9K
CogAgent:智谱开源的智能视觉语言模型,实现图形界面自动化操作

CogAgent: Smart Spectrum's open source intelligent visual language model for automating graphical interfaces

Comprehensive Introduction CogAgent is an open source visual language model developed by Tsinghua University Data Mining Research Group (THUDM), aiming to automate the operation of cross-platform graphical user interface (GUI). The model is based on CogVLM (GLM-4V-9B) and supports bilingual Chinese and English...
1yrs ago
075.8K
UltraRAG:一站式RAG系统解决方案,简化数据构建与模型微调

UltraRAG: A One-Stop RAG System Solution to Simplify Data Construction and Model Fine-Tuning

Comprehensive Introduction UltraRAG is a RAG (Retrieval Augmented Generation) system solution jointly proposed by the THUNLP group at Tsinghua University, the NEUIR group at Northeastern University, Modelbest.Inc and the 9#AISoft team. The framework is based on agile deployment and modularized building...
1yrs ago
075.8K
TxAgent:帮医生分析药物作用和治疗方案的AI工具

TxAgent: the AI tool that helps doctors analyze drug effects and treatment options

Comprehensive Introduction TxAgent is an open-source AI tool developed by Harvard University's Medical and Scientific Artificial Intelligence Team (MIMS) to help physicians analyze drug interactions and develop personalized treatment plans. It combines patient-specific situations through multi-step reasoning and real-time retrieval of biomedical knowledge...
1yrs ago
075.7K
魔音工坊:专业配音与短视频解说创作平台|真人配音|克隆声音|一键成片

Magic Voice Workshop: professional voice-over and short video narration creation platform | real person voice-over | clone voice | one-click into a film

Comprehensive Introduction Magic Voice Workshop is a one-stop short video and AI dubbing platform with information on software dubbing, real-life dubbing, sound libraries, cloning services and more. The platform integrates audio editing, AI copy generation, video editing and collaboration tools for audio-related services and content creation. Users experience the audio editor...
2yrs ago
075.6K
录咖:一站式音视频处理平台|视频生成|AI字幕|提取音频|语音转文字

Record Cafe: One-stop Audio/Video Processing Platform|Video Generation|AI Subtitle|Audio Extraction|Speech to Text

Comprehensive Introduction Record Cafe is a one-stop audio/video processing platform that provides AI video dialog, AI subtitles and AI speech to text services. Functions include recording screen, editing video, converting GIF/audio, etc., and supports cloud storage and sharing. The interface is intuitive and easy to use, and it also supports multi-screen recording and multi-language smart...
2yrs ago
075.6K
AutoAgent:通过自然语言快速创建并部署AI智能体的框架

AutoAgent: a framework for rapid creation and deployment of AI intelligences through natural language

General Introduction AutoAgent is an open source AI intelligences framework developed by the Data Intelligence Laboratory of the University of Hong Kong (HKUDS) and hosted on GitHub.It allows users to rapidly create and deploy customized AI intelligences by describing their requirements in purely natural language, without any programming base...
1yrs ago
075.4K
堆友:AI设计工具箱与创意平台

Heap Friend: AI Design Toolkit and Creative Platform

Comprehensive Introduction PileYou is an online platform built by Alibaba's design team that integrates a variety of AI design tools, designed for designers and creative workers. The platform provides AI generation tools from text to images, including vertical industry design tools, PileYou Camera, Deer Class Marketing Chart, AI Art Characters, Model Change...
2yrs ago
075.3K
Sana:快速生成高分辨率图像,0.6B超小尺寸模型,低配笔记本GPU运行

Sana: fast generation of high-resolution images, 0.6B ultra-small size model, low-profile laptop GPU operation

General Introduction Sana is an efficient high-resolution image generation framework developed by NVIDIA Labs, capable of generating images up to 4096 × 4096 resolution in a matter of seconds.Sana utilizes a linear diffusion transformer and deep compression self-encoder technology to significantly...
2yrs ago
075.3K
通义听悟:阿里通义音视频内容转录AI助手

Tongyi Listening and Understanding: Ali Tongyi Audio and Video Content Transcription AI Assistant

Comprehensive Introduction Tongyi Listening and Understanding is a work-study AI assistant launched by Aliyun, focusing on transcribing and analyzing audio and video content. It relies on AliCloud's powerful AI models to transcribe audio and video content into text in real time, and provides translation, summarization, positioning and other functions. Tongyi Listening Woo supports multiple languages and scenarios...
2yrs ago
075.2K
VideoLingo:视频转录单词级时间轴字幕,视频字幕翻译和本地化配音开源工具

VideoLingo: video transcription word-level timeline subtitles, video subtitle translation and localized dubbing open source tools

General Description VideoLingo is a one-stop video translation and localization dubbing tool designed to generate Netflix-grade, high-quality subtitles, eliminating raw machine translation and multi-line subtitles, and adding high-quality voiceovers that enable global knowledge to be shared across language barriers. By...
2yrs ago
075.2K
ModelBest(面壁智能):全球领先的轻量高性能端侧大模型

ModelBest: The World's Leading Lightweight, High-Performance End-Side Big Model

General Introduction ModelBest is a company specializing in developing lightweight and high-performance large models, dedicated to applying advanced AI technologies to mainstream consumer electronics and various end devices in daily life. Its MiniCPM series of end-side models are characterized by extreme arithmetic power and memory usage efficiency...
2yrs ago
075.1K
VideoRAG:理解超长视频的RAG框架,支持多模态检索和知识图谱构建

VideoRAG: A RAG framework for understanding ultra-long videos with support for multimodal retrieval and knowledge graph construction

Comprehensive Introduction VideoRAG is a retrieval-enhanced generative framework designed for processing and understanding very long contextual videos. The tool combines a graph-driven textual knowledge base with hierarchical multimodal context encoding to efficiently process on a single NVIDIA RTX 3090 GPU...
1yrs ago
075.1K
Flair:AI生成专业摄影效果的商品展示图,产品商拍专用工具

Flair: AI generates professional photographic effect of the product display map, product commercial photography special tools

Comprehensive Introduction Flair is an AI-based online design tool focused on generating high-quality photographic images for e-commerce products. Users can quickly create realistic product scene images through drag-and-drop operations, which greatly improves design efficiency. The platform provides a wealth of templates and 3D elements to support real...
2yrs ago
074.8K
Flow(Laminar):构建智能体的轻量级任务引擎,简化并灵活管理任务

Flow (Laminar): a lightweight task engine for building intelligences that simplifies and flexibly manages tasks

Comprehensive Introduction Flow is a lightweight task engine designed for building AI agents, emphasizing simplicity and flexibility. Unlike traditional node- and edge-based workflows, Flow uses a dynamic task queuing system that supports parallel execution, dynamic scheduling, and intelligent dependency management. Its core concept is ...
2yrs ago
074.8K
NoCode – 美团推出的零代码AI开发平台

NoCode - Zero-Code AI Development Platform Launched by Meituan

What is NoCode NoCode is a zero-code AI development platform launched by Mission. Users don't need any programming experience, they just need to describe the requirements through natural language to quickly generate website pages, utilities, small games, event pages and other applications.NoCode supports one second generation of 200...
1yrs ago
074.8K
LazyLLM:商汤开源构建多智能体应用的低代码开发工具

LazyLLM: Shangtang's open source low-code development tool for building multi-intelligence body applications

Comprehensive Introduction LazyLLM is an open source tool developed by the LazyAGI team, focusing on simplifying the development process of multi-intelligence large model applications. It helps developers quickly build complex AI applications through one-click deployment and lightweight gateway mechanisms, saving tedious engineering configuration...
1yrs ago
074.7K
Edraw.AI(亿图):在线协作白板工具,AI生成流程图和多种图表

Edraw.AI: Online collaborative whiteboard tool, AI-generated flowcharts and multiple diagrams

Comprehensive Introduction Edraw.AI is a revolutionary AI-powered online visualization whiteboard collaboration platform that integrates more than 40 intelligent tools and a library of carefully designed templates. The platform uses advanced AI technology to quickly transform users' textual thoughts into professional visual diagrams. The platform supports...
1yrs ago
074.7K
Hibiki:实时语音翻译模型,保留原声特点的流式翻译

Hibiki: a real-time speech translation model, streaming translation that preserves the characteristics of the original voice

General Introduction Hibiki is a high-fidelity real-time speech translation model developed by Kyutai Labs. Unlike traditional offline translation, Hibiki is able to generate natural speech translation in the target language and provide text translation in real time while the user is speaking. The model...
1yrs ago
074.7K
ColorFlow:漫画着色,黑白图像自动着色,提升图像色彩一致性和质量

ColorFlow: Comic book coloring, automatic coloring of black and white images to improve image color consistency and quality

Comprehensive Introduction ColorFlow is an image sequence auto-coloring tool developed by Tencent's ARC team to solve the problem of auto-coloring black and white image sequences. The tool utilizes a retrieval-enhanced coloring pipeline to accurately generate the colors of various elements through a pool of reference images, including the character's hair color and service...
2yrs ago
074.6K
逗哥配音:专注短视频解说、创作的智能配音神器

Teaser Dubbing: Intelligent dubbing tool that focuses on short video narration and creation

Comprehensive Introduction Tease Dubbing is a popular AI dubbing software with over 5 million users. The software utilizes advanced AI intelligent dubbing technology to provide professional and realistic dubbing effects, which is applicable to a variety of scenarios such as short videos, advertisement production, education and training. Teaser Dubbing is committed to providing users with fast...
2yrs ago
074.4K
AI2SRT:利用 Gemini模型,一键为长视频创建解说短视频或视频总结

AI2SRT: Create short narrated videos or video summaries for long videos with one click using Gemini models

Comprehensive Introduction AI2SRT is an open source project that utilizes the GeminiAI Big Model to generate short narrated videos and video summaries for long videos with one click, while supporting audio and video transcription subtitles. The project aims to simplify the video content creation process and provide efficient subtitle generation and translation functions. Users can pass...
1yrs ago
074.1K