Latest AI Resources

Total 2759 articles posts
Memora:构建人性化AI记忆模块,保存并更新与人类的互动信息

Memora: building humanized AI memory modules to save and update information about interactions with humans

General Introduction Memora is an agent designed to replicate human memories for each personalized AI. It helps AIs remember details of past interactions, emotions, and shared experiences just like humans do through features like timestamped memories, emotion markers, and multimodal memories.Memora supports multi-tenancy and is capable of handling...
9mos ago
026K
OpenEvidence - AI医学知识助手,解答临床问题、分析症状、推荐治疗方案

OpenEvidence - AI medical knowledge assistant that answers clinical questions, analyzes symptoms, and recommends treatments

OpenEvidence is a medical knowledge assistant platform based on AI technology to provide accurate clinical support for doctors and healthcare workers. The platform is based on small specialized models and multi-model integration architecture to quickly answer clinical questions, analyze symptoms, recommend treatment plans, and provide the latest medical knowledge more...
4mos ago
026K
悠船:Midjourney官方中文版文生图工具,免费生成25张图像

Yo Boat: Midjourney official Chinese version of the text generation tool, free to generate 25 images

General Introduction Midjourney China Lab (YoBoat), a brand of Boat Creative (Shanghai) Network Technology Co., Ltd, is an innovative lab that specializes in generative visual arts. It is committed to promoting the cutting-edge development of visual creation through deep learning and artificial intelligence technology. Its core product Yo Boat picks...
10mos ago
026K
Mistral OCR:94.89%总体精度,1000 页/30秒,只需1美元

Mistral OCR: 94.89% Overall Accuracy, 1000 Pages/30 Seconds, Only $1

In the long history of human civilization, every leap in the way information is acquired and parsed has profoundly driven social progress. From the ancient hieroglyphics, to the portable papyrus, to the later emergence of the printing press and today's wave of digitization, each technological innovation has greatly expanded the paradigm of human knowledge dissemination...
7mos ago
025.9K
UltraRAG:一站式RAG系统解决方案,简化数据构建与模型微调

UltraRAG: A One-Stop RAG System Solution to Simplify Data Construction and Model Fine-Tuning

Comprehensive Introduction UltraRAG is a RAG (Retrieval Augmented Generation) system solution jointly proposed by the THUNLP group at Tsinghua University, the NEUIR group at Northeastern University, Modelbest.Inc and the 9#AISoft team. The framework is based on agile deployment and modularized building...
9mos ago
025.9K
VideoLingo:视频转录单词级时间轴字幕,视频字幕翻译和本地化配音开源工具

VideoLingo: video transcription word-level timeline subtitles, video subtitle translation and localized dubbing open source tools

General Description VideoLingo is a one-stop video translation and localization dubbing tool designed to generate Netflix-grade, high-quality subtitles, eliminating raw machine translation and multi-line subtitles, and adding high-quality voiceovers that enable global knowledge to be shared across language barriers. By...
12mos ago
025.9K
BagelBell:AI文字冒险游戏

BagelBell: AI Text Adventure Game

Comprehensive Introduction BagelBell is an AI character creation and interaction platform owned by ByteDance, known overseas as BagelBell in English.It provides users with a vibrant and creative virtual world in which they can explore stories, create characters, and interact with AI...
1yrs ago
025.9K
shadcn/ui:组件库构建平台

shadcn/ui: component library building platform

General Introduction shadcn/ui is an open source component library building platform that provides beautiful and customizable UI components that users can copy and paste into their applications. The platform supports a variety of front-end frameworks and provides detailed installation and usage guidelines to help developers quickly get started...
1yrs ago
025.9K
WeClone:用微信聊天记录和语音训练数字分身

WeClone: training digital doppelgangers with WeChat chats and voices

Comprehensive introduction WeClone is an open source project that uses WeChat chat logs and voice messages, combined with large language models and speech synthesis technology, to allow users to create personalized digital doppelgangers. The project can analyze the user's chat habits to train the model , but also a small number of voice samples to generate realistic sound...
6mos ago
025.9K
Ultravox:实时端到端语音对话的音频多模态大模型,GPT-4o语音交互的开源实现

Ultravox: an audio multimodal macromodel for real-time end-to-end voice dialog, an open source implementation of GPT-4o voice interaction

Comprehensive Introduction Ultravox is an innovative multimodal Large Language Model (LLM) designed for real-time speech processing. Unlike traditional speech recognition systems, Ultravox eliminates the need for a separate Audio Speech Recognition (ASR) stage, and is able to directly convert audio into high-dimensional space in...
10mos ago
025.9K
LunaAI换脸:开源的秒鸭相机,部署前后端完整的企业级AI换脸小程序(算力服务付费,可二开)

LunaAI face swap: open source second duck camera, deploy front and back-end complete enterprise AI face swap applet (arithmetic service payment, can be two open)

Comprehensive Introduction LunaAI face swap applet is a face swap application developed based on uniapp and Vue framework. The application utilizes technologies such as PHP, MySQL, Nginx and Redis to achieve the function of the user's face changing operation through the applet. Users can use this small...
10mos ago
025.9K
匠邦AI:教师教学辅助AI助手,为老师提供备案教案/PPT课件/课题论文/出题组卷

Artisan AI: Teacher teaching aid AI assistant, providing teachers with filed lesson plans / PPT courseware / subject papers / questions and papers.

Comprehensive Introduction Artisan AI is an intelligent assistant focusing on the field of education, aiming to improve teachers' work efficiency and teaching quality through artificial intelligence technology. The site provides a variety of functions, including lesson plan design, subject report guidance, thesis checking and weight reduction, PPT courseware generation, etc., to help teachers in teaching, research...
9mos ago
025.9K
XRAG:优化检索增强生成系统的可视化评估工具

XRAG: A Visual Evaluation Tool for Optimizing Retrieval Enhancement Generation Systems

Comprehensive Introduction XRAG (eXamining the Core) is a benchmarking framework designed for evaluating the underlying components of advanced retrieval augmentation generation (RAG) systems. By profiling and analyzing each core module, XRAG provides information on how different configurations and components affect RAG...
9mos ago
025.9K
Image AI:集成多类AI图片编辑工具,免费视频换脸,简单上手

Image AI: Integration of multiple types of AI photo editing tools, free video face changing, easy to start!

Comprehensive Introduction Image AI is a remarkable all-in-one AI image platform that offers a wide range of advanced image tools to help users easily achieve high-quality visual effects. Whether it's face swap, image recognition, text to generate images, or image de-contextualization, Image AI can meet...
1yrs ago
025.8K
Easegen:开源数字人课程制作平台,PPT一键生成克隆数字人讲解视频

Easegen: open source digital human course production platform, PPT one-click generation cloning digital human lecture video

Comprehensive Introduction Easegen is an open source digital human course creation platform that aims to improve the efficiency of teaching content production and management through AI technology. The platform provides a one-stop solution from course production, video management to intelligent questioning, which allows users to create digital human-explained video courses...
1yrs ago
025.8K
Class Companion: K12教师设计的课后作业管理系统,为学生提供AI辅导和作业批改

Class Companion: an after-school homework management system designed by K12 teachers to provide AI tutoring and homework correction for students

General Description Class Companion is an online education platform designed for teachers and students that uses artificial intelligence technology to provide instant feedback and personalized tutoring. The platform supports a wide range of subjects and grade levels, helping teachers save time, improve teaching efficiency, and provide students with more practic...
10mos ago
025.8K
PlayAI:提供流畅、富有情感的语音对话和语音合成服务(英文)

PlayAI: providing smooth and emotional voice dialog and speech synthesis services (English)

Comprehensive Introduction PlayAI is an artificial intelligence platform focused on speech generation and speech cloning. It offers a wide range of speech models capable of generating smooth and emotional conversations. Users can use the platform to create personalized voice agents to enhance the interactive experience.PlayAI's technology is applicable...
11mos ago
025.7K
阿里妈妈创意中心:淘宝生态下的智能化营销创意支持平台

AliMama Creative Center: Intelligent Marketing Creative Support Platform under Taobao Ecology

Comprehensive Introduction Alimama Creative Center is Alibaba's intelligent marketing creative support platform, designed to provide merchants on Taobao, Tmall, and other e-commerce platforms with a full range of creative support from graphics to videos to landing pages. By combining AI intelligent copywriting capabilities and massive templates, Creative Center dramatically improves the design efficiency...
1yrs ago
025.7K
NarratoAI:文本生成影视解说与自动化剪辑神器

NarratoAI: Text-Generated Movie and TV Narration and Automated Editing Tool

Comprehensive Introduction NarratoAI is a fully automated tool that integrates movie and TV narration, automated editing, dubbing and subtitle generation. It relies on large-scale language modeling (LLM) technology to automatically generate copy and automatically edit videos with corresponding voiceovers and subtitles, providing users with a one-stop...
1yrs ago
025.7K
TRV:将幻灯片/PPT和讲解备注快速生成演讲视频

TRV: Rapidly Generate Presentation Videos from Slides/PPTs and Explanatory Notes

General Introduction TRV is an open source tool, hosted on GitHub, designed to help users quickly convert slides and presentation notes into videos with narration. It automatically generates audio and video content from incoming presentation files through simple command line operations, suitable for those who need to quickly create presentations...
8mos ago
025.7K
WriteWise:喜马拉雅推出的专业AI小说写作工具

WriteWise: a professional AI novel writing tool from Himalaya

Comprehensive Introduction WriteWise is an online service platform focused on novel creation launched by Himalaya. It provides professional AI writing assistance, covering such things as persona setting, dialogue design and martial arts fighting. In addition, it also provides a computer version for download, supports rich editor format configuration as well as stable...
1yrs ago
025.7K
Hallo2:音频驱动生成口型/表情同步的肖像视频(Windows一键安装)

Hallo2: audio-driven generation of lip-synchronized/expression-synchronized portrait videos (Windows one-click installation)

General Introduction Hallo2 is an open-source project jointly developed by Fudan University and Baidu, aiming to generate high-resolution portrait animations through audio-driven generation. The project utilizes advanced Generative Adversarial Networks (GAN) and time alignment techniques to achieve 4K resolution and up to 1 hour long video generation...
9mos ago
025.7K
XAudioPro:专业在线音频剪辑工具|有声书制作|文字转语音|伴奏分离

XAudioPro: Professional Online Audio Editing Tool|Audiobook Maker|Text to Speech|Accompaniment Separation

General Introduction XAudioPro is an advanced online audio real-time editing and transcoding tool that is both professional and portable. It supports professional audio editing functions such as cutting, cropping, copying, deleting, restoring, and amplitude gain control. It also provides denoising services such as spectral subtraction noise reduction, low-pass...
1yrs ago
025.7K
阿布量化交易系统:基于Python的开源量化交易平台

Abu quantitative trading system: Python based open source quantitative trading platform

Comprehensive introduction Abu quantitative trading system is an open source platform based on Python development. It was created by user "bbfamily" to help investors realize quantitative trading strategies through code. The system supports backtesting and trading of various financial products such as stocks, options, futures and bitcoin. It...
7mos ago
025.7K
析言GBI(XiYan-SQL):Text-to-SQL智能数据分析,轻松实现ChatBI

Analytics GBI (XiYan-SQL): Text-to-SQL Intelligent Data Analytics for ChatBI with Ease

Comprehensive Introduction Analyzing Words GBI is an intelligent data analysis product based on big models launched by AliCloud Hundred Refine. The product utilizes advanced natural language processing technology to help users query and analyze data through natural language without having to master complex SQL syntax. Analytics GBI supports multiple data sources, including...
10mos ago
025.7K
Slidesgo:免费PPT模板下载,辅助AI生成演示文稿,提供教育版工具

Slidesgo: free PPT templates to download, assist AI to generate presentations, provide educational version of the tool

General Introduction Slidesgo is a platform that provides a large number of free and customizable Google Slides and PowerPoint presentation templates. Users can pick templates in different styles or colors based on needs, such as business, education or medical topics. The site offers icons, letter...
1yrs ago
025.7K
通义听悟:阿里通义音视频内容转录AI助手

Tongyi Listening and Understanding: Ali Tongyi Audio and Video Content Transcription AI Assistant

Comprehensive Introduction Tongyi Listening and Understanding is a work-study AI assistant launched by Aliyun, focusing on transcribing and analyzing audio and video content. It relies on AliCloud's powerful AI models to transcribe audio and video content into text in real time, and provides translation, summarization, positioning and other functions. Tongyi Listening Woo supports multiple languages and scenarios...
1yrs ago
025.6K
ModelBest(面壁智能):全球领先的轻量高性能端侧大模型

ModelBest: The World's Leading Lightweight, High-Performance End-Side Big Model

General Introduction ModelBest is a company specializing in developing lightweight and high-performance large models, dedicated to applying advanced AI technologies to mainstream consumer electronics and various end devices in daily life. Its MiniCPM series of end-side models are characterized by extreme arithmetic power and memory usage efficiency...
12mos ago
025.6K
NodeRAG:基于异构图的精准信息检索与生成工具

NodeRAG: A Heterogeneous Graph-Based Tool for Accurate Information Retrieval and Generation

A Comprehensive Introduction NodeRAG is an open source Retrieval Augmented Generation (RAG) system hosted on GitHub and developed by Terry-Xu-666. It optimizes information retrieval and generation through heterogeneous graph structures, significantly improving retrieval accuracy and contextual relevance.Nod...
6mos ago
025.6K
Signs:通过AI技术助力学习和贡献美国手语的互动平台

Signs: an interactive platform for learning and contributing to American Sign Language fueled by AI technology

General Introduction Signs is an innovative online platform designed to help users learn American Sign Language (ASL) and contribute to the Deaf community through artificial intelligence technology. The site is powered by NVIDIA, the American Society for Deaf Children (ASDC), and creative agency Hello Mond...
8mos ago
025.6K
R2R:多模态内容解析并结合知识图谱与混合搜索的先进AI检索(RAG)系统

R2R: An Advanced AI Retrieval (RAG) System for Multimodal Content Parsing and Combining Knowledge Graph with Hybrid Search

Comprehensive Introduction R2R (RAG to Riches) is an advanced AI retrieval system supporting Retrieval Augmented Generation (RAG) functionality with production-ready features. Built on a containerized RESTful API, the system provides multimodal content parsing, hybrid search functionality...
10mos ago
025.6K