AI Personal Learning
and practical guidance
Beanbag Marscode1
Total 910 articles

Tags: ai open source projects Page 46

PhiData:构建拥有记忆、知识和工具的AI智能体-首席AI分享圈

PhiData: Building AI Intelligence with Memory, Knowledge and Tools

Comprehensive Introduction PhiData is a framework designed for developing intelligent AI assistants. It enables AI assistants to conduct long-term conversations, provide accurate business context, and perform various operations by enhancing memory, knowledge integration, and tool invocation capabilities.PhiData not only enhances the intelligence of AI assistants, but also expands...

ChatTTS:模仿真人说话声音的语音生成模型(ChatTTS一键加速包)-首席AI分享圈

ChatTTS: a speech generation model that mimics the voice of a real person speaking (ChatTTS one-click acceleration package)

General Introduction ChatTTS is a generative speech model designed for conversational scenarios. It generates natural and expressive speech, supports multiple languages and multiple speakers, and is suitable for interactive conversations. The model goes beyond large by predicting and controlling fine-grained prosodic features such as laughter, pauses, and interjections...

MoneyPrinterPlus:一键生成短视频的AI工具,免费批量混剪-首席AI分享圈

MoneyPrinterPlus: AI tool for generating short videos with one click, free batch mixing

Comprehensive Introduction MoneyPrinterPlus is an open source project aimed at generating and mixing all kinds of short videos with one click through AI technology, and automatically publishing them to multiple video platforms, such as Jieyin, Shutterbugs, Xiaohongshu, and Video Number. The tool supports local and cloud-based voice models, including chatTTS, fasterwhisper, G...

TF-ID:学术论文表格/图像识别工具-首席AI分享圈

TF-ID: academic paper form/image recognition tool

Comprehensive Introduction TF-ID (Table/Figure IDentifier) is a family of object detection models specialized for extracting tables and images from academic papers. The project was created by Yifei Hu and open-sourced on GitHub.TF-ID models are fine-tuned to recognize and extract tables and images from academic papers...

Chatbot UI:模仿ChatGPT界面和功能的开源AI聊天应用程序-首席AI分享圈

Chatbot UI: an open source AI chat app that mimics ChatGPT's interface and functionality

General Introduction Chatbot UI is an open source project designed to help developers create personalized and intelligent conversational interfaces. The project provides a range of interface components and interactive features that can be easily integrated into the existing Chatbot system to provide users with a smoother and smarter conversation experience.Chatbot UI ...

HivisionIDPhotos:开源智能AI证件照制作工具-首席AI分享圈

HivisionIDPhotos: open source intelligent AI photo ID creation tool

Comprehensive introduction HivisionIDPhotos is an open source lightweight AI document photo production tools, can intelligently identify the user photo scene and keying, to generate a standard document photo in line with a variety of specifications. The tool supports custom background colors and sizes, and in the future will also launch the beauty and intelligent change of formal dress function. With...

SadTalker:让照片说话|嘴型同步音频|合成口型同步视频|免费数字人-首席AI分享圈

SadTalker: Make Photos Talk | Mouth Synchronized Audio | Synthesized Mouth Synchronized Video | Free Digital People

General Introduction SadTalker is an open source tool that combines single still portrait photos and audio files to create realistic talking head videos for a wide range of scenarios such as personalized messages, educational content, and more. The revolutionary use of 3D modeling technologies such as ExpNet and PoseVAE excel in capturing the subtle facets...

MuseV+Muse Talk:完整数字人视频生成框架|人像转视频|姿态转视频|唇形同步-首席AI分享圈

MuseV+Muse Talk: Complete Digital Human Video Generation Framework | Portrait to Video | Pose to Video | Lip Synchronization

General Introduction MuseV is a public project on GitHub that aims to enable the generation of avatar videos of unlimited length and high fidelity. It is based on diffusion technology and offers Image2Video, Text2Image2Video, Video2Video and many other features. Provides model structure, use cases, quick start...

Unstructured:开源预处理非结构化文档,无结构数据处理的利器-首席AI分享圈

Unstructured: open source preprocessing unstructured documents, unstructured data processing tools

Comprehensive Introduction Unstructured-IO provides a range of open source components for processing and preprocessing images and text documents such as PDF, HTML, Word documents, etc. Its main goal is to simplify and optimize data processing workflow , especially for large language model (LLM) applications to provide support.Unstructured...

en_USEnglish