AI Personal Learning
and practical guidance
477 Articles

Tags :AI open source project Page 35

Amphion MaskGCT: Zero-Sample Text-to-Speech Cloning Model (Local One-Click Deployment Package) - Chief AI Sharing Circle

Amphion MaskGCT: Zero-sample text-to-speech cloning model (local one-click deployment package)

Comprehensive Introduction MaskGCT (Masked Generative Codec Transformer) is a completely non-autoregressive Text-to-Speech (TTS) model jointly introduced by Funky Maru Technology and The Chinese University of Hong Kong. The model does not require explicit text-to-speech alignment information and adopts a two-stage generation approach, which first passes ...

PDF to Podcast: Convert PDF to Podcast Utility

General Introduction Inspired by the podcast generation features of Notebook LM and the recent Open Notebook LM open source implementation. In this recipe, we will implement a detailed step-by-step guide on how to build a PDF to podcast pipeline. Given any PDF, we will generate a segment where the host and guest discuss and explain ...

MindSearch: open source AI search engine framework to deploy your own Perplexity search engine! -Chief AI Sharing Circle

MindSearch: open source AI search engine framework to deploy your own Perplexity search engine!

Comprehensive Introduction MindSearch is an open source AI search engine framework launched by Shanghai Artificial Intelligence Laboratory (SAL), which aims to simulate human thought process for complex information gathering and integration. The tool combines the advanced technology of large-scale language modeling (LLM) and search engine with a multi-intelligence body framework to achieve the...

CosyVoice: 3-second rush voice cloning open source project launched by Ali with support for emotionally controlled tags - Chief AI Sharing Circle

CosyVoice: 3-second rush voice cloning open source project launched by Ali with support for emotionally controlled tags

Comprehensive Introduction CosyVoice is a multilingual large-scale speech generation model that provides full-stack capabilities from inference, training to deployment. Developed by FunAudioLLM team, it aims to achieve high quality speech synthesis through advanced autoregressive transformers and ODE-based diffusion models.CosyVoice not only supports...

Fabric: an AI open-source workflow framework that integrates many cue words to efficiently handle a variety of transactions - Chief AI Sharing Circle

Fabric: an AI open source workflow framework that integrates many cue words to efficiently handle a variety of transactions

General Introduction Fabric is an open source AI framework developed by Daniel Miessler to simplify and automate everyday computer tasks and make artificial intelligence easier to use. It helps users efficiently handle a variety of tasks such as content summarization, data extraction through modular design and preset prompt words (Patterns)...

TANGO: A tool for voice-generated coordinated gesture portrait videos with full-body digital humans - Chief AI Sharing Circle

TANGO: a tool for voice-generated coordinated gesture portrait videos with full-body digital humans

General Introduction TANGO (Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation) is an open source collaborative speech gesture video generation framework jointly developed by the University of Tokyo and CyberAgent AI Labs An open source collaborative speech gesture video generation framework jointly developed by the University of Tokyo and CyberAgent AI Lab. The ...

Chief AI Sharing Circle

Chief AI Sharing Circle specializes in AI learning, providing comprehensive AI learning content, AI tools and hands-on guidance. Our goal is to help users master AI technology and explore the unlimited potential of AI together through high-quality content and practical experience sharing. Whether you are an AI beginner or a senior expert, this is the ideal place for you to gain knowledge, improve your skills and realize innovation.

Contact Us
en_USEnglish