AI Personal Learning
and practical guidance
TRAE
Total 992 articles

Tags: ai open source projects Page 45

文本提取API(text-extract-api):视觉提取文本信息,匿名化的PDF提取工具-首席AI分享圈

Text Extraction API (text-extract-api): visual extraction of text information, anonymized PDF extraction tool

General Description Text Extraction API (text-extract-api) is a powerful tool designed to extract and parse content from a variety of document formats (e.g. PDF, Word, PPTX, etc.). The API utilizes state-of-the-art Optical Character Recognition (OCR) technology and Ollama-supported models to be able to take any document or image...

OmniGen:统一图像生成模型,多模态输入生成人物一致性图像-首席AI分享圈

OmniGen: Unified Image Generation Model with Multimodal Inputs to Generate Character-Consistent Images

General Introduction OmniGen is a "general purpose" image generation model developed by VectorSpaceLab that allows users to create diverse and contextually rich visual effects with simple text prompts or multimodal inputs. It is particularly well suited for scenes that require character recognition and consistent character rendering...

PantoMatrix(EMAGE):全身手势生成框架,从音频生成全身手势的3D动画框架-首席AI分享圈

PantoMatrix (EMAGE): full-body gesture generation framework, 3D animation framework for generating full-body gestures from audio

Comprehensive Introduction PantoMatrix is an advanced full-body gesture generation framework capable of generating complete human movements from audio and partial gestures, including face, partial body, hand, and full-body movements. The framework utilizes the latest multimodal datasets and deep learning techniques to provide high quality 3D motion capture data...

Continue:与VS Code集成并自定义模型和embedding的开源AI代码助手-首席AI分享圈

Continue: open source AI code assistant that integrates with VS Code and customizes models and embedding

General Introduction Continue is an open source AI code assistant designed to improve the efficiency of software developers. Its main features include code auto-completion, code optimization and intelligent code suggestions for VS Code and JetBrains IDEs.Continue not only supports multiple language models, but also allows users to customize...

AigoTools:自动收录网站并支持多语言的开源AI工具导航站-首席AI分享圈

AigoTools: automatic inclusion of the site and support for multilingual open source AI tools navigation station

Comprehensive Introduction AigoTools is an open source AI web site navigation , designed to help users quickly create and manage navigation sites . It has built-in site management and AI-based automatic inclusion features , support for multiple languages , dark/light theme switching , and SEO optimization.AigoTools provides a variety of image storage solutions , including this ...

Amphion MaskGCT:零样本文本到语音克隆模型(本地一键部署包)-首席AI分享圈

Amphion MaskGCT: Zero-sample text-to-speech cloning model (local one-click deployment package)

Comprehensive Introduction MaskGCT (Masked Generative Codec Transformer) is a completely non-autoregressive Text-to-Speech (TTS) model jointly introduced by Funky Maru Technology and The Chinese University of Hong Kong. The model does not require explicit text-to-speech alignment information and adopts a two-stage generation approach, which first passes ...

PDF to Podcast: Convert PDF to Podcast Utility

General Introduction Inspired by the podcast generation features of Notebook LM and the recent Open Notebook LM open source implementation. In this recipe, we will implement a detailed step-by-step guide on how to build a PDF to podcast pipeline. Given any PDF, we will generate a segment where the host and guest discuss and explain ...

MindSearch:开源AI搜索引擎框架,部署您自己的 Perplexity 搜索引擎!-首席AI分享圈

MindSearch: open source AI search engine framework to deploy your own Perplexity search engine!

Comprehensive Introduction MindSearch is an open source AI search engine framework launched by Shanghai Artificial Intelligence Laboratory (SAL), which aims to simulate human thought process for complex information gathering and integration. The tool combines the advanced technology of large-scale language modeling (LLM) and search engine with a multi-intelligence body framework to achieve the...

en_USEnglish