AI Personal Learning
and practical guidance
Beanbag Marscode1
Total 908 articles

Tags: ai open source projects Page 42

ModelBest(面壁智能):全球领先的轻量高性能端侧大模型-首席AI分享圈

ModelBest: The World's Leading Lightweight, High-Performance End-Side Big Model

General Introduction ModelBest is a company specializing in developing lightweight and high-performance large models, dedicated to applying advanced AI technologies to mainstream consumer electronics and various end devices in daily life. Its MiniCPM series of end-side models are known for their extreme arithmetic power and memory usage efficiency, with small parameter counts,...

Podcastfy:多源内容转多语言音频对话工具,NotebookLM 播客功能的开源替代方案-首席AI分享圈

Podcastfy: Multi-source Content to Multilingual Audio Conversation Tool, an Open Source Alternative to NotebookLM's Podcasting Capability

General Introduction Podcastfy is an open source Python package that utilizes Generative Artificial Intelligence (GenAI) technology to convert web content, PDF files, text, images, youtube videos, and many other sources into engaging multi-language audio conversations. Unlike traditional user interface-based...

文多多 AiPPT:AI生成PPT,演讲稿生成-首席AI分享圈

Wendo AiPPT: AI Generated PPT, Presentation Generation

Comprehensive Introduction AiPPT is a PPT generation tool based on artificial intelligence technology, designed to help users quickly create professional presentations. It automatically generates content-rich, beautifully-designed slides by entering a theme, uploading a file, or providing a URL, etc. It supports native charts, animations and 3D effects and other complex...

Easegen:开源数字人课程制作平台,PPT一键生成克隆数字人讲解视频-首席AI分享圈

Easegen: open source digital human course production platform, PPT one-click generation cloning digital human lecture video

General Introduction Easegen is an open source digital human course creation platform that aims to improve the efficiency of teaching content production and management through AI technology. The platform provides a one-stop solution from course production, video management to intelligent questioning, which allows users to create digital human-explained video courses and utilize AI ...

MeetingMind:依赖OpenAI Whisper的开源智能会议记录与总结工具-首席AI分享圈

MeetingMind: An Open Source Intelligent Meeting Recording and Summarization Tool Relying on OpenAI Whisper

Comprehensive Introduction MeetingMind is an advanced AI application designed to improve the efficiency of capturing and summarizing business meetings. The app integrates OpenAI's Whisper technology for accurate speech-to-text and uses IBM Watson's AI to analyze and extract key points in the transcribed text....

Coqui TTS(xTTS):文本到语音生成的深度学习工具包,支持多种语言和声音克隆功能-首席AI分享圈

Coqui TTS (xTTS): Deep Learning Toolkit for Text-to-Speech Generation with Multiple Language Support and Voice Cloning Capabilities

Comprehensive Introduction Coqui TTS is an open source advanced text-to-speech (TTS) generation toolkit based on deep learning techniques. It has been battle-tested in both research and production environments, and provides a rich set of features and models that support text-to-speech conversion in multiple languages.Coqui TTS not only supports pre-trained models...

BlinkShot:输入提示词实时生成图像(免费接入Flux Schnell模型)-首席AI分享圈

BlinkShot: real-time image generation by typing prompt words (free access to Flux Schnell model)

General Introduction BlinkShot is an open source, real-time AI image generator that utilizes Together AI and Flux Schnell technology to allow users to generate high-quality images as they enter prompts. The platform is completely free and supports user customization and secondary development for designers, artists and content creation...

FunASR:开源语音识别工具包,说话人分离/ 多人对话语音识别-首席AI分享圈

FunASR: Open Source Speech Recognition Toolkit, Speaker Separation / Multi-Person Conversation Speech Recognition

Comprehensive Introduction FunASR is an open source speech recognition toolkit developed by Alibaba's Dharma Institute to bridge academic research and industrial applications. It supports a wide range of speech recognition features, including speech recognition (ASR), voice endpoint detection (VAD), punctuation recovery, language modeling, speaker verification, speak...

阿布量化交易系统:基于Python的开源量化交易平台-首席AI分享圈

Abu quantitative trading system: Python based open source quantitative trading platform

Comprehensive introduction Abu quantitative trading system is an open source platform based on Python development. It was created by user "bbfamily" to help investors realize quantitative trading strategies through code. The system supports backtesting and trading of various financial products such as stocks, options, futures and bitcoin. It combines machine learning techniques...

Knowledge Table:高效提取与探索结构化数据的开源工具-首席AI分享圈

Knowledge Table: an open source tool for efficient extraction and exploration of structured data

Comprehensive Introduction Knowledge Table (Knowledge Table) is an open source project designed to simplify the process of extracting and exploring structured data from unstructured documents. Users can create structured knowledge representations such as tables and graphs through a natural language query interface. The tool supports customization of extraction rules and formats...

CogView3:智谱轻言开源的级联扩散文本生成图像模型-首席AI分享圈

CogView3: Wisdom Spectrum Light Word open source cascade diffusion text to generate image models

Comprehensive Introduction CogView3 is an advanced text generation image system developed by Tsinghua University and Think Tank Team (Chi Spectrum Qingyan). It is based on the cascading diffusion model and generates high-resolution images through multiple stages.The key features of CogView3 include multi-stage generation, innovative architecture and efficient performance for artistic creation...

en_USEnglish