AI open source project

Total 1020 articles posts

Sorting

Datalab：专用OCR识别AI模型，PDF转Markdown（开源/API）

Datalab: dedicated OCR recognition AI model, PDF to Markdown (open source/API)

Comprehensive Introduction Datalab offers a range of advanced AI models focused on OCR, layout analysis, PDF to Markdown, and more. These models are not only high performing, but also easy to use and open source. The Marker models on the platform can quickly and accurately...

9mos ago

03.3K

ModelBest: The World's Leading Lightweight, High-Performance End-Side Big Model

General Introduction ModelBest is a company specializing in developing lightweight and high-performance large models, dedicated to applying advanced AI technologies to mainstream consumer electronics and various end devices in daily life. Its MiniCPM series of end-side models are characterized by extreme arithmetic power and memory usage efficiency...

Latest AI Resources # AI Big Model Native Conversation Tool # AI Java Open Source Projecct

10mos ago

03K

Podcastfy：多源内容转多语言音频对话工具，NotebookLM 播客功能的开源替代方案

Podcastfy: Multi-source Content to Multilingual Audio Conversation Tool, an Open Source Alternative to NotebookLM's Podcasting Capability

General Introduction Podcastfy is an open source Python package that utilizes Generative Artificial Intelligence (GenAI) technology to convert web content, PDF files, text, images, youtube videos, and many other sources into engaging multilingual...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

10mos ago

03K

One API: Multi-model API Management and Load Balancing, Distribution System

Comprehensive Introduction One API is an open source interface management and distribution system that supports a variety of big models such as OpenAI ChatGPT, Anthropic Claude, Google PaLM 2 & Gemini. The ...

Latest AI Resources # AI Java Open Source Projecct

10mos ago

03.9K

Wendo AiPPT: AI Generated PPT, Presentation Generation

Comprehensive Introduction AiPPT is a PPT generation tool based on artificial intelligence technology, designed to help users quickly create professional presentations. It automatically generates content-rich, beautifully-designed slides by entering a theme, uploading a file or providing a URL, and supports native charts, animations and 3D special...

Latest AI Resources # AI Java Open Source Projecct # AI Generated Presentation/PPT

6mos ago

03.3K

Easegen: open source digital human course production platform, PPT one-click generation cloning digital human lecture video

Comprehensive Introduction Easegen is an open source digital human course creation platform that aims to improve the efficiency of teaching content production and management through AI technology. The platform provides a one-stop solution from course production, video management to intelligent questioning, which allows users to create digital human-explained video courses...

Latest AI Resources # AI Java Open Source Projecct # AI Educational Tools # AI text to video

10mos ago

03.4K

Open Canvas：代码编辑协作画布，开源版OpenAI Canvas/Claude Artifacts

Open Canvas: code editing collaborative canvas, open source version of OpenAI Canvas/Claude Artifacts

General Introduction LangChain presents Open Canvas, an open source web application designed to enhance the document editing and collaboration experience with built-in dual-agent memory functionality and integrated smith to observe full execution details. The platform is powered by OpenA...

Latest AI Resources # AI Writing # AI Java Open Source Projecct # AI Canvas

5mos ago

03.7K

AutoGen Studio: Easy-to-Use Interface Version of the Multi-Agent System AutoGen

General Description AutoGen Studio 2.0 is a user interface powered by AutoGen designed to simplify the process of creating and managing multi-agent solutions. The platform enables users to declaratively define and modify agents and their workflows through an intuitive interface...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Development Framework

7mos ago

03.9K

MeetingMind：依赖OpenAI Whisper的开源智能会议记录与总结工具

MeetingMind: An Open Source Intelligent Meeting Recording and Summarization Tool Relying on OpenAI Whisper

General Introduction MeetingMind is an advanced AI application designed to improve the efficiency of capturing and summarizing business meetings. The app integrates OpenAI's Whisper technology for accurate speech-to-text and uses IBM Watso...

Latest AI Resources # AI Java Open Source Projecct # AI Text and Audio/Video Summarization Tool

10mos ago

03.5K

Coqui TTS（xTTS）：文本到语音生成的深度学习工具包，支持多种语言和声音克隆功能

Coqui TTS (xTTS): Deep Learning Toolkit for Text-to-Speech Generation with Multiple Language Support and Voice Cloning Capabilities

Comprehensive Introduction Coqui TTS is an open source advanced text-to-speech (TTS) generation toolkit based on deep learning techniques. It has been battle-tested in both research and production environments, and provides a rich set of features and models that support text-to-speech conversion in multiple languages.Coqui TTS...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

6mos ago

03.5K

MemFree: an AI search engine that mixes local knowledge base with search information

General Introduction MemFree is an advanced hybrid AI search engine capable of searching and asking questions through text, images, documents and web pages. It provides one-click access to search results for text, mind maps, images, and videos.MemFree aims to extract information from the user's knowledge base and...

Latest AI Resources # AI Java Open Source Projecct # AI search tool

10mos ago

03.1K

BlinkShot：输入提示词实时生成图像（免费接入Flux Schnell模型）

BlinkShot: real-time image generation by typing prompt words (free access to Flux Schnell model)

General Description BlinkShot is an open source, real-time AI image generator that utilizes Together AI and Flux Schnell technology to allow users to generate high-quality images as they enter prompts. The platform is completely free and supports user customization and secondary open...

Latest AI Resources # AI online image generation # AI Java Open Source Projecct

10mos ago

03.8K

FunASR: Open Source Speech Recognition Toolkit, Speaker Separation / Multi-Person Conversation Speech Recognition

Comprehensive Introduction FunASR is an open source speech recognition toolkit developed by Alibaba's Dharma Institute to bridge academic research and industrial applications. It supports a wide range of speech recognition features, including speech recognition (ASR), voice endpoint detection (VAD), punctuation recovery, language modeling, speaking...

Latest AI Resources # AI Java Open Source Projecct # AI Speech to Text

10mos ago

04.5K

UltraPixel: Revolutionizing ultra-high resolution image generation with rich image details

General Introduction UltraPixel is an advanced ultra-high resolution image generation technology designed to create extremely high-quality, detail-rich images. The project was developed by GitHub user catcathh and presented at NeurIPS 2024.U...

Latest AI Resources # AI online image generation # AI Java Open Source Projecct

10mos ago

02.9K

SiYuan (SiYuan Notes): privacy-first personal knowledge management software with AI writing/Q&A chat support

General: SiYuan Notes (SiYuan) is a privacy-first personal knowledge management software that is fully open source and supports self-hosting. It is written in TypeScript and Golang, provides fine-grained block-level references and Markdown WYSIWYG (WYSIWY...

Latest AI Resources # AI Java Open Source Projecct # AI Notes

6mos ago

03.9K

Abu quantitative trading system: Python based open source quantitative trading platform

Comprehensive introduction Abu quantitative trading system is an open source platform based on Python development. It was created by user "bbfamily" to help investors realize quantitative trading strategies through code. The system supports backtesting and trading of various financial products such as stocks, options, futures and bitcoin. It...

Latest AI Resources # AI Java Open Source Projecct # AI Financial Data Analytics

5mos ago

02.5K

Knowledge Table: an open source tool for efficient extraction and exploration of structured data

Comprehensive Introduction Knowledge Table (Knowledge Table) is an open source project designed to simplify the process of extracting and exploring structured data from unstructured documents. Users can create structured knowledge representations such as tables and graphs through a natural language query interface. The tool supports customizing the extraction ...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Graph

10mos ago

02.8K

CogView3: Wisdom Spectrum Light Word open source cascade diffusion text to generate image models

Comprehensive Introduction CogView3 is an advanced text generation image system developed by Tsinghua University and Think Tank Team (Chi Spectrum Qingyan). It is based on a cascading diffusion model to generate high-resolution images through multiple stages.The key features of CogView3 include multi-stage generation, innovative architecture and efficient performance...

Latest AI Resources # AI online image generation # AI Java Open Source Projecct

10mos ago

03K

RocketNotes：支持文本补全、文档对话、语义搜索的Markdown笔记应用

RocketNotes: Markdown notes app with text completion, document dialog, semantic search support

General Introduction RocketNotes is a web-based Markdown note-taking application that integrates Large Language Model (LLM)-driven text completion, chat, and semantic search. The project uses the 100% serverless RAG (Re...

Latest AI Resources # AI Java Open Source Projecct # AI Notes

9mos ago

03.1K

F5-TTS: Sample less speech cloning to generate smooth and emotionally rich cloned voices

Comprehensive Introduction F5-TTS is a novel non-autoregressive text-to-speech (TTS) system based on a stream-matched Diffusion Transformer (DiT). The system optimizes the text representation by using the ConvNeXt model...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

6mos ago

05.1K

AsrTools: speech-to-subtitle tool, lightweight client with built-in interfaces to Cutscene, Racer, and Must-Cut

Comprehensive Introduction AsrTools is an intelligent speech-to-text tool with built-in interfaces from big players such as Cutscene, Racer, Must Cut, etc. It does not require GPU or cumbersome configuration, and supports efficient multi-threaded batch processing. It is based on PyQt5 development, beautiful and user-friendly interface, able to output SRT and TXT format words...

Latest AI Resources # AI Java Open Source Projecct # AI Speech to Text

10mos ago

04.2K

Surya: professional multilingual document OCR tool, open source native deployment

Comprehensive Introduction Surya is an open source multilingual document OCR toolkit that supports text recognition in over 90 languages. It is capable of not only line-by-line text detection, but also layout analysis, reading order detection, and table recognition.Surya's performance rivals that of cloud services for all types of...

Latest AI Resources # AI Java Open Source Projecct # OCR

10mos ago

05.2K

Deploying hugging face's free api on cloudflare to support interface forwarding

Because the domestic deployment can not access hugging face, so in the big brother deployment program based on the transformation to be able to deploy to cloudflare workers. Preparation 1, register cloudflare 2, register hugging fac...

Latest AI Resources # AI Java Open Source Projecct # Free Large Model API

10mos ago

03.1K

Inbox Zero：轻松实现收件箱零邮件，借助 AI 帮助你对邮件进行归类、过滤、处理。

Inbox Zero: Easily achieve zero emails in your inbox, with the help of AI to help you categorize, filter, and process your emails.

General Description Inbox Zero is an open source email management app designed to help users quickly achieve inbox zero emails with an AI assistant. The app offers a variety of features including auto-replying, archiving, labeling and forwarding emails, managing and unsubscribing from newsletters, blocking cold emails, following...

Latest AI Resources # AI Java Open Source Projecct # AI Life Efficiency Assistant

8mos ago

02.3K

xyks: small ape oral math reverse notes, reverse engineering and decryption algorithms

Comprehensive Introduction Ape Mouth Calculator Reverse Notes is an open source project that aims to document and share the process and methods of reverse engineering the Ape Mouth Calculator application. The project contains a variety of reverse tools and techniques to use the instructions , such as Frida, dexdump , etc., to help users understand and crack the little ape oral math add...

Latest AI Resources # AI Java Open Source Projecct # AI Educational Tools

10mos ago

03.1K

XiaoYuanKouSuan_Auto：小猿口算自动答题工具，高效解决口算题目

XiaoYuanKouSuan_Auto: XiaoYuanKouSuan automatic question and answer tool, efficiently solving oral arithmetic questions

Comprehensive introduction Ape Mouth Calculator Automatic Question Answer Tool is a Python based open source project designed to efficiently solve the questions in the Ape Mouth Calculator application through OCR recognition and automation scripts. The tool utilizes technologies such as OpenCV and Tesseract to be able to recognize the questions on the screen in real time...

Latest AI Resources # AI Java Open Source Projecct # AI Educational Tools

10mos ago

02.8K

Telegram GPT Worker：部署在Cloudflare Workers上的多模型AI Telegram机器人

Telegram GPT Worker: a multi-model AI Telegram bot deployed on Cloudflare Workers

General Introduction GPT-Telegram-Worker is a multi-model AI Telegram bot based on Cloudflare Workers with support for multiple APs such as OpenAI, Claude, Azure, and...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Application

5mos ago

03.2K

Cloud Document Converter：飞书文档下载插件，飞书云文档转换为本地Markdown格式文档

Cloud Document Converter: Flying Book document download plug-in, Flying Book cloud document conversion to local Markdown format documents

General Introduction Cloud Document Converter is a Chrome extension designed for converting Flying Book cloud documents to Markdown format. Users can easily download or copy Flying Book cloud documents into Markdo...

Latest AI Resources # AI Java Open Source Projecct

9mos ago

02.7K

QuickPiperAudiobook：一键生成自然音质的有声书,支持PDF、epub、docx等格式

QuickPiperAudiobook: a key to generate natural sound quality audiobooks, support for PDF, epub, docx and other formats

Comprehensive Introduction QuickPiperAudiobook is an open source project designed to convert various text formats (e.g. epub, mobi, txt, PDF, HTML, etc.) into natural-sounding audiobooks through a simple one command. The tool uses Pi...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

10mos ago

02.8K

Crawl4AI: open source asynchronous web crawler tool to extract structured data without LLM

Comprehensive Introduction Crawl4AI is an open source asynchronous web crawler tool designed for large-scale language models (LLMs) and artificial intelligence (AI) applications. It simplifies the web crawling and data extraction process, supports efficient web crawling, and provides LLM-friendly output formats for...

Latest AI Resources # AI Java Open Source Projecct

9mos ago

03.8K

Cloudflare Serverless Registry：基于Cloudflare Workers的无服务器容器注册表

Cloudflare Serverless Registry: A Serverless Container Registry Based on Cloudflare Workers

General Introduction Cloudflare Serverless Registry is a serverless container registry based on Cloudflare Workers and R2 storage. It supports push and pull of images and provides username password and...

Latest AI Resources # AI Java Open Source Projecct

10mos ago

02.9K

AIHawk: Intelligent Job Search Assistant, Automated Resume Placement (English only)

General Introduction Auto_Jobs_Applier_AIHawk is a tool to automate job search using artificial intelligence technology. It helps users to automatically deliver a large number of resumes in a short period of time and personalize them according to their personal information and job search intentions. The tool is designed to raise...

Latest AI Resources # AI Java Open Source Projecct # AI Life Efficiency Assistant

8mos ago

03.5K

simple-one-api：一键集成多种免费大模型API，统一对外提供 OpenAI 接口

simple-one-api: one-click integration of multiple free big model APIs, unified external OpenAI interfaces

Comprehensive Introduction simple-one-api is an open source project designed to simplify the integration of multiple big model APIs. It supports OpenAI-compatible APIs such as Thousand Sails Big Model Platform, Xunfei Starfire Big Model, Tencent Mixed Element, and MiniMax and Deep-Seek....

Latest AI Resources # AI Java Open Source Projecct

9mos ago

03.1K

Voice Changer: A real-time voice changer to make your favorite anime characters sing!

Comprehensive Introduction Voice Changer is an open source real-time voice transformation tool that supports a wide range of AI voice models such as MMVC, so-vits-svc, RVC, DDSP-SVC, and Beatrice.The tool is compatible with multiple platforms...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

10mos ago

03.1K

VoAPI: High-value AI model forwarding interface management system, the official website provides free API quota on a daily basis

Comprehensive Introduction VoAPI is a new high-color and high-performance AI model interface management and distribution system, which is mainly used for personal or enterprise internal management and distribution channels. Developed based on NewAPI, the system provides rich functional modules and optimized user interface, aiming to enhance...

Latest AI Resources # AI Open Services # AI Java Open Source Projecct

9mos ago

02.8K

MockingBird：快速克隆声音与模型训练，基于 xtts v2 实现的文本转语音

MockingBird: Fast Voice Cloning and Model Training, Text-to-Speech based on xtts v2 Implementation

Comprehensive introduction MockingBird is an open source project designed to achieve rapid speech cloning and text-to-speech through AI technology. Users only need to provide 5 seconds of voice samples to generate any voice content. The project supports a variety of Chinese datasets , and in Windows ...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

8mos ago

03.4K

Clone Voice：支持多语言的少样本声音克隆工具，基于xtts_v2提供Windows一键安装包

Clone Voice: Multi-language sample-less voice cloning tool based on xtts_v2 for Windows one-click installer

General Description Clone Voice is an open source sound cloning tool that provides a web-based interface that allows users to clone voices using any sound or personal voice recording. The tool is easy to use, even without an NVIDIA GPU, and can be used with a pre-compiled app...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

10mos ago

03.5K

StreamingT2V: A Dynamic and Scalable Generation Technique from Text to Long Video

Comprehensive Introduction StreamingT2V is a public project developed by the Picsart AI research team focused on generating coherent, dynamic and scalable long videos based on textual descriptions. This technology uses an advanced autoregressive approach that guarantees temporal consistency of the video with the description text tightly...

Latest AI Resources # AI Java Open Source Projecct # AI text to video

9mos ago

03K

Text2Video-Zero：Picsart AI Research团队发布的文本到视频零样本生成器

Text2Video-Zero: Text-to-Video Zero Sample Generator Released by the Picsart AI Research Team

General Introduction Text2Video-Zero is an official implementation of a zero-sample text-to-video generator for GitHub developed by the Picsart AI Research team.The project provides a way to use text cues to generate text with temporal consistency and correct...

Latest AI Resources # AI Java Open Source Projecct # AI text to video

10mos ago

03.3K

Retrieval based Voice Conversion WebUI：基于检索的语音转换框架|模拟真人歌声

Retrieval based Voice Conversion WebUI: A Framework for Retrieval-based Voice Conversion | Simulating Real-life Singing Voices

Comprehensive Introduction Retrieval based Voice Conversion WebUI is an easy-to-use VITS-based voice conversion framework that enables voice conversion between any speakers, including song covers and real-time voice changes. It has low ...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

10mos ago

03.6K

VoiceCraft: open source zero-sample speech cloning and text-to-speech tool

Comprehensive Introduction VoiceCraft is an open source speech editing and zero-sample speech synthesis tool based on the neural codec language model. It employs an innovative coded sequence generation method that enables insertion, deletion and replacement operations on existing speech sequences to generate natural, coherent edited speech...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

10mos ago

03.1K

edge-tts: Text-to-Speech Python Module | Free Text-to-Speech Service

General Description edge-tts is an open source Python module that allows users to use Microsoft Edge's online text-to-speech service in Python code without the need for the Microsoft Edge browser, Windows operating system or API secret...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

10mos ago

04.1K

CoAI.Dev (Chat Nio)：AI聚合应用一站式 B/C 端解决方案，支持弹性计费和订阅计划模式

CoAI.Dev (Chat Nio): One-stop B/C solution for AI aggregation apps with flexible billing and subscription plan model support

General Introduction CoAI.Dev (formerly Chat Nio) is a chat platform that integrates multiple AI models and supports distributed streaming, image generation, cross-device conversation synchronization and sharing. It implements a subscription and Token billing system, Key transit service and multi...

Latest AI Resources # AI Side Hustle Money Making Programs # AI Java Open Source Projecct # AI Localized Chat Application

9mos ago

03.4K

ChatOllama: Native real-time chat application UI based on Nuxt 3 and Ollama

Comprehensive introduction ChatOllama is an open source online chat application project based on a large language model (LLM) , supporting numerous language models and knowledge base management. Users can use the platform for model management ( list display , download , delete ) , chat with the model and other functions . The project utilizes ...

Latest AI Resources # AI Java Open Source Projecct # AI Localized Chat Application # Knowledge Retrieval with RAG Framework

10mos ago

03.3K

MinerU：PDF文档提取转换为多模态Markdown格式，支持电子书OCR扫描

MinerU: PDF document extraction and conversion to multimodal Markdown format, support e-book OCR scanning

Comprehensive Introduction MinerU is an open source data extraction tool developed by the OpenDataLab team at the Shanghai Artificial Intelligence Laboratory, focusing on efficiently extracting content from complex PDF documents, web pages, and eBooks. It can take multimodal PDFs containing images, formulas, tables and other elements...

Latest AI Resources # AI Java Open Source Projecct # OCR # Document Extraction and Cleaning

11mos ago

04.9K

DCT-Net: An Open Source Tool for Transpainting Photos and Videos to Anime Stylization

Comprehensive Introduction DCT-Net is an open source project developed by DAMO Academy and Wang Xuan Institute of Computer Technology, Peking University, aiming at anime stylized transformation of images. The project utilizes deep learning techniques through Domain-Calibrated Translation (Domain-Calibrat...

Latest AI Resources # AI Image Style Control # AI Java Open Source Projecct # AI Video Conversion Style

7mos ago

03.2K

Diffusers Image Outpaint：超强开源AI图像扩展工具，图像外绘（image outpainting）

Diffusers Image Outpaint: super powerful open source AI image extension tool, image outpainting (image outpainting)

General Introduction Diffusers Image Outpaint is a powerful AI image expansion tool created by Hugging Face community member fffiloni. The tool utilizes advanced diffusion modeling techniques that allow images into...

Latest AI Resources # AI Image Enlargement and Restoration # AI Java Open Source Projecct

11mos ago

03.6K

Tap4 AI WebUI: open source lightweight AI tool navigation project

Comprehensive Introduction Tap4 AI WebUI is an open source lightweight AI tool navigation website project , designed to help users easily build their own AI tool catalog. The project uses Next.js and Supabase technology stack , support for multi-language SEO optimization to provide AI...

Latest AI Resources # AI Side Hustle Money Making Programs # AI Java Open Source Projecct

10mos ago

03.2K

CodeFormer: image and video facial restoration, old photo restoration, offers one-click deployment version

CodeFormer General Introduction CodeFormer is a codebase for robust blind face repair, developed by a team of researchers at S-Lab, Nanyang Technological University and presented at NeurIPS 2022. The project utilizes a codebook lookup transformer (C...

Latest AI Resources # AI Side Hustle Money Making Programs # AI Image Enlargement and Restoration # AI Java Open Source Projecct

11mos ago

03.8K

GFPGAN: Tencent's open source face repair algorithm

Comprehensive Introduction GFPGAN (Generative Facial Prior GAN) is an open source face repair algorithm developed by Tencent ARC (Applied Research Center). The algorithm utilizes a pre-trained facial GAN...

Latest AI Resources # AI Image Enlargement and Restoration # AI Java Open Source Projecct

11mos ago

03.2K

Curiosity：使用LangGraph构建类似 Perplexity 的AI搜索工具

Curiosity: building a Perplexity-like AI search tool using LangGraph

General Introduction Curiosity is a project aimed at exploration and experimentation, primarily using the LangGraph and FastHTML technology stacks, with the goal of building a Perplexity AI-like search product. The core of the project is a simple...

Latest AI Resources # AI Java Open Source Projecct # AI search tool

11mos ago

02.4K

Moshi: a real-time speech dialog framework with support for multiple languages and accents for speech dialog base models

General Introduction Moshi Chat is an end-to-end real-time AI voice assistant from Kyutai, a French non-profit AI lab. It not only listens in real-time, but also engages in natural conversations and supports multimodal interactions, including the ability to see, hear, and speak.Moshi Ch...

Latest AI Resources # AI Java Open Source Projecct

11mos ago

03K

QAnything: Local Knowledge Base Q&A System with Highly Integrated RAG Processing Flow

QAnything Comprehensive Introduction QAnything (Question and Answer based on Anything) is a local knowledge base Q&A system launched by NetEase, which supports all kinds of file formats and databases, and can be installed and used offline....

Latest AI Resources # AI Open Services # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

11mos ago

03.1K

StickerBaker: Make Personalized Sticker Images with AI

General Description stickerbaker is an open source sticker maker that utilizes artificial intelligence technology to create a variety of interesting stickers. Whether you want a simple cat sticker or you want to make a series of diverse stickers, stickerbaker can fulfill your needs...

Latest AI Resources # AI online image generation # AI Java Open Source Projecct

11mos ago

03.3K

ALog: portable AI voice diary app with speech-to-text support.

General Introduction ALog is an AI-based voice diary application designed to help users record their daily lives by voice. The project is developed by duxins and open-sourced on GitHub. Users can record their diary through voice input, and the app will automatically convert the voice into text...

Latest AI Resources # AI Java Open Source Projecct # AI Speech to Text

7mos ago

03.7K

OpenSPG: Open Source Knowledge Graph Engine

Comprehensive Introduction OpenSPG is an open source knowledge graph engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic Augmented Programmable Graph) framework. The engine is designed to provide features such as explicit semantic representation, logical rule definition and operational framework to support the construction and management of domain knowledge graphs...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Graph

11mos ago

04K

Mem0: an open source project that provides an intelligent memory layer for AI assistants and agents

General Introduction Mem0 (pronounced "mem-zero") is an open source project that provides an intelligent memory layer for AI assistants and agents. It remembers user preferences, adapts to individual needs, and improves over time, making it ideal for customer-supported chatbots, AI assistants, and autonomous system...

Latest AI Resources # AI Java Open Source Projecct

11mos ago

03.9K

Void: open source VSCode-based Cursor alternative

General Introduction Void is an open source Cursor alternative based on a branch of the vscode repository. It provides a powerful development environment designed to provide developers with a more efficient coding experience.Void's goal is to continuously improve its functionality and stability through community contributions and rapid iteration...

Latest AI Resources # AI Java Open Source Projecct # AI Programming

10mos ago

03.7K

GaiaNet node: install and run your own local model online proxy service

General Introduction GaiaNet-AI/gaianet-node is an open source project that allows users to quickly install the default node software stack on Mac, Linux or Windows WSL with a single command. Users can initialize nodes, customize configurations, download...

Latest AI Resources # AI Java Open Source Projecct # Locally Deployed Open Source Large Modeling Tool

11mos ago

02.8K

LlamaCoder: Quickly Generate and Publish Small Web Applications Using Prompt Words

General Introduction LlamaCoder is an open source code generation tool based on Llama 3.1 and Together AI. It can generate small applications with simple prompts and is suitable for developers to quickly realize their ideas.LlamaCoder provides...

Latest AI Resources # AI Java Open Source Projecct # AI Programming # AI Page Design

8mos ago

04.6K

Awesome CursorRules: rule sets to enhance the Cursor AI experience

General Description awesome-cursorrules is a project dedicated to providing custom rules files for Cursor AI.Cursor AI is an AI-powered code editor, and .cursorrules files can be customized...

Latest AI Resources # AI Java Open Source Projecct # PROMPTS Aids

11mos ago

03.5K

MathTranslate: LaTeX Translation Tool for Scientific Papers

General Introduction MathTranslate is an online tool specialized in translating LaTeX documents, especially for scientific papers. The tool is able to keep LaTeX expressions (e.g. mathematical expressions) unchanged and finally compiles LaTeX documents into...

Latest AI Resources # AI Java Open Source Projecct # AI Translation # Thesis

7mos ago

04K

GOT-OCR2.0: end-to-end multimodal OCR model based on QWen2 0.5B

Comprehensive Introduction GOT-OCR2.0 is a StepStar co-proposed de Open Source Optical Character Recognition (OCR) model, which aims to drive OCR technology towards OCR-2.0 through a unified end-to-end model. The model supports a wide range of OCR tasks, including normal text recognition, gr...

Latest AI Resources # AI Java Open Source Projecct # OCR

11mos ago

02.9K

TgWechat: end-to-end encrypted chat plugin for WeChat

General Introduction tgwechat is an open source WeChat plugin developed by developer dplusec. It protects WeChat chat privacy with end-to-end encryption, allowing users to send messages securely. The project went live on GitHub on August 31, 2019 under a GPL v3 license...

Latest AI Resources # AI Java Open Source Projecct

5mos ago

01.9K

OpenSumi Lite: Pure Front-End IDE Solution for Easy Code Viewing and Editing

General Introduction OpenSumi Lite is a pure front-end IDE solution based on the OpenSumi project, designed to provide code viewing and editing capabilities without the need for a Node.js environment. The project is co-developed by Alibaba Group and Ant Group and uses...

AI Answers # AI Java Open Source Projecct

6mos ago

02.5K

FiveThirtyNine: Predicting the probability of future events based on search knowledge

Comprehensive Introduction Forecast AI is a superb forecasting platform based on advanced artificial intelligence technology. It utilizes powerful data analytics and machine learning algorithms to provide users with highly accurate predictions of future events. Whether it's political elections, economic trends or social events, Forecast ...

Latest AI Resources # AI Java Open Source Projecct # AI search tool

11mos ago

03.3K

GPT SoVITS: Revolutionary Speech Generation and Speech Cloning Tools

Comprehensive Introduction GPT-SoVITS is an open source speech conversion and synthesis tool that combines the GPT model and SoVITS voice changer technology. The tool supports on-the-fly text-to-speech conversion with zero and few samples, and voice style migration with only 5 seconds of audio samples. Its features include cross-language ...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

11mos ago

03.3K

Fish Speech: Fast and Highly Accurate Cloning of English and Chinese Speech Using Few Samples

General Introduction Fish Speech is an open source text-to-speech (TTS) synthesis tool developed by Fish Audio. The tool is based on cutting-edge AI technologies such as VQ-GAN, Llama and VITS, and is capable of converting text into realistic speech.Fish S...

Latest AI Resources # AI Java Open Source Projecct # AI voice cloning

6mos ago

04.1K

IMS Toucan: Fast and Controllable Multilingual (7000+ languages supported) Text-to-Speech Tool

General Introduction IMS Toucan is a state-of-the-art text-to-speech (TTS) toolkit developed by the Institute for Natural Language Processing (IMS) at the University of Stuttgart, Germany. The toolkit supports more than 7000 languages and is characterized by fast, controllable and low computational resource requirements.IMS...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

6mos ago

03.2K

Automatically generates daily Product Hunt Hot Product lists

General Introduction Product Hunt Daily Chinese Hotlist is an automated tool based on GitHub Actions that generates a daily list of popular products on Product Hunt at regular intervals in the form of a Markdown file...

Latest AI Resources # AI Java Open Source Projecct

11mos ago

02.5K

CrisperWhisper: Accurate Verbatim Speech Transcription Tool

General Description CrisperWhisper is an advanced speech recognition tool based on OpenAI Whisper that focuses on fast, accurate and word-by-word speech transcription. It provides accurate word-level timestamps, even in the case of speech fills and pauses...

Latest AI Resources # AI Java Open Source Projecct # AI Speech to Text

7mos ago

04.2K

PaddleOCR: A multi-language OCR tool library based on Flying Paddle, supporting recognition of more than 80 languages

Comprehensive Introduction PaddleOCR is a multilingual OCR toolkit based on PaddlePaddle, designed to provide a practical and ultra-lightweight OCR system. It supports the recognition of more than 80 languages and provides data annotation and synthesis tools to support the service...

Latest AI Resources # AI Java Open Source Projecct # OCR

8mos ago

03.7K

Deep Live Cam：开源的实时AI换脸工具，一张照片就能实现实时换脸直播

Deep Live Cam: open source real-time AI face-swapping tool, a photo can realize real-time face-swapping live

General Introduction Deep Live Cam is an open source artificial intelligence tool designed to enable real-time face replacement and deep fake video generation from a single photo. The tool utilizes advanced deep learning algorithms to enable real-time face replacement in live streams or video calls, protecting user privacy and adding fun...

Latest AI Resources # AI Java Open Source Projecct # AI video face swap

9mos ago

03.5K

NarratoAI: Text-Generated Movie and TV Narration and Automated Editing Tool

Comprehensive Introduction NarratoAI is a fully automated tool that integrates movie and TV narration, automated editing, dubbing and subtitle generation. It relies on large-scale language modeling (LLM) technology to automatically generate copy and automatically edit videos with corresponding voiceovers and subtitles, providing users with a one-stop...

Latest AI Resources # AI Side Hustle Money Making Programs # AI Java Open Source Projecct # AI text to video

11mos ago

03.3K

Babelfish.ai: Browser-Run Real-Time Speech Transcription and Translation Application

General Introduction Babelfish.ai is a real-time transcription and translation application built on Huggingface Transformer.js and Supabase Realtime. The application can load large models in the browser and...

Latest AI Resources # AI Java Open Source Projecct # AI Speech to Text

11mos ago

03.1K

Vector Vein: Code-Free AI Workflow Building Platform

Comprehensive Introduction Vector Vein is a code-free AI workflow building platform designed to help users easily create intelligent, automated workflows. With no programming foundation required, users can simply connect various functional modules through drag-and-drop operations to build complex AI work...

Latest AI Resources # AI Java Open Source Projecct # Low-code workflow

8mos ago

02.9K

LivePortrait: Animation tool for generating dynamic portraits from still images and videos

General Introduction LivePortrait is an advanced AI dynamic portrait animation tool developed by Racer Technology. It utilizes innovative AI technology to transform still images into vivid video animations. Whether you use real photos, animated styles or artistic portraits, LivePo...

Latest AI Resources # AI Image to Video # AI Java Open Source Projecct # AI Video Conversion Style

9mos ago

03.3K

PhiData: Building AI Intelligence with Memory, Knowledge and Tools

Comprehensive Introduction PhiData is a framework designed for developing intelligent AI assistants. It enables AI assistants to have long conversations, provide accurate business context, and perform various operations through enhanced memory, knowledge integration, and tool invocation capabilities.PhiData not only enhances AI assistant...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Application

5mos ago

03.2K

ChatTTS: a speech generation model that mimics the voice of a real person speaking (ChatTTS one-click acceleration package)

General Introduction ChatTTS is a generative speech model designed for conversational scenarios. It generates natural and expressive speech, supports multiple languages and multiple speakers, and is suitable for interactive conversations. The model does this by predicting and controlling fine-grained prosodic features such as laughter, pauses and interjections, sup...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech

6mos ago

03.2K

MoneyPrinterPlus: AI tool for generating short videos with one click, free batch mixing

Comprehensive Introduction MoneyPrinterPlus is an open source project aimed at generating and mixing all kinds of short videos with one click through AI technology, and automatically publishing them to multiple video platforms, such as Jieyin, Shutterbugs, Xiaohongshu, and Video Number. The tool supports local and cloud-based voice models, including chat...

Latest AI Resources # AI Side Hustle Money Making Programs # AI Java Open Source Projecct # AI Video Generation Tool

11mos ago

03.5K

TF-ID: academic paper form/image recognition tool

Comprehensive Introduction TF-ID (Table/Figure IDentifier) is a family of object detection models specialized for extracting tables and images from academic papers. The project was created by Yifei Hu and is open-sourced on GitHub.The TF-ID model was developed by...

Latest AI Resources # AI Java Open Source Projecct

11mos ago

03.3K

Chatbot UI: an open source AI chat app that mimics ChatGPT's interface and functionality

General Introduction Chatbot UI is an open source project designed to help developers create personalized and intelligent conversational interfaces. The project provides a series of interface components and interactive features that can be easily integrated into the existing Chatbot system to provide users with a more fluent and intelligent dialog body...

Latest AI Resources # AI Java Open Source Projecct # AI Localized Chat Application

11mos ago

04.6K

GLIGEN GUI: Precise control of the position of image elements, intuitive graphical interface based on ComfyUI

General Introduction GLIGEN GUI is an intuitive graphical interface based on ComfyUI, designed to simplify the use of the GLIGEN model, a novel text-to-image model that allows precise specification of the position of objects in an image. With GLIGE...

Latest AI Resources # AI Image Generation Aids # AI Java Open Source Projecct

11mos ago

02.8K

Easy Voice Toolkit: AI Voice Toolkit for Local Deployment

Comprehensive Introduction Easy-Voice-Toolkit is a multifunctional toolkit based on the Open Source Speech Project, providing a variety of automated audio tools for speech recognition, speech transcription, speech conversion, dataset creation and model training. Users can selectively use these tools as needed...

Latest AI Resources # AI Java Open Source Projecct # AI text-to-speech # AI voice cloning

11mos ago

03.4K

FaceFusion: Video Face Swap Enhancement Tool | Voice Synchronized Video Mouth Moves

General Introduction FaceFusion is an advanced cloud platform with integrated facial exchange and enhancement features that optimizes the image-to-video and image-to-image exchange process with 5 professional models to ensure flawless output. In addition, it performs facial enhancement with 7 models using 3...

Latest AI Resources # AI Java Open Source Projecct # AI video face swap

6mos ago

05.5K

Kotaemon: simple to deploy open source multimodal document quiz tool

General Introduction Kotaemon is an open source document Q&A tool designed to provide end-users and developers with Q&A functionality based on Retrieval Augmented Generation (RAG). The project is developed by Cinnamon and supports a variety of LLM API providers (e.g. OpenA...

Latest AI Resources # AI Java Open Source Projecct # Knowledge Graph # Knowledge Retrieval with RAG Framework

11mos ago

03.4K

HivisionIDPhotos: open source intelligent AI photo ID creation tool

Comprehensive introduction HivisionIDPhotos is an open source lightweight AI document photo production tool, can intelligently identify the user photo scene and keying, to generate a standard document photo in line with a variety of specifications. The tool supports custom background color and size, the future will also introduce beauty and...

Latest AI Resources # AI Java Open Source Projecct # AI keying to change backgrounds

11mos ago

03.2K

Marker: quickly convert PDF to Markdown open source tools

General Introduction Marker is a deep learning based document processing tool designed to convert PDF files to Markdown format quickly and accurately. It supports a wide range of document types and is especially optimized for conversion of books and scientific papers.Marker is able to remove headers...

Latest AI Resources # AI Java Open Source Projecct # Document Extraction and Cleaning

5mos ago

04.7K

SadTalker: Make Photos Talk | Mouth Synchronized Audio | Synthesized Mouth Synchronized Video | Free Digital People

General Introduction SadTalker is an open source tool that combines a single still portrait photo with an audio file to create realistic talking avatar videos for a variety of scenarios such as personalized messages, educational content, and more. The revolutionary use of 3D modeling technologies such as ExpNet and PoseVA...

Latest AI Resources # AI Java Open Source Projecct # AI Digital Man # Port Synchronization

6mos ago

03.5K

VideoReTalking: Audio-Driven Lip Synchronization and Video Editing System

General Introduction VideoReTalking is an innovative system that allows users to generate lip-synchronized facial videos based on the input audio, producing high-quality and lip-synchronized output videos even with different emotions. The system breaks down this goal into three consecutive tasks: with typical expressions...

Latest AI Resources # AI Java Open Source Projecct # Port Synchronization

8mos ago

03.4K

MuseV+Muse Talk：完整数字人视频生成框架|人像转视频|姿态转视频|唇形同步

MuseV+Muse Talk: Complete Digital Human Video Generation Framework | Portrait to Video | Pose to Video | Lip Synchronization

General Introduction MuseV is a public project on GitHub that aims to enable the generation of avatar videos of unlimited length and high fidelity. It is based on diffusion technology and provides Image2Video, Text2Image2Video, Video2Video...

Latest AI Resources # AI Java Open Source Projecct # AI Digital Man # Port Synchronization

8mos ago

05.6K

Unstructured: open source preprocessing unstructured documents, unstructured data processing tools

Comprehensive Introduction Unstructured-IO provides a set of open source components for processing and pre-processing images and text documents such as PDF, HTML, Word documents, etc. Its main goal is to simplify and optimize the data processing workflow , especially for large language models (LL...

Latest AI Resources # AI Java Open Source Projecct # Document Extraction and Cleaning

11mos ago

03.5K

magic-html：从HTML网址中提取主体数据，输出纯文本/markdown

magic-html: extract body data from HTML URL, output plain text/markdown

General Introduction magic-html is a Python library designed to simplify the process of extracting body region content from HTML. Whether dealing with complex HTML structures or simple web pages, this library aims to provide a convenient and efficient interface for users. It supports multimodal extraction...

Latest AI Resources # AI Java Open Source Projecct

11mos ago

02.8K

WebPilot: Intelligent Web Information Processing Tool, Free API for Web Content Capture

WebPilot General Introduction Webpilot is a free and open source "web assistant" that allows you to communicate freely with any web page or perform automated tasks. You don't need to switch pages or copy and paste, just select text or enter commands, webpilot ...

Latest AI Resources # AI Open Services # AI Java Open Source Projecct # AI search tool

12mos ago

03.8K

DB-GPT: Building AI Native Data Application Development Framework, Integrating Multi-Model Management and Intelligent Data Processing

Comprehensive Introduction DB-GPT is an open source AI native data application development framework built using AWEL (Agentic Workflow Expression Language) and smart body technology. The project aims to build infrastructure in the field of large modeling...

Latest AI Resources # AI Java Open Source Projecct # AI data analysis # Knowledge Retrieval with RAG Framework

5mos ago

03K

DreamTalk: Generate expressive talking videos with a single avatar image!

DreamTalk Comprehensive Introduction DreamTalk is a diffusion model-driven expression talking head generation framework, jointly developed by Tsinghua University, Alibaba Group and Huazhong University of Science and Technology. It mainly consists of three parts: a noise reduction network, a style-aware lip expert and a style predictor, and can be based on...

Latest AI Resources # AI Java Open Source Projecct # AI Digital Man # Port Synchronization

8mos ago

03.3K

InstantID: upload an image and migrate the portrait features to generate different styles of images

Comprehensive Introduction InstantID is an advanced technology focused on generating images with personalized styles or poses in seconds while ensuring a high level of fidelity using a single reference ID picture. The technology employs a diffusion model-based solution by integrating facial images, landmark maps...

Latest AI Resources # AI Image Style Control # AI Java Open Source Projecct # AI Face Swap and Dress Up

12mos ago

02.8K

ComfyUI Portrait Master 中文版：优化肖像生成的提示词工具

ComfyUI Portrait Master Chinese version: Cue word tool to optimize portrait generation

General Introduction ComfyUI Portrait Master Chinese version is a portrait cue word generation tool designed for AI image creators. The tool helps users generate high-quality portraits by optimizing the cue words. Users can choose different lenses according to the demand...

Latest AI Resources # AI Image Generation Aids # AI Java Open Source Projecct # ComfyUI

12mos ago

03.8K

IOPaint: All-around AI image processing tool, erasing, expanding, replacing elements and drawing text.

General Introduction IOPaint is a free and open source AI image processing tool that supports image erasing, repairing and expanding. It uses state-of-the-art AI models to help users easily remove unwanted objects from an image, repair blemishes, add new content, and even expand an image.IOPa...

Latest AI Resources # AI Image Enlargement and Restoration # AI Java Open Source Projecct # AI keying to change backgrounds

10mos ago

015.3K