Latest AI Resources

Total 2832 articles posts
Dzine:可控的AI图像生成功能与画布设计工具,提供数百种图像风格样式

Dzine: Controllable AI image generation capabilities and canvas design tools, offering hundreds of image styles and styles

General Introduction Dzine (formerly Stylar) is an all-in-one AI design platform that offers an integrated workflow from image generation to editing, unrivaled image composition and style control. Its predefined styles make it easy for users of all skill levels to customize designs without complex...
1yrs ago
036.6K
MMAudio:为视频画面生成同步音效与配乐,视频到音频的多模态联合训练工具

MMAudio: generating synchronized sound effects and soundtracks for video footage, video-to-audio multimodal co-training tool

General Introduction MMAudio is an open-source project aiming to generate high-quality synchronized audio through joint multimodal training. Developed by Ho Kei Cheng et al. at the Chinese University of Hong Kong, the project's main function is to generate synchronized audio based on video and/or text input.MM...
12mos ago
036.6K
AI2SRT:利用 Gemini模型,一键为长视频创建解说短视频或视频总结

AI2SRT: Create short narrated videos or video summaries for long videos with one click using Gemini models

Comprehensive Introduction AI2SRT is an open source project that utilizes the GeminiAI Big Model to generate short narrated videos and video summaries for long videos with one click, while supporting audio and video transcription subtitles. The project aims to simplify the video content creation process and provide efficient subtitle generation and translation functions. Users can pass...
11mos ago
036.5K
opensource_notebooklm:基于Deepseek-V3和PlayHT TTS的NotebookLM开源实现

opensource_notebooklm: open source implementation of NotebookLM based on Deepseek-V3 and PlayHT TTS

General Introduction Open Source NotebookLM is an innovative artificial intelligence project that combines Deepseek-V3's language understanding capabilities with PlayHT's speech synthesis technology, aiming to create an intelligent note-taking conversation system. The project was developed by Build Fast w...
11mos ago
036.4K
MegaParse:解析各类型文档为LLM可用数据,完整保留文档中的表格、图片等所有信息

MegaParse: parses all types of documents into LLM-available data, preserving all information in the document such as tables, pictures, etc. in its entirety

Comprehensive Introduction MegaParse is a powerful and versatile document parsing tool designed to optimize data processing for the Large Language Model (LLM). Whether you are working with text, PDF, PowerPoint presentations or Word documents, MegaParse...
1yrs ago
036.4K
Artflow:创作人物一致性的动画故事和虚拟数字人口播视频

Artflow: Creating character-consistent animated stories and virtual digital pop-up videos

General Description Artflow is an online platform that enables users to upload photos, train exclusive AI characters, and create character-consistent videos and animated stories. Offering free training for the first time, users can customize their identity to create unique images and videos for a variety of scenarios. Monthly ...
1yrs ago
036.4K
Easegen:开源数字人课程制作平台,PPT一键生成克隆数字人讲解视频

Easegen: open source digital human course production platform, PPT one-click generation cloning digital human lecture video

Comprehensive Introduction Easegen is an open source digital human course creation platform that aims to improve the efficiency of teaching content production and management through AI technology. The platform provides a one-stop solution from course production, video management to intelligent questioning, which allows users to create digital human-explained video courses...
1yrs ago
036.4K
触手AI:简单易上手的AI绘图工具,支持训练自己的图像风格

Tentacle AI: simple and easy to use AI drawing tools, support training your own image style

Comprehensive Introduction Touch AI is a professional AI creation platform under Jellyfish Intelligence, providing AI painting, online drawing and massive models and other functions. The platform supports minimalist and professional modes with strong ease of use, provides a variety of drawing styles and design models, rich plug-in options, and allows users to experience AIGC creation capabilities online...
1yrs ago
036.3K
ExtractThinker:提取和分类文档为结构化数据,优化文档处理流程

ExtractThinker: extracting and classifying documents into structured data to optimize the document processing flow

Comprehensive Introduction ExtractThinker is a flexible document intelligence tool that utilizes Large Language Models (LLMs) to extract and classify structured data from documents, providing a seamless ORM-like document processing workflow. It supports a variety of document loaders, including Tess...
11mos ago
036.3K
鬼手剪辑:视频去重|短剧解说|视频翻译|去除字幕

Ghost Hand Clips: video de-emphasis|skit commentary|video translation|subtitle removal

Comprehensive Introduction The official website of Ghost Hand Clips is designed to provide efficient video translation and subtitle removal tools for video creators, merchants and MCN organizations. Using powerful AI technology, Ghost Hand Clips is able to achieve intelligent translation of video content, subtitle removal and video personalization, helping users break through the language barrier and easily play...
1yrs ago
036.2K
Arcade:录制屏幕操作快速生成产品互动演示视频

Arcade: Record on-screen operations to quickly generate interactive product demo videos.

General Description Arcade is an easy-to-use online platform that helps users quickly create interactive demos. It is suitable for marketers, product managers and sales teams to demonstrate product features. By recording on-screen actions, Arcade automatically generates interactive demo content that users can use in just a few minutes...
9mos ago
036.2K
Vizard:长视频自动剪辑为适合社交媒体推广的爆款短视频

Vizard: Long videos are automatically edited into short, explosive videos suitable for social media promotion.

General Introduction Vizard, from Blue Pulse, is an online tool that utilizes artificial intelligence technology to help users quickly turn long videos into short social media clips. Designed for content creators, marketers, and educators, it automatically recognizes the best moments in a video and generates short clips suitable for...
9mos ago
036.2K
CogVLM2:开源多模态模型,支持视频理解与多轮对话

CogVLM2: Open Source Multimodal Modeling with Support for Video Comprehension and Multi-Round Dialogue

Comprehensive Introduction CogVLM2 is an open source multimodal model developed by the Tsinghua University Data Mining Research Group (THUDM), based on the Llama3-8B architecture, and designed to provide performance comparable to or even better than GPT-4V. The model supports image understanding, multi-round dialogs, and visual ...
10mos ago
036.2K
阿里妈妈创意中心:淘宝生态下的智能化营销创意支持平台

AliMama Creative Center: Intelligent Marketing Creative Support Platform under Taobao Ecology

Comprehensive Introduction Alimama Creative Center is Alibaba's intelligent marketing creative support platform, designed to provide merchants on Taobao, Tmall, and other e-commerce platforms with a full range of creative support from graphics to videos to landing pages. By combining AI intelligent copywriting capabilities and massive templates, Creative Center dramatically improves the design efficiency...
1yrs ago
036.2K
XRAG:优化检索增强生成系统的可视化评估工具

XRAG: A Visual Evaluation Tool for Optimizing Retrieval Enhancement Generation Systems

Comprehensive Introduction XRAG (eXamining the Core) is a benchmarking framework designed for evaluating the underlying components of advanced retrieval augmentation generation (RAG) systems. By profiling and analyzing each core module, XRAG provides information on how different configurations and components affect RAG...
10mos ago
036.2K
Hibiki:实时语音翻译模型,保留原声特点的流式翻译

Hibiki: a real-time speech translation model, streaming translation that preserves the characteristics of the original voice

General Introduction Hibiki is a high-fidelity real-time speech translation model developed by Kyutai Labs. Unlike traditional offline translation, Hibiki is able to generate natural speech translation in the target language and provide text translation in real time while the user is speaking. The model...
10mos ago
036.2K
Cardog:车辆信息研究与汽车市场数据智能分析

Cardog: Vehicle Information Research and Intelligent Analysis of Automotive Market Data

Comprehensive Introduction Cardog is a vehicle research and management platform that combines artificial intelligence technology, aiming to provide users with convenient vehicle-related information query and management services. Users can utilize its AI interface to research vehicle performance, obtain market analysis, view documentation, and even manage personal vehicle information...
9mos ago
036.1K
Agent TARS:使用视觉和命令操作电脑的开源智能体

Agent TARS: An Open Source Intelligence Using Vision and Commands to Operate Computers

Comprehensive Introduction Agent TARS is a multimodal AI intelligence open-sourced by ByteDance.The core feature is to visually understand web content and combine command line and file system operations to help users complete complex computer tasks. Instead of requiring manual operations like traditional tools, it can self...
8mos ago
036.1K
EdutorAI:AI生成试卷和互动测验,扫描书页生成随机问题

EdutorAI: AI-generated test papers and interactive quizzes, scanning book pages to generate randomized questions

General Introduction EdutorAI is a website that utilizes artificial intelligence technology to provide solutions for education. It provides teachers, students and parents with a variety of tools, including a question generator, a pull-out recognition card maker, a quiz creator, and more, designed to improve learning efficiency and effectiveness. Users can upload text...
1yrs ago
036.1K
魔音工坊:专业配音与短视频解说创作平台|真人配音|克隆声音|一键成片

Magic Voice Workshop: professional voice-over and short video narration creation platform | real person voice-over | clone voice | one-click into a film

Comprehensive Introduction Magic Voice Workshop is a one-stop short video and AI dubbing platform with information on software dubbing, real-life dubbing, sound libraries, cloning services and more. The platform integrates audio editing, AI copy generation, video editing and collaboration tools for audio-related services and content creation. Users experience the audio editor...
1yrs ago
036.1K
MoneyPrinterTurbo:输入视频主题一键生成视频文案和高清短视频

MoneyPrinterTurbo: Generate video copy and short HD videos in one click by entering a video theme

Comprehensive Introduction MoneyPrinterTurbo is an open source project that utilizes advanced AI big model technology to achieve the function of generating short HD videos with one click. Users only need to provide a video theme or keywords, the system will automatically generate video copy, video clips, video subtitles and...
9mos ago
036.1K
Sana Labs:企业知识管理和员工培训学的AI工具

Sana Labs: AI Tools for Enterprise Knowledge Management and Employee Trainology

General Introduction Sana Labs is a company dedicated to improving the efficiency of knowledge acquisition and learning in organizations through AI technology. Headquartered in Stockholm, Sweden, Sana offers a range of products including a Learning Management System (LMS), a Learning Experience Platform (LXP), an AI assistant, and more...
11mos ago
036.1K
Visprex:快速可视化CSV文件,自动将数据生成各类分析图表,数据完全在浏览器中处理

Visprex: fast visualization of CSV files, automatically generate all kinds of analytical charts from the data, and process the data completely in the browser.

General Introduction Visprex is a lightweight data visualization tool designed to help users analyze and present data quickly and intuitively. The tool runs entirely in the browser, ensuring data privacy and security, and does not send data to any backend servers.Visprex supports a wide range of...
1yrs ago
036.1K
XAudioPro:专业在线音频剪辑工具|有声书制作|文字转语音|伴奏分离

XAudioPro: Professional Online Audio Editing Tool|Audiobook Maker|Text to Speech|Accompaniment Separation

General Introduction XAudioPro is an advanced online audio real-time editing and transcoding tool that is both professional and portable. It supports professional audio editing functions such as cutting, cropping, copying, deleting, restoring, and amplitude gain control. It also provides denoising services such as spectral subtraction noise reduction, low-pass...
1yrs ago
036K
Image AI:集成多类AI图片编辑工具,免费视频换脸,简单上手

Image AI: Integration of multiple types of AI photo editing tools, free video face changing, easy to start!

Comprehensive Introduction Image AI is a remarkable all-in-one AI image platform that offers a wide range of advanced image tools to help users easily achieve high-quality visual effects. Whether it's face swap, image recognition, text to generate images, or image de-contextualization, Image AI can meet...
1yrs ago
035.9K
AI投资系统:自动化A股投资决策系统,利用多智能体系统分析市场数据

AI investment system: automated A-share investment decision-making system that utilizes a multi-intelligence system to analyze market data

Comprehensive Introduction A_Share_investment_Agent is an A-share investment decision aid based on a multi-intelligence system. The system is designed to analyze market data, calculate the intrinsic value of stocks, analyze market sentiment, and fundamental data through multiple collaborative intelligences to...
10mos ago
035.9K