Latest AI Resources

Total 3094 articles posts
反谱 - AI音乐转谱平台,支持音频文件转五线谱和简谱

AntiScore - AI music transcription platform, supports audio files to pentatonic and simple music.

AntiSpectrum is an innovative online AI music conversion platform, based on advanced AI technology, to convert audio files (such as MP3, FLAC, etc.) into pentatonic and simple scores. AntiSpectrum has a vocal separation function, which separates the vocals from the accompaniment in the music, making it easy for music production and mixing. AntiSpectrum supports converting MIDI files...
11mos ago
071.9K
Perplexica:1比1复刻 Perplexity AI 功能和界面的开源AI搜索引擎

Perplexica: an open source AI search engine that replicates Perplexity AI's features and interface 1 to 1

Comprehensive Introduction Perplexica is an open source AI-driven search engine designed to provide answers that delve deep into the Internet. It uses advanced machine learning algorithms, such as similarity search and embedding techniques, to optimize search results and provide clear answers with cited sources.Perple...
1yrs ago
071.9K
飞桨 PP-TableMagic:复杂表格结构化信息提取神器

Flying Paddle PP-TableMagic: Structured Information Extraction for Complex Tables

The goal of table recognition is to parse tables in images, accurately identify table structures and cell locations, and reduce them to structured table formats (e.g., HTML). In today's information age, a large amount of important tabular data still exists in an unstructured state (e.g., scanned documents with pictures of statistical tables...).
1yrs ago
071.9K
堆友:AI设计工具箱与创意平台

Heap Friend: AI Design Toolkit and Creative Platform

Comprehensive Introduction PileYou is an online platform built by Alibaba's design team that integrates a variety of AI design tools, designed for designers and creative workers. The platform provides AI generation tools from text to images, including vertical industry design tools, PileYou Camera, Deer Class Marketing Chart, AI Art Characters, Model Change...
2yrs ago
071.5K
Hibiki:实时语音翻译模型,保留原声特点的流式翻译

Hibiki: a real-time speech translation model, streaming translation that preserves the characteristics of the original voice

General Introduction Hibiki is a high-fidelity real-time speech translation model developed by Kyutai Labs. Unlike traditional offline translation, Hibiki is able to generate natural speech translation in the target language and provide text translation in real time while the user is speaking. The model...
1yrs ago
071.3K
Sana:快速生成高分辨率图像,0.6B超小尺寸模型,低配笔记本GPU运行

Sana: fast generation of high-resolution images, 0.6B ultra-small size model, low-profile laptop GPU operation

General Introduction Sana is an efficient high-resolution image generation framework developed by NVIDIA Labs, capable of generating images up to 4096 × 4096 resolution in a matter of seconds.Sana utilizes a linear diffusion transformer and deep compression self-encoder technology to significantly...
1yrs ago
071.2K
NoCode – 美团推出的零代码AI开发平台

NoCode - Zero-Code AI Development Platform Launched by Meituan

What is NoCode NoCode is a zero-code AI development platform launched by Mission. Users don't need any programming experience, they just need to describe the requirements through natural language to quickly generate website pages, utilities, small games, event pages and other applications.NoCode supports one second generation of 200...
11mos ago
071.2K
Edraw.AI(亿图):在线协作白板工具,AI生成流程图和多种图表

Edraw.AI: Online collaborative whiteboard tool, AI-generated flowcharts and multiple diagrams

Comprehensive Introduction Edraw.AI is a revolutionary AI-powered online visualization whiteboard collaboration platform that integrates more than 40 intelligent tools and a library of carefully designed templates. The platform uses advanced AI technology to quickly transform users' textual thoughts into professional visual diagrams. The platform supports...
1yrs ago
071.1K
GeekAI:自部署商业化多功能AI助手,完整接入多模型API运营后台

GeekAI: Self-deployed commercialized multi-functional AI assistant with complete access to multi-model API operation backend

Comprehensive introduction GeekAI is a full set of open source solutions for AI assistants based on AI big language model API implementation. The project comes with an operations management backend , out of the box , integrated with ChatGPT, Azure, ChatGLM, Xunfei Starfire, Wenxin Yiyin and many other p...
2yrs ago
071.1K
Flair:AI生成专业摄影效果的商品展示图,产品商拍专用工具

Flair: AI generates professional photographic effect of the product display map, product commercial photography special tools

Comprehensive Introduction Flair is an AI-based online design tool focused on generating high-quality photographic images for e-commerce products. Users can quickly create realistic product scene images through drag-and-drop operations, which greatly improves design efficiency. The platform provides a wealth of templates and 3D elements to support real...
1yrs ago
071.1K
通义听悟:阿里通义音视频内容转录AI助手

Tongyi Listening and Understanding: Ali Tongyi Audio and Video Content Transcription AI Assistant

Comprehensive Introduction Tongyi Listening and Understanding is a work-study AI assistant launched by Aliyun, focusing on transcribing and analyzing audio and video content. It relies on AliCloud's powerful AI models to transcribe audio and video content into text in real time, and provides translation, summarization, positioning and other functions. Tongyi Listening Woo supports multiple languages and scenarios...
2yrs ago
071.1K
NodeRAG:基于异构图的精准信息检索与生成工具

NodeRAG: A Heterogeneous Graph-Based Tool for Accurate Information Retrieval and Generation

A Comprehensive Introduction NodeRAG is an open source Retrieval Augmented Generation (RAG) system hosted on GitHub and developed by Terry-Xu-666. It optimizes information retrieval and generation through heterogeneous graph structures, significantly improving retrieval accuracy and contextual relevance.Nod...
1yrs ago
071K
魔音工坊:专业配音与短视频解说创作平台|真人配音|克隆声音|一键成片

Magic Voice Workshop: professional voice-over and short video narration creation platform | real person voice-over | clone voice | one-click into a film

Comprehensive Introduction Magic Voice Workshop is a one-stop short video and AI dubbing platform with information on software dubbing, real-life dubbing, sound libraries, cloning services and more. The platform integrates audio editing, AI copy generation, video editing and collaboration tools for audio-related services and content creation. Users experience the audio editor...
2yrs ago
071K
Ant Design X:快速构建AI聊天界面的工具包,支持模型集成和数据流管理。

Ant Design X: A toolkit for rapidly building AI chat interfaces with support for model integration and data flow management.

Comprehensive Introduction Ant Design X is a toolkit open-sourced by Ant Group, designed to help developers quickly build AI-driven dialog interfaces. It provides a rich set of components and templates, supports model integration compatible with OpenAI standards, and is suitable for a variety of applications such as intelligent customer service, AI assistants, and other...
1yrs ago
070.9K
Flow(Laminar):构建智能体的轻量级任务引擎,简化并灵活管理任务

Flow (Laminar): a lightweight task engine for building intelligences that simplifies and flexibly manages tasks

Comprehensive Introduction Flow is a lightweight task engine designed for building AI agents, emphasizing simplicity and flexibility. Unlike traditional node- and edge-based workflows, Flow uses a dynamic task queuing system that supports parallel execution, dynamic scheduling, and intelligent dependency management. Its core concept is ...
1yrs ago
070.9K
AI2SRT:利用 Gemini模型,一键为长视频创建解说短视频或视频总结

AI2SRT: Create short narrated videos or video summaries for long videos with one click using Gemini models

Comprehensive Introduction AI2SRT is an open source project that utilizes the GeminiAI Big Model to generate short narrated videos and video summaries for long videos with one click, while supporting audio and video transcription subtitles. The project aims to simplify the video content creation process and provide efficient subtitle generation and translation functions. Users can pass...
1yrs ago
070.9K
Infinity:生成高分辨率图像的比特自回归建模,实现无限制高分辨率图像生成

Infinity: bitwise autoregressive modeling for generating high-resolution images for unlimited high-resolution image generation

General Introduction Infinity is a groundbreaking high-resolution image generation framework developed by the FoundationVision team. The project breaks through the limitations of traditional image generation models through an innovative bit-level visual autoregressive modeling approach.The core features of Infinity...
1yrs ago
070.8K
Tough Tongue AI:与AI对话练习面试与职场沟通技巧

Tough Tongue AI: Practice Interview and Workplace Communication Skills by Talking to an AI

General Introduction Tough Tongue AI is an artificial intelligence platform designed for practicing tough conversations. Users can simulate a variety of complex conversational situations, such as job interviews, salary negotiations, sales presentations, etc. by selecting preset scenarios or creating custom scenarios. The platform provides video and...
1yrs ago
070.7K
Dzine:可控的AI图像生成功能与画布设计工具,提供数百种图像风格样式

Dzine: Controllable AI image generation capabilities and canvas design tools, offering hundreds of image styles and styles

General Introduction Dzine (formerly Stylar) is an all-in-one AI design platform that offers an integrated workflow from image generation to editing, unrivaled image composition and style control. Its predefined styles make it easy for users of all skill levels to customize designs without complex...
2yrs ago
070.6K
TxAgent:帮医生分析药物作用和治疗方案的AI工具

TxAgent: the AI tool that helps doctors analyze drug effects and treatment options

Comprehensive Introduction TxAgent is an open-source AI tool developed by Harvard University's Medical and Scientific Artificial Intelligence Team (MIMS) to help physicians analyze drug interactions and develop personalized treatment plans. It combines patient-specific situations through multi-step reasoning and real-time retrieval of biomedical knowledge...
1yrs ago
070.6K
逗哥配音:专注短视频解说、创作的智能配音神器

Teaser Dubbing: Intelligent dubbing tool that focuses on short video narration and creation

Comprehensive Introduction Tease Dubbing is a popular AI dubbing software with over 5 million users. The software utilizes advanced AI intelligent dubbing technology to provide professional and realistic dubbing effects, which is applicable to a variety of scenarios such as short videos, advertisement production, education and training. Teaser Dubbing is committed to providing users with fast...
2yrs ago
070.5K
MMAudio:为视频画面生成同步音效与配乐,视频到音频的多模态联合训练工具

MMAudio: generating synchronized sound effects and soundtracks for video footage, video-to-audio multimodal co-training tool

General Introduction MMAudio is an open-source project aiming to generate high-quality synchronized audio through joint multimodal training. Developed by Ho Kei Cheng et al. at the Chinese University of Hong Kong, the project's main function is to generate synchronized audio based on video and/or text input.MM...
1yrs ago
070.5K
阿里妈妈创意中心:淘宝生态下的智能化营销创意支持平台

AliMama Creative Center: Intelligent Marketing Creative Support Platform under Taobao Ecology

Comprehensive Introduction Alimama Creative Center is Alibaba's intelligent marketing creative support platform, designed to provide merchants on Taobao, Tmall, and other e-commerce platforms with a full range of creative support from graphics to videos to landing pages. By combining AI intelligent copywriting capabilities and massive templates, Creative Center dramatically improves the design efficiency...
2yrs ago
070.5K