Latest AI Resources

Total 2758 articles posts
Solvely:解决数学(拍照解题)、科学及文科难题的AI学习助手

Solvely: an AI learning assistant for solving math (photo solving), science and liberal arts puzzles

General Introduction Solvely is an AI-based study aid website focused on helping students solve math, science and liberal arts problems. It provides detailed step-by-step explanations by taking a picture and uploading the problem or typing it in directly, covering a wide range of topics from elementary school to college and even graduate school level. No ...
8mos ago
028K
RMBG-2-Studio:批量移除图像和视频背景的开源程序,基于RMBG 2.0优化

RMBG-2-Studio: open source program for batch removal of image and video backgrounds, optimized for RMBG 2.0

General Introduction RMBG-2-Studio is an enhanced background removal and replacement application developed based on the BRIA-RMBG-2.0 model. The application is designed to provide users with efficient and accurate image background processing capabilities for a variety of image types, including e-commerce, gaming and...
10mos ago
028K
Step-Audio:多模态语音交互框架,识别语音并使用克隆语音交流等功能

Step-Audio: a multimodal voice interaction framework that recognizes speech and communicates using cloned speech, among other features

Comprehensive Introduction Step-Audio is an open source intelligent speech interaction framework designed to provide out-of-the-box speech understanding and generation capabilities for production environments. The framework supports multi-language dialog (e.g., Chinese, English, Japanese), emotional speech (e.g., happy, sad), regional dialects (e.g., Cantonese, Szechuan ...
8mos ago
028K
Notta:AI会议记录与音频转录工具,自动转录会议、采访或录音

Notta: AI meeting recording and audio transcription tool to automatically transcribe meetings, interviews or recordings

General Description Notta is a powerful AI meeting recording and audio transcription tool designed to help users automatically convert meetings, interviews or audio recordings into searchable text. With Notta, users can easily transcribe, edit, summarize and collaborate to boost productivity.Notta supports...
9mos ago
027.9K
StreamingT2V:从文本到长视频的动态且可扩展的生成技术

StreamingT2V: A Dynamic and Scalable Generation Technique from Text to Long Video

Comprehensive Introduction StreamingT2V is a public project developed by the Picsart AI research team focused on generating coherent, dynamic and scalable long videos based on textual descriptions. This technology uses an advanced autoregressive approach that guarantees temporal consistency of the video with the description text tightly...
11mos ago
027.9K
ChatTTS:模仿真人说话声音的语音生成模型(ChatTTS一键加速包)

ChatTTS: a speech generation model that mimics the voice of a real person speaking (ChatTTS one-click acceleration package)

General Introduction ChatTTS is a generative speech model designed for conversational scenarios. It generates natural and expressive speech, supports multiple languages and multiple speakers, and is suitable for interactive conversations. The model does this by predicting and controlling fine-grained prosodic features such as laughter, pauses and interjections, sup...
8mos ago
027.9K
腾讯智影:智能视频创作工具|AI数字人、动漫生成套件

Tencent Smart Shadow: Intelligent Video Creation Tool | AI Digital Man, Anime Generation Kit

Comprehensive Introduction Tencent Smart Shadow is an online intelligent video creation platform launched by Tencent, which can support text dubbing, digital human broadcasting, automatic subtitle recognition and other functions through powerful AI tools provided by cloud services.It integrates material search, video editing, rendering export and publishing, bringing users a convenient visual...
1yrs ago
027.9K
Qwen-Agent:基于Qwen的智能代理应用框架,包括工具调用、代码解释器、RAG和Chrome扩展。

Qwen-Agent: Qwen-based framework for intelligent agent applications, including tool calls, code interpreters, RAGs and Chrome extensions.

Comprehensive Introduction Qwen-Agent is an intelligent agent application framework developed based on Qwen 2.0 and above, with capabilities such as command following, tool usage, planning and memorization. The framework provides a variety of sample applications such as browser assistants, code interpreters and custom assistants...
10mos ago
027.9K
Memary:利用知识图谱增强Agent长期记忆的开源项目

Memary: an open-source project to enhance Agent long-term memory using knowledge graphs

General Introduction Memary is an innovative open source project focused on providing long-term memory management solutions for autonomous intelligences. The project helps intelligences break through the limitations of traditional context windows to achieve smarter interaction experiences through knowledge graphs and specialized memory modules.Memary adopts...
10mos ago
027.9K
Excel AI:AI智能函数插件,实现数据提取、批量转换、公式生成、数据分析

Excel AI: AI intelligent function plug-ins, to achieve data extraction, batch conversion, formula generation, data analysis

Comprehensive introduction Excel AI is an Excel plug-in based on artificial intelligence technology , the unique AI function can be automatically populated according to the user description of all kinds of functions . Designed to enhance the efficiency of data processing through intelligent functions and automation tools. Users can use the plug-in to achieve data extraction, transfer...
10mos ago
027.9K
Haiper:AI视频创作工具|文本转视频|图像转视频|视频风格转换|延长视频

Haiper: AI Video Creation Tool|Text to Video|Image to Video|Video Style Converter|Extended Video

Comprehensive Introduction Haiper is an advanced AI video authoring tool dedicated to supporting content creation through perceptual base modeling. Users can use the tool for free to generate high-quality video content from text descriptions or images.Haiper is not only easy to operate, but also has a stable output...
1yrs ago
027.9K
Transkriptor:将音频和视频转为文字的AI智能转录工具

Transkriptor: the AI-smart transcription tool that turns audio and video into text

General Introduction Transkriptor is an AI-driven transcription tool that focuses on quickly converting audio and video to text. It supports over 100 languages with an accuracy rate of up to 99% and is suitable for a wide range of scenarios such as meetings, interviews, classroom notes, and more. Users can upload files, direct...
6mos ago
027.8K
问小白:提供工作和生活帮助的全能AI助手,集成满血DeepSeek-R1

Ask White: an all-around AI assistant that provides work and life help with integrated full-blooded DeepSeek-R1

Comprehensive Introduction AskSeek is an AI intelligent assistant (including web-side and APP-side) developed by Yuanshi Technology, based on the self-developed Yuanshi Big Model, currently integrating the latest DeepSeek-R1 model, aiming to simplify the user's through quick Q&A, intelligent search, text creation, and other...
5mos ago
027.8K
Llasa 1~8B:高品质语音生成和克隆的开源文本转语音模型

Llasa 1~8B: an open source text-to-speech model for high quality speech generation and cloning

General Introduction Llasa-3B is an open source text-to-speech (TTS) model developed by the Audio Lab of the Hong Kong University of Science and Technology (HKUST Audio). The model is based on the Llama 3.2B architecture, which has been carefully tuned to provide high-quality speech generation that not only supports multiple...
8mos ago
027.8K
SciSpace:一站式学术研究与论文写作平台,为学生和研究人员提供一体化 AI 工具

SciSpace: A one-stop academic research and paper writing platform with integrated AI tools for students and researchers

General Introduction SciSpace (formerly Typeset.io) is an AI-powered platform designed for academic research and writing. It provides a wealth of tools and resources to help researchers and students find, understand and write about literature more efficiently. The platform integrates literature management, automatic gr...
11mos ago
027.7K
AI reads books:AI逐页阅读PDF书籍,自动提取知识要点并生成总结

AI reads books: AI reads PDF books page by page, automatically extracts the main points of knowledge and generates summaries.

Comprehensive Introduction AI-reads-books-page-by-page is a Python-based development of intelligent PDF book analysis tool, which can automate the page-by-page analysis of PDF books, extract the key knowledge points, and after the specified page interval to generate stage...
10mos ago
027.7K
AutoAgent:通过自然语言快速创建并部署AI智能体的框架

AutoAgent: a framework for rapid creation and deployment of AI intelligences through natural language

General Introduction AutoAgent is an open source AI intelligences framework developed by the Data Intelligence Laboratory of the University of Hong Kong (HKUDS) and hosted on GitHub.It allows users to rapidly create and deploy customized AI intelligences by describing their requirements in purely natural language, without any programming base...
4mos ago
027.7K
AsrTools:语音转字幕工具,内置剪映、快手、必剪接口的轻量客户端

AsrTools: speech-to-subtitle tool, lightweight client with built-in interfaces to Cutscene, Racer, and Must-Cut

Comprehensive Introduction AsrTools is an intelligent speech-to-text tool with built-in interfaces from big players such as Cutscene, Racer, Must Cut, etc. It does not require GPU or cumbersome configuration, and supports efficient multi-threaded batch processing. It is based on PyQt5 development, beautiful and user-friendly interface, able to output SRT and TXT format words...
1yrs ago
027.7K
Apify:全栈网页抓取与数据提取平台,自动化数据收集,构建自定义爬虫,集成多种API

Apify: full-stack web crawling and data extraction platform, automate data collection, build custom crawlers, integrate multiple APIs

General Introduction Apify is a full-stack web crawling and data extraction platform that provides a variety of tools and services to help users automate data extraction from any website. Users can use off-the-shelf crawling tools or build and distribute their own data extraction tools.Apify supports multiple programming languages and frameworks...
11mos ago
027.7K
ChatFree(ChatAnywhere-2):使用GPT API创建的本地Copilot,支持任意窗口中补全对话

ChatFree (ChatAnywhere-2): Native Copilot created using the GPT API to support complementary conversations in any window.

General Introduction ChatFree is an open source project that aims to free users' AI apps from the constraints of browsers to run locally. Created using GPT API, Copilot is designed to support a wide range of office software such as Office, Word, WPS, and more. The project was developed by ...
10mos ago
027.7K
Media.io:多功能在线媒体处理工具,在线视频、音频、图像编辑器

Media.io: Multi-functional online media processing tool, online video, audio, image editor

General Introduction Media.io is a powerful online AI video editing and media file processing platform. It helps users to enhance, convert, compress, etc. videos, audios and pictures. In addition to the basic editing functions, there are also features like video cartoonization, AI song cover generation, audio desc...
6mos ago
027.6K
JanitorAI:角色扮演与互动故事AI

JanitorAI: Role Playing and Interactive Storytelling AI

General Introduction JanitorAI focuses on providing an innovative online interactive story creation platform that utilizes advanced chatbot technology to help users build and share their stories. With its simple and intuitive interface, it is suitable not only for professional writers, but also for any regular user who loves creating and storytelling...
4mos ago
027.6K
算了么:共享你电脑闲置 GPU 显卡算力赚钱,支持科学研究

Forget it: Share your computer's unused GPUs and graphics cards to earn money and support scientific research!

Comprehensive Introduction Nevermind is a platform that utilizes the arithmetic power of idle graphics cards to perform scientific calculations and earn revenue. Users can share their computer's idle GPU resources to support scientific research and technological progress, while earning a certain financial return. The platform aims to promote scientific progress and solve important scientific research problems...
10mos ago
027.5K
AICamp:适合团队使用的大模型集成聊天平台,接入自有API或免费使用GPT-4o-mini

AICamp: an integrated chat platform for teams with large models, access to its own API or free use of GPT-4o-mini

Comprehensive Introduction AICamp is a comprehensive AI platform designed to simplify the use of various AI tools and models. It provides a shared workspace for teams, facilitating team members to collaborate and improve productivity.AICamp offers a wide range of advanced AI features to help organizations bring...
10mos ago
027.4K
AutoGen:微软开发的多智能体对话框架

AutoGen: A Multi-Intelligent Body Dialog Framework Developed by Microsoft

Comprehensive Introduction AutoGen is an open source framework developed by a team of Microsoft researchers focused on simplifying the building of large language model (LLM) applications through multi-intelligent body conversations. It allows developers to create AI agents that can talk to each other and collaborate to solve tasks. This approach not only improves the performance of LLM...
9mos ago
027.4K
RD-Agent:自动化数据驱动研发工具,通过AI技术推动以数据为导向的研发过程

RD-Agent: an automated data-driven R&D tool to drive data-driven R&D processes through AI technology

Comprehensive Introduction RD-Agent is an open source tool from Microsoft designed to automate and optimize the research and development (R&D) process. The tool focuses on data-driven scenarios to improve the efficiency of model and data development through artificial intelligence techniques.RD-Agent integrates research...
7mos ago
027.4K
秘塔AI搜索:提供无广告的高效学术搜索服务,研究模式深度挖掘知识

Secreta AI Search: Providing ad-free and efficient academic search services, research model for deep knowledge mining

General Introduction Secreta AI Search is a technology company dedicated to improving productivity through artificial intelligence technology. The site provides ad-free and efficient academic search services, aiming to provide users with accurate and fast search results. Secret Tower AI Search has a self-developed large language model, MetaLLM, which can...
9mos ago
027.3K
Refly:基于自由画布上流程编排的AI写作平台,自动化生成文章

Refly: an AI writing platform based on process orchestration on a free canvas for automated article generation

Comprehensive Introduction Refly is a free canvas-based AI native authoring engine designed to help users turn ideas into high-quality content through multi-threaded conversations, knowledge base integration, contextual memory and intelligent search technology. The platform covers over 20 professional scenario templates, including learning...
8mos ago
027.3K
Deep Live Cam:开源的实时AI换脸工具,一张照片就能实现实时换脸直播

Deep Live Cam: open source real-time AI face-swapping tool, a photo can realize real-time face-swapping live

General Introduction Deep Live Cam is an open source artificial intelligence tool designed to enable real-time face replacement and deep fake video generation from a single photo. The tool utilizes advanced deep learning algorithms to enable real-time face replacement in live streams or video calls, protecting user privacy and adding fun...
11mos ago
027.3K
DomoAI:智能视频艺术风格转换|图像转视频|文本转视频

DomoAI: Intelligent Video Art Style Conversion|Image to Video|Text to Video

General Description DomoAI has recently launched its Video to Video feature, which converts existing videos into a completely different art style with amazing results. It allows users to easily create unique styles of visual art. Other features included in the platform can convert still images to motion video, text to picture...
1yrs ago
027.3K
MMAudio:为视频画面生成同步音效与配乐,视频到音频的多模态联合训练工具

MMAudio: generating synchronized sound effects and soundtracks for video footage, video-to-audio multimodal co-training tool

General Introduction MMAudio is an open-source project aiming to generate high-quality synchronized audio through joint multimodal training. Developed by Ho Kei Cheng et al. at the Chinese University of Hong Kong, the project's main function is to generate synchronized audio based on video and/or text input.MM...
10mos ago
027.3K
DeOldify:使用AI技术为黑白照片和视频上色的经典开源工具

DeOldify: the classic open-source tool for colorizing black-and-white photos and videos using AI technology

Comprehensive Introduction DeOldify is an open source project based on deep learning technology, specifically designed for intelligent colorization and restoration of black and white photos and videos. The project uses an innovative NoGAN training method to successfully solve the common defects of traditional GAN networks in the image coloring process...
10mos ago
027.2K
Cerebras:目前全球最快的AI推理、高性能计算平台

Cerebras: the world's fastest AI inference, high-performance computing platform available today

General Introduction Cerebras is a company dedicated to advancing the field of Artificial Intelligence and High Performance Computing. Its core products include the world's fastest AI inference platform and high-performance computing gas pedal.The Cerebras platform is capable of training a wide range of models, from multilingual macromodels to medical chatbots...
1yrs ago
027.2K
NeoAI:让AI接管电脑远程操作,使用自然语言控制电脑的开源项目

NeoAI: Open source project that lets AI take over remote operation of computers and control them using natural language

General Introduction NeoAI is an innovative open source AI assistant tool that allows users to easily control and manage their computers through natural language conversations. Without writing any code, users can simply use everyday conversations to find files, automate tasks, manage devices, etc.NeoAI...
10mos ago
027.2K
tldraw:开源无限画布白板SDK,AI生成简约线框图和UML图

tldraw: open source unlimited canvas whiteboard SDK, AI to generate minimalist wireframe diagrams and UML diagrams

General Description tldraw is a free and instant collaborative drawing tool that provides an unlimited canvas where users can quickly draw graphics, write text and collaborate instantly. Featuring an intuitive interface and excellent performance, it is suitable for team collaboration and remote work. Supported through the open source community, tldr...
11mos ago
027.1K
百聆 (Bailing):低延时的开源语音对话助手,轻松实现自然对话交流

Bailing: a low-latency open source voice dialog assistant that easily realizes natural conversational exchanges

Comprehensive Introduction Bailing (Bailing) is an open source voice conversation assistant designed to engage in natural conversations with users through speech. The project combines speech recognition (ASR), voice activity detection (VAD), large language modeling (LLM) and speech synthesis (TTS) technologies to achieve...
9mos ago
027.1K
Zion(Momen):无代码开发平台,快速搭建个性化AI应用/SaaS应用,支持多端发布绑定自己的域名

Zion (Momen): no-code development platform to quickly build personalized AI apps/SaaS apps with support for multi-site publishing binding your own domain name

Comprehensive introduction Zion is a powerful no-code development platform, users do not need to write code to quickly build websites, WeChat small programs and other applications. The platform provides full visualization of the operation, from application development, deployment on-line to the growth of operations and maintenance, greatly reducing the threshold of development.Zion extensive coverage of business scenarios...
11mos ago
027.1K
SadTalker:让照片说话|嘴型同步音频|合成口型同步视频|免费数字人

SadTalker: Make Photos Talk | Mouth Synchronized Audio | Synthesized Mouth Synchronized Video | Free Digital People

General Introduction SadTalker is an open source tool that combines a single still portrait photo with an audio file to create realistic talking avatar videos for a variety of scenarios such as personalized messages, educational content, and more. The revolutionary use of 3D modeling technologies such as ExpNet and PoseVA...
8mos ago
027.1K
RunPod:专为AI设计的GPU云服务,快速冷启动SD且按秒付费

RunPod: GPU Cloud Service Designed for AI with Fast Cold Start SD and Pay Per Second

Comprehensive Introduction RunPod is a cloud computing platform designed for AI, aiming to provide developers, researchers and enterprises with a one-stop solution for AI model development, training and scaling. The platform integrates on-demand GPU resources, serverless reasoning, and automatic scaling for AI projects across...
11mos ago
027.1K