Latest AI Resources

Total 2759 articles posts
Signs:通过AI技术助力学习和贡献美国手语的互动平台

Signs: an interactive platform for learning and contributing to American Sign Language fueled by AI technology

General Introduction Signs is an innovative online platform designed to help users learn American Sign Language (ASL) and contribute to the Deaf community through artificial intelligence technology. The site is powered by NVIDIA, the American Society for Deaf Children (ASDC), and creative agency Hello Mond...
8mos ago
025.6K
SegAnyMo:从视频中自动分割任意运动物体的开源工具

SegAnyMo: open source tool to automatically segment arbitrary moving objects from video

General Introduction SegAnyMo is an open source project developed by a team of researchers at UC Berkeley and Peking University, including members such as Nan Huang. This tool focuses on video processing and can automatically recognize and segment arbitrary moving objects in a video, such as people, animals or...
7mos ago
025.6K
堆友:AI设计工具箱与创意平台

Heap Friend: AI Design Toolkit and Creative Platform

Comprehensive Introduction PileYou is an online platform built by Alibaba's design team that integrates a variety of AI design tools, designed for designers and creative workers. The platform provides AI generation tools from text to images, including vertical industry design tools, PileYou Camera, Deer Class Marketing Chart, AI Art Characters, Model Change...
1yrs ago
025.6K
Wrtn:优秀简洁的智能写作助手,提供常用写作模板与防御AI检测功能(韩语)

Wrtn: excellent and simple intelligent writing assistant, providing common writing templates and defense AI detection function (Korean)

General Introduction Wrtn is an AI-based content generation platform designed to help users quickly create high-quality text content. Whether it's an academic paper, business document or social media post, Wrtn provides intelligent writing support through its powerful AI technology. Users just need to...
11mos ago
025.5K
魔音工坊:专业配音与短视频解说创作平台|真人配音|克隆声音|一键成片

Magic Voice Workshop: professional voice-over and short video narration creation platform | real person voice-over | clone voice | one-click into a film

Comprehensive Introduction Magic Voice Workshop is a one-stop short video and AI dubbing platform with information on software dubbing, real-life dubbing, sound libraries, cloning services and more. The platform integrates audio editing, AI copy generation, video editing and collaboration tools for audio-related services and content creation. Users experience the audio editor...
1yrs ago
025.5K
AIHawk:智能求职助手,自动化投放简历(限英文)

AIHawk: Intelligent Job Search Assistant, Automated Resume Placement (English only)

General Introduction Auto_Jobs_Applier_AIHawk is a tool to automate job search using artificial intelligence technology. It helps users to automatically deliver a large number of resumes in a short period of time and personalize them according to their personal information and job search intentions. The tool is designed to raise...
10mos ago
025.5K
InstantID:上传一张图片,迁移人像特征来生成不同风格图片

InstantID: upload an image and migrate the portrait features to generate different styles of images

Comprehensive Introduction InstantID is an advanced technology focused on generating images with personalized styles or poses in seconds while ensuring a high level of fidelity using a single reference ID picture. The technology employs a diffusion model-based solution by integrating facial images, landmark maps...
1yrs ago
025.5K
Relevance AI:让企业轻松创建AI助手的无代码平台

Relevance AI: The no-code platform that makes it easy for organizations to create AI assistants

General Introduction Relevance AI is a platform that makes it easy for organizations to create AI assistants. It doesn't require programming and anyone can use it to design AI for everyday tasks such as answering emails, organizing data or generating content. The goal of the website is to help businesses save time and improve efficiency through AI...
7mos ago
025.5K
CogVLM2:开源多模态模型,支持视频理解与多轮对话

CogVLM2: Open Source Multimodal Modeling with Support for Video Comprehension and Multi-Round Dialogue

Comprehensive Introduction CogVLM2 is an open source multimodal model developed by the Tsinghua University Data Mining Research Group (THUDM), based on the Llama3-8B architecture, and designed to provide performance comparable to or even better than GPT-4V. The model supports image understanding, multi-round dialogs, and visual ...
8mos ago
025.5K
ExamFul.AI:智能备考助手,助力AP、IB和A-Level考试,历年真题/论文和AI智能辅导

ExamFul.AI: Intelligent test preparation assistant to help AP, IB and A-Level exams, past exam questions/essays and AI smart tutoring

General Introduction ExamFul is an online learning platform designed for students preparing for AP, IB and A-Level exams. The platform provides rich resources of past exam questions and combines AI intelligent tutoring to help students prepare for exams efficiently. Whether it's consolidating knowledge points or solving difficult problems, Ex...
12mos ago
025.5K
Hyperspace(aiOS):分布式AI算力共享网络,aiOS生成式浏览器,深度知识智能体

Hyperspace (aiOS): distributed AI arithmetic sharing network, aiOS generative browser, deep knowledge intelligences

General Introduction Hyperspace is an innovative generative browser (aiOS) based on the world's largest peer-to-peer AI network, designed to provide users with powerful tools for deep research and analysis. By integrating multiple AI models and data sources, Hyperspace allows users to quickly generate...
7mos ago
025.5K
FliFlik:AI图片处理客户端,一键图像高清化、放大、降噪与水印去除

FliFlik: AI image processing client, one-click image high-definition, enlargement, noise reduction and watermark removal

General Introduction FliFlik is a multimedia solution platform focused on providing efficient and convenient digital processing services. Whether it's photos, audio or video, FliFlik can optimize and enhance them with its advanced AI technology. The platform supports Windows...
10mos ago
025.5K
Newsful:基于AI的金融新闻摘要网站

Newsful: an AI-based financial news summary site

General Introduction Newsful is an online platform that utilizes artificial intelligence technology to provide financial news services, focusing on real-time aggregation of corporate news and market developments from around the world. The site uses natural language processing (NLP) and machine learning technologies to extract information from multiple media sources for the use...
7mos ago
025.5K
Genesis:开源生成式物理引擎,实现基于真实物理的4D动态世界模拟

Genesis: open source generative physics engine for real physics-based 4D dynamic world simulation

General Introduction Genesis is a generative physics world designed for general purpose robotics and embodied AI learning. It provides a unified simulation platform that supports the simulation of a wide range of materials and physical phenomena.Genesis aims to unlock generative AI and physics simulation by combining...
10mos ago
025.5K
卡卡字幕助手(VideoCaptioner):基于LLM的智能字幕助手,一键生成高质量字幕

VideoCaptioner: LLM-based intelligent captioning assistant, generating high-quality captions with one click!

General Introduction Kaka Caption Assistant (VideoCaptioner) is an intelligent video caption processing tool based on the Large Language Model (LLM). It can generate high-quality subtitles in one click without high-performance GPU, and supports the whole process of subtitle generation, sentence breaking, optimization and translation. It supports the whole process of subtitle generation, sentence breaking, optimization and translation...
11mos ago
025.4K
反谱 - AI音乐转谱平台,支持音频文件转五线谱和简谱

AntiScore - AI music transcription platform, supports audio files to pentatonic and simple music.

AntiSpectrum is an innovative online AI music conversion platform, based on advanced AI technology, to convert audio files (such as MP3, FLAC, etc.) into pentatonic and simple scores. AntiSpectrum has a vocal separation function, which separates the vocals from the accompaniment in the music, making it easy for music production and mixing. AntiSpectrum supports converting MIDI files...
4mos ago
025.4K
PDF.ai:解读法律协议、财务报告、书籍、科学论文等复杂的PDF文档

PDF.ai: Interpret complex PDF documents such as legal agreements, financial reports, books, scientific papers, etc.

Comprehensive Introduction PDF.ai is a platform that utilizes artificial intelligence technology to interact with PDF documents. Users can upload PDF files and talk to the documents through AI technology to ask questions, get summaries, find information, and more. The platform is suitable for processing legal agreements, financial reports, books, scientific...
10mos ago
025.4K
AnkiAIUtils: Anki Flashcard Learning AI Toolset, an intelligent assistant that automatically optimizes memorized cards

AnkiAIUtils: Anki Flashcard Learning AI Toolset, an intelligent assistant that automatically optimizes memorized cards

General Description AnkiAIUtils is a set of AI-enhanced tools designed for the Anki flashcard learning system. Developed by a medical student, the tool is designed to automatically improve cards that users are struggling with during the learning process through AI technology. It can intelligently provide users with personalized...
10mos ago
025.4K
HunyuanVideo-Foley - 腾讯推出的开源视频音效生成模型

HunyuanVideo-Foley - Tencent's Open Source Video Sound Generation Model

HunyuanVideo-Foley is an open source video sound generation model by the Tencent Mixed Yuan team that supports adding accurately matched sound effects to silent videos. The model is based on a large-scale dataset training , with a multimodal diffusion transformer architecture , combined with the characterization of the alignment loss function and audio VAE optimization techniques ...
2mos ago
025.4K
MegaParse:解析各类型文档为LLM可用数据,完整保留文档中的表格、图片等所有信息

MegaParse: parses all types of documents into LLM-available data, preserving all information in the document such as tables, pictures, etc. in its entirety

Comprehensive Introduction MegaParse is a powerful and versatile document parsing tool designed to optimize data processing for the Large Language Model (LLM). Whether you are working with text, PDF, PowerPoint presentations or Word documents, MegaParse...
10mos ago
025.4K
AutoGPT:工作流自动化与自主执行任务的智能体构建平台

AutoGPT: Intelligent Body Building Platform for Workflow Automation and Autonomous Task Execution

General Description AutoGPT is a powerful platform designed to help users create, deploy and manage continuously running AI agents and automate complex workflows. Developed by Significant Gravitas, the platform offers a wide range of tools and features that enable users to focus...
10mos ago
025.4K
Waifu2x Extension GUI:深度学习技术放大、修复图像与视频插帧(Windows x64)

Waifu2x Extension GUI: Deep Learning Techniques to Enlarge, Repair Image and Video Interpolation (Windows x64)

Comprehensive Introduction Waifu2x-Extension-GUI is a powerful image and video processing tool that utilizes deep convolutional neural network techniques to achieve super-resolution zoom and video frame interpolation for images, GIFs and videos. The tool supports multiple algorithms and engines, including Wai...
10mos ago
025.4K
MiniRAG:简化检索增强生成框架,实体图索引召回相关文本块

MiniRAG: Simplified Retrieval Enhanced Generation Framework, Entity Graph Index Recall Relevant Text Blocks

Comprehensive Introduction MiniRAG is an extremely simple Retrieval Augmented Generation (RAG) framework that aims to enable good RAG performance even for small models through heterogeneous graph indexing and lightweight topology-enhanced retrieval. It is developed by the Data Science Laboratory of the University of Hong Kong (HKUDS) to address ...
9mos ago
025.4K
Agentic Security:开源的LLM漏洞扫描工具,提供全面的模糊测试和攻击技术

Agentic Security: open source LLM vulnerability scanning tool that provides comprehensive fuzz testing and attack techniques

General Introduction Agentic Security is an open source LLM (Large Language Model) vulnerability scanning tool designed to provide developers and security professionals with comprehensive fuzz testing and attack techniques. The tool supports customized rule sets or agent-based attacks and is able to integrate LLM AP...
8mos ago
025.3K
法行宝:AI法律顾问,人工智能法律咨询,百度AI法律平台

Fa Xing Bao: AI Legal Advisor, Artificial Intelligence Legal Consultation, Baidu AI Legal Platform

Comprehensive Introduction LawXinbao is an intelligent legal service platform launched by Baidu, which integrates advanced artificial intelligence technology with a professional legal knowledge base. The platform is dedicated to providing users with convenient and professional legal intelligent services, including intelligent legal Q&A, case analysis, contract review and other functions. Through deep learning...
9mos ago
025.3K
WeShop唯象:AI商拍平台、服装模特拍摄、商品拍摄

WeShop: AI commercial photography platform, clothing modeling, product photography

Comprehensive Introduction WeShop is the first AI commercial photography platform in China, focusing on the intelligent generation of e-commerce product images. It provides a solution to create professional product images without models, photographers and physical locations, making product display more attractive. Customers are able to realize highly efficient production of product images at low cost...
1yrs ago
025.3K
Devika:开源的AI软件工程师智能体,能够理解、拆分指令为子任务并编写代码

Devika: open-source AI software engineer intelligence that understands, splits instructions into subtasks and writes code

General Introduction Devika is an advanced AI software engineer that understands high-level human instructions, breaks them down into steps, studies the relevant information, and writes code to achieve a given goal. It intelligently develops software using large-scale language models, planning and reasoning algorithms, and web browsing capabilities.D...
7mos ago
025.3K
BuildIn.AI:适合 Notion 用户的知识管理工具

BuildIn.AI: A Knowledge Management Tool for Notion Users

General Introduction BuildIn.AI is a cloud-based platform focused on real-time collaboration and knowledge management, designed to help users efficiently create, manage and share information. It is suitable for individuals, teams or professionals, providing a digital workplace that integrates document storage, real-time editing and information organization...
8mos ago
025.2K
MedRAX: 利用多模态大模型进行胸部X光片分析的智能体

MedRAX: A Smart Body for Chest X-ray Analysis Using Multimodal Large Models

Comprehensive Introduction MedRAX is a state-of-the-art AI intelligence designed for chest radiograph (CXR) analysis. It integrates state-of-the-art CXR analysis tools and multimodal large language models to dynamically process complex medical queries without additional training.MedRAX, through its modular design...
7mos ago
025.2K
5ire:支持本地向量知识库的跨平台大模型桌面客户端

5ire: cross-platform large model desktop client with support for local vector knowledge bases

General Introduction 5ire is an open source cross-platform big model desktop client designed to provide users with convenient local vector knowledge base management and big model interaction capabilities. The software supports parsing and vectorized storage of multiple document formats with powerful retrieval-enhanced generation (RAG) capabilities. In addition, 5i...
12mos ago
025.2K