Latest AI Resources

Total 3105 articles posts
鬼手剪辑:视频去重|短剧解说|视频翻译|去除字幕

Ghost Hand Clips: video de-emphasis|skit commentary|video translation|subtitle removal

Comprehensive Introduction The official website of Ghost Hand Clips is designed to provide efficient video translation and subtitle removal tools for video creators, merchants and MCN organizations. Using powerful AI technology, Ghost Hand Clips is able to achieve intelligent translation of video content, subtitle removal and video personalization, helping users break through the language barrier and easily play...
2yrs ago
070K
AppAgent:利用多模态智能体自动操作智能手机

AppAgent: automated smartphone operation using multimodal intelligences

Comprehensive Introduction AppAgent is a large language model (LLM)-based multimodal agent framework designed to manipulate smartphone applications. The framework mimics human interactions such as taps and swipes through a simplified manipulation space, thus eliminating the need for system back-end access and extending its use across different app...
1yrs ago
070K
天工AI:全能AI助手,助力高效工作与生活

Tiangong AI: All-around AI assistant for efficient work and life

Comprehensive Introduction Tiangong AI is the first all-round AI assistant in China, which integrates various functions such as search, dialog, writing, document analysis, drawing, PPT production and so on. It is able to understand the user's intention, search for information from all over the internet, and summarize, generalize and integrate through advanced AI technology to output high-quality, no...
1yrs ago
070K
COSINE:智能理解代码库,让开发者轻松理解和编写代码的AI工具(内测)

COSINE: Intelligent Understanding Codebase, an AI tool that makes it easy for developers to understand and write code (in beta)

General Introduction Cosine is a revolutionary AI-driven code understanding platform that provides deep codebase understanding and analysis services for modern software developers. Supporting over 50 programming languages, the platform utilizes a unique technical architecture that combines a specialized search engine, vector database, and ...
1yrs ago
069.9K
VideoSeal:先进的开源视频隐藏水印嵌入与提取工具,保护视频版权

VideoSeal: Advanced open source video hidden watermark embedding and extraction tools to protect video copyrights

General Introduction VideoSeal is an open source video watermarking tool developed by Facebook Research, designed to provide efficient video watermark embedding and extraction. The tool supports the latest open source models and contains pre-trained models, training code, inference code and evaluation tools...
1yrs ago
069.8K
Agenta:集成到AI应用的提示词与模型效果评估工具

Agenta: a tool for evaluating the effectiveness of cue words and models integrated into AI applications

Comprehensive Introduction Agenta is an open source AI model management tool specialized in helping users easily experiment with cue words, test model effects and monitor runs. It is suitable for people who want to develop AI applications quickly, providing a platform that is simple to operate. You can use it to try the effect of different cue words on...
1yrs ago
069.8K
ExtractThinker:提取和分类文档为结构化数据,优化文档处理流程

ExtractThinker: extracting and classifying documents into structured data to optimize the document processing flow

Comprehensive Introduction ExtractThinker is a flexible document intelligence tool that utilizes Large Language Models (LLMs) to extract and classify structured data from documents, providing a seamless ORM-like document processing workflow. It supports a variety of document loaders, including Tess...
1yrs ago
069.8K
VideoChat:自定义形象和音色克隆的实时语音交互数字人,支持端到端语音方案和级联方案

VideoChat: real-time voice-interactive digital person with customized image and tone cloning, supporting end-to-end voice solutions and cascading solutions

Comprehensive Introduction VideoChat is a real-time voice interaction digital person project based on open source technology, supporting both end-to-end voice schemes (GLM-4-Voice - THG) and cascade schemes (ASR-LLM-TTS-THG). The project allows users to customize the digital ...
2yrs ago
069.8K
Haiper:AI视频创作工具|文本转视频|图像转视频|视频风格转换|延长视频

Haiper: AI Video Creation Tool|Text to Video|Image to Video|Video Style Converter|Extended Video

Comprehensive Introduction Haiper is an advanced AI video authoring tool dedicated to supporting content creation through perceptual base modeling. Users can use the tool for free to generate high-quality video content from text descriptions or images.Haiper is not only easy to operate, but also has a stable output...
2yrs ago
069.8K
PantoMatrix(EMAGE):全身手势生成框架,从音频生成全身手势的3D动画框架

PantoMatrix (EMAGE): full-body gesture generation framework, 3D animation framework for generating full-body gestures from audio

Comprehensive Introduction PantoMatrix is an advanced full-body gesture generation framework capable of generating complete human movements from audio and partial gestures, including face, partial body, hand and full-body movements. The framework utilizes the latest multimodal datasets and deep learning techniques to provide high-quality 3D...
2yrs ago
069.7K
海绵音乐:智能AI音乐创作平台,文字和图片生成音乐

Sponge Music: Intelligent AI music creation platform, text and image generated music

General Introduction SpongeBob Music is a music creation platform based on artificial intelligence technology. Users only need to enter a sentence of inspiration or upload a picture to generate an exclusive piece of music. The platform provides a variety of music styles and creation tools to help users easily create high-quality music. Whether you are a professional musician or...
2yrs ago
069.6K
EchoMimic:音频驱动人像照片生成说话视频(EchoMimicV2加速版安装包)

EchoMimic: Audio-driven portrait photos to generate talking videos (EchoMimicV2 accelerated installer)

General Introduction EchoMimic is an open source project designed to generate realistic portrait animations through audio-driven generation. Developed by Ant Group's Terminal Technologies division, the project utilizes editable marker point conditions to generate dynamic portrait videos using a combination of audio and facial marker points.EchoMimic...
1yrs ago
069.6K
BRIA:生成式AI图像开放平台|图像去背景|图像元素编辑|RMBG

BRIA: Open Platform for Generative AI Images|Image De-Backgrounding|Image Element Editing|RMBG

BRIA General Introduction BRIA provides a comprehensive visually generated AI business solution with a platform that uses 100% licensed datasets to ensure copyright protection and creator benefits. The platform supports base model access, APIs, SDKs, and web integrations, practicing Responsible AI, taking responsibility for all output...
2yrs ago
069.5K
小悟空:字节跳动推出的多功能AI助手,简单易上手的AI助理

Little Wukong: a versatile, easy-to-use AI assistant from ByteDance

Comprehensive introduction "Little Wukong" is a multi-functional AI dialog assistant and personal assistant tool launched by ByteDance. It integrates more than 200 AI tools, covering a wide range of aspects such as creation generation, learning and enhancement, workplace assistance, professional consultation, virtual character dialog, and leisure and entertainment. Little Wukong is designed...
2yrs ago
069.5K
GLM-PC(智谱牛牛)正式发布内测下载,真正可以控制电脑的AI

GLM-PC (Smart Spectrum Bull) officially released for internal download, the real AI that can control the computer

GLM-PC (Bull) Introduction GLM-PC is a desktop application based on the CogAgent model, which is able to perform complex tasks quickly through natural language commands. It has the ability of task planning and interface understanding, and can autonomously complete various computer operations according to user instructions. Notes for use...
1yrs ago
069.4K
ReCamMaster:从单一视频生成多视角视频的渲染工具

ReCamMaster: Rendering Tool for Generating Multi-View Videos from a Single Video

General Introduction ReCamMaster is an open source video processing tool, the core function is to generate new camera views from a single video. Users can specify the camera track and re-render the video to get a dynamic picture with different angles. It is developed by a team of Zhejiang University and Racer Technology, based on text-to...
1yrs ago
069.4K
Magic 1-For-1: 高效生成视频的开源项目,号称在一分钟内生成一分钟的视频

Magic 1-For-1: efficient generation of video open source project that claims to generate a minute of video in one minute

Comprehensive Introduction Magic 1-For-1 is an efficient video generation model designed to optimize memory usage and reduce inference latency. The model decomposes the text-to-video generation task into two subtasks: text-to-image generation and image-to-video generation, enabling more efficient training and distillation...
1yrs ago
069.4K
Search o1:赋予推理模型主动搜索能力,让大模型边思考边搜索外部知识

Search o1: Empowering inference models to actively search for external knowledge while the larger model is thinking

Comprehensive Introduction Search-o1 is an open source project that aims to enhance the performance of large-scale reasoning models (LRMs) by integrating advanced search mechanisms. The core idea is to solve the knowledge deficit problem encountered in the reasoning process through dynamic search and knowledge integration. The project was developed by sunn...
1yrs ago
069.2K
ChatOllama:基于Nuxt 3和Ollama的本地实时聊天应用UI

ChatOllama: Native real-time chat application UI based on Nuxt 3 and Ollama

Comprehensive introduction ChatOllama is an open source online chat application project based on a large language model (LLM) , supporting numerous language models and knowledge base management. Users can use the platform for model management ( list display , download , delete ) , chat with the model and other functions . The project utilizes ...
2yrs ago
069.2K
TwinMind:免费离线语音转录文字的APP

TwinMind: free offline voice to text transcription app

General Introduction TwinMind is a smart tool developed by ThirdEar AI, Inc. that "helps you remember everything". TwinMind is a smart tool developed by ThirdEar AI, Inc. that "remembers everything for you". It can record conversations, meetings, or lectures in real time and convert them to text in more than 100 languages, even with your cell phone in your pocket...
1yrs ago
069.2K
FliFlik:AI图片处理客户端,一键图像高清化、放大、降噪与水印去除

FliFlik: AI image processing client, one-click image high-definition, enlargement, noise reduction and watermark removal

General Introduction FliFlik is a multimedia solution platform focused on providing efficient and convenient digital processing services. Whether it's photos, audio or video, FliFlik can optimize and enhance them with its advanced AI technology. The platform supports Windows...
2yrs ago
069.1K
MetaLaw:提升法律研究效率的AI助手,类案检索与法律分析

MetaLaw: an AI assistant to improve the efficiency of legal research, class case search and legal analysis

Comprehensive Introduction MetaLaw is an online platform focused on improving the efficiency of legal research. Through advanced AI technology, MetaLaw provides accurate class case search and analysis services to help legal practitioners quickly find relevant cases and conduct in-depth analysis. The platform's AI analysis assistant...
1yrs ago
069.1K
Cerebras:目前全球最快的AI推理、高性能计算平台

Cerebras: the world's fastest AI inference, high-performance computing platform available today

General Introduction Cerebras is a company dedicated to advancing the field of Artificial Intelligence and High Performance Computing. Its core products include the world's fastest AI inference platform and high-performance computing gas pedal.The Cerebras platform is capable of training a wide range of models, from multilingual macromodels to medical chatbots...
2yrs ago
069.1K
MagicArticulate:将静态3D模型生成骨骼结构动画资产

MagicArticulate: generating skeletal structure animation assets from static 3D models

Comprehensive Introduction MagicArticulate is an AI framework developed by ByteDance in collaboration with Nanyang Technological University, focusing on rapidly transforming static 3D models into animation-enabled digital assets. It does this through an advanced autoregressive Transformer and functional diffusion modeling, self...
1yrs ago
069K