Latest AI Resources

Total 2620 articles posts
CogAgent:智谱开源的智能视觉语言模型,实现图形界面自动化操作

CogAgent: Smart Spectrum's open source intelligent visual language model for automating graphical interfaces

Comprehensive Introduction CogAgent is an open source visual language model developed by Tsinghua University Data Mining Research Group (THUDM), aiming to automate the operation of cross-platform graphical user interface (GUI). The model is based on CogVLM (GLM-4V-9B) and supports bilingual Chinese and English...
8mos ago
03.2K
Offer鸡:AI在线面试助手,实时语音识别,支持常见远程面试平台

Offer Chicken: AI online interview assistant, real-time voice recognition, supports common remote interview platforms

Comprehensive Introduction Offer Chicken is an AI interview assistant designed for young job seekers, supporting Windows and macOS systems. Through dual-channel voice capture and real-time voice recognition technology, Offer Chicken is able to provide real-time answer prompts in online interviews, helping job seekers improve their interview form...
8mos ago
03.2K
YouMind:专业创作者辅助工具,摘录各类材料并存入知识库辅助写作

YouMind: a professional creator's aid that excerpts all kinds of material and deposits it in a knowledge base to aid in writing.

General Introduction YouMind is an AI authoring system powered by top-notch Large Language Models (LLMs) designed to help users extract and preserve important content from a wide range of materials, focusing on creation rather than simple collection. Whether browsing the web, watching YouTube videos, listening to podcasts...
7mos ago
03.2K
AIEvo:创建多智能体协作应用的高效框架

AIEvo: An Efficient Framework for Creating Multi-Intelligent Collaborative Applications

General Introduction AIEvo is Ant Group's open source multi-agent framework designed to efficiently create multi-agent applications. The framework strictly follows the SOP task graph to improve the execution success rate of complex tasks , and through feedback and monitoring mechanisms to ensure high flexibility and scalability.AIEvo has been produced within Ant Group ...
7mos ago
03.2K
5ire:支持本地向量知识库的跨平台大模型桌面客户端

5ire: cross-platform large model desktop client with support for local vector knowledge bases

General Introduction 5ire is an open source cross-platform big model desktop client designed to provide users with convenient local vector knowledge base management and big model interaction capabilities. The software supports parsing and vectorized storage of multiple document formats with powerful retrieval-enhanced generation (RAG) capabilities. In addition, 5i...
9mos ago
03.2K
Infography:文本、链接或文档转换为精美信息图,适合小红书等自媒体传播

Infography: Text, links or documents are converted into beautiful infographics, suitable for small red book and other self-media distribution

General Introduction Infography is a powerful online tool designed to help bloggers, marketers, educators, and influencers transform complex blog posts into visually compelling infographics. Through the use of AI technology, Infography can take large amounts of data...
7mos ago
03.2K
BotSharp:基于.NET的多智能体AI应开发与管理平台

BotSharp: .NET-based multi-intelligence body AI should development and management platform

Comprehensive Introduction BotSharp is an open source project based on .NET Core dedicated to providing a comprehensive AI chatbot platform building tool. It uses C# programming, supports cross-platform operation, and aims to simplify the application of machine learning algorithms, enabling enterprise-level developers to efficiently ...
7mos ago
03.2K
Galaxy.ai:集成1700+AI工具库的多功能平台,用于了解市场中各类生成式AI工具(付费)

Galaxy.ai: a multifunctional platform integrating 1700+ AI tool libraries for understanding all types of generative AI tools in the market (paid)

Comprehensive Introduction Galaxy.ai is a platform that integrates a wide range of AI tools designed to provide users with comprehensive AI solutions. Whether it's text generation, image processing, video production or speech synthesis, Galaxy.ai is able to satisfy a wide range of user needs. The platform offers...
8mos ago
03.2K
SpeechGPT 2.0-preview:实时交互的端到端拟人语音对话大模型

SpeechGPT 2.0-preview: an end-to-end anthropomorphic speech dialog grand model for real-time interaction

SpeechGPT 2.0-preview is the first anthropomorphic real-time interaction system introduced by OpenMOSS, which is trained based on millions of hours of speech data. The system is equipped with anthropomorphic spoken expression and 100ms low latency response, supporting natural and smooth real...
6mos ago
03.2K
GoEnhance:视频转视频,图像增强和放大的AI工具

GoEnhance: video to video, image enhancement and enlargement AI tool

GoEnhance General Introduction GoEnhance AI is an advanced artificial intelligence platform that specializes in video-to-video conversion, image enhancement and enlargement. It utilizes cutting-edge AI technology that can enhance images to extreme detail and make the animation creation process easier. Users can easily...
8mos ago
03.2K
XAudioPro:专业在线音频剪辑工具|有声书制作|文字转语音|伴奏分离

XAudioPro: Professional Online Audio Editing Tool|Audiobook Maker|Text to Speech|Accompaniment Separation

General Introduction XAudioPro is an advanced online audio real-time editing and transcoding tool that is both professional and portable. It supports professional audio editing functions such as cutting, cropping, copying, deleting, restoring, and amplitude gain control. It also provides denoising services such as spectral subtraction noise reduction, low-pass...
10mos ago
03.2K
AigoTools:自动收录网站并支持多语言的开源AI工具导航站

AigoTools: automatic inclusion of the site and support for multilingual open source AI tools navigation station

General Introduction AigoTools is an open source AI web site navigation designed to help users quickly create and manage navigation sites. It has built-in site management and AI-based auto-inclusion features , support for multi-language , dark/light theme switching , and SEO optimization.AigoTools proposes ...
10mos ago
03.2K
COSINE:智能理解代码库,让开发者轻松理解和编写代码的AI工具(内测)

COSINE: Intelligent Understanding Codebase, an AI tool that makes it easy for developers to understand and write code (in beta)

General Introduction Cosine is a revolutionary AI-driven code understanding platform that provides deep codebase understanding and analysis services for modern software developers. Supporting over 50 programming languages, the platform utilizes a unique technical architecture that combines a specialized search engine, vector database, and ...
8mos ago
03.2K
LazyLLM:商汤开源构建多智能体应用的低代码开发工具

LazyLLM: Shangtang's open source low-code development tool for building multi-intelligence body applications

Comprehensive Introduction LazyLLM is an open source tool developed by the LazyAGI team, focusing on simplifying the development process of multi-intelligence large model applications. It helps developers quickly build complex AI applications through one-click deployment and lightweight gateway mechanisms, saving tedious engineering configuration...
6mos ago
03.2K
Edraw.AI(亿图):在线协作白板工具,AI生成流程图和多种图表

Edraw.AI: Online collaborative whiteboard tool, AI-generated flowcharts and multiple diagrams

Comprehensive Introduction Edraw.AI is a revolutionary AI-powered online visualization whiteboard collaboration platform that integrates more than 40 intelligent tools and a library of carefully designed templates. The platform uses advanced AI technology to quickly transform users' textual thoughts into professional visual diagrams. The platform supports...
8mos ago
03.2K
文心快码(Baidu Comate):你的AI编程助手,结合百度编程大数据,为你生成优质编程代码。

Wenxin Quick Code (Baidu Comate): your AI programming assistant, combined with Baidu programming big data, to generate quality programming code for you.

Comprehensive Introduction Baidu Comate is an advanced AI programming assistant developed by Baidu, based on Baidu's ERNIE Big Model, integrating proprietary and open source data to provide next-generation programming assistance. It features code completion, interpretation and debugging to help developers think, write and optimize...
5mos ago
03.2K
Lumi(炉米):创建和分享AI模型,构建工作流,进行LoRA训练(内测)

Lumi (Furnace Rice): create and share AI models, build workflows, perform LoRA training (internal testing)

Comprehensive Introduction Lumi is an AI model sharing community platform launched by ByteDance, aiming to provide AI creation tools for creators. The platform allows users to upload and share models, build workflows, and perform LoRA training. Currently, Lumi is still in the internal testing phase and is only open to whitelisted users...
9mos ago
03.2K
Moondream:批量反推图像提示词的开源轻量级视觉语言模型

Moondream: an open source lightweight visual language model for batch backpropagation of image cue words

Comprehensive Introduction Moondream is an open source lightweight visual language model designed to enable image description capabilities through deep learning and computer vision techniques. The model is able to run efficiently on a variety of platforms and is particularly suitable for edge devices.Moondream uses advanced techniques and...
7mos ago
03.2K
Pyramid Flow:快手推出的开源版

Pyramid Flow: an open source version of "Kringle" launched by Racer, based on SD3 and running on GPUs of less than 8GB (one-click deployment version)

Comprehensive Introduction Pyramid Flow is an efficient autoregressive video generation method based on the Flow Matching technique. The method achieves higher computational efficiency in generating and decompressing video content by interpolating between different resolutions and noise levels...
9mos ago
03.2K
Cardog:车辆信息研究与汽车市场数据智能分析

Cardog: Vehicle Information Research and Intelligent Analysis of Automotive Market Data

Comprehensive Introduction Cardog is a vehicle research and management platform that combines artificial intelligence technology, aiming to provide users with convenient vehicle-related information query and management services. Users can utilize its AI interface to research vehicle performance, obtain market analysis, view documentation, and even manage personal vehicle information...
6mos ago
03.2K
TEN Agent:实时多模态智能体框架,支持与智能体无延时的语音与视频对话。

TEN Agent: a real-time multimodal intelligent body framework that supports latency-free voice and video dialog with intelligent bodies.

Comprehensive Introduction TEN Agent is an open source real-time multimodal intelligences framework that integrates the OpenAI Realtime API and RTC to support a variety of functions such as weather querying, web searching, visual processing and RAG (Retrieval Augmented Generation). The framework aims to provide high ...
9mos ago
03.2K
字语智能:智能写作平台,提升创作效率

Word Intelligence: Intelligent Writing Platform to Enhance Creative Efficiency

Comprehensive Introduction Word Intelligence is a comprehensive AI writing platform that provides a variety of AI tools such as text rewriting, text continuation, translation (Chinese and English), title recommendation, full-text proofreading, full-text detection and summary generation. It aims to use AI technology to help users improve writing efficiency and quality. The platform is suitable for administrative, e-commerce...
11mos ago
03.2K
Arcade:录制屏幕操作快速生成产品互动演示视频

Arcade: Record on-screen operations to quickly generate interactive product demo videos.

General Description Arcade is an easy-to-use online platform that helps users quickly create interactive demos. It is suitable for marketers, product managers and sales teams to demonstrate product features. By recording on-screen actions, Arcade automatically generates interactive demo content that users can use in just a few minutes...
5mos ago
03.2K
X-Dyna:静态人像参考视频姿态生成视频,让小姐姐的照片跳舞

X-Dyna: Static Portrait Reference Video Pose Generation Video to Make Missy's Photos Dance

Comprehensive Introduction X-Dyna is an open source project developed by ByteDance to generate dynamic portrait animations using zero-sample diffusion techniques. The project utilizes facial expressions and body movements in drive video to animate individual portrait images, generating realistic and context-aware motion effects.X-D...
7mos ago
03.2K
寻光AI:达摩院推出的一站式剧本、分镜、视频创作平台(内测)

Seeking Light AI: A One-Stop Platform for Script, Score, and Video Creation from Dharma Institute (Internal Test)

Comprehensive Introduction Seeking Light AI is a one-stop video creation platform launched by Dharma Institute, aiming to simplify the video production process through visual AIGC technology. Users can create videos as easily as making PPTs, and the platform provides script creation, split-screen design, material editing and other functions, which greatly improves the video creation...
10mos ago
03.1K