Latest AI Resources

Total 2981 articles posts
HeyGen - AI 数字人视频创作平台,支持多语言翻译配音

HeyGen - AI Digital Human Video Creation Platform with Multi-Language Translation and Dubbing Support

HeyGen is an AI-driven digital human video creation platform that supports a streamlined video production process, allowing users to quickly generate professional-level digital human videos. The platform is based on advanced AI technology, giving users full control over the image and voice of digital people, providing a rich library of material, including diverse background...
9mos ago
040.3K
Make - AI无代码自动化工作流搭建平台

Make - AI's no-code automated workflow building platform

Make is an AI-driven no-code automation platform that helps organizations improve efficiency and innovation based on automated processes. The platform offers more than 2,000 pre-built apps that support a variety of business scenarios, such as marketing, sales, finance, etc. Make's core features include no-code visual process creation, AI...
9mos ago
042.3K
MiMo-VL - 小米开源的多模态模型

MiMo-VL - Xiaomi's open source multimodal modeling

MiMo-VL is Xiaomi's open source multimodal grand model, consisting of a visual coder, a cross-modal projection layer and a language model. The visual coder is based on Qwen2.5-ViT, which supports native resolution inputs and preserves more details; the language model is Xiaomi's self-developed MiMo-7B, which is designed for complex projections...
9mos ago
045.7K
Fish Audio - AI 语音合成与声音克隆工具

Fish Audio - AI Speech Synthesis and Sound Cloning Tool

Fish Audio is a powerful generative AI speech synthesis tool that supports text-to-speech (TTS) and voice cloning. Users only need to input text, the tool supports the conversion to natural and smooth voice, the platform provides multiple languages and voice styles to choose from, to meet different scenarios and user...
9mos ago
065.5K
SignGemma - 谷歌 DeepMind 推出的手语翻译模型

SignGemma - Sign Language Translation Model from Google DeepMind

SignGemma is the world's most powerful sign language interpreting AI model introduced by Google DeepMind, supporting the accurate translation of American Sign Language (ASL) into English text. The model is based on multimodal training, combining visual and textual data to capture sign language actions in real time and quickly translate them into text...
9mos ago
046.8K
CRIC深度智联 - 克而瑞推出的中国房地产首个AI Agent

CRIC - The First AI Agent for Real Estate in China Launched by CRIC

CRIC Depth Intelligence is the first AI intelligent body of Chinese real estate independently developed by CRIC, based on CRIC's 20 years of experience in the real estate industry and data accumulation and multimodal big model technology, which opens up the whole chain from data integration, intelligent analysis to content generation.
9mos ago
037.8K
WebAgent - 阿里通义开源的自主搜索AI Agent

WebAgent - Ali Tongyi Open Source Autonomous Search AI Agent

WebAgent is an open source autonomous search AI Agent from Alibaba's Tongyi Labs, with powerful end-to-end autonomous information retrieval and multi-step reasoning capabilities.WebAgent can actively perceive, decide and act in the network environment like a human being, and is widely used in academic research, business decision...
9mos ago
045.8K
灵码 IDE - 通义灵码推出 AI 原生开发环境工具

Linguaphone IDE - Tongyi Linguaphone Launches AI Native Development Environment Tools

Spirit Code IDE is the AI native integrated development environment (IDE) launched by Tongyi Spirit Code, which is deeply adapted to the 3 major models of Thousand Questions, and has a powerful programming intelligent body mode to support the autonomous completion of tasks such as project perception, code retrieval, and execution of terminal operations. It supports MCP tools and integrates Magic Hitch MCP Square's 3...
9mos ago
042K
BAGEL - 字节跳动推出的开源多模态基础模型

BAGEL - Open source multimodal base model launched by Wordpress

BAGEL is a multimodal base model open-sourced by ByteDance with 14 billion parameters, of which 7 billion are active. The model base with the Mixed Transformer Expert Architecture (MoT) captures pixel-level and semantic-level features of an image with two independent encoders, respectively, to support efficient processing of images, text, video...
9mos ago
043.3K
可灵 2.1 - 快手推出的AI视频生成模型

Keling 2.1 - AI Video Generation Model Launched by Shutterstock

KeLing 2.1 is an AI video generation model launched by Racer, which is now available on the KeLing AI video platform. The model contains three versions: standard, high quality and master, providing 720P, 1080P and movie and TV level effects to meet different creative needs. The standard version of the generation speed, suitable for rapid production...
10mos ago
047.7K
小云雀 - 剪映推出的智能创作Agent

Little Lark - Smart Creation Agent by Shear Image

Little Lark is an intelligent creation Agent launched by Shear Image, based on AI technology to reshape the boundaries of content creation, making creation simpler, more efficient and more interesting. Little Lark supports zero-threshold creation of videos, digital pop-up videos, design drawings and pictures for backgrounds, users only need to enter a command, AI support efficiently complete...
10mos ago
078.6K
稿定AI社区 - AI创意内容设计平台,多种设计资源满足不同创作需求

Drafting AI Community - AI creative content design platform, a variety of design resources to meet different creative needs

Drafting AI Community is an online AI creative inspiration platform that provides users with a wealth of creative design resources and tools. The platform covers a variety of design fields, including image photos, e-commerce design, holiday themes, 3D illustrations, avatar design, Xiaohongshu materials, portrait design, etc., to meet the needs of different users.
10mos ago
040.8K
NoCode – 美团推出的零代码AI开发平台

NoCode - Zero-Code AI Development Platform Launched by Meituan

What is NoCode NoCode is a zero-code AI development platform launched by Mission. Users don't need any programming experience, they just need to describe the requirements through natural language to quickly generate website pages, utilities, small games, event pages and other applications.NoCode supports one second generation of 200...
10mos ago
059.8K
Sim Studio:开源的AI代理工作流构建工具

Sim Studio: open source workflow builder for AI agents

Comprehensive Introduction Sim Studio is an open source AI agent workflow building platform focused on helping users quickly design, test, and deploy large-scale language model (LLM) workflows through a lightweight, intuitive visual interface. Users can create complex workflows without deep programming by dragging and dropping...
10mos ago
089.4K
RealtimeVoiceChat:低延迟与AI进行自然口语对话

RealtimeVoiceChat: low-latency natural spoken conversation with AI

General Introduction RealtimeVoiceChat is an open source project focused on real-time, natural conversations with artificial intelligence via voice. Users use a microphone to input their voice, and the system captures the audio through a browser, quickly converts it to text, and a large-scale language model (LLM) generates back...
10mos ago
077.6K
Cooragent:一句话构建多智能体任务协作工具

Cooragent: building a multi-intelligence task collaboration tool in one sentence

General Introduction Cooragent is an open source AI agent collaboration framework developed by LeapLab at Tsinghua University and hosted on GitHub.It allows users to create intelligent AI agents with a one-sentence description and supports multiple agents to collaborate on complex tasks. The framework provides two...
10mos ago
055.3K
Claude生成深度研究报告的MCP服务

Claude's MCP service for generating in-depth research reports

Comprehensive Introduction MCP Server Deep Research is an open source tool that automatically generates structured research reports for complex problems through artificial intelligence and web search. Users enter a research question, and the tool breaks down the question, searches for authoritative information, assesses source credibility...
10mos ago
052.2K
Deep Recall:为大模型提供企业级记忆框架的开源工具

Deep Recall: an open source tool that provides an enterprise-class memory framework for large models

Comprehensive Introduction Deep Recall is an open source, enterprise-class memory framework designed for large-scale language models (LLMs). It provides hyper-personalized responsiveness through efficient contextual retrieval and integration. The framework uses a three-tier architecture, including a memory service, a reasoning service, and a coordinator, supporting...
10mos ago
059.2K
Paper2Code:将机器学习论文自动转化为可运行代码

Paper2Code: Automatically Converting Machine Learning Papers into Runnable Code

General Introduction Paper2Code is an open source project that aims to solve the problem of lack of code implementations for machine learning papers. It automatically transforms scientific papers into runnable code repositories through the multi-agent Large Language Modeling (LLM) system PaperCoder. The system uses planning ...
10mos ago
059.8K
Potpie AI:快速创建专属代码库的AI工程助手

Potpie AI: An AI engineering assistant for quickly creating proprietary code bases

Comprehensive Introduction Potpie AI is an open source platform focused on providing developers with customized AI engineering assistants. It allows AI agents to deeply understand code structure and logic and automate tasks such as debugging, testing, and code generation by building a knowledge graph of the code base. Users can use simple...
11mos ago
047.7K
Vexa:实时会议转录与智能知识提取工具

Vexa: a real-time meeting transcription and intelligent knowledge extraction tool

Comprehensive Introduction Vexa is an open source real-time meeting transcription and knowledge management platform designed to provide efficient meeting recording and intelligent knowledge extraction services for enterprises and individuals. It automatically joins platforms such as Google Meet, Zoom, etc. through API-driven meeting robots...
11mos ago
094.5K
LLManager:智能自动化流程审批与人类审核结合的管理工具

LLManager: a management tool that combines intelligent automated process approvals with human reviews

Comprehensive Introduction LLManager is an open source intelligent approval management tool, developed based on LangChain's LangGraph framework, focused on automating the processing of approval requests while optimizing decision making with human review. It does this through semantic search, sample less learning and...
11mos ago
055.3K
NodeRAG:基于异构图的精准信息检索与生成工具

NodeRAG: A Heterogeneous Graph-Based Tool for Accurate Information Retrieval and Generation

A Comprehensive Introduction NodeRAG is an open source Retrieval Augmented Generation (RAG) system hosted on GitHub and developed by Terry-Xu-666. It optimizes information retrieval and generation through heterogeneous graph structures, significantly improving retrieval accuracy and contextual relevance.Nod...
11mos ago
062.1K