Latest AI Resources

Total 2879 articles posts
SpeechGPT 2.0-preview:实时交互的端到端拟人语音对话大模型

SpeechGPT 2.0-preview: an end-to-end anthropomorphic speech dialog grand model for real-time interaction

SpeechGPT 2.0-preview is the first anthropomorphic real-time interaction system introduced by OpenMOSS, which is trained based on millions of hours of speech data. The system is equipped with anthropomorphic spoken expression and 100ms low latency response, supporting natural and smooth real...
11mos ago
035.2K
Goose:开源可扩展的编程智能体,自动化执行编程全流程任务

Goose: open source scalable programming intelligences that automate the full range of programming tasks

General Introduction Goose is an open source AI agent tool developed by Block, Inc. designed to help developers automate everyday development tasks. It supports a wide range of Large Language Models (LLMs) and interacts with users via the command line or desktop application interfaces.Goose can perform a wide range of tasks from agent...
11mos ago
045.3K
YuE:将歌词转化为完整歌曲的基础模型,支持多种音乐风格

YuE: Transforms lyrics into a base model of a complete song, supporting a wide range of musical styles

General Introduction YuE is an open source full song generation base model that focuses on transforming lyrics into full songs. Unlike other models that can only generate short snippets of non-vocal music, YuE is capable of generating full songs with lead and backing vocals up to several minutes in length. The model addresses music generation in...
11mos ago
044.4K
Float:跨语言智能搜索引擎,用母语检索不同语言知识

Float: a cross-language intelligent search engine to retrieve knowledge in different languages in their native language

Comprehensive Introduction FloatSearch AI is a cross-language intelligent search engine based on artificial intelligence technology, designed to provide users with a more accurate and efficient search experience. It understands users' natural language queries and provides relevant and accurate answers based on semantic analysis.FloatS...
11mos ago
035.4K
UltraRAG:一站式RAG系统解决方案,简化数据构建与模型微调

UltraRAG: A One-Stop RAG System Solution to Simplify Data Construction and Model Fine-Tuning

Comprehensive Introduction UltraRAG is a RAG (Retrieval Augmented Generation) system solution jointly proposed by the THUNLP group at Tsinghua University, the NEUIR group at Northeastern University, Modelbest.Inc and the 9#AISoft team. The framework is based on agile deployment and modularized building...
11mos ago
043.3K
Llasa 1~8B:高品质语音生成和克隆的开源文本转语音模型

Llasa 1~8B: an open source text-to-speech model for high quality speech generation and cloning

General Introduction Llasa-3B is an open source text-to-speech (TTS) model developed by the Audio Lab of the Hong Kong University of Science and Technology (HKUST Audio). The model is based on the Llama 3.2B architecture, which has been carefully tuned to provide high-quality speech generation that not only supports multiple...
10mos ago
047.1K
FinGPT:开源金融大语言模型平台,助力金融分析与预测

FinGPT: Open Source Financial Big Language Modeling Platform for Financial Analytics and Prediction

Comprehensive Introduction FinGPT is an open source financial big language modeling platform developed by the AI4Finance Foundation, designed for the financial sector to solve complex financial tasks and drive innovation in fintech.FinGPT utilizes lightweight adaptation techniques and reinforcement learning approaches...
11mos ago
045.1K
Kluster.ai:低成本AI推理平台,送 100$ DeepSeek-R1额度,约1.67 亿 tokens!

Kluster.ai: low-cost AI inference platform, sends 100$ DeepSeek-R1 credits, ~167 million tokens!

Comprehensive Introduction Kluster.ai is an AI inference platform designed for developers to provide efficient and cost-effective large-scale AI processing solutions. The platform dynamically adjusts computational resources to ensure efficient batch and real-time processing capabilities through adaptive inference technology.Klust...
11mos ago
040.6K
Ragas:评估RAG召回QA准确率与答案相关性

Ragas: assessing RAG recall QA accuracy and answer correlation

Comprehensive Introduction Ragas is a tool specifically designed to evaluate and optimize Retrieval Augmented Generation (RAG) systems. It provides a comprehensive set of evaluation metrics by analyzing the relationships between queries, retrieval contexts, and generated answers. These metrics include fidelity, answer relevance, context relevance, on...
11mos ago
056.6K
AutoGen:微软开发的多智能体对话框架

AutoGen: A Multi-Intelligent Body Dialog Framework Developed by Microsoft

Comprehensive Introduction AutoGen is an open source framework developed by a team of Microsoft researchers focused on simplifying the building of large language model (LLM) applications through multi-intelligent body conversations. It allows developers to create AI agents that can talk to each other and collaborate to solve tasks. This approach not only improves the performance of LLM...
11mos ago
044.6K
Fey: 金融市场研究工具,提升投资决策的智能助手

Fey: Financial market research tools, intelligent assistants to enhance investment decisions

General Introduction Fey is an intelligent assistant designed for modern investors, providing real-time market data and personalized investment advice. With a simple and intuitive interface, users can easily access important financial information and market trends.Fey's core features include stock tracking, financial analysis, personalized new...
11mos ago
036.9K
Needle:接入私人数据源的AI搜索与工作自动化平台

Needle: an AI search and job automation platform with access to private data sources

General Introduction Needle is an artificial intelligence platform designed for enterprises to enhance their productivity through efficient information search and automated workflows. The platform is capable of connecting various data sources within an organization to provide unified search and data management capabilities. Users can simply...
11mos ago
031.6K
TankWork:通过语音和文字操作电脑,并提供实时语音反馈的智能体

TankWork: an intelligent body that operates computers via voice and text and provides real-time voice feedback

General Introduction TankWork is an open source desktop agent framework designed to enable AI to perceive and control your computer through computer vision and system-level interaction. The framework allows agents to directly control computers through voice and text commands, process real-time screen content, and provide continuous audio visual...
11mos ago
039.4K
XRAG:优化检索增强生成系统的可视化评估工具

XRAG: A Visual Evaluation Tool for Optimizing Retrieval Enhancement Generation Systems

Comprehensive Introduction XRAG (eXamining the Core) is a benchmarking framework designed for evaluating the underlying components of advanced retrieval augmentation generation (RAG) systems. By profiling and analyzing each core module, XRAG provides information on how different configurations and components affect RAG...
11mos ago
040.3K
CHRONOS:新闻时间线总结工具,提升新闻检索和时间线生成效率

CHRONOS: News Timeline Summarization Tool to Improve News Retrieval and Timeline Generation Efficiency

Comprehensive Introduction CHRONOS is a news timeline summarization tool developed by Alibaba NLP team. The tool generates timeline summaries of news events through iterative self-questioning.CHRONOS is not only capable of handling open-domain timeline summarization tasks, but also in terms of efficiency and scalability...
11mos ago
033.6K
X-Dyna:静态人像参考视频姿态生成视频,让小姐姐的照片跳舞

X-Dyna: Static Portrait Reference Video Pose Generation Video to Make Missy's Photos Dance

Comprehensive Introduction X-Dyna is an open source project developed by ByteDance to generate dynamic portrait animations using zero-sample diffusion techniques. The project utilizes facial expressions and body movements in drive video to animate individual portrait images, generating realistic and context-aware motion effects.X-D...
11mos ago
037.9K
腾讯混元3D(Hunyuan3D):生成高分辨率3D资产,多种3D素材生成工作流

Tencent Hybrid 3D (Hunyuan3D): Generate high-resolution 3D assets, multiple 3D material generation workflows

Comprehensive Introduction Tencent Hunyuan3D (Hunyuan3D 2.0) is an advanced large-scale 3D synthesis system from Tencent designed to generate high-resolution textured 3D assets. The system consists of two core components: Hunyuan3D-DiT, a large-scale shape generation model, and Hunyuan3D-DiT, a large-scale texture...
11mos ago
049K
RAG Web UI:构建智能文档问答系统,简单构建私有Web端知识库

RAG Web UI: Building an Intelligent Documentation Q&A System and Simply Building a Private Web-Side Knowledge Base

Comprehensive Introduction RAG Web UI is an intelligent dialog system based on RAG (Retrieval Augmented Generation) technology. It helps organizations and individuals build intelligent Q&A systems based on their own knowledge base. By combining document retrieval and large language modeling, RAG Web UI provides accurate and reliable...
11mos ago
036.9K
Kheish:多角色智能体,审查、验证和格式化输出以生成高质量结果

Kheish: multi-actor intelligences that review, validate and format output to produce high quality results

Comprehensive Introduction Kheish is an open source multi-role agent designed for Large Language Model (LLM) tasks that require structured, step-by-step collaboration.Kheish is more than just a simple coordinator, it is an intelligent agent in its own right, requesting modules on demand, integrating user-reversal...
11mos ago
036.2K
AI ContentCraft:生成短故事、对话脚本、配音、配图的多功能AI内容创作工具

AI ContentCraft: a versatile AI content creation tool for generating short stories, dialog scripts, voiceovers, and graphics

General Introduction AI ContentCraft is a versatile content creation tool that integrates text generation, speech synthesis, image generation and more. It helps creators quickly generate stories, podcast scripts, and accompanying audio and video content. The tool supports multiple language conversions and can batch...
11mos ago
043.8K
Unigraph:构建本地运行的知识图谱和个人搜索引擎

Unigraph: building locally running knowledge graphs and personal search engines

Comprehensive Introduction Unigraph is a local-first general-purpose knowledge graph and personal search engine designed to provide users with an integrated workspace to help manage and search for a wide variety of data in their personal lives. With Unigraph, users can integrate data from different sources into a...
11mos ago
035.7K