AI Personal Learning
and practical guidance
Beanbag Marscode1
Total 908 articles

Tags: ai open source projects Page 22

TankWork:通过语音和文字操作电脑,并提供实时语音反馈的智能体-首席AI分享圈

TankWork: an intelligent body that operates computers via voice and text and provides real-time voice feedback

General Introduction TankWork is an open source desktop agent framework designed to enable AI to perceive and control your computer through computer vision and system-level interaction. The framework allows agents to directly control computers through voice and text commands, process real-time screen content, and provide continuous audio visual feedback and manipulation...

Quantum Swarm: a framework for multi-intelligence cluster collaboration

Quantum Swarm is an open source artificial intelligence framework focused on developing and researching AI population intelligence. The project is maintained by the Quarm AI team on GitHub and aims to provide a flexible and efficient platform for building and testing multi-intelligence systems.The Quantum Swarm framework is primarily coded in Python...

XRAG:优化检索增强生成系统的可视化评估工具-首席AI分享圈

XRAG: A Visual Evaluation Tool for Optimizing Retrieval Enhancement Generation Systems

Comprehensive Introduction XRAG (eXamining the Core) is a benchmarking framework designed for evaluating the underlying components of advanced retrieval augmentation generation (RAG) systems. By profiling and analyzing each core module, XRAG provides insights into how different configurations and components affect the overall performance of a RAG system. The framework supports ...

CHRONOS:新闻时间线总结工具,提升新闻检索和时间线生成效率-首席AI分享圈

CHRONOS: News Timeline Summarization Tool to Improve News Retrieval and Timeline Generation Efficiency

General Introduction CHRONOS is a news timeline summarization tool developed by Alibaba NLP team. The tool generates timeline summaries of news events through iterative self-questioning.CHRONOS is not only capable of handling open-domain timeline summarization tasks, but also significantly improves efficiency and scalability in...

Go-with-the-Flow:控制视频中物体的运动轨迹,视频中增减任何运动物体-首席AI分享圈

Go-with-the-Flow: Controls the movement of objects in the video, adding or subtracting any moving objects in the video.

General Introduction Go-with-the-Flow is an open source project developed by the Netflix Eyeline Studios research team to control the motion patterns of video diffusion models by distorting noise. The project allows users to determine how cameras and objects in a scene move, and can even put a video's motion...

X-Dyna:静态人像参考视频姿态生成视频,让小姐姐的照片跳舞-首席AI分享圈

X-Dyna: Static Portrait Reference Video Pose Generation Video to Make Missy's Photos Dance

Comprehensive Introduction X-Dyna is an open source project developed by ByteDance to generate dynamic portrait animations through zero-sample diffusion techniques. The project utilizes facial expressions and body movements in drive video to animate individual portrait images, generating realistic and context-aware motion effects.X-Dyna works by...

腾讯混元3D(Hunyuan3D):生成高分辨率3D资产,多种3D素材生成工作流-首席AI分享圈

Tencent Hybrid 3D (Hunyuan3D): Generate high-resolution 3D assets, multiple 3D material generation workflows

Comprehensive Introduction Tencent Hunyuan3D (Hunyuan3D 2.0) is an advanced large-scale 3D synthesis system from Tencent, designed to generate high-resolution textured 3D assets. The system includes two core components: Hunyuan3D-DiT, a large-scale shape generation model, and Hunyuan3D-Paint, a large-scale texture synthesis model.Hunyu...

RAG Web UI:构建智能文档问答系统,简单构建私有Web端知识库-首席AI分享圈

RAG Web UI: Building an Intelligent Documentation Q&A System and Simply Building a Private Web-Side Knowledge Base

Comprehensive Introduction RAG Web UI is an intelligent dialog system based on RAG (Retrieval Augmented Generation) technology. It helps organizations and individuals to build intelligent Q&A systems based on their own knowledge base. By combining document retrieval and large language modeling, RAG Web UI provides accurate and reliable knowledge Q&A services. The system supports...

UI-TARS Desktop:使用自然语言控制电脑的桌面智能体应用-首席AI分享圈

UI-TARS Desktop: Desktop Intelligentsia Application for Controlling Computers Using Natural Language

General Introduction UI-TARS Desktop is a graphical interface agent application based on UI-TARS (Visual Language Model) developed by ByteDance. The application allows users to control computers through natural language for more intuitive and efficient human-computer interaction.UI-TARS Desktop supports cross-platform operation, both...

Kheish:多角色智能体,审查、验证和格式化输出以生成高质量结果-首席AI分享圈

Kheish: multi-actor intelligences that review, validate and format output to produce high quality results

Comprehensive Introduction Kheish is an open source multi-role agent designed for Large Language Model (LLM) tasks that require structured, step-by-step collaboration.Kheish is more than just a simple coordinator, it is an intelligent agent in its own right, requesting modules on demand, integrating user feedback across different...

AI ContentCraft:生成短故事、对话脚本、配音、配图的多功能AI内容创作工具-首席AI分享圈

AI ContentCraft: a versatile AI content creation tool for generating short stories, dialog scripts, voiceovers, and graphics

General Introduction AI ContentCraft is a versatile content creation tool that integrates text generation, speech synthesis, image generation and more. It helps creators quickly generate stories, podcast scripts, and accompanying audio and video content. The tool supports multiple language conversions, can batch process content, and is extremely...

en_USEnglish