AI Personal Learning
and practical guidance
Beanbag Marscode1
Total 908 articles

Tags: ai open source projects

VideoMind:视频按时间戳定位内容与问答的开源项目-首席AI分享圈

VideoMind: video by timestamp positioning content and Q&A open source project

General Introduction VideoMind is an open source multimodal AI tool focused on inference, Q&A and summary generation for long videos. It was developed by Ye Liu of the Hong Kong Polytechnic University and a team from Show Lab at the National University of Singapore. The tool mimics the way humans understand video by splitting tasks into planning,...

SegAnyMo:从视频中自动分割任意运动物体的开源工具-首席AI分享圈

SegAnyMo: open source tool to automatically segment arbitrary moving objects from video

General Introduction SegAnyMo is an open source project developed by a team of researchers at UC Berkeley and Peking University, including members such as Nan Huang. This tool focuses on video processing and can automatically recognize and segment arbitrary moving objects in a video, such as people, animals or vehicles. It combines TAP...

GenXD:生成任意3D和4D场景视频的开源框架-首席AI分享圈

GenXD: open source framework for generating videos of arbitrary 3D and 4D scenes

General Introduction GenXD is an open source project, developed by the National University of Singapore (NUS) and Microsoft team. It focuses on generating arbitrary 3D and 4D scenes, to solve the real-world 3D and 4D generation due to insufficient data and model design complexity brought about by the problem. The project analyzes the camera and object motion, kn...

ChatAnyone:从照片生成半身数字人肖像视频的工具-首席AI分享圈

ChatAnyone: a tool for generating half-body digital human portrait videos from photos

General Introduction ChatAnyone is an innovative project developed by the HumanAIGC team. It utilizes artificial intelligence techniques to generate digital human portrait videos with upper body movements from a single photo and audio input. The project is based on a hierarchical motion diffusion model that generates head movements, gestures and expressions for...

II-Researcher:深度搜索与分步推理解答复杂问题-首席AI分享圈

II-Researcher: Deep Search and Stepwise Reasoning to Answer Complex Questions

General Introduction II-Researcher is an open source artificial intelligence research tool developed by the Intelligent-Internet team and hosted on GitHub.It is designed for deep search and complex reasoning, and is capable of answering complex questions through intelligent web searches and multi-step analysis. The project was launched on March 27, 2025...

Cua:让AI代理在macOS/Linux沙盒中安全执行应用-首席AI分享圈

Cua: Enabling AI agents to securely execute applications in macOS/Linux sandboxes

General Introduction Cua is an open source project called Computer-Use Agent (pronounced "koo-ah"), designed for Apple Silicon devices to create and run high-performance macOS and Linux virtual machines at speeds approaching 90% for native devices. It is designed for Apple Silicon devices , can create and run high-performance macOS and Linux virtual machines , speed close to the native device 90%. Cua uses Ap...

Paper to Podcast:把学术论文转换为多人对话播客-首席AI分享圈

Paper to Podcast: Converting Academic Papers to Multi-Person Conversation Podcasts

General Introduction Paper to Podcast is an open source tool that specializes in transforming academic research papers into lively and entertaining podcasts. It makes complex academic content easy to understand by using artificial intelligence technology to turn a PDF-formatted paper into a conversation between three characters - the host, the learner, and the expert. This ...

en_USEnglish