AI open source project

Total 1020 articles posts
Agent S:像人类一样操作电脑的开源智能体框架

Agent S: An Open Source Framework for Intelligent Bodies to Operate Computers Like Humans

General Introduction Agent S is an open-source framework developed by Simular AI that lets intelligences operate computers like humans through a graphical user interface (GUI). It uses a multimodal large language model and empirical learning techniques to accomplish tasks such as browsing the web, editing documents, using software...
4mos ago
01.1K
VideoMind:视频按时间戳定位内容与问答的开源项目

VideoMind: video by timestamp positioning content and Q&A open source project

General Introduction VideoMind is an open source multimodal AI tool focused on inference, Q&A and summary generation for long videos. It was developed by Ye Liu of the Hong Kong Polytechnic University and a team from Show Lab at the National University of Singapore. The tool mimics human understanding of video...
2mos ago
01.1K
SegAnyMo:从视频中自动分割任意运动物体的开源工具

SegAnyMo: open source tool to automatically segment arbitrary moving objects from video

General Introduction SegAnyMo is an open source project developed by a team of researchers at UC Berkeley and Peking University, including members such as Nan Huang. This tool focuses on video processing and can automatically recognize and segment arbitrary moving objects in a video, such as people, animals or...
4mos ago
01K
GenXD:生成任意3D和4D场景视频的开源框架

GenXD: open source framework for generating videos of arbitrary 3D and 4D scenes

General Introduction GenXD is an open source project, developed by the National University of Singapore (NUS) and Microsoft team. It focuses on generating arbitrary 3D and 4D scenes , to solve the real-world 3D and 4D generation due to insufficient data and model design complexity brought about by the problem . The project was developed by ...
4mos ago
01.1K
MegaTTS3:合成中英文语音的轻量模型

MegaTTS3: A Lightweight Model for Synthesizing Chinese and English Speech

Comprehensive Introduction MegaTTS3 is an open source speech synthesis tool developed by ByteDance in cooperation with Zhejiang University, focusing on generating high-quality Chinese and English speech. Its core model is only 0.45B parameters , lightweight and efficient , support for mixed Chinese and English speech generation and speech cloning . The project is hosted on ...
4mos ago
01.3K
AgentIQ:灵活连接和管理AI智能体的开源工具

AgentIQ: An open source tool for flexible connection and management of AI intelligences

General Introduction AgentIQ is an open source tool from NVIDIA designed to help developers efficiently connect and manage AI intelligences. It enables intelligences from different frameworks to seamlessly collaborate, connect enterprise data and tools, and build workflows like calling functions. The tool's biggest...
4mos ago
0913
TripoSF:快速生成高分辨率3D模型的实用工具

TripoSF: A useful tool for quickly generating high-resolution 3D models

Comprehensive Introduction TripoSF is an open source project built by the VAST-AI-Research team, specifically designed to quickly generate high-resolution 3D models from a single image. It uses a technique called SparseFlex, which has high processing efficiency and is able to generate high-resolution 3D models from a single image in a general...
4mos ago
01.1K
Rankify:支持信息检索与重排序的Python工具包

Rankify: a Python toolkit supporting information retrieval and reordering

General Introduction Rankify is an open source Python toolkit developed by the Data Science Group at the University of Innsbruck, Austria. It focuses on information retrieval, reordering and retrieval augmentation generation (RAG), providing a unified framework. The toolkit comes with a built-in set of 40 pre-retrieved benchmarks...
5mos ago
01.1K
Agent TARS:使用视觉和命令操作电脑的开源智能体

Agent TARS: An Open Source Intelligence Using Vision and Commands to Operate Computers

Comprehensive Introduction Agent TARS is a multimodal AI intelligence open-sourced by ByteDance.The core feature is to visually understand web content and combine command line and file system operations to help users complete complex computer tasks. Instead of requiring manual operations like traditional tools, it can self...
5mos ago
01.3K
Qlib:微软开发的AI量化投资研究工具

Qlib: an AI quantitative investment research tool developed by Microsoft

Comprehensive Introduction Qlib is an open source platform developed by Microsoft that focuses on using AI technology to help users research quantitative investments. It starts from the most basic data processing and supports users to explore investment ideas and turn them into usable strategies. The platform is simple and easy to use, and is suitable for those who want to use machine learning to improve their investment research...
5mos ago
01.5K
SmartRead:自动标注技术PDF文档并提供相关引用源

SmartRead: Automatically annotate technical PDF documents and provide relevant citation sources

Comprehensive Introduction SmartRead is an AI-based open source tool designed for technical documents. It can automatically analyze PDF files, mark key content, such as important terms, titles or core ideas to help users quickly understand complex documents. At the same time, it can also provide with the main document...
5mos ago
01.3K
LangManus:支持多智能体协作的开源AI自动化框架

LangManus: an open source AI automation framework supporting multi-intelligence collaboration

General Introduction LangManus is an open source AI automation framework hosted on GitHub. Developed by a group of former colleagues in their spare time, it is an academically-driven project with the goal of combining language models and specialized tools to accomplish web search, data crawling, and code execution...
5mos ago
01.2K
TxAgent:帮医生分析药物作用和治疗方案的AI工具

TxAgent: the AI tool that helps doctors analyze drug effects and treatment options

Comprehensive Introduction TxAgent is an open-source AI tool developed by Harvard University's Medical and Scientific Artificial Intelligence Team (MIMS) to help physicians analyze drug interactions and develop personalized treatment plans. It combines patient-specific situations through multi-step reasoning and real-time retrieval of biomedical knowledge...
5mos ago
01.1K