Comprehensive Introduction mcp-server-qdrant is a Model Context Protocol (MCP) server built on the Qdrant vector search engine. It is mainly used to help AI systems store and retrieve memories, and is especially suited for scenarios that require semantic search. This tool transforms information into vectors by...
General Introduction R1-Omni is an open source project launched on GitHub by the HumanMLLM team. It is the first application of Reinforcement Learning with Verifiable Rewards (RLVR) techniques to a multimodal large language model, focusing on emotion recognition. The project analyzes video and audio data to recognize characters' emotions, such as anger, fast...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
The goal of table recognition is to parse tables in images, accurately identify table structures and cell locations, and reduce them to structured table formats (e.g., HTML). In today's information age, a large amount of important tabular data still exists in an unstructured state (e.g., pictures of information statistics in scanned documents, pd...
General Introduction BlenderMCP is an open source tool that connects Blender to Claude AI via the Model Context Protocol (MCP) protocol. Users can control Blender directly with text commands to quickly create and edit 3D models, scenes and materials. This tool is suitable for 3D...
General Introduction Cloudflare Agents is an open source development framework from Cloudflare designed to help developers build intelligent AI agents on the global edge network. It gives agents the ability to persist state, communicate in real-time, and run autonomously, and the project is currently in active development. Core features include...
General Introduction codemcp is an open source tool designed for Claude Desktop users, developed by Edward Z. Yang on GitHub. It makes Claude Desktop a useful pair programming assistant. Users can talk directly to Claude to implement a local code base in...
General Introduction OpenAI Agents SDK is a lightweight development tool from OpenAI designed for building multi-intelligent body workflows. Based on Python, it is easy to use and supports developers to configure Agents, Handoffs, Guardrails, and other tasks by...
General Introduction AI Toolkit by Ostris is an open source AI toolset focused on supporting Stable Diffusion and FLUX.1 models for training and image generation tasks. Created and maintained by developer Ostris and hosted on GitHub, the toolkit aims to provide researchers and developers with flexible model micro...
Comprehensive Introduction Tencent Turbo S is Tencent's self-developed next-generation fast-thinking model, which has been launched on Tencent Cloud's official website, and will be officially released on February 27, 2025. It is different from traditional slow thinking models (e.g. Deepseek R1, Hybrid T1) in that it can realize "second reply", doubling the speed of spitting, and reducing the delay of the first word...
General Introduction HippoRAG is an open source framework developed by the OSU-NLP group at The Ohio State University, inspired by human long term memory mechanisms. It combines Retrieval Augmented Generation (RAG), Knowledge Graph, and Personalized PageRank techniques to help Large Language Models (LLMs) continuously integrate knowledge from external documents...
General Introduction AgentNetworkProtocol (ANP) is an open source protocol project, hosted on GitHub, focused on providing secure and efficient communication solutions for intelligent agents (AI Agents). It solves agent through a three-layer architecture - identity and encrypted communication layer, meta-protocol layer and application protocol layer...
General Introduction Open-LLM-VTuber is an open source project that allows users to interact with Large Language Models (LLMs) through speech and text, and combines Live2D technology to present dynamic virtual characters. It supports Windows, macOS and Linux, can run completely offline, both web and desktop client models...
Comprehensive Introduction Ovis (Open VISion) is an open source multimodal large language model (MLLM) developed by the AIDC-AI team of Alibaba's International Digital Commerce Group and hosted on GitHub.The model uses an innovative structural embedding alignment technique to efficiently merge visual and textual data, supporting image,...
General Introduction X-R1 is a reinforcement learning framework open-sourced on GitHub by the dhcode-cpp team, aiming to provide developers with a low-cost, efficient tool for training models based on end-to-end reinforcement learning. The project is inspired by DeepSeek-R1 and open-r1 and focuses on building...
Comprehensive Introduction Eino is a Golang-based open source framework launched by the CloudWeGo team, aiming to become the ultimate development tool for large model (LLM) applications. It is designed to be the ultimate development tool for large model (LLM) applications. It draws on the excellent design of open source frameworks such as LangChain and LlamaIndex, and combines the results of cutting-edge research and ByteDance's internal practice with...
General Introduction OpenManus-RL is an open source project jointly developed by UIUC-Ulab and the OpenManus team of the MetaGPT community, hosted on GitHub.The project enhances the reasoning and decision-making capabilities of large language model (LLM) intelligences through reinforcement learning (RL) techniques, based on Deepseek-R1, QwQ-32B ...
General Introduction ANUS (Advanced Neural Understanding System) is an open source AI agent framework hosted on GitHub, generated entirely by user nikmcfly by prompting Manus AI. It aims to provide developers, researchers and AI enthusiasts with a...
Comprehensive Introduction Long-VITA is an open source multimodal macromodel developed by the VITA-MLLM team, focusing on visual and linguistic tasks dealing with very long contexts. It is able to analyze images, videos and texts simultaneously, supports inputs of up to 1 million tokens, and is suitable for video understanding, high-resolution image solving...
General Introduction Meeting Minutes (aka Meetily) is a free and open source AI meeting assistant tool developed by Zackriya Solutions that focuses on capturing meeting audio in real-time, generating transcribed text and automatically extracting meeting summaries. The tool runs entirely on local devices and supports macOS ...