General Introduction NodeTool is an innovative AI authoring platform designed to provide a simple, intuitive interface for AI enthusiasts, developers, data scientists and creatives. Whether you're an artist, developer, or beginner, NodeTool helps you quickly prototype ideas and visualize no...
General Description Porkybank is an open source personal finance management application designed to help users easily track their daily budget. With a simple formula (Income - Expenses) / Days = Cash, users can visualize their financial situation. The project is hosted on GitHub and uses Elixir and P...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
General Introduction CrewAI is an advanced framework designed to orchestrate collaboration between role-playing and autonomous AI agents. By facilitating collaborative intelligence, CrewAI enables agents to work together seamlessly to solve complex tasks. Whether building intelligent assistant platforms, automating customer service teams, or multi-agent research teams, Crew...
General Description Artab is a browser extension designed to showcase the world's greatest works of art every time you open a new tab. The extension is available for Chrome, Edge and Firefox browsers. With Artab, users can enjoy a wide range of classic works of art in their daily browsing, enhancing...
Comprehensive Introduction Leffa is a unified framework for generating controllable character images, enabling precise manipulation of character appearance (e.g., virtual fitting) and pose (e.g., pose transfer). The framework significantly reduces distortion of fine-grained details by directing the target query to focus on the correct reference key in the attention layer, while preserving...
General Introduction MMAudio is an open source project aiming at generating high-quality synchronized audio through joint multimodal training. Developed by Ho Kei Cheng et al. at the Chinese University of Hong Kong, the project's main function is to generate synchronized audio based on video and/or text input.The core innovation of MMAudio is...
General Introduction H2O GPT is an open source project that aims to provide privatized chat and document processing capabilities. The project is based on the Apache 2.0 license , supports a variety of GPT models , including LLaMa2, Mistral, Falcon and so on. Users can use H2O GPT to achieve local documents (such as PDF, E...
General Introduction OpenChat is a user-friendly chatbot console designed to simplify the use of Large Language Models (LLMs). By providing a two-step setup process, OpenChat enables users to easily create and manage multiple custom chatbots. The platform supports GPT-3 and GPT-4 models and...
General Introduction LocalGPT is an open source project designed to allow users to talk to documents on local devices and ensure data privacy. By using various open source models, LocalGPT can process and understand document content without uploading the data to the cloud. The project supports a variety of platforms, including GPU, C...
General Introduction PrivateGPT is an AI project available for production environments that allows users to quiz documents using Large Language Models (LLMs) without an Internet connection. The project ensures data privacy for 100%, and all data is processed in the user's execution environment without disclosure.Priv...
Comprehensive Introduction AutoGPT is a powerful platform designed to help users create, deploy, and manage continuously running AI agents that automate complex workflows. Developed by Significant Gravitas, the platform offers a wide range of tools and features that enable users to focus on important tasks without worrying about technical...
General Introduction DragGAN is an interactive image editing tool based on Generative Adversarial Networks (GAN). The project, presented at SIGGRAPH 2023 by Xingang Pan et al, aims to enable users to intuitively manipulate details in an image through simple point-and-click and drag-and-drop operations.DragGAN combines st...
Comprehensive Introduction Qwen-Agent is an intelligent agent application framework developed based on Qwen 2.0 and above, with capabilities such as command following, tool usage, planning and memorization. The framework provides a variety of sample applications such as browser assistants, code interpreters and custom assistants to help developers quickly construct...
General Introduction Mini-Cover is an open source online cover generation tool designed to generate personalized covers for blogs, short videos and social media platforms. Developed by JLinMr, the tool aims to provide a clean and efficient solution to help users quickly generate covers that meet their needs.Mini-Cove...
General Introduction MarkItDown is a Python tool developed by Microsoft designed to convert various files and office documents to Markdown format. The tool supports a wide range of file types, including PDF, PowerPoint, Word, Excel, images (EXIF metadata and OCR), audio (EXIF metadata and language...
General Introduction Claude Engineer is an interactive command line interface (CLI) developed by Doriandarko that utilizes Anthropic's Claude-3.5-Sonnet model to assist in software development tasks. The framework allows Claude to generate and manage its own tools, continuously extending its capabilities through dialog...
Comprehensive Introduction Swarms is an enterprise-grade production-ready multi-agent orchestration framework designed to boost business productivity through efficient agent management and task processing. With support for multiple models, multiple memory systems and custom agent creation, the framework provides a modular design and comprehensive logging capabilities to ensure system...
General Introduction Sonic is an innovative platform focused on global audio perception, designed to generate vivid portrait animations driven by audio. Developed by a team of researchers from Tencent and Zhejiang University, the platform utilizes audio information to control facial expressions and head movements to generate natural and smooth animated videos.Sonic ...
Comprehensive Introduction Ultravox is an innovative multimodal Large Language Model (LLM) designed for real-time speech processing. Unlike traditional speech recognition systems, Ultravox eliminates the need for a separate Audio Speech Recognition (ASR) stage, and is able to directly convert audio to text in high-dimensional space. This feature makes...