General Introduction Ruyi-Models is an open source project designed to generate high quality videos from images. Developed by the IamCreateAI team, the project supports the generation of cinematic video at 768 resolution, 24 frames per second, totaling 120 frames in 5 seconds.Ruyi-Models supports lens control and motion amplitude control ...
General Introduction Robo Blogger is an innovative blog creation tool designed to simplify the content generation process through speech-to-text technology. Users can record ideas through any speech-to-text application and Robo Blogger transforms those ideas into structured blog content. The tool utilizes LangChain ...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
General Introduction Genesis is a generative physics world designed for general purpose robotics and embodied AI learning. It provides a unified simulation platform that supports the simulation of a wide range of materials and physical phenomena.Genesis aims to unlock an infinite variety of data by combining generative AI and physics simulation to help machine...
Comprehensive Introduction Kolors is a large-scale text-to-image generation model developed by the Racer team, based on potential diffusion techniques. The model is trained on billions of text-image data pairs, and is capable of generating high-quality, complex semantically accurate images with support for both Chinese and English inputs.Kolors is well known for its visual quality, complex semantic accuracy...
Comprehensive Introduction ColorFlow is an image sequence auto-coloring tool developed by Tencent's ARC team to solve the problem of auto-coloring black and white image sequences. The tool utilizes a retrieval-enhanced coloring pipeline to accurately generate the colors of various elements, including the character's hair color and clothing, from a pool of reference images, ensuring that the color...
Comprehensive Introduction BrushEdit is an all-in-one image repair and editing tool developed by Tencent ARC Labs. The tool is based on the latest AI technology and can automatically identify and repair defects in images, while supporting interactive editing by users.BrushEdit combines a variety of advanced image processing algorithms to raise...
Comprehensive Introduction Outlines is an open source library developed by dottxt-ai to enhance the application of Large Language Models (LLMs) through structured text generation. The library supports a wide range of model integrations, including OpenAI, transformers, llama.cpp, etc. It provides simple but powerful cue primitives,...
General Introduction RapBank is a dataset and toolset designed for rap lyrics generation. The project was created by NZqian to provide researchers and developers with a high-quality rap lyrics dataset by collecting and processing rap songs from YouTube.RapBank contains over 9 ...
Comprehensive Introduction R2R (RAG to Riches) is a state-of-the-art AI retrieval system supporting Retrieval Augmented Generation (RAG) functionality with production-ready features. Built on a containerized RESTful API, the system provides multimodal content parsing, hybrid search capabilities, configurable GraphRAG, and comprehensive...
Comprehensive Introduction Infini-Megrez is an edge intelligence solution developed by the unquestioned core dome (Infinigence AI), aiming to achieve efficient multimodal understanding and analysis through hardware and software co-design. The core of the project is the Megrez-3B model, which supports integrated image, text and audio understanding with high accuracy...
General Introduction GenEx is an advanced AI model capable of generating a fully explorable 360° 3D world from a single image. Users can interactively explore this generated world.GenEx pushes the boundaries of figurative AI in imaginative spaces and has the potential to extend these capabilities to present...
Comprehensive Introduction RAGFlow is an open source Retrieval Augmented Generation (RAG) engine based on deep document understanding technology. It provides an efficient RAG workflow for organizations of all sizes, incorporating a large-scale language model (LLM) capable of delivering real-world question-and-answer capabilities based on data in complex formats.RAGFlow...
General Introduction NodeTool is an innovative AI authoring platform designed to provide a simple, intuitive interface for AI enthusiasts, developers, data scientists and creatives. Whether you're an artist, developer, or beginner, NodeTool helps you quickly prototype ideas and visualize no...
General Description Porkybank is an open source personal finance management application designed to help users easily track their daily budget. With a simple formula (Income - Expenses) / Days = Cash, users can visualize their financial situation. The project is hosted on GitHub and uses Elixir and P...
General Introduction CrewAI is an advanced framework designed to orchestrate collaboration between role-playing and autonomous AI agents. By facilitating collaborative intelligence, CrewAI enables agents to work together seamlessly to solve complex tasks. Whether building intelligent assistant platforms, automating customer service teams, or multi-agent research teams, Crew...
General Description Artab is a browser extension designed to showcase the world's greatest works of art every time you open a new tab. The extension is available for Chrome, Edge and Firefox browsers. With Artab, users can enjoy a wide range of classic works of art in their daily browsing, enhancing...
Comprehensive Introduction Leffa is a unified framework for generating controllable character images, enabling precise manipulation of character appearance (e.g., virtual fitting) and pose (e.g., pose transfer). The framework significantly reduces distortion of fine-grained details by directing the target query to focus on the correct reference key in the attention layer, while preserving...
General Introduction MMAudio is an open source project aiming at generating high-quality synchronized audio through joint multimodal training. Developed by Ho Kei Cheng et al. at the Chinese University of Hong Kong, the project's main function is to generate synchronized audio based on video and/or text input.The core innovation of MMAudio is...
General Introduction H2O GPT is an open source project that aims to provide privatized chat and document processing capabilities. The project is based on the Apache 2.0 license , supports a variety of GPT models , including LLaMa2, Mistral, Falcon and so on. Users can use H2O GPT to achieve local documents (such as PDF, E...