Comprehensive Introduction olmOCR is an open source tool developed by the AllenNLP team at the Allen Institute for Artificial Intelligence (AI2) that focuses on converting PDF files to linearized text, and is especially suited for dataset preparation and training for large-scale language models (LLMs). It ...
General Introduction Coding-Tutor is an open source project hosted on GitHub and created by developer iwangjian to provide learners with a personalized programming teaching experience. It uses conversational artificial intelligence technology to dynamically adjust teaching content based on the user's knowledge background and learning progress, helping...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
General Introduction par_scrape is a Python-based open source web crawler tool, launched on GitHub by developer Paul Robello, designed to help users intelligently extract data from web pages. It integrates two powerful browser automation technologies, Selenium and Playwright, and combines...
General Introduction Flock is an open source, low-code platform for workflow, hosted on GitHub and developed by the Onelevenvy team. It is based on LangChain and LangGraph technologies and is focused on helping users quickly build chatbots, retrieval augmented generation (RAG) applications, and orchestrate multi-agent groups...
General Introduction TableGPT Agent is an intelligent tool based on the GitHub open source project designed for processing and analyzing tabular data. It relies on the TableGPT2 Big Language Model, which utilizes natural language interactions to enable users to easily query, manipulate and understand complex table content. Whether from ...
General Introduction TRV is an open source tool, hosted on GitHub, designed to help users quickly convert slides and lecture notes into videos with narration. It automatically generates audio and video content from incoming presentation files through simple command line operations, suitable for those who need to quickly create presentation videos for teaching...
General Introduction gibberlink is an open source project on GitHub by developer PennyroyalTea focused on enabling communication optimization between two conversational AI intelligences. When two AI intelligences talk on the phone and recognize each other as AI, they will cut from the human language (English) to...
Comprehensive Introduction LazyLLM is an open source tool developed by the LazyAGI team, focusing on simplifying the development process of multi-intelligence large model applications. It helps developers quickly build complex AI applications and save time on tedious engineering configurations through one-click deployment and lightweight gateway mechanisms. Whether you are a beginner...
General Introduction DeepSeek-RAG-Chatbot is an open source chatbot project built on the DeepSeek R1 model, hosted on GitHub and created by developer SaiAkhil066. It combines Retrieval Augmented Generation (RAG) technology with support for users to upload documents (e.g. PDF, DOCX or TXT ...
Comprehensive Introduction MagicArticulate is an AI framework developed by ByteDance in collaboration with Nanyang Technological University (NTU), focusing on rapidly transforming static 3D models into animation-enabled digital assets. It automatically generates skeletal structures and skinning rights for models through advanced autoregressive Transformer and functional diffusion modeling...
General Introduction AingDesk is an open source and free software designed to help users easily deploy and run various AI models on their local computers. Whether it is DeepSeek or Llama models, AingDesk enables one-click deployment with simple steps. The software supports Windows, Linux...
General Introduction CapsWriter-Offline is a voice input and subtitle transcription tool for PC, hosted on GitHub and built by developer HaujetZhao. It runs completely offline and does not require an internet connection to realize speech-to-text and audio/video file to subtitle transcription, supporting unlimited hours of recording...
Comprehensive Introduction PDF-Extract-Kit is an open source project developed by the OpenDataLab team , focusing on the efficient extraction of high-quality content from complex and diverse PDF documents . It integrates advanced document parsing technology , support for layout detection , formula recognition , table extraction and OCR functions for ...
General Introduction FlashMLA is an efficient MLA (Multi-head Latent Attention) decoding kernel developed by DeepSeek AI, optimized for NVIDIA Hopper Architecture GPUs, and designed to improve the performance of variable-length sequence processing. The project is open-sourced on GitHub, providing developers with free...
Comprehensive Introduction TPO-LLM-WebUI is an innovative project open-sourced by Airmomo on GitHub that enables real-time optimization of Large Language Models (LLMs) through an intuitive web interface. It uses the TPO (Test-Time Prompt Optimization) framework to completely say goodbye to the traditional fine-tuning of the tedious process of ...
Comprehensive Introduction Neural4D is an innovative AI-based platform focused on helping users quickly generate high-quality 3D models and animations with simple text or image input. Developed by DreamTech, it relies on the world's leading end-to-end 3D generation of large models, where users simply provide a description...
Comprehensive Introduction InternLM-XComposer is an open source graphical multimodal big model project developed by InternLM team , hosted on GitHub.It is based on the InternLM language model , able to handle text , image , video and other multimodal data , widely used in graphic creation , image understanding and video sub...
General Introduction Make Sense is a free online image annotation tool designed to help users quickly prepare datasets for computer vision projects. It requires no complicated installation, just open a browser access to use it, supports multiple operating systems, and is perfect for small deep learning projects. Users can use it to...
General Introduction TreeGPT is an open source chat application based on Next.js, focusing on visualizing conversations with large language models (LLMs, e.g., GPT) through tree graph structures (directed acyclic graphs, DAGs), replacing the traditional linear chatting approach to improve speed and ease of use. The project is hosted on http...