Comprehensive Introduction Surya is an open source OCR toolkit for multilingual documents that supports text recognition in more than 90 languages. It is capable of not only line-by-line text detection, but also layout analysis, reading order detection and table recognition.Surya's performance is comparable to cloud services for a wide range of document types, including p...
Comprehensive Introduction MinerU is an open source data extraction tool developed by the OpenDataLab team at the Shanghai Artificial Intelligence Laboratory, focusing on efficiently extracting content from complex PDF documents, web pages, and eBooks. It can convert multimodal PDF documents containing images, formulas, tables and other elements into easy-to-analyze M...
Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.
General Description PixPin is a powerful screenshot and posting tool designed to enhance users' productivity. Whether for daily office or professional needs, PixPin provides convenient screenshot, paste, long screenshot, text recognition (OCR) and dynamic screenshot functions. Its simple interface and rich features make...
Comprehensive Introduction GOT-OCR2.0 is a StepStar co-proposed de Open Source Optical Character Recognition (OCR) model, which aims to drive OCR technology towards OCR-2.0 through a unified end-to-end model. The model supports a wide range of OCR tasks, including normal text recognition, formatted text recognition, fine-grained OCR, multi...
General Introduction PaddleOCR is a multilingual OCR toolkit based on PaddlePaddle, designed to provide a practical and ultra-lightweight OCR system. It supports the recognition of over 80 languages and provides data annotation and synthesis tools to support on servers, mobile devices, embedded and IoT devices...
Pix2Text General Description Pix2Text (P2T) is an open source, free tool designed to replace Mathpix, providing image text and mathematical formula recognition. Users can use the tool for free via the web version, recognizing up to 10,000 characters per day.P2T supports recognizing text in pictures, tables,...
Umi-OCR General Description Umi-OCR is an open source, free offline OCR software that supports screenshot, batch image import, PDF document recognition, exclude watermarks and headers and footers, scanning and generating QR codes. The software has a built-in multi-language library for Windows and Linux.Umi-OCR requires no installation, un...
TTime General Introduction TTime, a project published on GitHub by InkTimeRecord, is a simple and efficient translation software. It mainly provides input, screenshot, stroke and hoverball translation functions, supports multiple translation sources and text recognition services, allowing users to quickly convert languages and text...