🚀 Invitation to Experience: China's First AI IDE Intelligent Programming Software Trae Chinese version downloadThe DeepSeek-R1 and Doubao-pro are available for unlimited use!

Total 27 articles

Tags: OCR Page 2

Surya: professional multilingual document OCR tool, open source native deployment

Comprehensive Introduction Surya is an open source OCR toolkit for multilingual documents that supports text recognition in more than 90 languages. It is capable of not only line-by-line text detection, but also layout analysis, reading order detection and table recognition.Surya's performance is comparable to cloud services for a wide range of document types, including p...

2024-10-14AI tools AI open source project OCR

MinerU：PDF文档提取转换为多模态Markdown格式，支持电子书OCR扫描-首席AI分享圈

MinerU: PDF document extraction and conversion to multimodal Markdown format, support e-book OCR scanning

Comprehensive Introduction MinerU is an open source data extraction tool developed by the OpenDataLab team at the Shanghai Artificial Intelligence Laboratory, focusing on efficiently extracting content from complex PDF documents, web pages, and eBooks. It can convert multimodal PDF documents containing images, formulas, tables and other elements into easy-to-analyze M...

Trae Chinese Version First Invitation to Download: Unlimited use of DeepSeek-R1 after registration!

Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.

2025-04-26

PixPin: long and dynamic screenshots, built-in native text recognition (OCR)

General Description PixPin is a powerful screenshot and posting tool designed to enhance users' productivity. Whether for daily office or professional needs, PixPin provides convenient screenshot, paste, long screenshot, text recognition (OCR) and dynamic screenshot functions. Its simple interface and rich features make...

2024-09-23AI tools OCR

GOT-OCR2.0：基于 QWen2 0.5B 端到端的多模态OCR模型-首席AI分享圈

GOT-OCR2.0: end-to-end multimodal OCR model based on QWen2 0.5B

Comprehensive Introduction GOT-OCR2.0 is a StepStar co-proposed de Open Source Optical Character Recognition (OCR) model, which aims to drive OCR technology towards OCR-2.0 through a unified end-to-end model. The model supports a wide range of OCR tasks, including normal text recognition, formatted text recognition, fine-grained OCR, multi...

2024-09-15AI tools AI open source project OCR

PaddleOCR: A multi-language OCR tool library based on Flying Paddle, supporting recognition of more than 80 languages

General Introduction PaddleOCR is a multilingual OCR toolkit based on PaddlePaddle, designed to provide a practical and ultra-lightweight OCR system. It supports the recognition of over 80 languages and provides data annotation and synthesis tools to support on servers, mobile devices, embedded and IoT devices...

2024-09-09AI tools AI open source project OCR

Pix2Text: open source free image text recognition tool

Pix2Text General Description Pix2Text (P2T) is an open source, free tool designed to replace Mathpix, providing image text and mathematical formula recognition. Users can use the tool for free via the web version, recognizing up to 10,000 characters per day.P2T supports recognizing text in pictures, tables,...

2024-09-01AI tools OCR

Umi-OCR: open source offline OCR software, batch image recognition and PDF recognition

Umi-OCR General Description Umi-OCR is an open source, free offline OCR software that supports screenshot, batch image import, PDF document recognition, exclude watermarks and headers and footers, scanning and generating QR codes. The software has a built-in multi-language library for Windows and Linux.Umi-OCR requires no installation, un...

2024-09-01AI tools OCR

TTime: Picture Your Text Recognition and Text Translation Software

TTime General Introduction TTime, a project published on GitHub by InkTimeRecord, is a simple and efficient translation software. It mainly provides input, screenshot, stroke and hoverball translation functions, supports multiple translation sources and text recognition services, allowing users to quickly convert languages and text...

2024-08-29AI tools AI translation OCR

preceding page
1
2
Total 2 pages

Tags: OCR Page 2

Surya: professional multilingual document OCR tool, open source native deployment

MinerU: PDF document extraction and conversion to multimodal Markdown format, support e-book OCR scanning

Trae Chinese Version First Invitation to Download: Unlimited use of DeepSeek-R1 after registration!

PixPin: long and dynamic screenshots, built-in native text recognition (OCR)

GOT-OCR2.0: end-to-end multimodal OCR model based on QWen2 0.5B

PaddleOCR: A multi-language OCR tool library based on Flying Paddle, supporting recognition of more than 80 languages

Pix2Text: open source free image text recognition tool

Umi-OCR: open source offline OCR software, batch image recognition and PDF recognition

TTime: Picture Your Text Recognition and Text Translation Software

Can't find AI tools? Try here!

FLUX.1 image generator (supports Chinese input)

Recent AI Hotspots

AI Tools Recommendations

AI Tools Classification