OCR: gscan2pdf

From OnnoWiki
Jump to navigation Jump to search

gscan2pdf gscan2pdf is a free and open source graphical utility that can identify and extract text from a variety of file formats. It can directly work with scanners to scan papers and then export OCR detected text content into PDF files. It also supports multiple OCR engines including Tesseract OCR, GOCR, Ocropus and Cuneiform, as long as packages for these engines are installed on your system. Other than direct scanning of papers, you can also import image files and extract text from them.



Digital Transformation Indonesia Conference & Expo 2023 Digital Transformation Indonesia Conference and Expo (DTICX 2023) akan menghadirkan para pengambil keputusan, ahli teknologi, dan para profesional dari 10 sektor industri penting di Indonesia (Pemerintahan, Jasa, Keuangan, Kesehatan, Telekomunikasi, Infrastruktur, Manufaktur, Transportasi &... SPONSORED BY DIGITAL TRANSFORMATION... REGISTER To install gscan2pdf in Ubuntu, use the command specified below:

$ sudo apt install gscan2pdf gocr cuneiform tesseract-ocr

You can install it in other Linux distributions from default repositories through the package manager. Source code and executable binaries are also available here.


Pranala Menarik