Difference between revisions of "OCR"

From OnnoWiki
Jump to navigation Jump to search
 
(4 intermediate revisions by the same user not shown)
Line 3: Line 3:
 
These OCR software are especially useful for converting and preserving old documents as they can be used to identify text and create digital copies. Sometimes the identified text may not be 100% accurate but OCR software removes the need for manual edits to a great extent by extracting as much text as possible. Manual edits can be made later to improve accuracy further and create one-to-one replicas. Most OCR software can extract text into separate files, though some also support superimposing a hidden text layer on original files. Superimposed text allows you to read content in original print and format but also allows you to select and copy text. This technique is specially used to digitize old documents into PDF format.
 
These OCR software are especially useful for converting and preserving old documents as they can be used to identify text and create digital copies. Sometimes the identified text may not be 100% accurate but OCR software removes the need for manual edits to a great extent by extracting as much text as possible. Manual edits can be made later to improve accuracy further and create one-to-one replicas. Most OCR software can extract text into separate files, though some also support superimposing a hidden text layer on original files. Superimposed text allows you to read content in original print and format but also allows you to select and copy text. This technique is specially used to digitize old documents into PDF format.
  
 +
==Referensi==
 +
 +
* https://linuxhint.com/ocr-apps-linux/
  
 
==Pranala Menarik==
 
==Pranala Menarik==
  
 +
* [[OCR: OCRMyPDF]]
 
* [[OCR: Tesseract OCR]]
 
* [[OCR: Tesseract OCR]]
 
* [[OCR: gImageReader]]
 
* [[OCR: gImageReader]]
 +
* [[OCR: Paperwork]]
 +
* [[OCR: OCRFeeder]]
 +
* [[OCR: gscan2pdf]]

Latest revision as of 09:45, 27 May 2024

This article will cover a list of useful “Optical Character Recognition” software available for Linux. An optical character recognition (OCR) software attempts to detect text content of non-text files whose content cannot be selected or copied but can be viewed or read. For instance, an OCR software can identify text from images, PDF or other scanned documents in digital file formats using various algorithms and AI based solutions.

These OCR software are especially useful for converting and preserving old documents as they can be used to identify text and create digital copies. Sometimes the identified text may not be 100% accurate but OCR software removes the need for manual edits to a great extent by extracting as much text as possible. Manual edits can be made later to improve accuracy further and create one-to-one replicas. Most OCR software can extract text into separate files, though some also support superimposing a hidden text layer on original files. Superimposed text allows you to read content in original print and format but also allows you to select and copy text. This technique is specially used to digitize old documents into PDF format.

Referensi

Pranala Menarik