PitchHut logo
OCRmyPDF-AppleOCR
Effortlessly integrate OCR with Apple’s Vision Framework.
Pitch

OCRmyPDF-AppleOCR is a plugin that leverages Apple’s Vision Framework for efficient optical character recognition. With support for accurate and fast text detection, this tool enhances the OCRmyPDF experience on macOS, making document processing seamless and reliable.

Description

OCRmyPDF-AppleOCR is a powerful plugin designed to enhance the functionality of OCRmyPDF by incorporating optical character recognition (OCR) capabilities using Apple’s advanced Vision Framework on macOS. This plugin leverages Apple's proprietary OCR implementation, which boasts exceptional accuracy and speed, outperforming many other on-device OCR engines, such as Tesseract.

Key Features

  • Integration with Apple Vision Framework: Utilize Apple’s powerful text detection features to achieve high-quality OCR results.
  • Multiple Recognition Modes: Choose from different recognition modes:
    • fast: Suitable for quick scans.
    • accurate: For precise character recognition.
    • livetext: Specially designed for vertical text layouts, particularly useful for East Asian languages.
  • Flexible Language Support: Supports a wide range of languages, ensuring that users can efficiently process documents in multiple linguistic contexts.

Usage

To activate this plugin within OCRmyPDF, simply use the --plugin option. Archive desired language specifications with the -l option for OCR processing. Moreover, automatic language detection can be enabled using the --language und command.

ocrmypdf -l jpn --plugin ocrmypdf_appleocr input.pdf output.pdf

Options

The plugin provides several customization options:

  • --appleocr-recognition-mode: Select between fast, accurate, or livetext modes (default to livetext on macOS 13 and later).
  • --appleocr-disable-correction: Option to disable language correction in Apple Vision OCR (default: False).
  • --pdf-renderer: Choose between hocr and sandwich for how OCR results are imbedded in the PDF.

Supported Languages

This plugin enables OCR across various languages, including:

Language CodeLanguage NameFast ModeAccurate ModeLiveText
engEnglish
fraFrench
jpnJapanese
...............

The plugin provides flexibility to users, ensuring that it fits seamlessly into workflows that require reliable OCR processing. Suitable for developers and technical users, OCRmyPDF-AppleOCR effectively harnesses the capabilities of macOS to deliver outstanding OCR performance.

0 comments

No comments yet.

Sign in to be the first to comment.