Skip to content

Extract (Text + OCR) from PDF

Description

This activity extracts text or data from PDF files using OCR (Optical Character Recognition) and converts it into a structured format.

Input

PDF Files

Output

Extracted data or text

Configuration Fields

  • Start page Specifies the starting page number from which the PDF extraction process will begin.
  • End page Specifies the ending page number at which the PDF extraction process will stop.

Sample Input

Not Applicable

Sample Configuration

alt text

Sample Output

Extracted text or structured data from the specified pages of the PDF document.